CN101868545B - 具有改变的根构造的植物、涉及编码富含亮氨酸重复序列激酶(llrk)多肽及其同源物的基因的相关构建体和方法 - Google Patents
具有改变的根构造的植物、涉及编码富含亮氨酸重复序列激酶(llrk)多肽及其同源物的基因的相关构建体和方法 Download PDFInfo
- Publication number
- CN101868545B CN101868545B CN2008801169274A CN200880116927A CN101868545B CN 101868545 B CN101868545 B CN 101868545B CN 2008801169274 A CN2008801169274 A CN 2008801169274A CN 200880116927 A CN200880116927 A CN 200880116927A CN 101868545 B CN101868545 B CN 101868545B
- Authority
- CN
- China
- Prior art keywords
- plant
- recombinant dna
- sequence
- plants
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 270
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 109
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 107
- 102000004196 processed proteins & peptides Human genes 0.000 title claims abstract description 107
- 108090000623 proteins and genes Proteins 0.000 title description 246
- 108010006444 Leucine-Rich Repeat Proteins Proteins 0.000 title description 6
- 108091000080 Phosphotransferase Proteins 0.000 title description 6
- 210000004901 leucine-rich repeat Anatomy 0.000 title description 6
- 102000020233 phosphotransferase Human genes 0.000 title description 6
- 241000196324 Embryophyta Species 0.000 claims abstract description 740
- 108020004511 Recombinant DNA Proteins 0.000 claims abstract description 150
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 83
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 83
- 239000002157 polynucleotide Substances 0.000 claims abstract description 83
- 230000009261 transgenic effect Effects 0.000 claims description 164
- 230000001105 regulatory effect Effects 0.000 claims description 65
- 239000002773 nucleotide Substances 0.000 claims description 61
- 125000003729 nucleotide group Chemical group 0.000 claims description 60
- 238000004458 analytical method Methods 0.000 claims description 39
- 238000010276 construction Methods 0.000 claims description 39
- 230000000295 complement effect Effects 0.000 claims description 38
- 230000008859 change Effects 0.000 claims description 23
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 claims description 14
- 230000001276 controlling effect Effects 0.000 claims description 11
- 238000011156 evaluation Methods 0.000 claims description 9
- 238000009395 breeding Methods 0.000 claims description 7
- 230000001131 transforming effect Effects 0.000 claims description 6
- 238000004519 manufacturing process Methods 0.000 claims description 5
- 238000009396 hybridization Methods 0.000 claims description 3
- 238000002156 mixing Methods 0.000 claims description 3
- 235000013311 vegetables Nutrition 0.000 claims 11
- 125000003275 alpha amino acid group Chemical group 0.000 claims 6
- 108020004414 DNA Proteins 0.000 description 166
- 240000008042 Zea mays Species 0.000 description 143
- 210000004027 cell Anatomy 0.000 description 143
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 140
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 130
- 150000007523 nucleic acids Chemical group 0.000 description 109
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 108
- 235000009973 maize Nutrition 0.000 description 108
- 108091028043 Nucleic acid sequence Proteins 0.000 description 87
- 241000219194 Arabidopsis Species 0.000 description 76
- 150000001413 amino acids Chemical group 0.000 description 71
- 229910052757 nitrogen Inorganic materials 0.000 description 70
- 230000001629 suppression Effects 0.000 description 64
- 239000013598 vector Substances 0.000 description 64
- 230000009418 agronomic effect Effects 0.000 description 61
- 230000009466 transformation Effects 0.000 description 57
- 210000001519 tissue Anatomy 0.000 description 53
- 230000014509 gene expression Effects 0.000 description 49
- 239000002609 medium Substances 0.000 description 47
- 102000004169 proteins and genes Human genes 0.000 description 41
- 239000002299 complementary DNA Substances 0.000 description 40
- 235000018102 proteins Nutrition 0.000 description 39
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 36
- 230000000692 anti-sense effect Effects 0.000 description 35
- 244000068988 Glycine max Species 0.000 description 32
- 108700011259 MicroRNAs Proteins 0.000 description 30
- 239000002679 microRNA Substances 0.000 description 30
- 230000004075 alteration Effects 0.000 description 28
- 210000002257 embryonic structure Anatomy 0.000 description 28
- 239000013612 plasmid Substances 0.000 description 28
- 238000003384 imaging method Methods 0.000 description 27
- 241000589158 Agrobacterium Species 0.000 description 25
- 229940024606 amino acid Drugs 0.000 description 24
- 235000001014 amino acid Nutrition 0.000 description 24
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 22
- 235000005822 corn Nutrition 0.000 description 22
- 230000001172 regenerating effect Effects 0.000 description 22
- 230000007613 environmental effect Effects 0.000 description 21
- 230000012010 growth Effects 0.000 description 21
- 235000010469 Glycine max Nutrition 0.000 description 20
- 108700019146 Transgenes Proteins 0.000 description 20
- 239000013604 expression vector Substances 0.000 description 20
- 108010050848 glycylleucine Proteins 0.000 description 20
- 238000013507 mapping Methods 0.000 description 20
- 239000012634 fragment Substances 0.000 description 19
- 230000009368 gene silencing by RNA Effects 0.000 description 18
- 239000003550 marker Substances 0.000 description 18
- 108020004999 messenger RNA Proteins 0.000 description 18
- 102000039446 nucleic acids Human genes 0.000 description 18
- 108020004707 nucleic acids Proteins 0.000 description 18
- 240000007594 Oryza sativa Species 0.000 description 17
- 235000007164 Oryza sativa Nutrition 0.000 description 17
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 17
- 230000000694 effects Effects 0.000 description 17
- 230000008569 process Effects 0.000 description 17
- 235000009566 rice Nutrition 0.000 description 17
- 241000894007 species Species 0.000 description 17
- 238000002869 basic local alignment search tool Methods 0.000 description 16
- 230000000875 corresponding effect Effects 0.000 description 16
- 238000005516 engineering process Methods 0.000 description 16
- 230000008929 regeneration Effects 0.000 description 16
- 238000011069 regeneration method Methods 0.000 description 16
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 15
- 230000001404 mediated effect Effects 0.000 description 15
- 239000000047 product Substances 0.000 description 15
- 239000002028 Biomass Substances 0.000 description 14
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 14
- 239000000203 mixture Substances 0.000 description 14
- FGIUAXJPYTZDNR-UHFFFAOYSA-N potassium nitrate Chemical compound [K+].[O-][N+]([O-])=O FGIUAXJPYTZDNR-UHFFFAOYSA-N 0.000 description 14
- 238000011282 treatment Methods 0.000 description 14
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 14
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 13
- 108020004459 Small interfering RNA Proteins 0.000 description 13
- 238000003780 insertion Methods 0.000 description 13
- 230000037431 insertion Effects 0.000 description 13
- 239000002245 particle Substances 0.000 description 13
- 206010020649 Hyperkeratosis Diseases 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 12
- 108010061238 threonyl-glycine Proteins 0.000 description 12
- 108091026890 Coding region Proteins 0.000 description 11
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 11
- 239000002585 base Substances 0.000 description 11
- 238000004520 electroporation Methods 0.000 description 11
- 238000002474 experimental method Methods 0.000 description 11
- 238000012216 screening Methods 0.000 description 11
- 239000000243 solution Substances 0.000 description 11
- 238000013519 translation Methods 0.000 description 11
- 108091032955 Bacterial small RNA Proteins 0.000 description 10
- 108090000848 Ubiquitin Proteins 0.000 description 10
- 102000044159 Ubiquitin Human genes 0.000 description 10
- 238000013459 approach Methods 0.000 description 10
- 230000018109 developmental process Effects 0.000 description 10
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 10
- 108010037850 glycylvaline Proteins 0.000 description 10
- 230000006798 recombination Effects 0.000 description 10
- 238000005215 recombination Methods 0.000 description 10
- 230000004044 response Effects 0.000 description 10
- 108700028369 Alleles Proteins 0.000 description 9
- 241001465754 Metazoa Species 0.000 description 9
- 230000007022 RNA scission Effects 0.000 description 9
- 229930006000 Sucrose Natural products 0.000 description 9
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 9
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 9
- 108010047495 alanylglycine Proteins 0.000 description 9
- 238000004422 calculation algorithm Methods 0.000 description 9
- 238000011161 development Methods 0.000 description 9
- 235000013399 edible fruits Nutrition 0.000 description 9
- 238000002360 preparation method Methods 0.000 description 9
- 108010031719 prolyl-serine Proteins 0.000 description 9
- 239000002689 soil Substances 0.000 description 9
- 239000000126 substance Substances 0.000 description 9
- 239000005720 sucrose Substances 0.000 description 9
- 238000013518 transcription Methods 0.000 description 9
- 230000035897 transcription Effects 0.000 description 9
- 230000017105 transposition Effects 0.000 description 9
- 101150104463 GOS2 gene Proteins 0.000 description 8
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 8
- 240000005979 Hordeum vulgare Species 0.000 description 8
- 235000007340 Hordeum vulgare Nutrition 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- 108091081021 Sense strand Proteins 0.000 description 8
- 235000021307 Triticum Nutrition 0.000 description 8
- 241000209140 Triticum Species 0.000 description 8
- 230000004913 activation Effects 0.000 description 8
- 230000030279 gene silencing Effects 0.000 description 8
- 230000007614 genetic variation Effects 0.000 description 8
- 108010078144 glutaminyl-glycine Proteins 0.000 description 8
- 108010079547 glutamylmethionine Proteins 0.000 description 8
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 8
- 108010057821 leucylproline Proteins 0.000 description 8
- 230000000670 limiting effect Effects 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- 101150105138 nas2 gene Proteins 0.000 description 8
- 108010051242 phenylalanylserine Proteins 0.000 description 8
- 230000002786 root growth Effects 0.000 description 8
- 230000000392 somatic effect Effects 0.000 description 8
- 239000000725 suspension Substances 0.000 description 8
- 108010090461 DFG peptide Proteins 0.000 description 7
- 102100023387 Endoribonuclease Dicer Human genes 0.000 description 7
- 108091092195 Intron Proteins 0.000 description 7
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 7
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 7
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 7
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 7
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- 238000003556 assay Methods 0.000 description 7
- 210000000349 chromosome Anatomy 0.000 description 7
- 230000000408 embryogenic effect Effects 0.000 description 7
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- 230000032361 posttranscriptional gene silencing Effects 0.000 description 7
- 235000010333 potassium nitrate Nutrition 0.000 description 7
- 230000002829 reductive effect Effects 0.000 description 7
- 239000000523 sample Substances 0.000 description 7
- 238000012163 sequencing technique Methods 0.000 description 7
- 108010048818 seryl-histidine Proteins 0.000 description 7
- 239000004055 small Interfering RNA Substances 0.000 description 7
- 108010000700 Acetolactate synthase Proteins 0.000 description 6
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 6
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 6
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 6
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 6
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 6
- 101000907904 Homo sapiens Endoribonuclease Dicer Proteins 0.000 description 6
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 6
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 6
- 229910002651 NO3 Inorganic materials 0.000 description 6
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 description 6
- 108700026244 Open Reading Frames Proteins 0.000 description 6
- 238000000692 Student's t-test Methods 0.000 description 6
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 6
- 108010081404 acein-2 Proteins 0.000 description 6
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 6
- GINJFDRNADDBIN-FXQIFTODSA-N bilanafos Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCP(C)(O)=O GINJFDRNADDBIN-FXQIFTODSA-N 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 6
- 210000004209 hair Anatomy 0.000 description 6
- 108010036413 histidylglycine Proteins 0.000 description 6
- SEOVTRFCIGRIMH-UHFFFAOYSA-N indole-3-acetic acid Chemical compound C1=CC=C2C(CC(=O)O)=CNC2=C1 SEOVTRFCIGRIMH-UHFFFAOYSA-N 0.000 description 6
- 208000015181 infectious disease Diseases 0.000 description 6
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 6
- 238000005259 measurement Methods 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- ZWLUXSQADUDCSB-UHFFFAOYSA-N phthalaldehyde Chemical compound O=CC1=CC=CC=C1C=O ZWLUXSQADUDCSB-UHFFFAOYSA-N 0.000 description 6
- 230000008635 plant growth Effects 0.000 description 6
- 230000010152 pollination Effects 0.000 description 6
- 239000002243 precursor Substances 0.000 description 6
- 108091008146 restriction endonucleases Proteins 0.000 description 6
- 238000004114 suspension culture Methods 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 238000012546 transfer Methods 0.000 description 6
- 238000011144 upstream manufacturing Methods 0.000 description 6
- 239000005631 2,4-Dichlorophenoxyacetic acid Substances 0.000 description 5
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid Chemical compound CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 5
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 5
- 235000006008 Brassica napus var napus Nutrition 0.000 description 5
- 240000000385 Brassica napus var. napus Species 0.000 description 5
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 5
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 5
- 229920000742 Cotton Polymers 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 108091060211 Expressed sequence tag Proteins 0.000 description 5
- 229920002148 Gellan gum Polymers 0.000 description 5
- 239000005561 Glufosinate Substances 0.000 description 5
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 5
- 244000299507 Gossypium hirsutum Species 0.000 description 5
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 5
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 5
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 5
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 5
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 5
- 241000219823 Medicago Species 0.000 description 5
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 5
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 5
- 108091034117 Oligonucleotide Proteins 0.000 description 5
- 102000000574 RNA-Induced Silencing Complex Human genes 0.000 description 5
- 108010016790 RNA-Induced Silencing Complex Proteins 0.000 description 5
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 5
- 244000062793 Sorghum vulgare Species 0.000 description 5
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 5
- 238000012098 association analyses Methods 0.000 description 5
- 230000001488 breeding effect Effects 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 5
- 238000003776 cleavage reaction Methods 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 108010087823 glycyltyrosine Proteins 0.000 description 5
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 5
- 229910052737 gold Inorganic materials 0.000 description 5
- 239000010931 gold Substances 0.000 description 5
- 230000002363 herbicidal effect Effects 0.000 description 5
- 238000010191 image analysis Methods 0.000 description 5
- 230000002401 inhibitory effect Effects 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 210000001161 mammalian embryo Anatomy 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 230000009467 reduction Effects 0.000 description 5
- 230000002441 reversible effect Effects 0.000 description 5
- 230000007017 scission Effects 0.000 description 5
- 238000002864 sequence alignment Methods 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 229960000268 spectinomycin Drugs 0.000 description 5
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 5
- 101150085703 vir gene Proteins 0.000 description 5
- 230000003612 virological effect Effects 0.000 description 5
- LWTDZKXXJRRKDG-KXBFYZLASA-N (-)-phaseollin Chemical compound C1OC2=CC(O)=CC=C2[C@H]2[C@@H]1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-KXBFYZLASA-N 0.000 description 4
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 4
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 4
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 4
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 4
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 4
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 4
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 4
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 4
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 4
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 4
- 108020004705 Codon Proteins 0.000 description 4
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 4
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 4
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 4
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 4
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 4
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 4
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 4
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 4
- 239000004471 Glycine Substances 0.000 description 4
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 4
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 4
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 4
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 4
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 4
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 4
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 4
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 4
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 4
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 4
- 241000209510 Liliopsida Species 0.000 description 4
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 4
- 101001000581 Oryza sativa subsp. japonica Glucose-6-phosphate isomerase, cytosolic A Proteins 0.000 description 4
- 108700001094 Plant Genes Proteins 0.000 description 4
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 4
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 4
- 108010079005 RDV peptide Proteins 0.000 description 4
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 4
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 4
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 4
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 4
- 244000061456 Solanum tuberosum Species 0.000 description 4
- 235000002595 Solanum tuberosum Nutrition 0.000 description 4
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 4
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 4
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 4
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 239000003086 colorant Substances 0.000 description 4
- 239000003184 complementary RNA Substances 0.000 description 4
- 238000012226 gene silencing method Methods 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 238000003306 harvesting Methods 0.000 description 4
- 230000036541 health Effects 0.000 description 4
- 239000001307 helium Substances 0.000 description 4
- 229910052734 helium Inorganic materials 0.000 description 4
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 4
- 239000004009 herbicide Substances 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 239000005556 hormone Substances 0.000 description 4
- 229940088597 hormone Drugs 0.000 description 4
- 230000001939 inductive effect Effects 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 108010058731 nopaline synthase Proteins 0.000 description 4
- 235000015097 nutrients Nutrition 0.000 description 4
- 230000037361 pathway Effects 0.000 description 4
- 239000008188 pellet Substances 0.000 description 4
- 230000008488 polyadenylation Effects 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 102000054765 polymorphisms of proteins Human genes 0.000 description 4
- 238000003757 reverse transcription PCR Methods 0.000 description 4
- ATHGHQPFGPMSJY-UHFFFAOYSA-N spermidine Chemical compound NCCCCNCCCN ATHGHQPFGPMSJY-UHFFFAOYSA-N 0.000 description 4
- 230000002269 spontaneous effect Effects 0.000 description 4
- 238000007619 statistical method Methods 0.000 description 4
- 230000035882 stress Effects 0.000 description 4
- 238000012353 t test Methods 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- 230000017260 vegetative to reproductive phase transition of meristem Effects 0.000 description 4
- 108091005957 yellow fluorescent proteins Proteins 0.000 description 4
- HXUVTXPOZRFMOY-NSHDSACASA-N 2-[[(2s)-2-[[2-[(2-aminoacetyl)amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound NCC(=O)NCC(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 HXUVTXPOZRFMOY-NSHDSACASA-N 0.000 description 3
- OCKGFTQIICXDQW-ZEQRLZLVSA-N 5-[(1r)-1-hydroxy-2-[4-[(2r)-2-hydroxy-2-(4-methyl-1-oxo-3h-2-benzofuran-5-yl)ethyl]piperazin-1-yl]ethyl]-4-methyl-3h-2-benzofuran-1-one Chemical compound C1=C2C(=O)OCC2=C(C)C([C@@H](O)CN2CCN(CC2)C[C@H](O)C2=CC=C3C(=O)OCC3=C2C)=C1 OCKGFTQIICXDQW-ZEQRLZLVSA-N 0.000 description 3
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 3
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 3
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 3
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 3
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 3
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 3
- 108020005544 Antisense RNA Proteins 0.000 description 3
- 101100194010 Arabidopsis thaliana RD29A gene Proteins 0.000 description 3
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 3
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 3
- JEXPNDORFYHJTM-IHRRRGAJSA-N Arg-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JEXPNDORFYHJTM-IHRRRGAJSA-N 0.000 description 3
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 3
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 3
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 3
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 3
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 3
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 3
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 3
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 3
- 241000219198 Brassica Species 0.000 description 3
- 241000701489 Cauliflower mosaic virus Species 0.000 description 3
- 244000298479 Cichorium intybus Species 0.000 description 3
- 240000001980 Cucurbita pepo Species 0.000 description 3
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 3
- 108010002537 Fruit Proteins Proteins 0.000 description 3
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 3
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 3
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 3
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 3
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 3
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 3
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 3
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 3
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 3
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 3
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 3
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 3
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 3
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 3
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 3
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 3
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- 244000020551 Helianthus annuus Species 0.000 description 3
- 235000003222 Helianthus annuus Nutrition 0.000 description 3
- LBCAQRFTWMMWRR-CIUDSAMLSA-N His-Cys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O LBCAQRFTWMMWRR-CIUDSAMLSA-N 0.000 description 3
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 3
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 3
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 3
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 3
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 3
- 235000002678 Ipomoea batatas Nutrition 0.000 description 3
- 244000017020 Ipomoea batatas Species 0.000 description 3
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 3
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 3
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 3
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 3
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 3
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 3
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 3
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 3
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 3
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 3
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 3
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 3
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 108091092878 Microsatellite Proteins 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- 102100023609 Nucleoside diphosphate kinase, mitochondrial Human genes 0.000 description 3
- 101000893863 Oryza sativa subsp. japonica Glucose-6-phosphate isomerase, cytosolic B Proteins 0.000 description 3
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 3
- 244000046052 Phaseolus vulgaris Species 0.000 description 3
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 3
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 3
- 235000010582 Pisum sativum Nutrition 0.000 description 3
- 240000004713 Pisum sativum Species 0.000 description 3
- 108010064851 Plant Proteins Proteins 0.000 description 3
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 3
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 3
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 3
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 3
- 235000004443 Ricinus communis Nutrition 0.000 description 3
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 3
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 3
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 3
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 3
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 3
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 3
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 3
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 3
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 3
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 3
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 3
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 3
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 3
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 3
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 3
- CDRYEAWHKJSGAF-BPNCWPANSA-N Tyr-Ala-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O CDRYEAWHKJSGAF-BPNCWPANSA-N 0.000 description 3
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 3
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 3
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 3
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 3
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 3
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 3
- 208000036142 Viral infection Diseases 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 108010070783 alanyltyrosine Proteins 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 3
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 3
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 3
- 108010060035 arginylproline Proteins 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 101150102092 ccdB gene Proteins 0.000 description 3
- 238000012512 characterization method Methods 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 229960005091 chloramphenicol Drugs 0.000 description 3
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 3
- 230000002759 chromosomal effect Effects 0.000 description 3
- 230000009089 cytolysis Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 108010054812 diprotin A Proteins 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 230000013020 embryo development Effects 0.000 description 3
- 241001233957 eudicotyledons Species 0.000 description 3
- 102000054766 genetic haplotypes Human genes 0.000 description 3
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010084389 glycyltryptophan Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 238000005286 illumination Methods 0.000 description 3
- 239000003617 indole-3-acetic acid Substances 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 230000014601 lateral root branching Effects 0.000 description 3
- 108091023663 let-7 stem-loop Proteins 0.000 description 3
- 108091063478 let-7-1 stem-loop Proteins 0.000 description 3
- 108091049777 let-7-2 stem-loop Proteins 0.000 description 3
- 108091053735 lin-4 stem-loop Proteins 0.000 description 3
- 108091032363 lin-4-1 stem-loop Proteins 0.000 description 3
- 108091028008 lin-4-2 stem-loop Proteins 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 210000003739 neck Anatomy 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 235000021232 nutrient availability Nutrition 0.000 description 3
- 230000002018 overexpression Effects 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 108010082527 phosphinothricin N-acetyltransferase Proteins 0.000 description 3
- 238000003976 plant breeding Methods 0.000 description 3
- 235000021118 plant-derived protein Nutrition 0.000 description 3
- 239000013600 plasmid vector Substances 0.000 description 3
- 108010054624 red fluorescent protein Proteins 0.000 description 3
- 230000021749 root development Effects 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 230000032258 transport Effects 0.000 description 3
- 108010038745 tryptophylglycine Proteins 0.000 description 3
- 238000010200 validation analysis Methods 0.000 description 3
- 230000009385 viral infection Effects 0.000 description 3
- JLIDBLDQVAYHNE-YKALOCIXSA-N (+)-Abscisic acid Chemical compound OC(=O)/C=C(/C)\C=C\[C@@]1(O)C(C)=CC(=O)CC1(C)C JLIDBLDQVAYHNE-YKALOCIXSA-N 0.000 description 2
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- 102000007469 Actins Human genes 0.000 description 2
- 108010085238 Actins Proteins 0.000 description 2
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 2
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 2
- XYKDZXKKYOOTGC-FXQIFTODSA-N Ala-Cys-Met Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(=O)O)N XYKDZXKKYOOTGC-FXQIFTODSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 2
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- 244000144730 Amygdalus persica Species 0.000 description 2
- 244000105624 Arachis hypogaea Species 0.000 description 2
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 2
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 2
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 2
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 2
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 2
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 2
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 2
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 2
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 2
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- GIMTZGADWZTZGV-DCAQKATOSA-N Arg-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GIMTZGADWZTZGV-DCAQKATOSA-N 0.000 description 2
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 2
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 2
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 2
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 2
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 2
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 2
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 2
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 2
- PTNFNTOBUDWHNZ-GUBZILKMSA-N Asn-Arg-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O PTNFNTOBUDWHNZ-GUBZILKMSA-N 0.000 description 2
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 2
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 2
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 2
- LWXJVHTUEDHDLG-XUXIUFHCSA-N Asn-Leu-Leu-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LWXJVHTUEDHDLG-XUXIUFHCSA-N 0.000 description 2
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 2
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 2
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 2
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 2
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- VXEORMGBKTUUCM-KWBADKCTSA-N Asp-Val-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O VXEORMGBKTUUCM-KWBADKCTSA-N 0.000 description 2
- 244000003416 Asparagus officinalis Species 0.000 description 2
- 235000005340 Asparagus officinalis Nutrition 0.000 description 2
- 235000007319 Avena orientalis Nutrition 0.000 description 2
- 244000075850 Avena orientalis Species 0.000 description 2
- 235000000832 Ayote Nutrition 0.000 description 2
- 235000011331 Brassica Nutrition 0.000 description 2
- 240000002791 Brassica napus Species 0.000 description 2
- 235000011293 Brassica napus Nutrition 0.000 description 2
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 description 2
- 240000003259 Brassica oleracea var. botrytis Species 0.000 description 2
- 235000009467 Carica papaya Nutrition 0.000 description 2
- 240000006432 Carica papaya Species 0.000 description 2
- 235000007542 Cichorium intybus Nutrition 0.000 description 2
- 241000207199 Citrus Species 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 244000241257 Cucumis melo Species 0.000 description 2
- 235000009854 Cucurbita moschata Nutrition 0.000 description 2
- 235000009804 Cucurbita pepo subsp pepo Nutrition 0.000 description 2
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 2
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 2
- MTNJRNQDDSWQQA-GQGQLFGLSA-N Cys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N MTNJRNQDDSWQQA-GQGQLFGLSA-N 0.000 description 2
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 2
- 102000004594 DNA Polymerase I Human genes 0.000 description 2
- 108010017826 DNA Polymerase I Proteins 0.000 description 2
- 101001031598 Dictyostelium discoideum Probable serine/threonine-protein kinase fhkC Proteins 0.000 description 2
- 102100031780 Endonuclease Human genes 0.000 description 2
- 241000234643 Festuca arundinacea Species 0.000 description 2
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 2
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 2
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 2
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 2
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 2
- XMVLTPMCUJTJQP-FXQIFTODSA-N Glu-Gln-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N XMVLTPMCUJTJQP-FXQIFTODSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 2
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 2
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 2
- ZCFNZTVIDMLUQC-SXNHZJKMSA-N Glu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZCFNZTVIDMLUQC-SXNHZJKMSA-N 0.000 description 2
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 2
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 2
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 2
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 2
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 2
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- 108010068370 Glutens Proteins 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 2
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 2
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 2
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- JPAACTMBBBGAAR-HOTGVXAUSA-N Gly-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)CC(C)C)C(O)=O)=CNC2=C1 JPAACTMBBBGAAR-HOTGVXAUSA-N 0.000 description 2
- WRFOZIJRODPLIA-QWRGUYRKSA-N Gly-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O WRFOZIJRODPLIA-QWRGUYRKSA-N 0.000 description 2
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- QQJMARNOLHSJCQ-DCAQKATOSA-N His-Cys-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N QQJMARNOLHSJCQ-DCAQKATOSA-N 0.000 description 2
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 2
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 2
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 2
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 2
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- 239000004201 L-cysteine Substances 0.000 description 2
- 235000013878 L-cysteine Nutrition 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 2
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 2
- GPXFZVUVPCFTMG-AVGNSLFASA-N Leu-Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C GPXFZVUVPCFTMG-AVGNSLFASA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 2
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 2
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 2
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 2
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 2
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 2
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 2
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 2
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 2
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 2
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 2
- 240000005561 Musa balbisiana Species 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 101710163504 Phaseolin Proteins 0.000 description 2
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 2
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 2
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 2
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 2
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 2
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 2
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 2
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 2
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 2
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 2
- 102000011195 Profilin Human genes 0.000 description 2
- 108050001408 Profilin Proteins 0.000 description 2
- 235000006040 Prunus persica var persica Nutrition 0.000 description 2
- 240000000111 Saccharum officinarum Species 0.000 description 2
- 235000007201 Saccharum officinarum Nutrition 0.000 description 2
- 101100094582 Schizosaccharomyces pombe (strain 972 / ATCC 24843) rum1 gene Proteins 0.000 description 2
- 235000007238 Secale cereale Nutrition 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 2
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 2
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 2
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 2
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 2
- NRUPKQSXTJNQGD-XGEHTFHBSA-N Thr-Cys-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NRUPKQSXTJNQGD-XGEHTFHBSA-N 0.000 description 2
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- 108091036066 Three prime untranslated region Proteins 0.000 description 2
- GHXXDFDIDHIEIL-WFBYXXMGSA-N Trp-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GHXXDFDIDHIEIL-WFBYXXMGSA-N 0.000 description 2
- OZUJUVFWMHTWCZ-HOCLYGCPSA-N Trp-Gly-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OZUJUVFWMHTWCZ-HOCLYGCPSA-N 0.000 description 2
- IQXWAJUIAQLZNX-IHPCNDPISA-N Trp-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N IQXWAJUIAQLZNX-IHPCNDPISA-N 0.000 description 2
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 2
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 2
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 2
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 2
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 2
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 108010064997 VPY tripeptide Proteins 0.000 description 2
- 244000078534 Vaccinium myrtillus Species 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 2
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 2
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 2
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 2
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 2
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 2
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 2
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 2
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 235000009754 Vitis X bourquina Nutrition 0.000 description 2
- 235000012333 Vitis X labruscana Nutrition 0.000 description 2
- 240000006365 Vitis vinifera Species 0.000 description 2
- 235000014787 Vitis vinifera Nutrition 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010050181 aleurone Proteins 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 238000000540 analysis of variance Methods 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 230000011681 asexual reproduction Effects 0.000 description 2
- 238000013465 asexual reproduction Methods 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 230000010310 bacterial transformation Effects 0.000 description 2
- 230000004888 barrier function Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 2
- 229960003669 carbenicillin Drugs 0.000 description 2
- 230000007910 cell fusion Effects 0.000 description 2
- 235000020971 citrus fruits Nutrition 0.000 description 2
- 238000003053 completely randomized design Methods 0.000 description 2
- 238000012790 confirmation Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 238000007405 data analysis Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 230000003828 downregulation Effects 0.000 description 2
- 230000005284 excitation Effects 0.000 description 2
- 239000012737 fresh medium Substances 0.000 description 2
- 230000035784 germination Effects 0.000 description 2
- 239000008103 glucose Substances 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010050792 glutenin Proteins 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 125000001165 hydrophobic group Chemical group 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 231100000518 lethal Toxicity 0.000 description 2
- 230000001665 lethal effect Effects 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 230000035800 maturation Effects 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- QSHDDOUJBYECFT-UHFFFAOYSA-N mercury Chemical compound [Hg] QSHDDOUJBYECFT-UHFFFAOYSA-N 0.000 description 2
- 229910052753 mercury Inorganic materials 0.000 description 2
- 230000000442 meristematic effect Effects 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 235000019713 millet Nutrition 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 230000001338 necrotic effect Effects 0.000 description 2
- 239000000618 nitrogen fertilizer Substances 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 238000001543 one-way ANOVA Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 210000003463 organelle Anatomy 0.000 description 2
- 101150113864 pat gene Proteins 0.000 description 2
- 235000020232 peanut Nutrition 0.000 description 2
- LWTDZKXXJRRKDG-UHFFFAOYSA-N phaseollin Natural products C1OC2=CC(O)=CC=C2C2C1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-UHFFFAOYSA-N 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 229960002429 proline Drugs 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 235000021251 pulses Nutrition 0.000 description 2
- 235000015136 pumpkin Nutrition 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- JQXXHWHPUNPDRT-WLSIYKJHSA-N rifampicin Chemical compound O([C@](C1=O)(C)O/C=C/[C@@H]([C@H]([C@@H](OC(C)=O)[C@H](C)[C@H](O)[C@H](C)[C@@H](O)[C@@H](C)\C=C\C=C(C)/C(=O)NC=2C(O)=C3C([O-])=C4C)C)OC)C4=C1C3=C(O)C=2\C=N\N1CC[NH+](C)CC1 JQXXHWHPUNPDRT-WLSIYKJHSA-N 0.000 description 2
- 229960001225 rifampicin Drugs 0.000 description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 2
- 230000008117 seed development Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 230000001568 sexual effect Effects 0.000 description 2
- 230000035939 shock Effects 0.000 description 2
- 230000003584 silencer Effects 0.000 description 2
- SQGYOTSLMSWVJD-UHFFFAOYSA-N silver(1+) nitrate Chemical compound [Ag+].[O-]N(=O)=O SQGYOTSLMSWVJD-UHFFFAOYSA-N 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 229940063673 spermidine Drugs 0.000 description 2
- 239000011550 stock solution Substances 0.000 description 2
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 2
- 229960000344 thiamine hydrochloride Drugs 0.000 description 2
- 235000019190 thiamine hydrochloride Nutrition 0.000 description 2
- 239000011747 thiamine hydrochloride Substances 0.000 description 2
- UZKQTCBAMSWPJD-UQCOIBPSSA-N trans-Zeatin Natural products OCC(/C)=C\CNC1=NC=NC2=C1N=CN2 UZKQTCBAMSWPJD-UQCOIBPSSA-N 0.000 description 2
- UZKQTCBAMSWPJD-FARCUNLSSA-N trans-zeatin Chemical compound OCC(/C)=C/CNC1=NC=NC2=C1N=CN2 UZKQTCBAMSWPJD-FARCUNLSSA-N 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000010361 transduction Methods 0.000 description 2
- 230000026683 transduction Effects 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 230000002792 vascular Effects 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- 229940023877 zeatin Drugs 0.000 description 2
- MJYQFWSXKFLTAY-OVEQLNGDSA-N (2r,3r)-2,3-bis[(4-hydroxy-3-methoxyphenyl)methyl]butane-1,4-diol;(2r,3r,4s,5s,6r)-6-(hydroxymethyl)oxane-2,3,4,5-tetrol Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O.C1=C(O)C(OC)=CC(C[C@@H](CO)[C@H](CO)CC=2C=C(OC)C(O)=CC=2)=C1 MJYQFWSXKFLTAY-OVEQLNGDSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- NCMVOABPESMRCP-SHYZEUOFSA-N 2'-deoxycytosine 5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 NCMVOABPESMRCP-SHYZEUOFSA-N 0.000 description 1
- LTFMZDNNPPEQNG-KVQBGUIXSA-N 2'-deoxyguanosine 5'-monophosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1C[C@H](O)[C@@H](COP(O)(O)=O)O1 LTFMZDNNPPEQNG-KVQBGUIXSA-N 0.000 description 1
- YEJQWBFDKKTPNO-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylbutanoyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C(N)C(=O)N1CCCC1C(=O)NCC(=O)NC(C(C)C)C(O)=O YEJQWBFDKKTPNO-UHFFFAOYSA-N 0.000 description 1
- OYCLSQDXZMROJK-UHFFFAOYSA-N 2-bromo-4-[3-(3-bromo-4-hydroxyphenyl)-1,1-dioxo-2,1$l^{6}-benzoxathiol-3-yl]phenol Chemical compound C1=C(Br)C(O)=CC=C1C1(C=2C=C(Br)C(O)=CC=2)C2=CC=CC=C2S(=O)(=O)O1 OYCLSQDXZMROJK-UHFFFAOYSA-N 0.000 description 1
- VADKRMSMGWJZCF-UHFFFAOYSA-N 2-bromophenol Chemical compound OC1=CC=CC=C1Br VADKRMSMGWJZCF-UHFFFAOYSA-N 0.000 description 1
- JLIDBLDQVAYHNE-LXGGSRJLSA-N 2-cis-abscisic acid Chemical compound OC(=O)/C=C(/C)\C=C\C1(O)C(C)=CC(=O)CC1(C)C JLIDBLDQVAYHNE-LXGGSRJLSA-N 0.000 description 1
- 101710140048 2S seed storage protein Proteins 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- 108010036211 5-HT-moduline Proteins 0.000 description 1
- 230000005730 ADP ribosylation Effects 0.000 description 1
- 101150001232 ALS gene Proteins 0.000 description 1
- 240000004507 Abelmoschus esculentus Species 0.000 description 1
- 241001133760 Acoelorraphe Species 0.000 description 1
- 235000009434 Actinidia chinensis Nutrition 0.000 description 1
- 244000298697 Actinidia deliciosa Species 0.000 description 1
- 235000009436 Actinidia deliciosa Nutrition 0.000 description 1
- 235000001674 Agaricus brunnescens Nutrition 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- ZFXQNADNEBRERM-BJDJZHNGSA-N Ala-Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZFXQNADNEBRERM-BJDJZHNGSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- WJRXVTCKASUIFF-FXQIFTODSA-N Ala-Cys-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WJRXVTCKASUIFF-FXQIFTODSA-N 0.000 description 1
- HFBFSOAKPUZCCO-ZLUOBGJFSA-N Ala-Cys-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HFBFSOAKPUZCCO-ZLUOBGJFSA-N 0.000 description 1
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 1
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 1
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 241000234282 Allium Species 0.000 description 1
- 235000005254 Allium ampeloprasum Nutrition 0.000 description 1
- 240000006108 Allium ampeloprasum Species 0.000 description 1
- 235000002732 Allium cepa var. cepa Nutrition 0.000 description 1
- 240000002234 Allium sativum Species 0.000 description 1
- 244000099147 Ananas comosus Species 0.000 description 1
- 235000007119 Ananas comosus Nutrition 0.000 description 1
- 101710117679 Anthocyanidin 3-O-glucosyltransferase Proteins 0.000 description 1
- 240000007087 Apium graveolens Species 0.000 description 1
- 235000015849 Apium graveolens Dulce Group Nutrition 0.000 description 1
- 235000010591 Appio Nutrition 0.000 description 1
- 108700025100 Arabidopsis RPK1 Proteins 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 101100306081 Arabidopsis thaliana RPK1 gene Proteins 0.000 description 1
- 235000017060 Arachis glabrata Nutrition 0.000 description 1
- 235000010777 Arachis hypogaea Nutrition 0.000 description 1
- 235000018262 Arachis monticola Nutrition 0.000 description 1
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- RYRQZJVFDVWURI-SRVKXCTJSA-N Arg-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N RYRQZJVFDVWURI-SRVKXCTJSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- BQBPFMNVOWDLHO-XIRDDKMYSA-N Arg-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N BQBPFMNVOWDLHO-XIRDDKMYSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- PPPXVIBMLFWNSK-BQBZGAKWSA-N Arg-Gly-Cys Chemical compound C(C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N PPPXVIBMLFWNSK-BQBZGAKWSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 1
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- XKDYWGLNSCNRGW-WDSOQIARSA-N Arg-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCN=C(N)N)CCCCN)C(O)=O)=CNC2=C1 XKDYWGLNSCNRGW-WDSOQIARSA-N 0.000 description 1
- AFNHFVVOJZBIJD-GUBZILKMSA-N Arg-Met-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O AFNHFVVOJZBIJD-GUBZILKMSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 1
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- RLHANKIRBONJBK-IHRRRGAJSA-N Asn-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N RLHANKIRBONJBK-IHRRRGAJSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 1
- MUWDILPCTSMUHI-ZLUOBGJFSA-N Asp-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)O MUWDILPCTSMUHI-ZLUOBGJFSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- WKGJGVGTEZGFSW-FXQIFTODSA-N Asp-Asn-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O WKGJGVGTEZGFSW-FXQIFTODSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- RATOMFTUDRYMKX-ACZMJKKPSA-N Asp-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RATOMFTUDRYMKX-ACZMJKKPSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- CZECQDPEMSVPDH-MNXVOIDGSA-N Asp-Leu-Val-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CZECQDPEMSVPDH-MNXVOIDGSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- FIAKNCXQFFKSSI-ZLUOBGJFSA-N Asp-Ser-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O FIAKNCXQFFKSSI-ZLUOBGJFSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- YODBPLSWNJMZOJ-BPUTZDHNSA-N Asp-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N YODBPLSWNJMZOJ-BPUTZDHNSA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- 235000016068 Berberis vulgaris Nutrition 0.000 description 1
- 241000335053 Beta vulgaris Species 0.000 description 1
- 241000219310 Beta vulgaris subsp. vulgaris Species 0.000 description 1
- 235000018185 Betula X alpestris Nutrition 0.000 description 1
- 235000018212 Betula X uliginosa Nutrition 0.000 description 1
- BTBUEUYNUDRHOZ-UHFFFAOYSA-N Borate Chemical compound [O-]B([O-])[O-] BTBUEUYNUDRHOZ-UHFFFAOYSA-N 0.000 description 1
- 241000167854 Bourreria succulenta Species 0.000 description 1
- 235000003351 Brassica cretica Nutrition 0.000 description 1
- 240000007124 Brassica oleracea Species 0.000 description 1
- 235000003899 Brassica oleracea var acephala Nutrition 0.000 description 1
- 235000011301 Brassica oleracea var capitata Nutrition 0.000 description 1
- 235000004221 Brassica oleracea var gemmifera Nutrition 0.000 description 1
- 235000017647 Brassica oleracea var italica Nutrition 0.000 description 1
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 1
- 244000308368 Brassica oleracea var. gemmifera Species 0.000 description 1
- 235000008744 Brassica perviridis Nutrition 0.000 description 1
- 244000233513 Brassica perviridis Species 0.000 description 1
- 235000000540 Brassica rapa subsp rapa Nutrition 0.000 description 1
- 235000003343 Brassica rupestris Nutrition 0.000 description 1
- 235000004936 Bromus mango Nutrition 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 102100025238 CD302 antigen Human genes 0.000 description 1
- 241000244203 Caenorhabditis elegans Species 0.000 description 1
- 101100268056 Caenorhabditis elegans zag-1 gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 235000002566 Capsicum Nutrition 0.000 description 1
- 102100027668 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 Human genes 0.000 description 1
- 101710134395 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 1 Proteins 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- KZBUYRJDOAKODT-UHFFFAOYSA-N Chlorine Chemical compound ClCl KZBUYRJDOAKODT-UHFFFAOYSA-N 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 244000241235 Citrullus lanatus Species 0.000 description 1
- 235000012828 Citrullus lanatus var citroides Nutrition 0.000 description 1
- 235000008733 Citrus aurantifolia Nutrition 0.000 description 1
- 235000005979 Citrus limon Nutrition 0.000 description 1
- 244000131522 Citrus pyriformis Species 0.000 description 1
- 241001672694 Citrus reticulata Species 0.000 description 1
- 240000000560 Citrus x paradisi Species 0.000 description 1
- 244000060011 Cocos nucifera Species 0.000 description 1
- 235000013162 Cocos nucifera Nutrition 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 240000007154 Coffea arabica Species 0.000 description 1
- 101100287598 Colletotrichum orbiculare (strain 104-T / ATCC 96160 / CBS 514.97 / LARS 414 / MAFF 240422) PKAR gene Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 108020004394 Complementary RNA Proteins 0.000 description 1
- 101710091838 Convicilin Proteins 0.000 description 1
- 235000002787 Coriandrum sativum Nutrition 0.000 description 1
- 244000018436 Coriandrum sativum Species 0.000 description 1
- 235000015510 Cucumis melo subsp melo Nutrition 0.000 description 1
- 235000009847 Cucumis melo var cantalupensis Nutrition 0.000 description 1
- 240000008067 Cucumis sativus Species 0.000 description 1
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 1
- 235000009852 Cucurbita pepo Nutrition 0.000 description 1
- 241000219130 Cucurbita pepo subsp. pepo Species 0.000 description 1
- 235000003954 Cucurbita pepo var melopepo Nutrition 0.000 description 1
- 235000017788 Cydonia oblonga Nutrition 0.000 description 1
- 244000019459 Cynara cardunculus Species 0.000 description 1
- 235000019106 Cynara scolymus Nutrition 0.000 description 1
- PRVVCRZLTJNPCS-FXQIFTODSA-N Cys-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N PRVVCRZLTJNPCS-FXQIFTODSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- HRJLVSQKBLZHSR-ZLUOBGJFSA-N Cys-Asn-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O HRJLVSQKBLZHSR-ZLUOBGJFSA-N 0.000 description 1
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 1
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 1
- RWGDABDXVXRLLH-ACZMJKKPSA-N Cys-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N RWGDABDXVXRLLH-ACZMJKKPSA-N 0.000 description 1
- WPXPYZPGSGWQSC-DCAQKATOSA-N Cys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N WPXPYZPGSGWQSC-DCAQKATOSA-N 0.000 description 1
- PJWIPBIMSKJTIE-DCAQKATOSA-N Cys-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N PJWIPBIMSKJTIE-DCAQKATOSA-N 0.000 description 1
- LYSHSHHDBVKJRN-JBDRJPRFSA-N Cys-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N LYSHSHHDBVKJRN-JBDRJPRFSA-N 0.000 description 1
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 1
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 1
- LWYKPOCGGTYAIH-FXQIFTODSA-N Cys-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LWYKPOCGGTYAIH-FXQIFTODSA-N 0.000 description 1
- UBHPUQAWSSNQLQ-DCAQKATOSA-N Cys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O UBHPUQAWSSNQLQ-DCAQKATOSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- XWTGTTNUCCEFJI-UBHSHLNASA-N Cys-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N XWTGTTNUCCEFJI-UBHSHLNASA-N 0.000 description 1
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 1
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 1
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 1
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 1
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 1
- 229930183912 Cytidylic acid Natural products 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 230000007067 DNA methylation Effects 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 235000002767 Daucus carota Nutrition 0.000 description 1
- 244000000626 Daucus carota Species 0.000 description 1
- 208000005156 Dehydration Diseases 0.000 description 1
- 102100024746 Dihydrofolate reductase Human genes 0.000 description 1
- 235000011511 Diospyros Nutrition 0.000 description 1
- 244000236655 Diospyros kaki Species 0.000 description 1
- 235000014466 Douglas bleu Nutrition 0.000 description 1
- 241001057636 Dracaena deremensis Species 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 108010093502 E2F Transcription Factors Proteins 0.000 description 1
- 102000001388 E2F Transcription Factors Human genes 0.000 description 1
- 235000001950 Elaeis guineensis Nutrition 0.000 description 1
- 244000127993 Elaeis melanococca Species 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 108010092674 Enkephalins Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 244000024675 Eruca sativa Species 0.000 description 1
- 235000014755 Eruca sativa Nutrition 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 244000004281 Eucalyptus maculata Species 0.000 description 1
- 238000000729 Fisher's exact test Methods 0.000 description 1
- 240000006927 Foeniculum vulgare Species 0.000 description 1
- 235000004204 Foeniculum vulgare Nutrition 0.000 description 1
- 235000016623 Fragaria vesca Nutrition 0.000 description 1
- 240000009088 Fragaria x ananassa Species 0.000 description 1
- 235000011363 Fragaria x ananassa Nutrition 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 230000005526 G1 to G0 transition Effects 0.000 description 1
- 108010061711 Gliadin Proteins 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 1
- PCKOTDPDHIBGRW-CIUDSAMLSA-N Gln-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N PCKOTDPDHIBGRW-CIUDSAMLSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- KFHASAPTUOASQN-JYJNAYRXSA-N Gln-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KFHASAPTUOASQN-JYJNAYRXSA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- CAVMESABQIKFKT-IUCAKERBSA-N Glu-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N CAVMESABQIKFKT-IUCAKERBSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 1
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- GNBMOZPQUXTCRW-STQMWFEESA-N Gly-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)CN)C(O)=O)=CNC2=C1 GNBMOZPQUXTCRW-STQMWFEESA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- IFHJOBKVXBESRE-YUMQZZPRSA-N Gly-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN IFHJOBKVXBESRE-YUMQZZPRSA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- 235000005335 Glycine tabacina Nutrition 0.000 description 1
- 240000003082 Glycine tabacina Species 0.000 description 1
- SQUHHTBVTRBESD-UHFFFAOYSA-N Hexa-Ac-myo-Inositol Natural products CC(=O)OC1C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C(OC(C)=O)C1OC(C)=O SQUHHTBVTRBESD-UHFFFAOYSA-N 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- NIKBMHGRNAPJFW-IUCAKERBSA-N His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 NIKBMHGRNAPJFW-IUCAKERBSA-N 0.000 description 1
- WZOGEMJIZBNFBK-CIUDSAMLSA-N His-Asp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WZOGEMJIZBNFBK-CIUDSAMLSA-N 0.000 description 1
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 1
- LIEIYPBMQJLASB-SRVKXCTJSA-N His-Gln-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LIEIYPBMQJLASB-SRVKXCTJSA-N 0.000 description 1
- TVRMJKNELJKNRS-GUBZILKMSA-N His-Glu-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N TVRMJKNELJKNRS-GUBZILKMSA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- YVCGJPIKRMGNPA-LSJOCFKGSA-N His-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O YVCGJPIKRMGNPA-LSJOCFKGSA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- XIGFLVCAVQQGNS-IHRRRGAJSA-N His-Pro-His Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XIGFLVCAVQQGNS-IHRRRGAJSA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- 108700005087 Homeobox Genes Proteins 0.000 description 1
- 102000009331 Homeodomain Proteins Human genes 0.000 description 1
- 108010048671 Homeodomain Proteins Proteins 0.000 description 1
- 101100273718 Homo sapiens CD302 gene Proteins 0.000 description 1
- 101001109137 Homo sapiens Receptor-interacting serine/threonine-protein kinase 2 Proteins 0.000 description 1
- 101000733257 Homo sapiens Rho guanine nucleotide exchange factor 28 Proteins 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 1
- RFMDODRWJZHZCR-BJDJZHNGSA-N Ile-Lys-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O RFMDODRWJZHZCR-BJDJZHNGSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 108010044467 Isoenzymes Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- 229930182821 L-proline Natural products 0.000 description 1
- 235000003228 Lactuca sativa Nutrition 0.000 description 1
- 240000008415 Lactuca sativa Species 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 101710094902 Legumin Proteins 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- WXDRGWBQZIMJDE-ULQDDVLXSA-N Leu-Phe-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O WXDRGWBQZIMJDE-ULQDDVLXSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 1
- LSLUTXRANSUGFY-XIRDDKMYSA-N Leu-Trp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O LSLUTXRANSUGFY-XIRDDKMYSA-N 0.000 description 1
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- URLZCHNOLZSCCA-VABKMULXSA-N Leu-enkephalin Chemical class C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CNC(=O)CNC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=CC=C1 URLZCHNOLZSCCA-VABKMULXSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 241000208682 Liquidambar Species 0.000 description 1
- 235000006552 Liquidambar styraciflua Nutrition 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 1
- ZAWOJFFMBANLGE-CIUDSAMLSA-N Lys-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N ZAWOJFFMBANLGE-CIUDSAMLSA-N 0.000 description 1
- XTONYTDATVADQH-CIUDSAMLSA-N Lys-Cys-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XTONYTDATVADQH-CIUDSAMLSA-N 0.000 description 1
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 1
- LXNPMPIQDNSMTA-AVGNSLFASA-N Lys-Gln-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 LXNPMPIQDNSMTA-AVGNSLFASA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- CBNMHRCLYBJIIZ-XUXIUFHCSA-N Lys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N CBNMHRCLYBJIIZ-XUXIUFHCSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- WAAZECNCPVGPIV-RHYQMDGZSA-N Lys-Thr-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O WAAZECNCPVGPIV-RHYQMDGZSA-N 0.000 description 1
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241000220225 Malus Species 0.000 description 1
- 235000011430 Malus pumila Nutrition 0.000 description 1
- 235000015103 Malus silvestris Nutrition 0.000 description 1
- 229910021380 Manganese Chloride Inorganic materials 0.000 description 1
- GLFNIEUTAYBVOC-UHFFFAOYSA-L Manganese chloride Chemical compound Cl[Mn]Cl GLFNIEUTAYBVOC-UHFFFAOYSA-L 0.000 description 1
- 235000014826 Mangifera indica Nutrition 0.000 description 1
- 240000007228 Mangifera indica Species 0.000 description 1
- 240000003183 Manihot esculenta Species 0.000 description 1
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- OBPCXINRFKHSRY-SDDRHHMPSA-N Met-Met-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N OBPCXINRFKHSRY-SDDRHHMPSA-N 0.000 description 1
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- WSPQHZOMTFFWGH-XGEHTFHBSA-N Met-Thr-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(O)=O WSPQHZOMTFFWGH-XGEHTFHBSA-N 0.000 description 1
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 1
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 108020005196 Mitochondrial DNA Proteins 0.000 description 1
- 101100496109 Mus musculus Clec2i gene Proteins 0.000 description 1
- 235000003805 Musa ABB Group Nutrition 0.000 description 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- PVNIIMVLHYAWGP-UHFFFAOYSA-N Niacin Chemical compound OC(=O)C1=CC=CN=C1 PVNIIMVLHYAWGP-UHFFFAOYSA-N 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 240000007817 Olea europaea Species 0.000 description 1
- 235000002725 Olea europaea Nutrition 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 238000002944 PCR assay Methods 0.000 description 1
- 235000001591 Pachyrhizus erosus Nutrition 0.000 description 1
- 244000215747 Pachyrhizus erosus Species 0.000 description 1
- 235000018669 Pachyrhizus tuberosus Nutrition 0.000 description 1
- 240000004370 Pastinaca sativa Species 0.000 description 1
- 235000017769 Pastinaca sativa subsp sativa Nutrition 0.000 description 1
- 101710091688 Patatin Proteins 0.000 description 1
- 239000006002 Pepper Substances 0.000 description 1
- 244000025272 Persea americana Species 0.000 description 1
- 235000008673 Persea americana Nutrition 0.000 description 1
- 244000062780 Petroselinum sativum Species 0.000 description 1
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- OMHMIXFFRPMYHB-SRVKXCTJSA-N Phe-Cys-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OMHMIXFFRPMYHB-SRVKXCTJSA-N 0.000 description 1
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 1
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 1
- DEZCWWXTRAKZKJ-UFYCRDLUSA-N Phe-Phe-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DEZCWWXTRAKZKJ-UFYCRDLUSA-N 0.000 description 1
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- 241000425347 Phyla <beetle> Species 0.000 description 1
- 108010047620 Phytohemagglutinins Proteins 0.000 description 1
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 1
- 241000018646 Pinus brutia Species 0.000 description 1
- 235000011613 Pinus brutia Nutrition 0.000 description 1
- 241001236219 Pinus echinata Species 0.000 description 1
- 235000005018 Pinus echinata Nutrition 0.000 description 1
- 235000017339 Pinus palustris Nutrition 0.000 description 1
- 241000218621 Pinus radiata Species 0.000 description 1
- 235000008577 Pinus radiata Nutrition 0.000 description 1
- 235000008566 Pinus taeda Nutrition 0.000 description 1
- 241000218679 Pinus taeda Species 0.000 description 1
- 235000016761 Piper aduncum Nutrition 0.000 description 1
- 240000003889 Piper guineense Species 0.000 description 1
- 235000017804 Piper guineense Nutrition 0.000 description 1
- 235000008184 Piper nigrum Nutrition 0.000 description 1
- 108020005089 Plant RNA Proteins 0.000 description 1
- 235000015266 Plantago major Nutrition 0.000 description 1
- 241000219000 Populus Species 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- ALJGSKMBIUEJOB-FXQIFTODSA-N Pro-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 ALJGSKMBIUEJOB-FXQIFTODSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- GBRUQFBAJOKCTF-DCAQKATOSA-N Pro-His-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O GBRUQFBAJOKCTF-DCAQKATOSA-N 0.000 description 1
- DTQIXTOJHKVEOH-DCAQKATOSA-N Pro-His-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O DTQIXTOJHKVEOH-DCAQKATOSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 235000009827 Prunus armeniaca Nutrition 0.000 description 1
- 244000018633 Prunus armeniaca Species 0.000 description 1
- 235000006029 Prunus persica var nucipersica Nutrition 0.000 description 1
- 244000017714 Prunus persica var. nucipersica Species 0.000 description 1
- 240000001416 Pseudotsuga menziesii Species 0.000 description 1
- 235000005386 Pseudotsuga menziesii var menziesii Nutrition 0.000 description 1
- 244000294611 Punica granatum Species 0.000 description 1
- 235000014360 Punica granatum Nutrition 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 235000014443 Pyrus communis Nutrition 0.000 description 1
- 240000001987 Pyrus communis Species 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 102100033204 Rho guanine nucleotide exchange factor 28 Human genes 0.000 description 1
- 241000220010 Rhode Species 0.000 description 1
- 108010057163 Ribonuclease III Proteins 0.000 description 1
- 102000003661 Ribonuclease III Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 235000017848 Rubus fruticosus Nutrition 0.000 description 1
- 240000007651 Rubus glaucus Species 0.000 description 1
- 235000011034 Rubus glaucus Nutrition 0.000 description 1
- 235000009122 Rubus idaeus Nutrition 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 1
- 101100238379 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) MPS1 gene Proteins 0.000 description 1
- 241000209056 Secale Species 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- CTRHXXXHUJTTRZ-ZLUOBGJFSA-N Ser-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O CTRHXXXHUJTTRZ-ZLUOBGJFSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 1
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- AMRRYKHCILPAKD-FXQIFTODSA-N Ser-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N AMRRYKHCILPAKD-FXQIFTODSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- FOOZNBRFRWGBNU-DCAQKATOSA-N Ser-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N FOOZNBRFRWGBNU-DCAQKATOSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 235000002597 Solanum melongena Nutrition 0.000 description 1
- 244000061458 Solanum melongena Species 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 235000009337 Spinacia oleracea Nutrition 0.000 description 1
- 244000300264 Spinacia oleracea Species 0.000 description 1
- 235000009184 Spondias indica Nutrition 0.000 description 1
- 229910000831 Steel Inorganic materials 0.000 description 1
- 108010043934 Sucrose synthase Proteins 0.000 description 1
- 235000021536 Sugar beet Nutrition 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 244000269722 Thea sinensis Species 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 1
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 1
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- 235000011941 Tilia x europaea Nutrition 0.000 description 1
- 240000006909 Tilia x europaea Species 0.000 description 1
- 241000566916 Tomentella Species 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 241000219793 Trifolium Species 0.000 description 1
- 235000019714 Triticale Nutrition 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- OFNPHOGOJLNVLL-KCTSRDHCSA-N Trp-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N OFNPHOGOJLNVLL-KCTSRDHCSA-N 0.000 description 1
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 1
- RERIQEJUYCLJQI-QRTARXTBSA-N Trp-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERIQEJUYCLJQI-QRTARXTBSA-N 0.000 description 1
- OBAMASZCXDIXSS-SZMVWBNQSA-N Trp-Glu-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N OBAMASZCXDIXSS-SZMVWBNQSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- ZHZLQVLQBDBQCQ-WDSOQIARSA-N Trp-Lys-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ZHZLQVLQBDBQCQ-WDSOQIARSA-N 0.000 description 1
- FXHOCONKLLUOCF-WDSOQIARSA-N Trp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FXHOCONKLLUOCF-WDSOQIARSA-N 0.000 description 1
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 1
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- 101710162629 Trypsin inhibitor Proteins 0.000 description 1
- 229940122618 Trypsin inhibitor Drugs 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- FFCRCJZJARTYCG-KKUMJFAQSA-N Tyr-Cys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O FFCRCJZJARTYCG-KKUMJFAQSA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- YIKDYZDNRCNFQB-KKUMJFAQSA-N Tyr-His-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O YIKDYZDNRCNFQB-KKUMJFAQSA-N 0.000 description 1
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- 235000003095 Vaccinium corymbosum Nutrition 0.000 description 1
- 240000001717 Vaccinium macrocarpon Species 0.000 description 1
- 235000012545 Vaccinium macrocarpon Nutrition 0.000 description 1
- 235000017537 Vaccinium myrtillus Nutrition 0.000 description 1
- 235000002118 Vaccinium oxycoccus Nutrition 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- IOETTZIEIBVWBZ-GUBZILKMSA-N Val-Met-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N IOETTZIEIBVWBZ-GUBZILKMSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- 101710196023 Vicilin Proteins 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 229920002494 Zein Polymers 0.000 description 1
- FJJCIZWZNKZHII-UHFFFAOYSA-N [4,6-bis(cyanoamino)-1,3,5-triazin-2-yl]cyanamide Chemical compound N#CNC1=NC(NC#N)=NC(NC#N)=N1 FJJCIZWZNKZHII-UHFFFAOYSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000032683 aging Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 238000004873 anchoring Methods 0.000 description 1
- 239000004410 anthocyanin Substances 0.000 description 1
- 229930002877 anthocyanin Natural products 0.000 description 1
- 235000010208 anthocyanin Nutrition 0.000 description 1
- 150000004636 anthocyanins Chemical class 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 235000016520 artichoke thistle Nutrition 0.000 description 1
- 235000000183 arugula Nutrition 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 239000012911 assay medium Substances 0.000 description 1
- 238000012093 association test Methods 0.000 description 1
- 101150103518 bar gene Proteins 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 238000003766 bioinformatics method Methods 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000009141 biological interaction Effects 0.000 description 1
- QKSKPIVNLNLAAV-UHFFFAOYSA-N bis(2-chloroethyl) sulfide Chemical compound ClCCSCCCl QKSKPIVNLNLAAV-UHFFFAOYSA-N 0.000 description 1
- 235000021029 blackberry Nutrition 0.000 description 1
- 238000004061 bleaching Methods 0.000 description 1
- 239000007844 bleaching agent Substances 0.000 description 1
- 235000021014 blueberries Nutrition 0.000 description 1
- 150000005693 branched-chain amino acids Chemical class 0.000 description 1
- HOZOZZFCZRXYEK-GSWUYBTGSA-M butylscopolamine bromide Chemical compound [Br-].C1([C@@H](CO)C(=O)O[C@H]2C[C@@H]3[N+]([C@H](C2)[C@@H]2[C@H]3O2)(C)CCCC)=CC=CC=C1 HOZOZZFCZRXYEK-GSWUYBTGSA-M 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 230000024245 cell differentiation Effects 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 230000002032 cellular defenses Effects 0.000 description 1
- 230000030570 cellular localization Effects 0.000 description 1
- 230000007248 cellular mechanism Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 210000002230 centromere Anatomy 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 238000002144 chemical decomposition reaction Methods 0.000 description 1
- 235000019693 cherries Nutrition 0.000 description 1
- 239000000460 chlorine Substances 0.000 description 1
- 229930002875 chlorophyll Natural products 0.000 description 1
- 235000019804 chlorophyll Nutrition 0.000 description 1
- ATNHDLDRLWWWCB-AENOIHSZSA-M chlorophyll a Chemical compound C1([C@@H](C(=O)OC)C(=O)C2=C3C)=C2N2C3=CC(C(CC)=C3C)=[N+]4C3=CC3=C(C=C)C(C)=C5N3[Mg-2]42[N+]2=C1[C@@H](CCC(=O)OC\C=C(/C)CCC[C@H](C)CCC[C@H](C)CCCC(C)C)[C@H](C)C2=C5 ATNHDLDRLWWWCB-AENOIHSZSA-M 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000014107 chromosome localization Effects 0.000 description 1
- 108010069282 cis-prenyl transferase Proteins 0.000 description 1
- 238000010224 classification analysis Methods 0.000 description 1
- 238000003501 co-culture Methods 0.000 description 1
- 235000016213 coffee Nutrition 0.000 description 1
- 235000013353 coffee beverage Nutrition 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000000205 computational method Methods 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 235000004634 cranberry Nutrition 0.000 description 1
- 230000001351 cycling effect Effects 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- IERHLVCPSMICTF-XVFCMESISA-N cytidine 5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(O)=O)O1 IERHLVCPSMICTF-XVFCMESISA-N 0.000 description 1
- IERHLVCPSMICTF-UHFFFAOYSA-N cytidine monophosphate Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(COP(O)(O)=O)O1 IERHLVCPSMICTF-UHFFFAOYSA-N 0.000 description 1
- 239000004062 cytokinin Substances 0.000 description 1
- UQHKFADEQIVWID-UHFFFAOYSA-N cytokinin Natural products C1=NC=2C(NCC=C(CO)C)=NC=NC=2N1C1CC(O)C(CO)O1 UQHKFADEQIVWID-UHFFFAOYSA-N 0.000 description 1
- 108010088245 cytokinin oxidase Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- GYOZYWVXFNDGLU-XLPZGREQSA-N dTMP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 GYOZYWVXFNDGLU-XLPZGREQSA-N 0.000 description 1
- 238000013481 data capture Methods 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 101150012655 dcl1 gene Proteins 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- LTFMZDNNPPEQNG-UHFFFAOYSA-N deoxyguanylic acid Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1CC(O)C(COP(O)(O)=O)O1 LTFMZDNNPPEQNG-UHFFFAOYSA-N 0.000 description 1
- FCRACOPGPMPSHN-UHFFFAOYSA-N desoxyabscisic acid Natural products OC(=O)C=C(C)C=CC1C(C)=CC(=O)CC1(C)C FCRACOPGPMPSHN-UHFFFAOYSA-N 0.000 description 1
- 230000001066 destructive effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010252 digital analysis Methods 0.000 description 1
- 108020001096 dihydrofolate reductase Proteins 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 235000004879 dioscorea Nutrition 0.000 description 1
- 235000021186 dishes Nutrition 0.000 description 1
- NEKNNCABDXGBEN-UHFFFAOYSA-L disodium;4-(4-chloro-2-methylphenoxy)butanoate;4-(2,4-dichlorophenoxy)butanoate Chemical compound [Na+].[Na+].CC1=CC(Cl)=CC=C1OCCCC([O-])=O.[O-]C(=O)CCCOC1=CC=C(Cl)C=C1Cl NEKNNCABDXGBEN-UHFFFAOYSA-L 0.000 description 1
- 231100000673 dose–response relationship Toxicity 0.000 description 1
- 230000002222 downregulating effect Effects 0.000 description 1
- 230000024346 drought recovery Effects 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 210000002242 embryoid body Anatomy 0.000 description 1
- CCIVGXIOQKPBKL-UHFFFAOYSA-M ethanesulfonate Chemical compound CCS([O-])(=O)=O CCIVGXIOQKPBKL-UHFFFAOYSA-M 0.000 description 1
- DNJIEGIFACGWOD-UHFFFAOYSA-N ethyl mercaptane Natural products CCS DNJIEGIFACGWOD-UHFFFAOYSA-N 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000012854 evaluation process Methods 0.000 description 1
- 238000013401 experimental design Methods 0.000 description 1
- 235000004426 flaxseed Nutrition 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 239000012458 free base Substances 0.000 description 1
- 210000000973 gametocyte Anatomy 0.000 description 1
- 230000006251 gamma-carboxylation Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 235000004611 garlic Nutrition 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 239000003349 gelling agent Substances 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 238000003205 genotyping method Methods 0.000 description 1
- 210000004602 germ cell Anatomy 0.000 description 1
- 239000012869 germination medium Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 102000005396 glutamine synthetase Human genes 0.000 description 1
- 108020002326 glutamine synthetase Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010083391 glycinin Proteins 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- RQFCJASXJCIDSX-UUOKFMHZSA-N guanosine 5'-monophosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O RQFCJASXJCIDSX-UUOKFMHZSA-N 0.000 description 1
- 235000013928 guanylic acid Nutrition 0.000 description 1
- 239000004226 guanylic acid Substances 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 108010002685 hygromycin-B kinase Proteins 0.000 description 1
- 208000006278 hypochromic anemia Diseases 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 208000021005 inheritance pattern Diseases 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 229960000367 inositol Drugs 0.000 description 1
- CDAISMWEOUEBRE-GPIVLXJGSA-N inositol Chemical compound O[C@H]1[C@H](O)[C@@H](O)[C@H](O)[C@H](O)[C@@H]1O CDAISMWEOUEBRE-GPIVLXJGSA-N 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 239000010977 jade Substances 0.000 description 1
- ZNJFBWYDHIGLCU-HWKXXFMVSA-N jasmonic acid Chemical compound CC\C=C/C[C@@H]1[C@@H](CC(O)=O)CCC1=O ZNJFBWYDHIGLCU-HWKXXFMVSA-N 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 230000002015 leaf growth Effects 0.000 description 1
- 235000021374 legumes Nutrition 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 238000007834 ligase chain reaction Methods 0.000 description 1
- 238000007854 ligation-mediated PCR Methods 0.000 description 1
- 239000004571 lime Substances 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 125000003588 lysine group Chemical class [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000011565 manganese chloride Substances 0.000 description 1
- 235000002867 manganese chloride Nutrition 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000010534 mechanism of action Effects 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 235000010460 mustard Nutrition 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- PUPNJSIFIXXJCH-UHFFFAOYSA-N n-(4-hydroxyphenyl)-2-(1,1,3-trioxo-1,2-benzothiazol-2-yl)acetamide Chemical compound C1=CC(O)=CC=C1NC(=O)CN1S(=O)(=O)C2=CC=CC=C2C1=O PUPNJSIFIXXJCH-UHFFFAOYSA-N 0.000 description 1
- 230000017074 necrotic cell death Effects 0.000 description 1
- 229960003512 nicotinic acid Drugs 0.000 description 1
- 235000001968 nicotinic acid Nutrition 0.000 description 1
- 239000011664 nicotinic acid Substances 0.000 description 1
- 230000014075 nitrogen utilization Effects 0.000 description 1
- 108091027963 non-coding RNA Proteins 0.000 description 1
- 102000042567 non-coding RNA Human genes 0.000 description 1
- 238000010899 nucleation Methods 0.000 description 1
- 238000001821 nucleic acid purification Methods 0.000 description 1
- 230000031787 nutrient reservoir activity Effects 0.000 description 1
- 235000021231 nutrient uptake Nutrition 0.000 description 1
- 235000014571 nuts Nutrition 0.000 description 1
- 229940054441 o-phthalaldehyde Drugs 0.000 description 1
- 238000009401 outcrossing Methods 0.000 description 1
- 239000003973 paint Substances 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 235000011197 perejil Nutrition 0.000 description 1
- 239000007793 ph indicator Substances 0.000 description 1
- 238000005191 phase separation Methods 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 238000000053 physical method Methods 0.000 description 1
- 230000001885 phytohemagglutinin Effects 0.000 description 1
- 230000008121 plant development Effects 0.000 description 1
- 239000003375 plant hormone Substances 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 239000004323 potassium nitrate Substances 0.000 description 1
- 238000004382 potting Methods 0.000 description 1
- 238000003825 pressing Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- ZUFQODAHGAHPFQ-UHFFFAOYSA-N pyridoxine hydrochloride Chemical compound Cl.CC1=NC=C(CO)C(CO)=C1O ZUFQODAHGAHPFQ-UHFFFAOYSA-N 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000000754 repressing effect Effects 0.000 description 1
- 230000003938 response to stress Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- CDAISMWEOUEBRE-UHFFFAOYSA-N scyllo-inosotol Natural products OC1C(O)C(O)C(O)C(O)C1O CDAISMWEOUEBRE-UHFFFAOYSA-N 0.000 description 1
- 230000001932 seasonal effect Effects 0.000 description 1
- 230000005562 seed maturation Effects 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 229910001961 silver nitrate Inorganic materials 0.000 description 1
- 235000017550 sodium carbonate Nutrition 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- 238000009331 sowing Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000011272 standard treatment Methods 0.000 description 1
- 238000000528 statistical test Methods 0.000 description 1
- 239000010959 steel Substances 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 235000011149 sulphuric acid Nutrition 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 235000013616 tea Nutrition 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000000759 time-resolved fluorescence anisotropy Methods 0.000 description 1
- 230000008467 tissue growth Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000009752 translational inhibition Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 239000002753 trypsin inhibitor Substances 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 230000009452 underexpressoin Effects 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 238000011179 visual inspection Methods 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
- -1 wounding Substances 0.000 description 1
- 241000228158 x Triticosecale Species 0.000 description 1
- 239000005019 zein Substances 0.000 description 1
- 229940093612 zein Drugs 0.000 description 1
- NWONKYPBYAMBJT-UHFFFAOYSA-L zinc sulfate Chemical compound [Zn+2].[O-]S([O-])(=O)=O NWONKYPBYAMBJT-UHFFFAOYSA-L 0.000 description 1
- 229910000368 zinc sulfate Inorganic materials 0.000 description 1
- 239000011686 zinc sulphate Substances 0.000 description 1
- 235000009529 zinc sulphate Nutrition 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8262—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield involving plant development
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y207/00—Transferases transferring phosphorus-containing groups (2.7)
- C12Y207/11—Protein-serine/threonine kinases (2.7.11)
- C12Y207/11001—Non-specific serine/threonine protein kinase (2.7.11.1), i.e. casein kinase or checkpoint kinase
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Botany (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
本发明描述了尤其可用于改变植物的根结构的分离的多核苷酸和多肽及重组DNA构建体、包含这些重组DNA构建体的组分(例如植物或种子)、以及利用这些重组DNA构建体的方法。所述重组DNA构建体包含可操作地连接在植物中有功能的启动子的多核苷酸,其中所述多核苷酸编码可用于改变植物根构造的多肽。
Description
本专利申请要求提交于2007年11月20日的美国临时申请60/989,161和提交于2008年4月1日的美国临时申请60/041,281的优先权,其全文以引用方式并入本文。
发明领域
本发明领域涉及植物育种和基因学,并且具体地讲涉及用于改变植物根构造的重组DNA构建体。
发明背景
在除了非常少的几个之外的所有其他自然生态系统中,水和营养物质的可用性限制了植物生长。它们在大多数农业生态系统中限制产量。植物根部起到重要作用,如水和营养物质摄取、在土壤中固定植物以及在根围建立生物相互作用。因此阐明植物根发育和功能的基因调控是农学和生态学中相当受关注的课题。
根系发源于在胚胎形成期间发育的初生根。初生根产生次生根,次生根继而产生三生根。所有次生、三生、四生以及更进一步分生的根均被称为侧根。包括玉米在内的许多植物也可从连续的地下节位(冠根)或地上节位(支柱根)处产生不定根。有三个主要过程影响根系的总体构造。第一个是在初生根分裂组织中的细胞分裂过程,该过程通过加入新生细胞到根中使得根不定生长。第二个是侧根形成过程,该过程增加根系的探索能力。第三个是根毛形成过程,该过程增加初生根和侧根的总表面(Lopez-Bucio等人,Current Opinion in Plant Biology(2003)6:280-287)。在已经分离出的玉米突变体中仅仅缺少根型的一个亚型。已经鉴定了拟南芥属的根形态基因突变体如SHORTROOT和SCARECROW,它显示初生根和侧根的发育缺陷(J.E.Malamy,Plant,Cell and Environment(2005)28:67-77)。
已经鉴定了许多特异性影响根发育的玉米突变体(Hochholdinger等人,2004,Annals of Botany 93:359-368)。隐性突变体rtcs和rt1不形成或形成较少的冠根和支柱根,然而初生根和侧根不受影响。在隐性突变体des21中,缺失侧生种子根和根毛。隐性突变体rthl-3缺失根毛。突变体lrt1和rum1在侧根开始产生之前受影响,而突变体slr1和slr2的侧根伸长能力受到削弱。决定根系构造的内源响应途径包括激素、细胞循环调节子和调节基因。水分胁迫和营养物质可用性属于决定根系构造的环境响应途径。
提交于2005年2月14日的美国专利申请2005-57473(美国专利公开公布2005/223429A1,公布于2005年10月6日)涉及使用拟南芥属细胞分裂素氧化酶基因来改变植物中的细胞分裂素含量并刺激根生长。
美国专利公开6,344,601(公布于2002年2月5日)涉及在植物细胞中低表达或超表达肌动蛋白抑制蛋白(profilin)以改变植物生长习性,例如减少根系或根毛系统会使花期推迟。
WO2004/US16432(提交于2004年5月21日(WO2004/106531,公布于2004年12月9日)涉及使用超表达顺式-异戊烯基转移酶的方法来操纵生长速率和/或产量和/或构造。
提交于2004年9月30日的美国专利申请2004/489500(美国专利公开公布2005/059154A1,公布于2005年3月13日)涉及使用在植物中超表达转录因子E2F的方法来改变细胞数量、构造和产量。
可利用激活标记来鉴定能影响性状的基因。已经在模型植物拟南芥属中使用该方法(Weigel等人,2000,Plant Physiol.122:1003-1013)。
插入转录增强子元件能够显著激活和/或提高附近内源基因的表达。
发明内容
本发明包括:
在一个实施方案中,分离的多核苷酸,所述分离的多核苷酸包含编码LLRK或LLRK样多肽的核酸序列或所述核酸序列的全长互补序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、或19比较时具有至少80%的序列同一性,或在与SEQ ID NO:17进行比较时具有至少85%的序列同一性。
在第二实施方案中,分离的多核苷酸,所述分离的多核苷酸包含编码LLRK或LLRK样多肽的核酸序列或所述核酸序列的全长互补序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、或19比较时具有至少85%的序列同一性,或在与SEQ ID NO:17进行比较时具有至少90%的序列同一性。
在第三实施方案中,分离的多核苷酸,所述分离的多核苷酸包含编码LLRK或LLRK样多肽的核酸序列或所述核酸序列的全长互补序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15或19比较时具有至少90%的序列同一性,或在与SEQ ID NO:17进行比较时具有至少95%的序列同一性。
在第四实施方案中,分离的多核苷酸,所述分离的多核苷酸包含编码LLRK或LLRK样多肽的核酸序列或所述核酸序列的全长互补序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15或19比较时具有至少95%的序列同一性。
在第五实施方案中,分离的多核苷酸,所述分离的多核苷酸包含编码LLRK或LLRK样多肽的核酸序列,其中所述多肽的氨基酸序列包含SEQ ID NO:15、17、19、或22。
在第六实施方案中,分离的多核苷酸,所述分离的多核苷酸包含编码LLRK或LLRK样多肽的核酸序列,其中所述核酸序列包含SEQ IDNO:14、16、18、或20。
在另一个实施方案中,包含任何前述多核苷酸的载体和重组构建体,以及包含所述重组构建体的细胞。
在另一个实施方案中,用任一前述多核苷酸来转化细胞的方法,以及用于生产和再生包含任一前述多核苷酸的转化植物的方法。
在另一个实施方案中,在基因组中包含重组DNA构建体的植物,该重组DNA构建体包含可操作地连接至少一种调控元件的多核苷酸,其中所述多核苷酸编码多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,并且其中所述植物在与未包含所述重组DNA构建体的对照植物进行比较时表现出改变的根构造。
在另一个实施方案中,在基因组中包含重组DNA构建体的植物,该重组DNA构建体包含:
(a)可操作地连接至少一种调控元件的多核苷酸,其中所述多核苷酸编码多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,或
(b)抑制DNA构建体,所述抑制DNA构建体包含至少一种调控元件,所述调控元件可操作地连接至:(i)以下序列的全部或部分:(A)编码多肽的核酸序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,或(B)所述(b)(i)(A)的核酸序列的全长互补序列;或(ii)源自所关注的靶基因的有义链或反义链的全部或部分的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于ClustalV比对方法,所述区域的核酸序列具有至少50%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样多肽,并且其中在与未包含所述重组构建体的对照植物比较时,所述植物表现出至少一种农学特性的改变。
在另一个实施方案中,改变植物根构造的方法,该方法包括(a)将重组DNA构建体引入到可再生的植物细胞中,该重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中该多核苷酸编码多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性;以及(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该重组DNA构建体并且在与未包含该DNA构建体的对照植物比较时表现出改变的根构造;并且任选地,(c)获得源自该转基因植物的子代植物,其中所述子代植物在其基因组中包含该重组DNA构建体并且在与未包含该DNA构建体的对照植物比较时表现出改变的根构造。
在另一个实施方案中,评价植物根构造的方法,该方法包括(a)将重组DNA构建体引入到可再生的植物细胞中,该重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中该多核苷酸编码多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该重组DNA构建体;以及(c)评价与未包含该重组DNA构建体的对照植物比较时该转基因植物的根构造;以及任选地,(d)获得源自该转基因植物的子代植物,其中该子代植物在其基因组中包含该重组DNA构建体;以及任选地,(e)评价与未包含该重组DNA构建体的对照植物比较时该子代植物的根构造。
在另一个实施方案中,评价植物根构造的方法,该方法包括(a)将重组DNA构建体引入到可再生的植物细胞中,该重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中该多核苷酸编码多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该重组DNA构建体;(c)获得源自该转基因植物的子代植物,其中该子代植物在其基因组中包含该重组DNA构建体;以及(d)评价与未包含该重组DNA构建体的对照植物比较时该子代植物的根构造。
在另一个实施方案中,测定植物农学特性改变的方法,该方法包括(a)将重组DNA构建体引入到可再生的植物细胞中,该重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中该多核苷酸编码多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ IDNO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该重组DNA构建体;以及(c)测定该转基因植物在与未包含该重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变;以及任选地,(d)获得源自该转基因植物的子代植物,其中该子代植物在其基因组中包含该重组DNA构建体;以及任选地,(e)测定该子代植物在与未包含该重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
在另一个实施方案中,测定植物农学特性改变的方法,该方法包括(a)将重组DNA构建体引入到可再生的植物细胞中,该重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中该多核苷酸编码多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ IDNO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该重组DNA构建体;(c)获得源自该转基因植物的子代植物,其中该子代植物在其基因组中包含该重组DNA构建体;以及(d)测定该子代植物在与未包含该重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
在另一个实施方案中,测定植物农学特征改变的方法,该方法包括:
(a)将抑制DNA构建体引入到可再生的植物细胞中,所述抑制DNA构建体包含至少一种调控元件,所述调控元件可操作地连接至:
(i)以下序列的全部或部分:(A)编码多肽的核酸序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,或(B)所述(b)(i)(A)的核酸序列的全长互补序列;或
(ii)源自所关注的靶基因的有义链或反义链的全部或部分的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样多肽;
(b)在步骤(a)之后,从所述可再生的植物细胞再生出转基因植物,其中所述转基因植物在其基因组中包含所述抑制DNA构建体;以及
(c)测定该转基因植物在与未包含该抑制DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变;
以及(d)获得源自该转基因植物的子代植物,其中该子代植物在其基因组中包含该抑制DNA构建体;以及任选地,(e)测定该子代植物在与未包含该抑制DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
在另一个实施方案中,测定植物农学特征改变的方法,该方法包括:
(a)将抑制DNA构建体引入到可再生的植物细胞中,所述抑制DNA构建体包含至少一种调控元件,所述调控元件可操作地连接至:
(i)以下序列的全部或部分:(A)编码多肽的核酸序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,或(B)所述(b)(i)(A)的核酸序列的全长互补序列;或
(ii)源自所关注的靶基因的有义链或反义链的全部或部分的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样多肽;
(b)在步骤(a)之后,从所述可再生的植物细胞再生出转基因植物,其中所述转基因植物在其基因组中包含所述抑制DNA构建体,并且当与未包含所述抑制DNA构建体的对照植物比较时表现出改变的根构造;
(c)获得源自所述转基因植物的子代植物,其中所述子代植物在其基因组中包含所述抑制DNA构建体;以及
(d)测定所述子代植物在与未包含所述抑制DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
在另一个实施方案中,改变植物根构造的方法,该方法包括:
(a)将抑制DNA构建体引入到可再生的植物细胞中,所述抑制DNA构建体包含至少一种调控元件,所述调控元件可操作地连接至:
(i)以下序列的全部或部分:(A)编码多肽的核酸序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性;或(B)所述(b)(i)(A)的核酸序列的全长互补序列;或
(ii)源自所关注的靶基因的有义链或反义链的全部或部分的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样多肽;以及
(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该抑制DNA构建体,并且其中该转基因植物在与未包含该抑制DNA构建体的对照植物比较时表现出改变的根构造;以及
任选地,(c)获得源自该转基因植物的子代植物,其中所述子代植物在其基因组中包含该重组DNA构建体,并且其中该子代植物在与未包含该抑制DNA构建体的对照植物比较时表现出改变的农学特性。
在另一个实施方案中,评价植物根构造的方法,该方法包括:
(a)将抑制DNA构建体引入到可再生的植物细胞中,所述抑制DNA构建体包含至少一种调控元件,所述调控元件可操作地连接至:
(i)以下序列的全部或部分:(A)编码多肽的核酸序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,或(B)所述(b)(i)(A)的核酸序列的全长互补序列;或
(ii)源自所关注的靶基因的有义链或反义链的全部或部分的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样多肽;
(b)在步骤(a)之后,从所述可再生的植物细胞再生出转基因植物,其中所述转基因植物在其基因组中包含所述抑制DNA构建体;以及
(c)评价与未包含该抑制DNA构建体的对照植物比较时该转基因植物的根构造;
以及(d)获得源自该转基因植物的子代植物,其中该子代植物在其基因组中包含该抑制DNA构建体;以及任选地,(e)评价与未包含该抑制DNA构建体的对照植物比较时该子代植物的根构造。
在另一个实施方案中,评价植物根构造的方法,该方法包括:
(a)将抑制DNA构建体引入到可再生的植物细胞中,所述抑制DNA构建体包含至少一种调控元件,所述调控元件可操作地连接至:
(i)以下序列的全部或部分:(A)编码多肽的核酸序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,或(B)所述(b)(i)(A)的核酸序列的全长互补序列;或
(ii)源自所关注的靶基因的有义链或反义链的全部或部分的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样多肽;
(b)在步骤(a)之后,从所述可再生的植物细胞再生出转基因植物,其中所述转基因植物在其基因组中包含所述抑制DNA构建体;
(c)获得源自所述转基因植物的子代植物,其中所述子代植物在其基因组中包含所述抑制DNA构建体;以及
(d)评价当与未包含所述抑制DNA构建体的对照植物比较时所述子代植物的根构造。
本发明中还包括上述植物的任何子代、上述植物的任何种子以及来自任一上述植物和子代植物的细胞。
生产可作为产品销售的种子的方法,该种子提供改变的根构造,该方法包括任一前述优选的方法,并且还包括从所述子代植物获得种子,其中所述种子在其基因组中包含所述重组DNA构建体。
在另一个实施方案中,本发明也涉及对基因变异进行作图的方法,所述基因变异与控制种子发育期间的胚芽/胚乳大小和/或改变植物中的含油表型有关,所述方法包括:
(a)使两种植物品种杂交;以及
(b)在得自步骤(a)的杂交的子代植物中针对以下序列来评价基因变异:
(i)选自SEQ ID NO:14、16、18、21、或35的核酸序列;或
(ii)编码选自SEQ ID NO:15、17、19、22、或36的多肽的核酸序列,
其中所述评价是使用选自下组方法进行的:RFLP分析、SNP分析、和基于PCR的分析。
在另一个实施方案中,本发明涉及改变种子发育期间的胚芽/胚乳尺寸和/或改变植物中含油表型的分子育种方法,所述方法包括:
(a)使两种植物品种杂交;以及
(b)在得自步骤(a)的杂交的子代植物中针对以下序列来评价基因变异:
(i)选自SEQ ID NO:14、16、18、21、或35的核酸序列;或
(ii)编码选自SEQ ID NO:15、17、19、22、或36的多肽的核酸序列;
其中所述评价是使用选自下组方法进行的:RFLP分析、SNP分析、和基于PCR的分析。
附图以及序列表的说明
根据以下的详细描述和附图以及序列表,可更全面地理解本发明,以下的详细描述和附图以及序列表形成本申请的一部分。
图1示出pHSbarENDs激活标记构建体(SEQ ID NO:1)的图谱,该构建体用于制备拟南芥属种群。
图2示出载体pDONRTM/Zeo(SEQ ID NO:2)的图谱。attP1位点位于核苷酸570-801;attP2位点位于核苷酸2754-2985(互补链)。
图3示出载体pDONRTM221(SEQ ID NO:3)的图谱。attP1位点位于核苷酸570-801;attP2位点位于核苷酸2754-2985(互补链)。
图4示出载体pBC-yelloW(SEQ ID NO:4)的图谱,该载体是用于构建拟南芥属表达载体的目的载体。attR1位点位于核苷酸11276-11399(互补链);attR2位点位于核苷酸9695-9819(互补链)。
图5示出PHP27840(SEQ ID NO:5)的图谱,该载体是用于构建大豆表达载体的目的载体。attR1位点位于核苷酸7310-7434;attR2位点位于核苷酸8890-9014。
图6示出PHP23236(SEQ ID NO:6)的图谱,该载体是用于构建Gaspe Bay Flint衍生的玉米品系的表达载体的目的载体。attR1位点位于核苷酸2006-2130;attR2位点位于核苷酸2899-3023。
图7示出PHP10523(SEQ ID NO:7)的图谱,它是存在于农杆菌菌株LBA4404中的质粒DNA。
图8示出PHP23235(SEQ ID NO:8)的图谱,它是用于构建目的载体PHP23236的载体。
图9示出了入门克隆PHP20234(SEQ ID NO:9)的图谱,它是转运PINII终止子的载体。attR2位点位于核苷酸591-747;attL3位点位于核苷酸1100-1195。
图10示出PHP28529(SEQ ID NO:10)的图谱,该载体是用于构建玉米品系表达载体的目的载体。attR3位点位于核苷酸3613-3737;attR4位点位于核苷酸2035-2159。
图11示出了入门克隆PHP28408(SEQ ID NO:11)的图谱,它是转运组成型玉米GOS2启动子的载体。attL4位点位于核苷酸160-255;attR1位点位于核苷酸2301-2447。
图12示出了入门克隆PHP22020(SEQ ID NO:12)的图谱,它是转运玉米根NAS2启动子的载体。attR1位点位于核苷酸31-187;attL4位点位于核苷酸2578-2673。
图13示出PHP29635(SEQ ID NO:13)的图谱,该载体是用于构建Gaspe Flint衍生的玉米品系的表达载体的目的载体。attR1位点位于核苷酸40786-40910;attR2位点位于核苷酸41679-41803。
图14示出PIIOXS2a-FRT87(ni)m(SEQ ID NO:29)的图谱,该载体用于构建目的载体PHP29635。
图15A至15D示出以下全长氨基酸序列的多重比对:SEQ ID NO:15、17、19、22和36,以及SEQ ID NO:23和24。完全匹配共有序列的残基显示为暗色。将共有序列显示于每个比对上部。共有残基通过直接取多数来测定。
图16示出图15A至15D中示出的NDK同源物的每对氨基酸序列的序列同一性百分比和趋异值图表。
图17是实施例17中用于半水耕玉米生长的培养基。
图18是列出实施例17中与不同硝酸盐浓度对Gaspe Bay Flint衍生的玉米系生长和发育的影响相关的数据的图表。
序列描述以及所附序列表遵循如37 C.F.R.§1.821-1.825所列出的关于专利申请中核苷酸和/或氨基酸序列公开的规定。
序列表包含用于核苷酸序列字符的单字母码和用于氨基酸的三字母码,如遵照IUPAC-IUBMB标准所定义的,该标准在Nucleic Acids Res.13:3021-3030(1985)以及在Biochemical J.219(2):345-373(1984)中有所描述,这两篇文献以引用的方式并入本文。用于核苷酸和氨基酸序列数据的符号和格式遵循在37 C.F.R.§1.822中所列出的规定。
SEQ ID NO:1,pHSbarENDs
SEQ ID NO:2,pDONRTM/Zeo
SEQ ID NO:3,pDONRTM221
SEQ ID NO:4,pBC-yellow
SEQ ID NO:5,PHP27840
SEQ ID NO:6,PHP23236
SEQ ID NO:7,PHP10523
SEQ ID NO:8,PHP23235
SEQ ID NO:9,PHP20234
SEQ ID NO:10,PHP28529
SEQ ID NO:11,PHP28408
SEQ ID NO:12,PHP22020
SEQ ID NO:13,PHP29635
表1和SEQ ID NO:36列出了本文所述的多肽、包含编码多肽全部或其主要部分的核酸片段的cDNA克隆的命名、以及在所附序列表中使用的对应标识符(SEQ ID NO:)。
表1
富含亮氨酸重复序列激酶(LLRK)
蛋白质 | 克隆命名 | SEQ ID NO:(核苷酸) | SEQ ID NO:(氨基酸) |
LLRK | p0128.cpiae89r:fis | 14 | 15 |
LLRK | cfp1n.pk018.k5b:fis | 16 | 17 |
LLRK | 重叠群:my.cco1n.pk077.a19cfp2n.pk056.k23a.fcfp2n.pk056.k23b | 18 | 19 |
SEQ ID NO:20对应于拟南芥富含亮氨酸重复序列激酶(LLRK)(AT1G69270)位点。
SEQ ID NO:21对应于由SEQ ID NO:20的核苷酸204-1826(终止)编码的拟南芥LLRK的编码序列。
SEQ ID NO:22对应于由SEQ ID NO:21编码的氨基酸序列,并且是NCBI GI NO:6730642的较短版本。
SEQ ID NO:23对应于NCBI GI NO:115455429(稻)
SEQ ID NO:24对应于NCBI GI No:125600990(稻)
SEQ ID NO:25是attB1序列。
SEQ ID NO:26是attB2序列。
SEQ ID NO:27是实施例9中的正向引物VC062。
SEQ ID NO:28是实施例9中的反向引物VC063。
SEQ ID NO:29PIIOXS2a-FRT87(ni)m。
SEQ ID NO:30是玉米NAS2启动子。
SEQ ID NO:31是GOS2启动子。
SEQ ID NO:32是泛素启动子。
SEQ ID NO:33是S2A启动子。
SEQ ID NO:34是PINII终止子。
SEQ ID NO:35是玉米克隆cfp5n.pk002.f22:fis的DNA序列,它是拟南芥属LLRK的同源物。
SEQ ID NO:36是由SEQ ID NO:35的核苷酸371至1909(终止)编码的氨基酸序列。
SEQ ID NO:37对应于NCBI GI No:115438737(稻)。
优选实施方案的详细描述
本文中所列出的每篇参考文献的公开内容的全文以引用的方式并入本文。
如本文所用的并在所附的权利要求书中的单数形式“一个”和“所述”包括复数涵义,除非上下文中清楚地另有指明。因此,例如,“一株植物”的涵义包括多株该类植物。“一个细胞”的涵义包括一个或多个细胞及其本领域的技术人员已知的等同物,等等。
术语“根构造”指构成根的不同部分的布置方式。术语“根构造”、“根结构”、“根系”或“根系构造”在这里可互换使用。
一般来讲,植物由胚发育成的第一种根称为初生根。在大多数双子叶植物中,初生根被称为主根。这种主根向下生长并产生分枝根(侧根)。在单子叶植物中,植物的初生根发生分枝,生成须根系。
术语“改变的根构造”指与参照植株或对照植株比较,在其不同发育阶段构成根系的不同部分的改变状况。应当理解,改变的根构造涵盖了一种或多种可测量参数(包括但不限于一个或多个根系部分的直径、长度、数目、角度或表面)的改变,所述根系部分包括但不限于初生根、侧根或分枝根、不定根和根毛,所有这些均在本发明的范围内。这些改变可导致根所占的面积或空间的整体改变。参照植株或对照植株在其基因组中不含重组DNA构建体或异源构建体。
“表达序列标签”(“EST”)是得自cDNA文库的DNA序列,并且因此是已经被转录的序列。EST通常通过cDNA插入序列单程测序获取。将完整的cDNA插入序列称为“全长插入序列”(“FIS”)。“重叠群”序列是由选自,但不限于EST、FIS和PCR序列的两个或更多个序列装配成的序列。将编码完整或功能性蛋白的序列称为“完全基因序列”(“CGS”),该序列能得自FIS或重叠群。
术语“V”阶段是指玉米植物的叶片生长阶段;例如V4=四片、V5=五片具有可见叶颈的叶片。叶颈是浅色领状“带”,位于暴露的叶片底部,靠近叶片接触植物茎部的区域。进行叶片计数,开始时计数最底下的、短的、圆顶真叶,最后计数具有可见叶颈的最上面的叶片。
“农学特性”是可测量的参数,包括但不限于绿度、产量、生长速率、生物量、成熟时的鲜重、成熟时的干重、果实产量、种子产量、总植物含氮量、果实含氮量、种子含氮量、营养组织含氮量、总植物游离氨基酸含量、果实游离氨基酸含量、种子游离氨基酸含量、营养组织游离氨基酸含量、总植物蛋白质含量、果实蛋白质含量、种子蛋白质含量、营养组织蛋白质含量、耐旱性、氮摄取、根倒伏、茎倒伏、植株高度、穗长和收获指数。
“llrk”(富含亮氨酸重复序列激酶)、rpk1、“at-llrk”、“at-rpk1”(拟南芥-受体蛋白激酶1)本文可互换使用,指拟南芥位点AT1G69270(SEQ ID NO:20)。
“LLRK”、“RPK1”(受体蛋白激酶1)或“AT-RPK1”指由AT1G69270(SEQ ID NO:20)编码的蛋白(SEQ ID NO:22)。
“llrk样”指拟南芥“LLRK”位点AT1G69270(SEQ ID NO:20)的来自不同物种的核苷酸同源物,如玉米,并且不受限制地包括任何以下核苷酸序列:SEQ ID NO:14、16、18和35。
“LLRK样”指拟南芥“LLRK”(SEQ ID NO:22)的来自不同物种的蛋白同源物,如玉米,并且不受限制地包括任何以下氨基酸序列:SEQ ID NO:15、17、19和36。
“环境条件”指植物生长的条件,例如水的可用性、营养物质(例如氮)的可用性或者病害的存在。
“转基因”指其基因组因异源核酸(如重组DNA构建体)的存在而发生改变的任何细胞、细胞系、愈伤组织、组织、植物部分或植物,包括那些最初的转基因事件以及从最初的转基因事件通过有性杂交或无性生殖而产生的那些。如本文所用的术语“转基因”不涵盖通过常规植物育种方法或通过诸如随机异花受精、非重组病毒感染、非重组细菌转化、非重组转座或自发突变之类的自然发生事件导致的基因组(染色体基因组或染色体外基因组)改变。
“基因组”在用于植物细胞时不仅涵盖存在于细胞核中的染色体DNA,而且还包括存在于细胞的亚细胞组分(如线粒体、质粒)中的细胞器DNA。
“植物”包括整个植株、植物器官、植物组织、种子和植物细胞以及同一植株的子代。植物细胞包括但不限于得自下列物质的细胞:种子、悬浮培养物、胚、分生区域、愈伤组织、叶、根、芽、配子体、孢子体、花粉和小孢子。
“子代”包括植物的任何后续世代。
“转基因”指其基因组因异源核酸(如重组DNA构建体)的存在而发生改变的任何细胞、细胞系、愈伤组织、组织、植物部分或植物,包括那些最初的转基因事件以及从最初的转基因事件通过有性杂交或无性生殖而产生的那些。如本文所用的术语“转基因”不涵盖通过常规植物育种方法或通过诸如随机异花受精、非重组病毒感染、非重组细菌转化、非重组转座或自发突变之类的自然发生事件导致的基因组(染色体基因组或染色体外基因组)改变。
“转基因植物”包括在其基因组内包含异源多核苷酸的植物。优选的是,异源多核苷酸被稳定地整合进基因组中,使得该多核苷酸传递至连续的世代。异源多核苷酸可单独地或作为重组DNA构建体的部分整合进基因组中。
针对序列而言的“异源”意指来自外来物种的序列,或者如果来自相同物种,则指通过蓄意的人为干预而从其天然形式发生了组成和/或基因座的显著改变的序列。
“多核苷酸”、“核酸序列”、“核苷酸序列”或“核酸片段”可互换使用并且是任选含有合成的、非天然的或改变的核苷酸碱基的单链或双链RNA或DNA聚合物。核苷酸(通常以它们的5′-单磷酸形式存在)通过如下它们的单个字母名称来指代:“A”为腺苷酸或脱氧腺苷酸(分别对应RNA或DNA),“C”表示胞苷酸或脱氧胞苷酸,“G”表示鸟苷酸或脱氧鸟苷酸,“U”表示尿苷酸,“T”表示脱氧胸苷酸,“R”表示嘌呤(A或G),“Y”表示嘧啶(C或T),“K”表示G或T,“H”表示A或C或T,“I”表示肌苷,并且“N”表示任何核苷酸。
“多肽”、“肽”、“氨基酸序列”和“蛋白质”在本文中可互换使用,指氨基酸残基的聚合物。该术语适用于其中一个或多个氨基酸残基是相应的天然存在的氨基酸的人工化学类似物的氨基酸聚合物,以及适用于天然存在的氨基酸聚合物。术语“多肽”、“肽”、“氨基酸序列”和“蛋白质”还可包括修饰,包括但不限于糖基化、脂质连接、硫酸盐化、谷氨酸残基的γ羧化、羟化和ADP-核糖基化。
“信使RNA(mRNA)”指无内含子并且可通过细胞翻译成蛋白质的RNA。
“cDNA”指与mRNA模板互补并且利用逆转录酶从mRNA模板合成的DNA。cDNA可以是单链的或者可用DNA聚合成酶I的Klenow片段转化成双链形式。
“成熟”蛋白质指经翻译后加工的多肽;即已经去除了存在于初级翻译产物中的任何前肽或原肽的多肽。
“前体”蛋白质指mRNA的翻译初级产物;即具有仍然存在的前肽和原肽。前肽和原肽可以是并且不限于细胞内定位信号。
“分离的”指物质,例如核酸和/或蛋白质,该物质基本上不含在天然存在的环境中通常伴随该物质或与其反应的组分,或者说是该物质被从所述组分移出。分离的多核苷酸可从它们天然存在于其中的宿主细胞纯化。技术人员已知的常规核酸纯化方法可用于获得分离的多核苷酸。该术语也涵盖重组多核苷酸和化学合成的多核苷酸。
“重组体”指(例如)通过化学合成或者通过用基因工程技术操纵分离的核酸片段来实现的两个原本分离的序列片段的人工组合。“重组体”也包括指已经通过引入异源核酸而进行了修饰的细胞或载体,或源于经这样修饰的细胞的细胞,但不涵盖由天然发生的事件(如自发突变、自然转化/转导/转座)对细胞或载体的改变,例如没有蓄意人为干扰而发生的那些。
“重组DNA构建体”指在自然界中通常不会一起存在的核酸片段的组合。因此,重组DNA构建体可包含源于不同来源的调控序列和编码序列,或源于相同来源但以不同于通常天然存在的方式排列的调控序列和编码序列。
术语“入门克隆”和“入门载体”本文可互换使用。
“调控序列”指位于编码序列的上游(5′非编码序列)、中间或下游(3′非编码序列),并且影响相关编码序列的转录、RNA加工或稳定性或者翻译的核苷酸序列。调控序列可包括但不限于启动子、翻译前导序列、内含子和多腺苷酸化识别序列。
“启动子”指能够控制另一核酸片段转录的核酸片段。
“在植物中有功能的启动子”指能够控制植物细胞中的转录的启动子,无论其是否来源于植物细胞。
“组织特异性启动子”和“组织优选启动子”可互换使用,并且指主要但非必须专一地在一种组织或器官中表达,而是也可在一种特定细胞中表达的启动子。
“发育调控启动子”指其活性由发育事件决定的启动子。
术语“可操作地连接”指核酸片段联合成单一片段,使得其中一个核酸片段的功能受到另一个核酸片段的调控。例如,在启动子能够调节核酸片段的转录时,该启动子与该核酸片段进行了可操作地连接。
“表达”指功能产物的产生。因此,核酸片段的表达可指核酸片段的转录(如生成mRNA或功能RNA的转录)和/或RNA翻译成前体或成熟蛋白质。
“表型”意指细胞或生物体的可检测的特征。
有关将核酸片段(例如重组DNA构建体)插入细胞内的“导入”意指“转染”或“转化”或“转导”,并且包括指将核酸片段整合进真核或原核细胞中,在该细胞中核酸片段可整合进细胞的基因组(如染色体、质粒、质体或线粒体DNA)内,转变成自主的复制子或瞬时表达(如转染的mRNA)。
“转化细胞”是将核酸片段(如重组DNA构建体)引入其中的任何细胞。
在此所用的“转化”指稳定转化和瞬时转化两者。
“稳定转化”指将核酸片段引入宿主生物体的基因组中,导致基因稳定遗传。一旦稳定转化,核酸片段稳定地整合进宿主生物体和任何连续世代的基因组中。
“瞬时转化”指将核酸片段引入宿主生物体的核中或包含DNA的细胞器中,引起基因表达而没有基因稳定遗传。
“等位基因”是占据染色体上给定位点的基因的几种供选择形式的其中一种。当二倍体植物中一对同源染色体上给定基因座上存在的等位基因相同时,该植物在该基因座处是纯合的。如果二倍体植物中一对同源染色体上给定基因座上存在的等位基因不同,则该植物在该基因座处是杂合的。如果转基因存在于二倍体植物中一对同源染色体中的其中之一上,则该植物在该基因座处是半合子的。
序列比对和同一性百分比可用设计用于检测同源序列的多种比较方法来测定,这些方法包括但不限于生物信息计算包(Inc.,Madison,WI)的程序。除非另外说明,本文提供的序列的多重比对用Clustal V比对方法(Higgins和Sharp,1989,CABIOS.5:151-153)采用默认参数(空位罚分=10,空位长度罚分=10)执行。用Clustal V方法进行成对比对和蛋白质序列的同一性百分比计算的默认参数为KTUPLE=1、缺口罚分=3、窗口(WINDOW)=5和DIAGONALS SAVED=5。而对于核酸,这些参数为KTUPLE=2,空位罚分=5,窗口=4和DIAGONALS SAVED=4。用Clustal V程序比对序列后,可通过查看同一程序中的“序列距离”表来获得“同一性百分比”和“趋异度”值。除非另外说明,本文提供的和申明的同一性百分比和趋异度是以该方式计算。
本文使用的标准重组DNA和分子克隆技术是本领域所熟知的并且在如下文献中有更全面的描述:Sambrook,J.,Fritsch,E.F.和Maniatis,T.,Molecular Cloning:A Laboratory Manual;Cold Spring HarborLaboratory Press:Cold Spring Harbor,1989(下文称为“Sambrook”)。
基因关联作图
基因作图的目的是为了鉴定产生受关注的表型的基因。作图的第一阶段通常是定位染色体的普通区域,该区域与受关注的表型的传送有关。接下来鉴定该基因和最后的特定等位基因是否具有诱发作用。
一种用于基因作图(关联分析)的方法使用具有已知谱系结构的玉米品系。在遍布基因组的随机标记上进行个体基因分型。然后如果谱系内的疾病基因靠近其中一个标记,标记的遗传模式将与受关注的表型的遗传模式相似。关联分析已经是在寻找与受关注的表型相关的简单基因方面高度成功的方法,即,寻找那些其中单个主基因在给定谱系中导致表型、并且环境因素不是非常重要的那些基因。
基因作图(关联或不平衡作图)的第二种方法在种群水平上使用关联。该方法是受关注的表型由特定单体型引起,这样一来遗传受关注表型的个体将也遗传在标记位点附近的相同等位基因。重组和突变使得该方法复杂化。在某种意义上,关联作图与关联分析并无根本的不同,但是它不使用家族谱系,而是使用未知的种群谱系。因为种群谱系比家族谱系要深的多,不平衡作图能进行比关联分析更精细的作图。关联作图方法和分析的综述参见“Association Mapping in Plants,”Oraguzie,N.C.;Rikkerink,E.H.A.;Gardiner,S.E.;Silva,H.N.D.Springer,2007年1月。
现在来看优选的实施方案:
优选的实施方案包括分离的多核苷酸和多肽、重组DNA构建体、包含这些重组DNA构建体的组分(例如植株或种子)以及利用这些重组DNA构建体的方法。
优选的分离的多核苷酸和多肽
本发明包括如下优选的分离的多核苷酸和多肽:
分离的多核苷酸,所述多核苷酸包含:(i)编码多肽的核酸序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性;或(ii)所述(i)的核酸序列的全长互补序列;任一上述分离的多核苷酸可用于本发明的任何重组DNA构建体(包括抑制DNA构建体)。所述多肽优选地是LLRK或LLRK样蛋白。
分离的多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性。所述多肽优选地是LLRK或LLRK样蛋白。
分离的多核苷酸,该多核苷酸包含(i)核酸序列,所述核酸序列基于Clustal V比对方法在与SEQ ID NO:14、16、18或20进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性,或(ii)所述(i)的核酸序列的全长互补序列;任一上述分离的多核苷酸可用于本发明的任何重组DNA构建体(包括抑制DNA构建体)。该分离的多核苷酸编码LLRK或LLRK样蛋白。
优选的重组DNA构建体和抑制DNA构建体
在一个方面,本发明包括重组DNA构建体(包括抑制DNA构建体)。
在一个优选的实施方案中,重组DNA构建体包含可操作地连接至少一种调控序列(如,在植物中有功能的启动子)的多核苷酸,其中该多核苷酸包含(i)编码氨基酸序列的核酸序列,所述氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性,或(ii)所述(i)的核酸序列的全长互补序列。
在另一个优选的实施方案中,重组DNA构建体包含可操作地连接至少一种调控序列(如,在植物中有功能的启动子)的多核苷酸,其中所述多核苷酸包含(i)核酸序列,所述核酸序列在与SEQ ID NO:14、16、18、或20进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性,或(ii)所述(i)的核酸序列的全长互补序列。
图15A至15D示出以下全长氨基酸序列的多重比对:SEQ ID NO:15、17、19、22和36,以及SEQ ID NO:23和24。用生物信息计算包(Inc.,Madison,WI)的程序进行序列多重比对。具体地讲,使用Clustal V比对方法(Higgins和Sharp(1989)CABIOS.5:151-153),多重比对预设参数为空位罚分=10,空位长度罚分=10,成对比对预设参数为KTUPLE=1,空位罚分=3,窗口=5以及DIAGONALS SAVED=5。
图16示出图15A至15D中示出的NDK同源物的每对氨基酸序列的序列同一性百分比和趋异值。
在另一个优选的实施方案中,重组DNA构建体包含可操作地连接至少一种调控序列(如,在植物中有功能的启动子)的多核苷酸,其中所述多核苷酸编码LLRK或LLRK样蛋白。
在另一方面,本发明包括抑制DNA构建体。
抑制DNA构建体优选包含至少一种调控序列(优选在植物中有功能的启动子),该调控序列可操作地连接至:(a)以下序列的全部或部分:(i)编码多肽的核酸序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性,或(ii)所述(a)(i)的核酸序列的全长互补序列;或者(b)源自所关注的靶基因的有义链或反义链的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样蛋白;或(c)以下序列的全部或部分:(i)核酸序列,所述核酸序列基于Clustal V比对方法在与SEQ ID NO:14、16、18、或20进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性,或(ii)所述(c)(i)的核酸序列的全长互补序列。该抑制DNA构建体优选包含共抑制构建体、反义构建体、病毒抑制构建体、发夹抑制构建体、茎环抑制构建体、产生双链RNA的构建体、RNAi构建体或小RNA构建体(如,siRNA构建体或miRNA构建体)。
应当理解(正如本领域技术人员将会理解的),本发明不仅仅涵盖这些具体的示例性序列。导致给定位点处产生化学上等价的氨基酸但不影响所编码多肽的功能特性的核酸片段中的改变是本领域众所周知的。因此,氨基酸丙氨酸(一种疏水性氨基酸)的密码子可被编码另一个疏水性较弱的残基(例如甘氨酸)或疏水性较强的残基(例如缬氨酸、亮氨酸或异亮氨酸)的密码子取代。类似地,导致一个带负电荷的残基替换为另一个带负电荷的残基(例如,天冬氨酸替代谷氨酸)或者一个带正电荷的残基替换为另一个带正电荷的残基(例如,赖氨酸替换精氨酸)的改变也可预期产生功能上等价的产物。导致多肽分子的N-末端和C-末端部分改变的核苷酸变化也将预计不会改变多肽的活性。所提出的修饰中的每一种均完全在本领域常规技术内,如测定所编码的产物的生物活性的保留。
“抑制DNA构建体”是在转化或稳定整合进植物基因组时,导致该植物中的靶基因“沉默”的重组DNA构建体。对该植物来说,该靶基因可以是内源性的或是转基因的。如本文针对靶基因所使用的,“沉默”通常指在由靶基因表达的mRNA或蛋白质/酶的水平上的抑制,和/或在酶活性或蛋白质功能性的水平上的抑制。术语“抑制”包括降低、减少、下调、减弱、抑制、消除或阻止。“沉默”或“基因沉默”不测定机理并且包括(并且不限于)反义、共抑制、病毒抑制、发夹抑制、茎环抑制、基于RNAi的方法以及基于小RNAi的方法。
抑制DNA构建体可包含源自所关注的靶基因的区域并且可包含所关注的靶基因的有义链(或反义链)的核酸序列的全部或部分。取决于所要利用的方法,该区域可与所关注基因的有义链(或反义链)的全部或部分100%相同或者具有少于100%同一性的同一性(如,具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%的同一性)。
抑制DNA构建体是本领域所熟知的,一旦选定所关注的靶基因就很容易构建,并且包括但不限于共抑制构建体、反义构建体、病毒抑制构建体、发夹抑制构建体、茎环抑制构建体、产生双链RNA的构建体,以及更通常的是,RNAi(RNA干扰)构建体和小RNA构建体,例如siRNA(短干扰RNA)构建体和miRNA(微RNA)构建体。
“反义抑制”指产生能够抑制靶蛋白表达的反义RNA转录物。
“反义RNA”指与靶初级转录物或mRNA的全部或部分互补,并阻断分离的靶核酸片段表达的RNA转录物(美国专利号:5,107,065)。反义RNA可与特定基因转录物的任何部分,即5′非编码序列、3′非编码序列、内含子或编码序列互补。
“共抑制”指产生能够抑制靶蛋白表达的有义RNA转录物。“有义”RNA指包括mRNA和在细胞内或体外能被翻译成蛋白质的RNA在内的RNA转录物。此前,已通过着眼于以有义方向过表达与内源mRNA具有同源性的核酸序列(其导致与过表达的序列具有同源性的所有RNA减少)设计出了植物中的共抑制构建体(参见Vaucheret等人,1998,Plant J.,16:651-659;以及Gura,2000 Nature 404:804-808)。
另一种变型描述了将植物病毒序列用于引导对近端mRNA编码序列的抑制(于1998年8月20日公开的PCT专利公开WO 98/36083)。
此前描述的是“发夹”结构的利用,该结构以互补方向整合mRNA编码序列的全部或部分,导致已表达的RNA形成潜在的“茎环”结构(于1999年10月21日公开的PCT专利公开WO99/53050)。在这种情况下,茎由对应相对于启动子以有义或反义方向插入的相关基因的多核苷酸形成,并且环由一些相关基因的多核苷酸形成,在构建体中该多核苷酸不具有互补序列。这增加了获得的转基因植物中的共抑制或沉默频率。关于发夹抑制的综述,参见Wesley,S.V.等人,2003,Methods inMolecular Biology,Plant Functional Genomics:Methods and Protocols236:273-286。
其中茎由至少30个来自待抑制基因的核苷酸形成而环由任何的核苷酸序列形成的构建体也已经有效地用于抑制(于1999年12月2日公开的PCT专利公开WO 99/61632)。
使用聚-T和聚-A序列产生茎环结构中的茎已经有所描述(于2002年1月3日公开的PCT专利公开WO 02/00894)。
然而另一种变型涉及使用合成的重复序列来促进茎环结构中的茎的形成。用这种重组DNA片段产生的转基因生物体已经显示由形成茎环结构的核苷酸片段编码的蛋白质的水平降低,如于2002年1月3日公开的PCT专利公开WO 02/00904中所述。
RNA干扰是指由短干扰性RNA(siRNA)介导的动物中序列特异性转录后基因沉默的过程(Fire等人,Nature 391:806 1998)。在植物中的对应过程通常称为转录后基因沉默(PTGS)或RNA沉默,并且在真菌中也称为阻抑作用(quelling)。据信转录后基因沉默过程是用于防止外来基因表达的进化保守性细胞防御机制,并且通常由不同植物区系和门所共有(Fire等人,Trends Genet.15:358 1999)。这种防止外来基因表达的保护作用可能是通过特异性破坏病毒基因组RNA的同源单链RNA的细胞反应,响应源自病毒感染或源自转座因子随机整合到宿主基因组内的双链RNA(dsRNA)的生成而进化而来。dsRNA在细胞中的存在通过还没有完全表征的机制引发了RNAi反应。
细胞中长dsRNA的存在刺激了称为dicer的核糖核酸酶III的活性。Dicer涉及使dsRNA加工成称为短干扰RNA(siRNA)的短dsRNA片段(Berstein等人,Nature 409:363 2001)。源自dicer活性的短干扰RNA的长度通常是约21至约23个核苷酸,并且包含约19个碱基对的双链体(Elbashir等人,Genes Dev.15:188,2001)。Dicer还涉及从保守结构的前体RNA上切下21个和22个核苷酸的小时序RNA(stRNA),该小时序RNA参与翻译控制(Hutvagner等人,2001,Science 293:834)。RNAi响应还涉及内切核酸酶复合物,通常称为RNA诱导沉默复合物(RISC),其介导具有与siRNA双链体的反义链互补的序列的单链RNA的裂解。靶RNA的裂解在与siRNA双链体的反义链互补的区域中间发生(Elbashir等人,Genes Dev.15:188,2001)。此外,RNA干扰还涉及小RNA(如miRNA)介导的基因沉默,可推定是通过调节染色质结构并由此防止靶基因序列转录的细胞机制(参见(例如)Allshire,Science297:1818-18192002;Volpe等人,Science 297:1833-1837 2002;Jenuwein,Science 297:2215-2218 2002;和Hall等人,Science 297:2232-2237 2002)。这样,本发明的miRNA分子可用于通过与RNA转录物相互作用或者作为另一种选择通过与特定基因序列相互作用来介导基因沉默,其中这样的相互作用导致在转录或转录后水平上的基因沉默。
已经在多种系统中研究了RNAi。Fire等人(Nature 391:806,1998)首次在秀丽隐杆线虫(C.elegans)中观察到RNAi。Wianny和Goetz(Nature Cell Biol.2:70,1999)描述了在小鼠胚胎中由dsRNA介导的RNAi。Hammond等人(Nature 404:293,2000)描述了在用dsRNA转染的果蝇(Drosophila)细胞中的RNAi。Elbashir等人(Nature 411:4942001)描述了通过将合成的21-核苷酸RNA的双链体引入包括人胚肾和HeLa细胞在内的培养的哺乳动物细胞中而诱导的RNAi。
小RNA在控制基因表达中起重要作用。很多发育过程(包括开花)的调节是由小RNA控制的。现在有可能通过使用在植物中产生小RNA的转基因构建体来以工程手段改变植物基因的基因表达。
小RNA似乎是通过与互补RNA或DNA靶序列碱基配对来行使功能的。当与RNA结合时,小RNA或者引发靶序列的RNA裂解或者引发翻译抑制。当与DNA靶序列结合时,据信小RNA可介导靶序列的DNA甲基化。无论具体机制是什么,这些事件的后果是基因表达受到抑制。
据认为,小RNA和它们的RNA靶标之间的序列互补性有助于测定采用了哪种机制(RNA裂解或翻译抑制)。据信,优选与它们的靶标互补的siRNA通过RNA裂解起作用。一些miRNA与它们的靶基因具有完全或几乎完全的互补性,并且对于至少一些这样的miRNA,已经证实了RNA裂解。其他miRNA与它们的靶标具有若干错配,并且在翻译水平上明显抑制了它们的靶标。同样,无需坚持特定的作用机理,出现了这样一种一般规律:完全或几乎完全的互补性引起RNA裂解,而当miRNA/靶标双链体含有许多错配时倾向于翻译抑制。对于此规律的一个明显例外是植物中微RNA 172(miR172)。miR172的其中一个靶标是APETALA2(AP2),尽管miR172与AP2具有几乎完全的互补性,但其表现出引起AP2的翻译抑制而不是引起RNA裂解。
微RNA(miRNA)是长度为约19至约24个核苷酸(nt)的已经在动物和植物中鉴定出的非编码RNA(Lagos-Quintana等人,Science294:853-858 2001,Lagos-Quintana等人,Curr.Biol.12:735-739 2002;Lau等人,Science 294:858-862,2001;Lee和Ambros,Science 294:862-864,2001;Llave等人,Plant Cell 14:1605-1619,2002;Mourelatos等人,Genes.Dev.16:720-728,2002;Park等人,Curr.Biol.12:1484-1495,2002;Reinhart等人,Genes.Dev.16:1616-1626,2002)。它们是由大小为大约70至200nt的较长的前体转录物加工生成的,并且这些前体转录物能够形成稳定的发夹结构。在动物中,涉及加工miRNA前体的酶称为Dicer,这是一种核糖核酸酶III样蛋白(Grishok等人,Cell 106:23-34 2001;Hutvagner等人,Science 293:834-838 2001;Ketting等人,Genes.Dev.15:2654-2659,2001)。植物也具有Dicer样酶,即DCL1(以前称为CARPEL FACTORY/SHORT INTEGUMENTS1/SUSPENSOR1),并且最近有证据表明,其像Dicer一样,也涉及发夹前体的加工以产生成熟miRNA(Park等人,Curr.Biol.12:1484-1495,2002;Reinhart等人,Genes.Dev.16:1616-1626,2002)。此外,最近的研究已经清楚地表明,至少某些miRNA发夹前体最初是作为较长的聚腺苷酸化转录物存在,并且在单个转录物中可存在几种不同的miRNA以及相关发夹(Lagos-Quintana等人,Science 294:853-858 2001;Lee等人,EMBO J 21:4663-4670 2002)。最近的研究还测定了从dsRNA产物的miRNA链选择,所述dsRNA产物是通过DICER加工发夹而产生的(Schwartz等人,2003,Cell 115:199-208)。看起来,经加工的dsRNA的两端的稳定性(即G∶C与A∶U的含量比,和/或错配)影响链选择,具有低稳定性的末端更容易因解旋酶活性而解旋。低稳定性末端的5′末端链被整合至RISC复合物内,而另一条链被降解。
微RNA看起来通过与位于由这些基因产生的转录物中的互补序列结合来调节靶基因。就lin-4和let-7而言,靶位点位于靶mRNA的3′非翻译区中(Lee等人,Cell 75:843-854,1993;Wightman等人,Cell 75:855-862,1993;Reinhart等人,Nature 403:901-906,2000;Slack等人,Mol.Cell 5:659-669 2000),并且在lin-4和let-7miRNA与其靶位点之间有几个错配。lin-4或let-7miRNA的结合看起来引起了由靶mRNA编码的蛋白质的稳态水平下调,而不影响转录物自身(Olsen和Ambros,Dev.Biol.216:671-680,1999)。另一方面,最近有证据表明,在某些情况下,miRNA可引起靶转录物在靶位点内特异性RNA裂解,并且该裂解步骤看起来需要miRNA与靶转录物之间具有100%的互补性(Hutvagner和Zamore,Science 297:2056-2060 2002;Llave等人,PlantCell 14:1605-1619 2002)。看起来有可能miRNA可进入至少两条靶基因调控途径:当靶互补性<100%时,蛋白下调,当靶互补性是100%时,RNA裂解。进入RNA裂解途径的微RNA与在动物中RNA干扰(RNAi)期间以及在植物中转录后基因沉默(PTGS)期间产生的21-25nt短干扰RNA(siRNA)类似(Hamilton和Baulcombe 1999;Hammond等人,2000;Zamore等人,2000;Elbashir等人,2001),并且可能整合进与在RNAi情况中观察到的复合物类似或相同的RNA-诱导的沉默复合物(RISC)内。
用生物信息学鉴定miRNA的靶标在动物中没有成功,这可能是因为动物miRNA与它们的靶标具有低水平的互补性。另一方面,生物信息学方法已经成功地用于预测植物miRNA的靶标(Llave等人,Plant Cell14:1605-1619 2002;Park等人,Curr.Biol.12:1484-1495 2002;Rhoades等人,Cell 110:513-520 2002),因此,看起来植物miRNA与它们的推定靶标的整体互补性高于动物miRNA。植物miRNA的这些预测靶标中的大部分编码涉及植物发育模式或细胞分化的转录因子家族的成员。
本发明的重组DNA构建体(包括抑制DNA构建体)优选包含至少一种调控序列。
优选的调控序列是启动子。
多种启动子可用于本发明的重组DNA构建体(及抑制DNA构建体)中。可根据所需结果来选择启动子,并且可包括用于在宿主生物体中表达的组成型启动子、组织特异性启动子、细胞特异性启动子、诱导型启动子或其他启动子。
虽然候选基因当通过组成型启动子驱动表达时可预测其效应,但候选基因在35S或UBI启动子控制下的高水平、组成型表达可具有多重效应。
使用组织特异表达和/或胁迫特异表达可消除不需要的效应但保留改变根构造的能力。在拟南芥属中已经观察到了该效应(Kasuga等人(1999)Nature Biotechnol.17:287-291)。
适用于植物宿主细胞的组成型启动子包括(例如)Rsyn7启动子的核心启动子和在WO 99/43838和美国专利6,072,050中公开的其他组成型启动子;CaMV 35S核心启动子(Odell等人,Nature 313:810-812(1985));稻肌动蛋白(McElroy等人,Plant Cell 2:163-171(1990));泛素(UBI)(Christensen等人,Plant Mol.Biol.12:619-632(1989)和Christensen等人,Plant Mol.Biol.18:675-689(1992));pEMU(Last等人,Theor.Appl.Genet.81:581-588(1991));MAS(Velten等人,EMBO J.3:2723-2730(1984));ALS启动子(美国专利公开5,659,026)、玉米GOS2启动子(WO0020571A2,公布于2000年4月1日)等。其他组成型启动子包括例如在美国专利5,608,149、5,608,144、5,604,121、5,569,597、5,466,785、5,399,680、5,268,463、5,608,142和6,177,611中公开的那些启动子。
在选择启动子用于本发明方法时,可能有利的是使用组织特异性启动子或发育调控启动子。
优选的组织特异性启动子或发育调控启动子是这样的DNA序列,该序列调节DNA序列选择性地在对雄穗发育、结籽或两者重要的植物细胞/组织中的表达,并限制这种DNA序列只在植物的雄穗发育或种子成熟期间表达。任何引起所需时空表达的可鉴定启动子均可用于本发明的方法中。
种子或胚特异性的并且可用于本发明的启动子包括大豆Kunitz胰蛋白酶抑制剂(Kti3,Jofuku和Goldberg,Plant Cell 1:1079-1093(1989))、马铃薯块茎特异蛋白(patatin)(马铃薯块茎)(Rocha-Sosa,M.等人,1989,EMBO J.8:23-29)、convicilin、豌豆球蛋白和豆球蛋白(豌豆子叶)(Rerie,W.G.等人,1991,Mol.Gen.Genet.259:149-157;Newbigin,E.J.等人,1990,Planta 180:461-470;Higgins,T.J.V.等人,1988,Plant.Mol.Biol.11:683-695)、玉米蛋白(玉米胚乳)(Schemthaner,J.P.等人,1988,EMBO J.7:1249-1255)、菜豆蛋白(菜豆子叶)(Segupta-Gopalan,C.等人,1985,Proc.Natl.Acad.Sci.U.S.A.82:3320-3324)、植物血球凝集素(菜豆子叶)(Voelker,T.等人,1987,EMBO J.6:3571-3577)、B-伴球蛋白和大豆球蛋白(大豆子叶)(Chen,Z-L等人,1988,EMBO J.7:297-302)、谷蛋白(稻胚乳)、大麦醇溶蛋白(大麦胚乳)(Marris,C.等人,1988,Plant Mol.Biol.10:359-366)、麦谷蛋白和麦醇溶蛋白(小麦胚乳)(Colot,V.等人,1987,EMBO J.6:3559-3564)和甘薯贮藏蛋白(sporamin)(甘薯块根)(Hattori,T.等人,1990,Plant Mol.Biol.14:595-604)。可操作地连接至嵌合基因构建体异源编码区的种子特异性基因的启动子在转基因植物中保持它们的时空表达模式。这样的实例包括在拟南芥属和甘蓝型油菜种子中表达脑啡肽的拟南芥属2S种子储藏蛋白基因启动子(Vanderkerckhove等人,Bio/Technology 7:L929-932(1989))、表达荧光素酶的菜豆凝集素和β-菜豆蛋白启动子(Riggs等人,Plant Sci.63:47-57(1989)),以及表达氯霉素乙酰转移酶的小麦谷蛋白启动子(Colot等人,EMBO J 6:3559-3564(1987))。
可诱导启动子响应内源性或外源性刺激的存在,例如,通过化合物(化学诱导剂),或响应环境、激素、化学信号和/或发育信号而选择性表达可操作地连接的DNA序列。可诱导的或受调控的启动子包括(例如)受光、热、胁迫、水涝或干旱、植物激素、创伤或诸如乙醇、茉莉酮酸酯、水杨酸或安全剂之类的化学品调控的启动子。
优选的启动子包括如下启动子:1)胁迫诱导型RD29A启动子(Kasuga等人,1999,Nature Biotechnol.17:287-91);2)大麦启动子B22E;B22E的表达是发育中的玉米籽粒中的柄所特异性的(“PrimaryStructure of a Novel Barley Gene Differentially Expressed in ImmatureAleurone Layers(在未成熟糊粉层中差异表达的新大麦基因的一级结构)”。Klemsdal,S.S.等人,Mol.Gen.Genet.228(1/2):9-16(1991));以及3)玉米启动子Zag2(“Identification and molecular characterizationof ZAG1,the maize homolog of the Arabidopsis floral homeotic geneAGAMOUS(ZAG1-拟南芥属花同源异形基因AGAMOUS的玉米同系物的鉴定和分子表征)”,Schmidt,R.J.等人,Plant Cell 5(7):729-737(1993))。“Structural characterization,chromosomallocalization andphylogenetic evaluation of two pairs of AGAMOUS-like MADS-box genesfrom maize(两对来自玉米的AGAMOUS样MADS-box基因的结构表征、染色体定位及系统发育评价)”,Theissen等人,Gene 156(2):155-166(1995);NCBI GenBank Accession X80206))。Zag2转录物可在授粉前5天至授粉后(DAP)7至8天被检测到,并且引导Ciml在发育中的雌花序心皮中表达,Ciml对发育中的玉米籽粒的籽仁而言是特异性的。Ciml转录物在授粉前4至5天至授粉后6至8天被检测到。其他可用的启动子包括可源自其表达与发育中的雌小花母系相关的基因的任何启动子。
用于在植物中调节本发明的核苷酸序列表达的其他优选启动子是维管元件特异性启动子或茎优选启动子。这种茎优选启动子包括苜蓿S2A启动子(GenBank登录号:EF030816;Abrahams等人,Plant Mol.Biol.27:513-528(1995))和S2B启动子(GenBank登录号:EF030817)等等,将这些文献以引用的方式并入本文。
启动子可整个源于天然基因,或者由源于天然存在的不同启动子的不同元件构成,或者甚至包含合成的DNA片段。本领域内的技术人员应当理解,不同的启动子可在不同的组织或细胞类型中,或者在不同的发育阶段,或者响应不同的环境条件而引导基因的表达。还应认识到,由于在大多数情况下还不能完全测定调控序列的确切范围,一些变型的DNA片段可能具有相同的启动子活性。在多数情况下引起基因在大多数细胞型中表达的启动子通常称为“组成型启动子”。目前不断在发现可用于植物细胞中的不同类型的新启动子;在Okamuro,J.K.和Goldberg,R.B.,Biochemistry of Plants 15:1-82(1989)的汇编中可找到许多实例。(将其与其他组成型启动子描述放在一起。)
优选的启动子可包括:RIP2、mLIP15、ZmCOR1、Rab17、CaMV 35S、RD29A、B22E、Zag2、SAM合成酶启动子、泛素启动子(SEQ ID NO:32)、CaMV 19S、nos、Adh、蔗糖合成酶启动子、R-等位基因启动子、根细胞启动子、维管组织特异性启动子S2A(Genbank登录号EF030816;SEQ ID NO:33)和S2B(Genbank登录号EF030817)及来自玉米的组成型启动子GOS2(SEQ ID NO:31)。其他优选的启动子包括根优选的启动子,例如玉米NAS2启动子(SEQ ID NO:30)、玉米Cyclo启动子(US 2006/0156439,公开于2006年7月13日)、玉米ROOTMET2启动子(WO05063998,公开于2005年7月14日)、CR1BIO启动子(WO06055487,公开于2006年5月26日)、CRWAQ81(WO05035770,公开于2005年4月21日)和玉米ZRP2.47启动子(NCBI保藏号:U38790,gi:1063664)。
核苷酸序列的“主要部分”包含的核苷酸序列足以提供其包含的启动子的推定鉴定。核苷酸序列可由本领域技术人员来人工评估,或使用基于计算机的序列比较和鉴定工具进行评估,所述工具使用算法如BLAST(Basic Local Alignment Search Tool;Altschul等人(1993)J.Mol.Biol.215:403-410)。一般来讲,为了推定鉴定启动子核酸序列是否与已知启动子同源,包含三十或更多个邻接核苷酸的序列是必需的。具有如本文报道序列的有益效果,技术人员现在可使用全部公布序列或它们的主要部分用于本领域技术人员已知的目的。因此,本发明包括在附随序列表中报道的完全序列,以及那些上述序列的主要部分。
本发明的重组DNA构建体(及抑制DNA构建体)也可包括其他调控序列,包括但不限于翻译前导序列、内含子和多腺苷酸化识别序列。在本发明的另一个优选的实施方案中,本发明的重组DNA构建体还包括增强子或沉默子。
内含子序列可加入到部分编码序列的5’非翻译区或编码序列以增加积聚在胞浆中的成熟信息的量。已经显示,在植物和动物两者的表达构建体的转录单位中包含可剪接内含子可使基因表达在mRNA和蛋白质水平上均增强高达1000倍。参见Buchman和Berg,Mol.Cell Biol.8:4395-4405(1988);Callis等人,Genes Dev.1:1183-1200(1987)。这种内含子对基因表达的增强通常在将其设置接近转录单位的5’端时为最大。玉米内含子Adh1-S内含子1、2和6、Bronze-1内含子的使用是本领域已知的。通常参见The Maize Handbook,第116章,Freeling和Walbot(编辑),Springer,纽约(1994)。
如果期望进行多肽表达,则通常希望在多核苷酸编码区的3′-端处包含有多腺苷酸化区。该多腺苷酸化区可源自天然基因,源自多种其他植物基因或源自T-DNA。要加入的3′端序列可源自(例如)胭脂碱合成酶或章鱼碱合成酶基因,或作为选择源自另外的植物基因,或较不优选的是源自任何其他真核基因。
“翻译前导序列”指位于基因启动子序列和编码序列之间的DNA序列。翻译前导序列存在于翻译起始序列的经完全加工后的mRNA上游。翻译前导序列可影响mRNA的初级转录过程、mRNA稳定性或翻译效率。翻译前导序列的实例已经有所描述(Turner,R.和Foster,G.Molecular Biotechnology 3:225(1995))。
在本发明的另一个优选的实施方案中,本发明的重组DNA构建体还包括增强子或沉默子。
任何植物均可选择用来鉴定将用于产生本发明重组DNA构建体和抑制DNA构建体的调控序列和基因。适用于分离基因和调控序列的靶植物的实例应该包括但不限于苜蓿、苹果、杏、拟南芥属植物、洋蓟、芝麻菜、芦笋、鳄梨、香蕉、大麦、豆类、甜菜、黑莓、蓝莓、西兰花、抱子甘蓝、卷心菜、卡诺拉、香瓜、胡萝卜、木薯、蓖麻、菜花、芹菜、樱桃、菊苣、芫荽、柑桔类、克莱门氏小柑橘类、三叶草、椰子、咖啡、玉米、棉、蔓越莓、黄瓜、花旗松、茄子、菊苣、茅菜、桉树、茴香、无花果、大蒜、葫芦、葡萄、柚子树、白兰瓜、豆薯、猕猴桃、生菜、韭葱、柠檬、酸橙、火炬松、亚麻子、芒果、甜瓜、蘑菇、油桃、坚果、燕麦、油棕、油菜、秋葵、橄榄树、洋葱、橙、观赏植物、棕榈、木瓜树、欧芹、欧洲防风草、豌豆、桃树、花生、梨树、胡椒、柿树、松树、菠萝、大蕉、李树、石榴树、白杨、马铃薯、南瓜、温柏、辐射松、红菊苣、萝卜、油菜、树莓、稻、黑麦、高粱、南方松、大豆、菠菜、南瓜、草莓、甜菜、甘蔗、向日葵、甘薯、枫香树、柑橘、茶、烟草、蕃茄、黑小麦、草皮草、芜菁、葡萄树、西瓜、小麦、薯蓣和西葫芦。用于鉴定调控序列的特别优选的植物是拟南芥属植物、玉米、小麦、大豆和棉。
优选的组合物
本发明的优选组合物是其基因组中包含本发明的任何重组DNA构建体(包括任何抑制DNA构建体)(例如上面所讨论的那些优选构建体)的植物。优选的组合物也包括任何植物的子代,以及获取自植物或其子代的任何种子,其中所述子代或种子在其基因组中包含重组DNA构建体(或抑制DNA构建体)。子代包括通过植物的自花授粉或异型杂交而获得的连续世代。子代也包括杂交种和近交系。
优选地,在杂交种子繁殖的农作物中,成熟的转基因植物可自花授粉而产生纯合的近交系植物。该近交系植物产生含有新引入的重组DNA构建体(或抑制DNA构建体)的种子。这些种子可生长而产生将会表现出改变的根(或植物)构造,或者可用于育种程序以产生杂交种子,这些杂交种子可生长而产生将会表现出改变的根(或植物)构造的植物。优选地,种子是玉米。
优选地,植物是单子叶植物或双子叶植物,更优选地,是玉米或大豆植物,甚至更优选的是玉米植物,例如玉米杂交种植物或玉米近交系植物。植物还可以是向日葵、高梁、蓖麻、葡萄、卡诺拉、小麦、苜蓿、棉、稻、大麦或小米。
优选地,重组DNA构建体稳定地整合进植物的基因组中。
尤其优选的实施方案包括但不限于如下优选的实施方案:
1.在基因组中包含重组DNA构建体的植物(优选玉米或大豆植物),该重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中所述多核苷酸编码多肽,所述多肽的氨基酸序列基于ClustalV比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性,并且其中所述植物在与未包含所述重组DNA构建体的对照植物进行比较时表现出改变的根构造。优选地,在与该对照植物比较时,该植物还表现出至少一种农学特性的改变。
2.植物(优选地玉米或大豆植物),所述植物在其基因组中包含:
重组DNA构建体,所述重组DNA构建体包含:
(a)可操作地连接至少一种调控元件的多核苷酸,其中所述多核苷酸编码多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性;或
(b)抑制DNA构建体,所述抑制DNA构建体包含至少一种调控元件,所述调控元件可操作地连接至:
(i)以下序列的全部或部分:(A)编码多肽的核酸序列,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%的序列同一性,或(B)所述(b)(i)(A)的核酸序列的全长互补序列;或
(ii)源自所关注的靶基因的有义链或反义链的全部或部分的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样多肽,并且其中在与未包含所述重组构建体的对照植物比较时,所述植物表现出至少一种农学特性的改变。
3.在基因组中包含重组DNA构建体的植物(优选玉米或大豆植物),该重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中所述多核苷酸编码LLRK或LLRK样蛋白,并且其中在与未包含所述重组DNA构建体的对照植物比较时,所述植物表现出改变的根构造。优选地,该植物还表现出至少一种农学特性的改变。
优选地,该LLRK或LLRK样蛋白来自拟南芥属(Arabidopsisthaliana)、玉米(Zea mays)、大豆(Glycine max)、烟豆(Glycine tabacina)、野大豆(Glycine soia)和短绒野大豆(Glycine tomentella)。
4.在基因组中包含抑制DNA构建体的植物(优选玉米或大豆植物),该抑制DNA构建体包含至少一个可操作地连接至源自所关注的靶基因的有义链或反义链的全部或部分的区域的调控元件,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样蛋白,并且其中在与未包含所述重组DNA构建体的对照植物比较时,所述植物表现出至少一种农学特性的改变。
5.在基因组中包含抑制DNA构建体的植物(优选玉米或大豆植物),该抑制DNA构建体包含至少一个可操作地连接至以下序列的全部或部分的调控元件:(a)编码多肽的核酸序列,在与SEQ ID NO:15、17、19、22、或36比较时,基于Clustal V比对方法,该多肽的氨基酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,或(b)所述(a)的核酸序列的完全互补序列,并且其中在与未包含所述重组·DNA构建体的对照植物比较时,所述植物表现出至少一种农学特性的改变。
6.上述优选实施方案1-5中的植物的任何子代、上述优选实施方案1-5中的植物的任何种子、上述优选实施方案1-5中的植物的子代的任何种子以及来自上述优选实施方案1-5中的植物以及它们的子代的细胞。
在上述优选的实施方案1-6或本发明的任何其他实施方案中的任一项中,重组DNA构建体(或抑制DNA构建体)优选包含至少一种在植物中有功能的启动子作为优选的调控序列。
在上述优选的实施方案1-6或本发明的任意其他实施方案中的任一项中,至少一种农学特性的改变是增加或减少,优选增加。
在任一前述的优选实施方案1-6或本发明的任何其他实施方案中,至少一种农学特性优选选自:绿度、产量、生长速率、生物量、成熟时的鲜重、成熟时的干重、果实产量、种子产量、总植物含氮量、果实含氮量、种子含氮量、营养组织中的含氮量、总植物游离氨基酸含量、果实游离氨基酸含量、种子游离氨基酸含量、营养组织游离氨基酸含量、总植物蛋白质含量、果实蛋白质含量、种子蛋白质含量、营养组织蛋白质含量、抗涝性、氮摄取、根倒伏、茎倒伏、植株高度、穗长以及收获指数;产量、绿度、生物量和根倒伏是尤其优选进行改变的农学特性(优选增加)。
在任一前述的优选实施方案1-6或本发明的任何其他实施方案中,在与对照植物比较时,植物优选表现出至少一种与环境条件例如水和营养物质的可用性无关的农学特性的改变。
本领域的普通技术人员熟悉测定植物根构造改变的规程。例如,可检测分析转基因玉米植物的根构造在幼苗期、花期或成熟期的改变。根构造的改变可通过统计温室培育的植物顶部第3或第4节的节根数目或根带的宽度来测定。“根带”指成熟期植物在花盆底部的根丛宽度。植物根构造变化的其他量度包括但不限于侧根的数量、节根的平均根直径、侧根的平均根直径、根毛的数量和长度。侧根分枝的程度(如侧根数量、侧根长度)可通过这样测定:从完整的根系进行二次取样,将样本用平面扫描器或数码相机成像并用WinRHIZOTM软件(RegentInstruments Inc.)分析。
对提取的有关根表型的数据进行统计分析(通常为t检验),以将转基因根与非转基因姊妹株植株的根进行比较。在多个事件和/或构建体涉及该分析的情况下,还可使用单因素方差分析。
下面的实施例描述了一些用于检测根构造改变的代表性规程和技术。
也可通过在田间测试中,在相同条件下比较植物与对照或参照植物提高产量的能力,来评价植物根构造的改变。
也可通过在田间测试中比较植物在胁迫条件下(例如营养物质过剩或受限、水过剩或受限、存在病害)保持基本产量(优选地至少75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%产量)的能力,与非胁迫条件下的对照或参照植物的产量,来评价根构造改变。
根构造的改变可通过测定转基因植物较于参照植物或对照植物的抗根倒伏性来测量。
在评估或测量其中利用了对照或参照植物的本发明任何实施方案(如,如本文描述的组合物或方法)中的转基因植物的农学特性或表型时,本领域的普通技术人员将很容易认识到要利用的合适对照或参照植物。例如,通过如下非限制性示例来说明:
1.转化过的植物的子代,该转化过的植物对于重组DNA构建体(或抑制DNA构建体)来说是半合子的,使得该子代分离成包含或不包含该DNA构建体(或抑制DNA构建体)的植株:包含该重组DNA构建体(或抑制DNA构建体)的子代将通常相对于未包含该重组DNA构建体(或抑制DNA构建体)的子代来进行测量(即,未包含该重组DNA构建体(或抑制DNA构建体)的子代是对照或参照植株)。
2.重组DNA构建体(或抑制DNA构建体)基因渗入至近交系中,例如在玉米中,或基因渗入进变种中,例如在大豆中:基因渗入品系将通常相对于亲本近交系或变种品系进行测量(即,亲本近交系或品种品系是对照或参照植物)。
3.双杂交系,其中第一杂交系由两个亲本近交系产生,而第二杂交系由相同的两个亲本近交系产生,不同的是其中一个亲本近交系含有重组DNA构建体(或抑制DNA构建体):第二杂交系通常将相对于第一杂交系进行测量(即亲本近交系或变种品系为对照植物或参照植物)。
4.包含重组DNA构建体(或抑制DNA构建体)的植株:该植株可相对于这样的对照植株进行评估或测量,该对照植株不包含重组DNA构建体(或抑制DNA构建体),但具有与该植株相当的遗传背景(例如,与包含重组DNA构建体(或抑制DNA构建体)的植株相比较,核遗传物质具有至少90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性)。存在许多可用于分析、比较和表征植物遗传背景的基于实验室的技术;其中这些技术是同工酶电泳、限制性片段长度多态性(RFLP)、随机扩增多态性DNA(RAPD)、任何引物聚合成酶链反应(AP-PCR)、DNA扩增指纹(DAF)、序列特异扩增区域(SCAR)、扩增片段长度多态性和也称为微卫星的简单序列重复(SSR)。
此外,本领域的普通技术人员将容易认识到,评估或测量转基因植物的农学特性或表型时合适的对照或参照植物将不包括先前已经针对所需的农学特性或表型,通过诱变或转化而选择的植物。
优选的方法
优选的方法包括但不限于用于改变植物根构造的方法、用于评价植物根构造改变的方法、用于改变植物农学特性的方法、用于测定植物农学特性改变的方法以及用于产生种子的方法。优选地,植物是单子叶植物或双子叶植物,更优选地,是玉米或大豆植物,甚至更优选地,是玉米植物。植物还可以是向日葵、高梁、蓖麻、卡诺拉、小麦、苜蓿、棉、稻、大麦或小米。种子优选的是玉米或大豆种子,更优选的是玉米种子,并且甚至更优选的是玉米杂交种种子或玉米近交系种子。
特别优选的方法包括但不限于如下方法:
改变植物根构造的方法,该方法包括:(a)将重组DNA构建体引入到可再生的植物细胞中,该重组DNA构建体包含可操作地连接至少一种调控序列(优选在植物中有功能的启动子)的多核苷酸,其中该多核苷酸编码多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性;以及(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该重组DNA构建体并且在与未包含该重组DNA构建体的对照植物比较时表现出改变的根构造。所述方法可进一步包括(c)获得源自该转基因植物的子代植物,其中所述子代植物在其基因组中包含该重组DNA构建体并且在与未包含该重组DNA构建体的对照植物比较时表现出改变的根构造。
改变植物根构造的方法,该方法包括:(a)将包含至少一种调控序列(优选在植物中有功能的启动子)的抑制DNA构建体导入可再生的植物细胞中,该调控序列可操作地连接至:
(i)以下序列的全部或部分:(A)编码多肽的核酸序列,在与SEQID NO:15、17、19、22、或36比较时,基于Clustal V比对方法,该多肽的氨基酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,或(B)所述(a)(i)(A)的核酸序列的全长互补序列;或
(ii)源自所关注的靶基因的有义链或反义链的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样多肽;以及
(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该重组DNA构建体并且在与未包含该抑制DNA构建体的对照植物比较时表现出改变的根构造。所述方法可进一步包括(c)获得源自该转基因植物的子代植物,其中所述子代植物在其基因组中包含该重组DNA构建体并且在与未包含该抑制DNA构建体的对照植物比较时表现出改变的根构造;
评价植物根构造改变的方法,该方法包括:(a)将重组DNA构建体引入到可再生的植物细胞中,该重组DNA构建体包含可操作地连接至少一种调控序列(优选在植物中有功能的启动子)的多核苷酸,其中所述多核苷酸编码多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性,或(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该重组DNA构建体;以及(c)评价与未包含该重组DNA构建体的对照植物比较时该转基因植物的根构造;该方法还可包括:(d)获得源自该转基因植物的子代植物,其中该子代植物在其基因组中包含该重组DNA构建体;以及(e)评价与未包含该重组DNA构建体的对照植物比较时该子代植物的根构造。
评价植物根构造改变的方法,所述方法包括:(a)将包含至少一种调控序列(优选在植物中有功能的启动子)的抑制DNA构建体导入可再生的植物细胞中,该调控序列可操作地连接至:
(i)以下序列的全部或部分:(A)编码多肽的核酸序列,在与SEQID NO:15、17、19、22、或36比较时,基于Clustal V比对方法,该多肽的氨基酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,或(B)所述(a)(i)(A)的核酸序列的全长互补序列;或者(ii)源自所关注的靶基因的有义链或反义链的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样多肽;以及
(b)在步骤(a)之后,从所述可再生的植物细胞再生出转基因植物,其中所述转基因植物在其基因组中包含所述抑制DNA构建体;以及(c)评价该转基因植物在与未包含该抑制DNA构建体的对照植物比较时改变的根构造。该方法可另外包括:(d)获得源自该转基因植物的子代植物,其中该子代植物在其基因组中包含该抑制DNA构建体;以及(e)评价该子代植物在与未包含该抑制DNA构建体的对照植物比较时改变的根构造。
评价植物根构造改变的方法,该方法包括:(a)将重组DNA构建体引入到可再生的植物细胞中,该重组DNA构建体包含可操作地连接至少一种调控序列(优选在植物中有功能的启动子)的多核苷酸,其中所述多核苷酸编码多肽,所述多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该重组DNA构建体;(c)获得源自所述转基因植物的子代植物,其中该子代植物在其基因组中包含该重组DNA构建体;以及(d)评价该子代植物在与未包含该重组DNA构建体的对照植物比较时改变的根构造。
评价植物根构造的方法,所述方法包括:
(a)将抑制DNA构建体引入到可再生的植物细胞中,所述抑制DNA构建体包含至少一种调控元件,所述调控元件可操作地连接至:(i)以下序列的全部或部分:(A)编码多肽的核酸序列,在与SEQ IDNO:15、17、19、22、或36比较时,基于Clustal V比对方法,该多肽的氨基酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,或(B)所述(a)(i)(A)的核酸序列的全长互补序列;或者(ii)源自所关注的靶基因的有义链或反义链的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%、或100%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样多肽;(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该抑制DNA构建体;
(c)获得源自所述转基因植物的子代植物,其中所述子代植物在其基因组中包含所述抑制DNA构建体;以及(d)评价与未包含该抑制DNA构建体的对照植物比较时该子代植物的根构造。
测定植物农学特性改变的方法,该方法包括:(a)将重组DNA构建体引入到可再生的植物细胞中,该重组DNA构建体包含可操作地连接至少一种调控序列(优选在植物中有功能的启动子)的多核苷酸,其中所述多核苷酸编码多肽,在与SEQ ID NO:15、17、19、22、或36比较时,基于Clustal V比对方法,该多肽的氨基酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含所述重组DNA构建体;以及(c)测定该转基因植物在与未包含该重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。该方法还可包括:(d)获得源自该转基因植物的子代植物,其中该子代植物在其基因组中包含该重组DNA构建体;以及(e)测定该子代植物在与未包含该重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
测定植物农学特性改变的方法,该方法包括:(a)将抑制DNA构建体引入到可再生的植物细胞中,该抑制DNA构建体包含至少一种调控序列(优选在植物中有功能的启动子),所述调控序列可操作地连接以下序列的全部或部分:(i)核酸序列,该核酸序列编码多肽,该多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,或(ii)所述(i)的核酸序列的全长互补序列;(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该抑制DNA构建体;以及(c)测定该转基因植物在与未包含该抑制DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。该方法可另外包括:(d)获得源自该转基因植物的子代植物,其中该子代植物在其基因组中包含该抑制DNA构建体;以及(e)测定该子代植物在与未包含该抑制DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
测定植物农学特性改变的方法,该方法包括:(a)将重组DNA构建体引入到可再生的植物细胞中,该重组DNA构建体包含可操作地连接至少一种调控序列(优选在植物中有功能的启动子)的多核苷酸,其中所述多核苷酸编码多肽,在与SEQ ID NO:15、17、19、22、或36比较时,基于Clustal V比对方法,该多肽的氨基酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含所述重组DNA构建体;(c)获得源自所述转基因植物的子代植物,其中该子代植物在其基因组中包含该重组DNA构建体;以及(d)测定该子代植物在与未包含该重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。测定植物中农学特性改变的方法可进一步包括:测定所述转基因植物在不同的环境条件下与未包含所述重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
测定植物农学特性改变的方法,该方法包括:(a)将抑制DNA构建体引入到可再生的植物细胞中,该抑制DNA构建体包含至少一种调控序列(优选在植物中有功能的启动子),所述调控序列可操作地连接以下序列的全部或部分:(i)核酸序列,该核酸序列编码多肽,该多肽的氨基酸序列基于Clustal V比对方法在与SEQ ID NO:15、17、19、22、或36进行比较时具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,或(ii)所述(i)的核酸序列的全长互补序列;
(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该抑制DNA构建体;(c)获得源自所述转基因植物的子代植物,其中该子代植物在其基因组中包含该抑制DNA构建体;以及(d)测定该子代植物在与未包含该重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
测定植物农学特性改变的方法,所述方法包括:(a)将抑制DNA构建体引入到可再生的植物细胞,该抑制DNA构建体包括至少一种调控元件,该调控元件可操作地连接至源自所关注的靶基因的有义链或反义链的全部或部分的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样多肽;(b)在步骤(a)之后从该可再生的植物细胞再生出转基因植物,其中该转基因植物在其基因组中包含该抑制DNA构建体;以及(c)测定所述转基因植物在与未包含所述抑制DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。所述方法可进一步包括:(d)获得源自所述转基因植物的子代植物,其中所述子代植物在其基因组中包含所述抑制DNA构建体;以及(e)测定该子代植物在与未包含该抑制DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
测定植物农学特性改变的方法,所述方法包括:(a)将抑制DNA构建体引入到可再生的植物细胞中,该抑制DNA构建体包括至少一种调控元件,该调控元件可操作地连接至源自所关注的靶基因的有义链或反义链的全部或部分的区域,当与所述区域所来源的有义链或反义链的全部或部分比较时,基于Clustal V比对方法,所述区域的核酸序列具有至少50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、56%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、99%或100%的序列同一性,并且其中所述所关注的靶基因编码LLRK或LLRK样蛋白;(b)在步骤(a)之后,从所述可再生的植物细胞再生出转基因植物,其中所述转基因植物在其基因组中包含所述抑制DNA构建体;(c)获得源自所述转基因植物的子代植物,其中所述子代植物在其基因组中包含所述抑制DNA构建体;以及(d)测定该子代植物在与未包含该抑制DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
产生种子(优选可作为提供改变的根构造的产品销售的种子)的方法,该方法包括任一上述的优选方法,并且还包括从所述子代植物获得种子,其中所述种子在其基因组中包含所述重组DNA构建体(或抑制DNA构建体)。
在任一前述的优选方法或本发明方法的任何其他实施方案中,测定转基因植物中农学特性改变的步骤(如果适用的话)可优选地包括测定在改变的环境条件下与不包含重组DNA构建体的对照植物进行比较时该转基因植物是否表现出至少一种农学特性的改变。
在任一前述的优选方法或本发明方法的任何其他实施方案中,测定子代植物中农学特性改变的步骤(如果适用的话)可优选地包括测定在改变的环境条件下与不包含重组DNA构建体的对照植物进行比较时该子代植物是否表现出至少一种农学特性的改变。
在任何前述的优选方法或本发明方法的任何其他实施方案中,在所述导入步骤中所述可再生的植物细胞优选地包括愈伤组织细胞(优选胚胎)、配子细胞、分生细胞或未成熟胚芽细胞。可再生的植物细胞优选来自近交系玉米植物。
在任一上述的优选方法或本发明方法的任何其他实施方案中,所述再生步骤优选包括:(i)在包含促进胚发生的激素的培养基中培育所述转化的植物细胞直至观察到愈伤组织;(ii)将所述步骤(i)的转化的植物细胞转移至包含促进组织机体形成的激素的第一培养基;以及(iii)在第二培养基上传代培养步骤(ii)后的所述转化的植物细胞,以允许嫩芽伸长、根发育或这两者同时发生。
在任一前述的优选方法或本发明方法的任何其他实施方案中,存在供选择的替代方案用于将包含可操作地连接至少一种调控序列上的多核苷酸的重组DNA构建体导入可再生的植物细胞。例如,可将调控序列(例如一种或多种增强子、优选地作为转位因子的部件)导入可再生的植物细胞,然后筛选其中将所述调控序列可操作地连接至编码本发明多肽的内源基因的事件。
将本发明的重组DNA构建体引入植物可通过任何合适的技术来进行,这些技术包括但不限于引导DNA摄取、化学处理、电穿孔、显微注射、细胞融合、感染、载体介导的DNA转移、轰击或农杆菌介导转化。
在任一上述的优选方法或本发明方法的任何其他实施方案中,至少一种农学特性优选选自:绿度、产量、生长速率、生物量、成熟时的鲜重、成熟时的干重、果实产量、种子产量、总植物含氮量、果实含氮量、种子含氮量、营养组织含氮量、总植物游离氨基酸含量、果实游离氨基酸含量、种子游离氨基酸含量、营养组织游离氨基酸含量、总植物蛋白质含量、果实蛋白质含量、种子蛋白质含量、营养组织蛋白质含量、抗涝性、氮摄取、根倒伏、茎倒伏、植株高度、穗长、茎倒伏以及收获指数。产量、绿度、生物量和根倒伏是尤其优选进行改变的农学特性(优选增加)。
在任一上述的优选方法或本发明方法的任何其他实施方案中,在与对照植物比较时,植物优选表现出至少一种与环境条件无关的农学特性的改变。
将本发明的重组DNA构建体引入植物可通过任何合适的技术来进行,这些技术包括但不限于引导DNA摄取、化学处理、电穿孔、显微注射、细胞融合、感染、载体介导的DNA转移、轰击或农杆菌介导转化。
优选的技术如下文实施例所示,用于转化玉米植物细胞和大豆植物细胞。
用于转化双子叶植物(主要通过利用根癌农杆菌以及获得转基因植物的其他优选方法包括公开的用于棉的那些(美国专利5,004,863、美国专利5,159,135、美国专利5,518,908);用于大豆的那些(美国专利5,569,834、美国专利5,416,011、McCabe等人,Bio/Technology 6:923(1988),Christou等人,Plant Physiol.87:671674(1988));用于芸苔属植物(Brassica)的那些(美国专利5,463,174);用于花生的那些(Cheng等人,Plant Cell Rep.15:653 657(1996),McKently等人,Plant Cell Rep.14:699 703(1995));用于番木瓜的那些;以及用于豌豆的那些(Grant等人,Plant Cell Rep.15:254 258,(1995))。
用电穿孔、粒子轰击和农杆菌转化单子叶植物也已有报道并且作为优选的方法包括(例如)如在天门冬属(asparagus)中实现的转化和植物再生(Bytebier等人,Proc.Natl.Acad.Sci.U.S.A.84:5354,(1987));在大麦中实现的转化和植物再生(Wan和Lemaux,Plant Physiol.104:37(1994));在玉米中实现的转化和植物再生(Rhodes等人,Science 240:204(1988);Gordon-Kamm等人,Plant Cell 2:603 618(1990);Fromm等人,Bio/Technology 8:833(1990);Koziel等人,Bio/Technology11:194,(1993);Armstrong等人,Crop Science 35:550-557(1995));在燕麦中实现的转化和植物再生(Somers等人,Bio/Technology 10:1589(1992));在鸭茅中实现的转化和植物再生(Horn等人,Plant Cell Rep.7:469(1988));在稻中实现的转化和植物再生(Toriyama等人,Theor.Appl.Genet.205:34,(1986);Part等人,Plant Mol.Biol.32:1135 1148,(1996);Abedinia等人,Aust.J.Plant Physiol.24:133 141(1997);Zhang和Wu,Theor.Appl.Genet.76:835(1988);Zhang等人,PlantCell Rep.7:379,(1988);Battraw和Hall,Plant Sci.86:191 202(1992);Christou等人,Bio/Technology 9:957(1991));裸麦(De la Pena等人,Nature 325:274(1987));在甘蔗中实现的转化和植物再生(Bower和Birch,Plant J.2:409(1992));在高羊茅(tall fescue)中实现的转化和植物再生(Wang等人,Bio/Technology 10:691(1992))和在小麦中实现的转化和植物再生(Vasil等人,Bio/Technology 10:667(1992);美国专利5,631,152)。
存在多种用于从植物组织再生植物的方法。再生的具体方法将取决于起始植物组织以及待再生的具体植物物种。
从单植物原生质体转化体或从多种经转化的外植体再生、发育和培育植物是本领域所熟知的(Weissbach和Weissbach(编辑),载于:Methods for Plant Molecular Biology,Academic Press,Inc.San Diego,CA,(1988))。该再生和生长方法通常包括如下步骤:选择转化的细胞、培养这些单独化的细胞通过胚发育的通常阶段以及通过生根小植株阶段。转基因胚以及种子以类似的方式再生。随后将所得的转基因的生根小苗种植在诸如土壤之类的合适植物生长培养基中。
含有编码所关注蛋白质的外来的外源性分离核酸片段的植物的发育或再生是本领域所熟知的。优选地,将再生的植物进行自花授粉以产生纯合的转基因植物。或者,将得自再生植物的花粉与农学上重要的品系的产生种子的植株进行杂交。相反,将来自这些重要品系的植物用于给再生植物授粉。利用本领域技术人员所熟知的方法培育含有所需多肽的本发明的转基因植物。
在另一方面,本发明也涉及与改变根构造和/或改变植物至少一种农学特性有关的对基因变异进行作图的方法,所述方法包括:
(a)使两种植物品种杂交;以及
(b)在得自步骤(a)的杂交的子代植物中针对以下序列来评价基因变异:
(i)选自SEQ ID NO:14、16、18、20、22、24、26、28、30或33的核酸序列;或
(ii)编码选自SEQ ID NO:15;17、19、21、23、25、27、29、31或34的多肽的核酸序列,
所述评价是使用选自下组的方法进行的:RFLP分析、SNP分析、和基于PCR的分析。
在另一个实施方案中,本发明涉及改变根构造和/或改变植物至少一种农学特性的分子育种方法,所述方法包括:
(a)使两种植物品种杂交;以及
(b)在得自步骤(a)的杂交的子代植物中针对以下序列来评价基因变异:
(i)选自SEQ ID NO:14、16、18、20、22、24、26、28、30或33的核酸序列;或
(ii)编码选自SEQ ID NO:15;17、19、21、23、25、27、29、31或34的多肽的核酸序列;
所述评价是使用选自下组的方法进行的:RFLP分析、SNP分析、和基于PCR的分析。
术语“对基因变异进行作图”或“对基因变异性进行作图”可互换使用,并且定义了鉴定DNA序列中在不同植物品系、栽培变种、品种、家族、或物种之间存在差异的基因区域内天然发生或诱导发生的变化的方法。甚至由于极少的碱基变化导致的在特定位点(基因)的基因变异性能够改变可能生成的限制酶消化片段的形式。病原体变成基因型可能是由于被分析基因内的删除或插入导致,或者甚至只是由于能够产生或删除限制酶识别位点的单个核苷酸取代导致。RFLP分析利用这一方法并且利用具有对应于受关注的分离核酸片段的探针的Southern印迹法。
因此,如果多态性(即,基因或DNA片段中普遍存在的变型;也指在相同物种内存在若干形式的基因(等位基因))产生或破坏一个限制性核酸内切酶裂解位点,或者如果它导致DNA丢失或插入(例如可变的核苷酸串联重复(variable nucleotide tandem repeat,VNTR)多态性),将改变能用限制性核酸内切酶消化生成的DNA片段的大小或特征。同样地,通过限制性片段分析能将具有变异序列的个体与那些具有初始序列的个体区分开来。将能用该方法鉴定的多态性称为“限制性片段长度多态性(“RFLP”)。RFLP已经被广泛应用于人类和植物基因分析(Glassberg,UK专利申请2135774;Skolnick等人,Cytogen.Cell Genet.32:58-67(1982);Botstein等人,Ann.J.Hum.Genet.32:314-331(1980);Fischer等人(PCT专利申请WO 90/13668;Uhlen,PCT专利申请WO90/11369)。
“单核苷酸多态性”或“SNP”的中心属性是多态性位点位于单个核苷酸上。据报道,SNP具有对RFLP或VNTR的某些优点。首先,SNP比其它类型的多态性更稳定。它们的自发突变率为大约10-9(Kornberg,DNA Replication,W.H.Freeman & Co.,San Francisco,1980),比VNTR(美国专利公开5,679,524)的频率低了大约1,000倍。其次,SNP的发生频率更高,并且比RFLP和VNTR更一致。因为SNP由序列变异产生,可通过随机基因组测序或cDNA分子测序来鉴定新的多态性。SNP也可由删除、点突变和插入产生。任何单个碱基改变,无论其起因,都可能是SNP。SNP的更高频率意味着它们可能比其它类别的多态性更易于鉴定。
SNP可使用多种方法中的任何一种进行鉴定。此类方法包括直接的或间接的位点测序、使用限制性酶(其中位点相应的等位基因产生或破坏限制性位点)、使用等位基因特异性杂交探针、使用由多态性的不同等位基因或者其它生化物质编码的蛋白的特异抗体。SNP可通过许多方法进行测序。可使用两种基本方法进行DNA测序,Sanger等人的链终止法,Proc.Natl.Acad.Sci.(U.S.A.)74:5463-5467(1977),以及Maxam和Gilbert的化学降解法,Proc.Natl.Acad.Sci.(U.S.A.)74:560-564(1977)。
此外,可通过改进的PCR技术来检测单点突变,例如连接酶链反应(“LCR”)和PCR-单链构象多态性(“PCR-SSCP”)分析。也可使用PCR技术来鉴定极少量样本中的基因表达水平,例如来自主体的组织或细胞。该技术称为反转录-PCR(“RT-PCR”)。
术语“分子育种”定义了在育种过程期间跟踪分子标记的方法。将分子标记与所期望的表型特征相连是普遍的。通过跟踪分离的分子标记或基因特征,而不是对表型进行评分,育种过程能通过种植更少的植物并免于对表型变型进行检测分析或目测而加快。该过程中有用的分子标记包括但不限于鉴定前文提到的可作图的基因变异时可用的任何标记,以及显示跨植物物种的同线性的任何紧接的基因。术语“同线性”指不同生物之间的基因组上的保守基因位置/顺序。这意味着发现两个或更多个基因位点(可以是紧接的,或者可以是不紧接的)位于不同物种间的相同基因组上。同线性的另一个术语是“基因组共线性”。
实施例
本发明将在下面的实施例中进一步说明,其中份数和百分比是以重量计并且度数是摄氏度,除非另外说明。应当理解,尽管这些实施例说明了本发明的优选实施方案,但仅是以例证的方式给出的。根据上面的论述和这些实施例,本领域的技术人员可以测定本发明的基本特征,并在不脱离本发明的精神和范围的情况下,可对本发明做出多种改变和修饰,以使其适用于多种用法和条件。因此,除了那些本文所示和描述的那些之外,根据前文所述,本发明的各种修改形式对本领域的技术人员来说将是显而易见的。这些修改形式也旨在属于所附权利要求书的范围内。
实施例1
制备具有激活标记基因的拟南芥种群
构建18.4kb的T-DNA基二元构建体,pHSbarENDs(图1;SEQ IDNO:1;)包含四个来源于花椰菜花叶病毒35S启动子的四个多聚增强子元件,对应于序列-341至-64,如Odell等人(1985)Nature 313:810-812所述。该构建体也包含允许质粒救援的载体序列(pUC9)、再动员T-DNA的转座子序列(Ds)、以及允许草胺磷选择转基因植物的bar基因。仅将从右边界(RB)至左边界(LB)包含的10.8kb片段转移到寄主植物基因组中。因为增强子元件位于靠近RB处,它们可诱导T-DNA整合后的基因组位点顺式激活。
将pHSbarENDs构建体转化到根癌农杆菌菌株C58中,在25℃下在LB中培养至OD600~1.0。然后离心沉淀细胞,并重悬在相等体积的5%蔗糖/0.05%Silwet L-77(OSI Specialties,Inc)中。在早期抽薹时,培育拟南芥属生态型Col-0的土壤使用农杆菌悬浮液进行顶部灌溉。一周后,相同植株再次用在蔗糖/Silwet中的相同农杆菌菌株进行顶部灌溉。然后将该植物的种子设为标准。所得T1种子在土壤中播种,通过喷洒草胺磷(AgrEvo;Bayer Environmental Science)选择转基因幼苗。从大约35,000个单个草胺磷抗性T1植株中收集T2种子。培养T2植株并收集来自96个分离T2品系的相同体积的T3种子。这组成了360个亚群。
农杆菌菌株和整个植株的转化如上所述进行。
选择了总计100,000个草胺磷抗性T1幼苗。分开保存来自每个品系的T2种子。
实施例2A
筛选以鉴定具有改变根构造的品系(非限制性氮条件)
在与早期发育期间来自如实施例1所述的种群的对照幼苗进行比较时,可分析在不限制氮条件下培养的具有激活标记的拟南芥属幼苗的根系构造。
来自每个96,000个分离T1激活标记品系的十个T2种子可用氯气进行灭菌并种植在培养皿上,培养皿包含以下培养基:0.5x N-FreeHoagland’s,60mM KNO3,0.1%蔗糖,1mM MES和1%PhytagelTM。通常将10个平板置于架子中。平板在4℃下保存三天以使种子分层,然后在22℃光照和20℃黑暗垂直保持11天。光周期为16h;8h黑暗,平均光照强度为~180μmol/m2/s。架子(通常每个装有10个平板)在每个搁板中每日旋转。在第14天,评估平板的幼苗状态,拍摄整个平板的数字图像并分析根面积。将平板随机分成10个水平区域。在板上10个水平区域的每个区域中的根面积以总面积百分比表示。仅仅使用区域3至9的面积进行品系根总面积计算。可使用ICORIA开发的Rootbot图像分析工具(专有)来评估根面积。根总面积以mm2表示。
期望具有增加的根生长特性的品系位于根分布区域的上端。假定架子有最多两个异常值,可使用滑动窗方法评估给定架子的根区域的变化。包括生长培养基、温度、和湿度在内的多个因素的环境变量可引起根生长的显著改变,尤其是在播种期间更是如此。因此根据播种日期和搁板来将所述品系分组以用于数据分析。然后通过平均根面积来拣选特定播种日期/搁板组中的架子。通过将架子ri的数据与来自下一个最低架子(ri-1,以及下一个最高平均根面积,ri+1)的数据进行合并来执行滑动窗根面积分布然后使用Grubbs型方法(Barnett等人,Outliers in StatisticalData,John Wiley & Sons,第3版(1994)来分析组合分布的变量以鉴定ri中的异常值。
将通过上文所述方法测定的具有显著增加的根生长的品系命名为Phase 1 hits。在相同分析条件下进行Phase 1 hits的重复试样再筛选。当任一个或两个Phase 2重复试样显示与平均值的显著差异时,认为该品系是经过验证的根构造品系。
在Phase 2的至少一个平板中再次发现是异常值的那些品系经过室内进行的Phase 3筛选以验证Phase 1和Phase 2中获得的结果。使用下文所述的Rootboot图像分析(如上所述)和验证Phase 3的结果。在第一轮筛选中进行相同方式的确认。T2种子用50%家用漂白剂,0.01%triton X-100溶液灭菌,并以10颗种子/平板的密度置于与第一轮筛选所述的相同平板培养基上。在4℃下保存平板三天以使种子分层,并在与首次实验相同的温度和光周期下培养种子,光照强度为~160μmol/m2/s。将平板垂直放入10平板架的八个中心位置,第一个和最后一个位置放空白平板。每隔一天旋转架子和架子中的平板。每隔平板拍摄两组照片。第一组在14-16天拍摄,此时大多数品系的初生根已经到达平板底部,第二组照片在发育出更多侧根两天后拍摄。通常用后面的一组照片进行数据分析。用软件(Regent Instruments Inc)分析在垂直平板上生长的这些幼苗的根生长,该软件是特别设计的一种进行根测量的图像分析系统。利用像素对照来从较暗的背景辨别出根构造。为了在不拍摄背景情况下鉴定的根的最大量,所述像素级别是150-170,并且使用滤光器去除长度/宽度比率小于10.0的物体。进行分析的平板上的面积为从植物叶片边缘至距离平板底部约1cm处。使用完全相同的设置和分析面积分析一批的所有平板。给出的一个平板的总根长度得分除以已经萌发并沿平板生长一半的植物数目。每个品系培养三个平板,取它们的得分均值。然后将该平均值与同时培养的包含野生型种子的三个平板的平均值比较。
然后使用通过与野生型相比具有更高根生长数值进行再确认的拟南芥激活标记品系,用于分子鉴定侧接T-DNA插入序列的DNA。
实施例2B
在突变种群中鉴定具有改变的根表型的突变品系(氮限制条件)
可使用两步筛选程序,该程序包括:
(1)用垂直平板检测分析法鉴定改变的根生长表型;
(2)在拯救的突变体品系中确认抗除草剂性和根表型;
初次筛选基于垂直平板,该平板包含无氮的Hoagland盐,0.3%蔗糖和1mM KNO3。该培养基也包含0.8%至1.0%PhytaGel作胶凝剂。具有1.0%Phytagel的培养基有时难以灌注,因为它凝固迅速,然而低于0.8%时当垂直放置时培养基将滑出平板。来自激活标记种群的突变体,其中100个单个品系的集合可用于总计36000个品系的筛选。在每个平板上,种植12个突变体和2个野生型Columbia种子。平板置于具有26℃恒温的培养室中,培养室为16小时-日循环,平板顶部的平均光照强度为110μE/m2s。这些平板在2.5周期限内拍照3-4次。当观察到清楚的根表型时拯救单个幼苗。拯救的幼苗在生长室(24℃,每日16小时,250至300μE/m2s)中生长至成熟以采集种子。
就次级筛选而言,将来自在初次筛选中鉴定的推定hits的种子播种于包含与上文相同的培养基(加上6mg/L双丙氨磷)的平板上。野生型Columbia种子在相同时间、但无双丙氨磷的相同培养基上播种。每个平板具有10个种子。每个突变体品系有3个平板,而野生型Columbia有2个平板作为重复试样。这些平板置于培养室中,生长条件与上文所述相同。剔除那些认为是假阳性的不具有抗除草剂性或无明显的根表型的品系。保存次级筛选验证的品系用于进一步研究。
实施例3
鉴定激活标记基因
使用下述两个标准程序中的一个或两个来鉴定侧接导致根构造改变的T-DNA插入序列的基因:(1)热不对称交错PCR(TAIL)PCR(Liu等人,(1995),Plant J.8:457-63);以及(2)SAIFF PCR(Siebert等人,(1995)Nucleic Acids Res.23:1087-1088)。至于复杂的多聚T-DNA插入序列,TAIL PCR和SAIFF PCR可能均不足以鉴定候选基因。在这些情况下,可使用包括反式PCR、质粒拯救和/或基因组文库构建在内的其他程序。
成功的结果是其中单个TAIL或SAIFF PCR片段包含T-DNA边界序列和拟南芥属基因组序列。
一旦获取侧接T-DNA插入序列的基因组序列标记,通过与公开可用的拟南芥属基因组的序列比对来鉴定候选基因。
具体地讲,最靠近35S增强子元件/T-DNA RB的注释基因是激活的基因的候选基因。
为了验证鉴定的基因真的靠近T-DNA并排除TAIL/SAIFF片段是嵌合伪克隆的可能性,用一个T-DNA中的寡核苷酸和一个候选基因特异性的寡核苷酸进行对基因组DNA的诊断PCR。将提供PCR产品的基因组DNA样本理解为表示T-DNA插入序列。该分析也验证了其中一种以上的插入事件发生在相同品系中的情况,例如,在TAIL和/或SAIFF PCR分析中鉴定是否有多个不同基因组片段。
实施例4
鉴定激活标记llrk基因
通过如实施例2B所述的筛选程序,以及随后经过如实施例2A所述的阶段3(室内)筛选来获取llrk基因。如实施例3所述进行激活标记基因的鉴定。
进一步分析显示具有改变的根构造的一个品系(5至7)。提取来自品系的DNA,使用T-DNA左边界内的引物通过连接介导PCR(Siebert等人,(1995)Nucleic Acids Res.23:1087-1088)建立T-DNA插入序列。一旦获取侧接T-DNA插入序列的基因组序列标记,通过与完全拟南芥属基因组的序列比对来鉴定候选基因。将其中一个鉴定的插入位点鉴定为嵌合插入;左边界的T-DNA序列经测定位于T-DNA插入序列的两端。这仍然是可能的:位于靠近T-DNA右边界的增强子元件足够靠近以对附近的候选基因产生效应。在这种情况下,假定右边界位置位于插入位点,并将侧接插入位点的两个基因选作候选基因。最靠近嵌合基因35S增强子的其中一个基因是AT1G69270(SEQ ID NO:20);拟南芥富含亮氨酸重复序列激酶),编码LLRK蛋白(SEQ ID NO:22),本文称为富含亮氨酸重复序列激酶,或LLRK。
实施例5A
验证候选拟南芥属基因(AT1G69270)经由转化到拟南芥属中增强植物
根构造的能力
可将候选基因转化到拟南芥属中并在35S启动子作用下过表达。如果在转基因品系中观察到与亲本激活标记品系相同或相似的表型,则将该候选基因认为是拟南芥属中验证过的“前导基因”。
可直接测试拟南芥属AT1G69270基因促进拟南芥属中的根构造的能力。
拟南芥AT1G69270cDNA用寡核苷酸进行PCR扩增,寡核苷酸导入attB1(SEQ ID NO:25)序列,其上游为ATG起始密码子的共有起始序列(CAACA)和AT1G69270 cDNA(SEQ ID NO:50的核苷酸51-764(终止))蛋白编码区的前23个核苷酸,以及attB2(SEQ ID NO:26)序列和包括所述cDNA终止密码子的蛋白编码区的最后21个核苷酸。使用InvitrogenTM 技术,用pDONRTM/Zeo(InvitrogenTM,图2;SEQ ID NO:2)进行MultiSiteBP重组反应。这种方法将细菌致死ccdB基因以及氯霉素抗性基因(CAM)从pDONRTM/Zeo移除并定向地克隆了该在旁侧具有attB1(SEQ ID NO:25)和attB2(SEQ ID NO:26)位点的PCR产物而得到入门克隆PHP28738。
用紧接InvitrogenTM C1转化插入序列上游的1.3-kb35S启动子构建称为pBC-yellow(图4,SEQ ID NO:4)的16.8-kb T-DNA基的二元载体,所述插入序列包含侧接attR1和attR2序列的ccdB基因和氯霉素抗性基因(CAM)。该载体也包含在Rd29a启动子控制下的YFP标记用于选择转化过的种子。
使用InvitrogenTM 技术,对包含定向克隆PCR产物和pBC-yellow的入门克隆进行MultiSiteLR重组反应。这使得能够迅速地和定向地克隆pBC-yellow中在35S启动子后的AT1G69270基因。
使用如实施例1所述的相同农杆菌介导的转化程序将35S-AT1G69270基因构建体导入野生型拟南芥属生态型Col-0中。
通过存在的荧光YFP标记选择转基因T1种子。按照如实施例2A所述的程序对荧光种子进行根构造检测分析。每个构建体使用6个平板对转基因T1种子进行再筛选。包含从荧光种子中分选出的未转化的Columbia种子的两个平板(每个架子)作为对照。
实施例5B
在氮限制条件下筛选候选基因
也可筛选如上文实施例5A所述通过存在的荧光标记YFP选择的转基因T1种子在氮限制条件下生长的抗性。为此目的,32个转基因个体可在一个有0.4mM KNO3或60mM KNO3的平板上紧邻着32个野生型个体生长。如果一个品系显示与对照的统计意义上的显著差异,可认为该品系是经验证的氮缺乏抗性品系。在掩蔽该平板图像以去除背景颜色后,每个个体收集两个不同的测量数据:总罗赛塔面积和进入绿色区的颜色百分比。使用色调、饱和度和强度数据(HIS),绿色区由色调50至66组成。总罗赛塔面积用作植物生物量的量度,而绿色区通过剂量-响应研究已经显示指示氮同化作用。
实施例5C
验证候选拟南芥属基因(AT1G69270)经由转化进入拟南芥属后改善植
物氮利用率的能力
如实施例5B所述来筛选能够在氮限制条件下生长的转基因种子。
在第10、11、12和13天评估植物。与野生型植物相比,表达拟南芥候选基因(AT1G69270)的转基因个体在低氮(0.4mM KNO3)和不限制氮条件(60mM KNO3)下进行验证;例如,转基因植物在两种条件下的生物量都增加。
实施例5D
筛选以鉴定具有改善的硝酸盐摄取的品系
就每个过表达品系而言,将十二个T2植株播种在96孔微滴定板上,所述微滴定板包含2mM MgSO4,0.5mM KH2PO4,1mM CaCl2,2.5mMKCl,0.15mM Sprint 330,0.06mM FeSO4,1μM MnCl2·4H2O,1μMZnSO4·7H2O,3μM H3BO3,0.1μM NaMoO4,0.1μM CuSO4·5H2O,
0.8mM硝酸钾,0.1%蔗糖,1mM MES,200μM溴酚红和0.40%PhytagelTM(pH测定培养基)。培养基pH使得溴酚呈红色,pH指示染料是黄色的。
将四个品系种植于每个平板中,每个平板上包含12个野生型个体和来自某一已经显示具有改善的硝酸盐摄取(阳性对照)的品系的12个个体,在每个96孔微滴定板上总计有72个个体。可使用基于网络的随机序列发生器测定每个平板上的品系顺序。不将种子种植在96孔微滴定板上的Row A或Row H中。每个实验使用四个平板,使得每个品系分析最多48株植物。在暗处、4℃条件下保持平板三天以使种子分层,然后在22℃,光照和黑暗交替条件下水平放置六天。光周期为16小时光照;8小时黑暗,平均光照强度为~200mmol/m2/s。旋转并振动每个架子中的平板。在第八或第九天(生长五天或六天),通过记录培养基颜色为粉红色、桃色、黄色或无发芽来评估幼苗状态。然后移除每孔上的植物和/或种子。将每个培养基块状物转移到1.2mL微滴定管中,并置于96孔深微滴定板中的相应孔中。将包含2μM荧光素的等体积水加入到每个1.2mL微滴定管中。用土壤覆盖平板并用液体循环高压灭菌。将每个管充分混合,从每个管中取出等分试样并分析培养基中保留的硝酸盐的量。如果t检验显示某个品系与野生型对照具有显著差异(p<0.05),则可认为所述品系是验证过的具有改善的硝酸盐摄取品系。
实施例5E
验证包含候选拟南芥属基因(AT1G69270)的转基因品系氮摄取增加
如实施例5D所述筛选氮摄取增加的转基因种子。
与不过表达拟南芥属候选基因的野生型植物相比,过表达拟南芥属候选基因(AT1G69270)的转基因个体经验证不是具有改善的硝酸盐摄取品系。
实施例6
cDNA文库的组成;
cDNA克隆的分离和测序
cDNA文库可通过许多可用的方法中的任一种制备。例如,通过首先根据生产商的说明书(Stratagene Cloning Systems,La Jolla,CA)制备Uni-ZAPTMXR载体中的cDNA文库,可将cDNA引入质粒载体中。根据Stratagene提供的说明书,将Uni-ZAPTM XR文库转换成质粒文库。转换后,cDNA插入序列将会包含在质粒载体pBluescript中。此外,可用T4 DNA连接酶(New England Biolabs)将cDNA直接引入预切的Bluescript II SK(+)载体(Stratagene)中,然后根据生产商的说明书(GIBCO BRL Products)将其转染进DH10B细胞中。一旦cDNA插入序列处于质粒载体中,从随机选取的含重组pBluescript质粒的细菌菌落制备质粒DNA,或者用对插入的cDNA序列旁侧的载体序列特异性的引物,通过聚合酶链式反应来扩增插入的cDNA序列。将扩增的DNA插入序列或质粒DNA在引物标记法测序反应(dye-primer sequencingreaction)中进行测序,以产生部分cDNA序列(表达序列标记或“EST”;参见Adams等人,1991,Science 252:1651-1656)。用Perkin Elmer Model377荧光测序仪分析所得的EST。
用改进的转座规程来产生全长插入序列(FIS)数据。从归档的甘油原种作为单一菌落来回收鉴定了FIS的克隆,并通过碱性裂解来分离质粒DNA。将分离的DNA模板在基于PCR的测序反应中与载体引物M13正向和反向寡核苷酸反应并上样至自动化的测序仪上。通过与对其进行FIS查询的初始EST序列进行序列比对来确认克隆鉴定。
将确认的模板通过基于酿酒酵母(Saccharomyces cerevisiae)Ty1转座因子(Devine和Boeke,1994,Nucleic Acids Res.22:3765-3772)的Primer Island转座试剂盒(PE Applied Biosystems,Foster City,CA)进行转座。该体外转座系统在整个一组大DNA分子中随机地放入独特的结合位点。随后将转座的DNA用于通过电穿孔转化DH10B电-感受态细胞(Gibco BRL/Life Technologies,Rockville,MD)。转座因子含有另外的可选标记(称为DHFR;Fling和Richards,1983,Nucleic AcidsRes.11:5147-5158),使得能在琼脂平板上仅双重筛选含有整合的转座子的那些亚克隆。从每次转座反应随机地选择多个亚克隆,通过碱性裂解制备质粒DNA,并用对转座子内的结合位点特异性的独特引物从转座事件位点向外进行测序(ABI Prism dye-terminator ReadyReaction mix)。
收集序列数据(ABI Prism Collections)并用Phred和Phrap(Ewing,等人,1998,Genome Res.8:175-185;Ewing和Green,1998,Genome Res.8:186-194)进行装配。Phred是一种公用软件程序,该程序再次读取ABI序列数据,再次调出(recall)碱基,赋质量值,并将碱基序列(base call)和质量值写入可编辑的输出文件中。Phrap序列组装程序使用这些质量值来增加组装的序列重叠群的准确度。通过Consed序列编辑器(Gordon等人,1998,Genome Res.8:195-202)检查装配序列。
在一些克隆中,cDNA片段对应基因的3’-端的一部分并且不会涵盖整个开放阅读框。为了获得上游信息,使用两种不同规程中的一者。这两种方法中的第一种方法导致产生含有所需基因序列的部分的DNA片段,而第二种方法导致产生含有整个开放阅读框的片段。这两种方法均使用两轮PCR扩增以从一个或多个文库获得片段。有时基于以前的知识(特定的基因应该存在于某些组织中)选择文库,有时则进行随机地选择。获得相同基因的反应可平行地在若干文库中进行,或者在文库池中进行。文库池通常用3至5个不同的文库制备并且使其归一化而成为一致的稀释度。在第一轮扩增中,两种方法均使用载体特异性的(正向)引物,同时还使用基因特异性的(反向)引物,该正向引物对应位于克隆5’-端处的载体的一部分。第一种方法使用与已知基因序列的一部分互补的序列,而第二种方法使用与3’-非翻译区(也称为UTR)的一部分互补的基因特异性引物。在第二轮扩增中,两种方法均使用套式引物组。按照生产商的说明书,用市售试剂盒将所得DNA片段连接进pBluescript载体中。该试剂盒选自可得自包括InvitrogenTM(Carlsbad,CA)、Promega Biotech(Madison,WI)和Gibco-BRL(Gaithersburg,MD)在内的一些供应商的许多试剂盒。如上所述,将质粒DNA通过碱性裂解方法分离并进行测序和用Phred/Phrap进行装配。
实施例7
cDNA克隆的鉴定
编码NDK样多肽的cDNA克隆通过这样鉴定:进行BLAST(基本的局部比对搜索工具);Altschul等人,1993,J.Mol.Biol.215:403-410;还可参见国立卫生研究院国家医学图书馆的国家生物技术信息中心的万维网址上对BLAST算法的解释),寻找与BLAST“nr”数据库中所包含序列(包括所有非冗余GenBank CDS翻译序列、源自3-维结构Brookhaven蛋白质数据银行(Protein Data Bank)、SWISS-PROT蛋白质序列数据库的最新的主要版本、EMBL和DDBJ数据库的序列)的相似性。采用国家生物技术信息中心(NCBI)提供的BLASTN算法,分析如实施例6中获得的cDNA序列与包含在“nr”数据库中的所有可公开获得的DNA序列的相似性。在所有的阅读框中翻译DNA并用NCBI提供的BLASTX算法(Gish和States,1993,Nat.Genet.3:266-272)比较与“nr”数据库中包含的所有可公开获得的蛋白质序列的相似性。为方便起见,通过BLAST计算仅仅偶然观察到cDNA序列与所搜索的数据库中所包含序列的匹配的P值-(概率)在本文报导为“pLog”值,它代表所报导的P值的负对数。因此,pLog值越大,cDNA序列和BLAST的“匹配”代表同源蛋白的可能性就越大。
将受分析的EST与上述Genbank数据库进行比较。通过使用BLASTn算法(Altschul等人,1997,Nucleic Acids Res.25:3389-3402.)对杜邦专利数据库比较具有序列同源共有区域或重叠区域的核苷酸序列,可找到含更5′端或3′端序列的EST。在两个或更多个核酸片段之间存在共有或重叠序列时,该序列可装配成单一的连续核苷酸序列,从而使最初的片段在5′或3′初始方向上延伸。一旦测定了最5′的EST后,可如实施例6中所述,通过全长插入序列来测定其完整的序列。可用tBLASTn算法,通过将已知基因(来自专有来源或公开数据库的已知基因)的氨基酸序列对EST数据库进行比较,可找到属于不同物种的同源基因。tBLASTn算法对所有6个阅读框都翻译了的核苷酸数据库进行氨基酸查询的搜索。该搜索允许不同物种之间的核苷酸密码子使用的差异,并且允许密码子简并。
实施例8
制备cDNA文库并且鉴定编码LLRK样多肽的cDNA克隆
如实施例6所述制备提供来自玉米不同组织的mRNA的cDNA文库。表2描述了该文库的特性。
表2
来自玉米的cDNA文库
文库 | 组织 | 克隆 |
p0128 | 收集的初生和次生幼穗 | p0128.cpiae89r:fis |
cfp1n | 收集的玉米雄穗V7至V12,全长富集的,标准化的。 | cfp1n.pk018.k5b:fis |
cfp2n | 授粉的和未授粉的玉米穗,集中的,全长富集的,标准化的。 | cfp2n.pk056.k23a.fcfp2n.pk056.k23b |
my | Myriad项目提交。 | my.cco1n.pk077.a19 |
cfp5n | 玉米粒,收集阶段,全长富集的,标准化的 | cfp5n.pk002.f22:fis |
使用来自表1中列出的克隆的EST序列和SEQ ID NO:36进行的BLASTX搜索揭示所述cDNA编码的多肽与来自稻的LLRK样多肽的相似性(GI No.115455429、12500990和115438737分别对应于SEQ IDNO:23、24、和37)。表3示出了表2列出的克隆的全长cDNA插入序列(“全长插入序列”或“FIS”)的BLAST结果。每个cDNA插入序列编码完整蛋白或功能性蛋白(“完全基因序列”或“CGS”)。表3和表4也示出了使用Clustal V比对方法、使用默认参数计算的每对氨基酸序列的序列同一性百分比值:
表3
编码NDK样多肽同源物的多肽序列的BLAST结果和同一性百分比
序列 | 状况 | NCBI GI | BLAST pLog打分 | %同一性 |
p0128.cpiae89r:fisSEQ ID NO:14 | CGS | 115455429(稻)SEQ ID NO:23 | >180 | 75.7 |
cfp1n.pk018.k5b:fisSEQ ID NO:16 | CGS | 125600990(稻)SEQ ID NO:24 | >180 | 83.4 |
重叠群:my.cco1n.pk077.a19cfp2n.pk056.k23a.fcfp2n.pk056.k23b(SEQ ID NO:18) | CGS | 115455429(稻)SEQ ID NO:23 | 164 | 60.6 |
cfp5n.pk002.f22:fis(SEQ ID NO:35) | CGS | 115438737(稻)SEQ ID NO:37 | >180 | 90.2 |
图15A至15D示出以下全长氨基酸序列的多重比对:SEQ ID NO:15、17、19、22、和36,以及SEQ ID NO:23和24。图16给出图15A-15D中给出的每对序列的序列同一性百分比和趋异值。
用LASERGENE生物信息计算包(DNASTAR Inc.,Madison,WI)的Megalign程序进行序列比对和同一性百分比计算。用带默认参数(空位罚分=10,空位长度罚分=10)的Clustal比对方法(Higgins和Sharp,1989,CABIOS.5:151-153)进行序列的多重比对。使用Clustal方法的成对比对的默认参数为KTUPLE 1,空位罚分=3,窗口=5,DIAGONALSSAVED=5。
序列比对和BLAST打分以及概率显示包含本发明cDNA克隆的核酸片段编码LLRK样多肽。
表4
编码与NDK和NDK样多肽同源的多肽的序列的BLAST结果
序列 | 状况 | 参照序列 | Blast pLog打分 | %同一性 |
p0128.cpiae89r:fisSEQ ID NO:14 | CGS | SEQ ID NO:4815,在WO2003008540-A2中 | >180 | 74.6 |
cfp1n.pk018.k5b:fisSEQ ID NO:16 | CGS | SEQ ID NO:52915,在JP2005185101中 | >180 | 83.4 |
重叠群:my.cco1n.pk077.a19cfp2n.pk056.k23a.fcfp2n.pk056.k23b(SEQ ID NO:18 | CGS | SEQ ID NO:47357,在US2004034888-A1中 | >180 | 60.6 |
cfp5n.pk002.f22:fis(SEQ ID NO:35) | CGS | SEQ ID NO:64281,在US2007011783中 | >180 | 99.8 |
实施例9
制备含有拟南芥属前导基因(AT1G69270)的同源物的植物表达载体
可使用诸如BLAST(基本的局部比对搜索工具(Basic LocalAlignment Search Tool);Altschul等人,J.Mol.Biol.215:403-410,1993;也参见美国国家卫生研究院(NationalInstitutes of Health)国立医学图书馆(National Library of Medicine)的国家生物技术信息中心(NationalCenter for Biotechnology Information)的万维网网址上对BLAST算法的解释)之类的序列比较算法,鉴定与前导llrk基因同源的序列。同源llrk样序列,如实施例8所述的序列,可通过任何一种以下方法进行PCR扩增。
方法1(基于RNA的方法):如果llrk同源物的蛋白编码区域的5’和3’序列信息是可用的,可如实施例5A所述设计基因特异性引物。可以将RT-PCR用于植物RNA来获得含有RUM1蛋白编码区的核酸片段,该llrk蛋白编码区旁侧为attB1(SEQ ID NO:25)和attB2(SEQ ID NO:26)序列。引物可含有起始密码子上游的共有Kozak序列(CAACA)。
方法2(基于DNA的方法):作为另外一种选择,如果编码LLRK多肽同源物的基因的cDNA克隆是可用的,可以PCR扩增完整cDNA插入序列(含有5′和3′非编码区)。可设计正向引物和反向引物,使它们分别或者含有attB1序列和在该cDNA插入序列前面的载体特异性序列或者含有attB2序列和在该cDNA插入序列后面的载体特异性序列。对于克隆进载体pBluescript SK+中的cDNA插入序列,可以使用正向引物VC062(SEQ ID NO:27)和反向引物VC063(SEQ ID NO:28)。
方法1和方法2可根据本领域技术人员已知的步骤进行修改。例如,方法1的引物可含有限制性酶切位点而不是attB1和attB2位点,用于后来将PCR产物克隆进含有attB1和attB2位点的载体内。另外,方法2可涉及从cDNA克隆、λ克隆、BAC克隆或基因组DNA扩增。
可利用BP重组反应将通过任一种上述方法获得的PCR产物与供体载体(例如pDONRTM/Zeo(InvitrogenTM,图2;SEQ ID NO:2)或pDONRTM221(InvitrogenTM,图3;SEQ ID NO:3)组合。这种方法将细菌致死ccdB基因以及氯霉素抗性基因(CAM)从pDONRTM221移除并定向地克隆了该在旁侧具有attB1和attB2位点的PCR产物而得到入门克隆(entry clone)。使用InvitrogenTM ClonaseTM技术,然后可将来自入门克隆的同源llrk样基因转移到合适的目的载体中以获得植物表达载体,所述载体用于拟南芥属、玉米和大豆,如pBC-Yellow(图4;SEQ ID NO:4)、PHP27840(图5;SEQ ID NO:5)或PHP23236(图6;SEQ ID NO:6),以获取植物表达载体,分别用于拟南芥属、大豆和玉米。
实施例10
用验证过的拟南芥属前导基因及其同源物制备大豆表达载体并转化大
豆
为了检查所得表型,可将大豆植株转化以过表达验证过的拟南芥属基因(AT1G69270)和来自不同物种的对应同源物。
可将实施例5A和9中所述的入门克隆用于将每个基因定向克隆进PHP27840载体(图5,SEQ ID NO:5)中,使得该基因的表达处于SCP1启动子的控制下。
然后可用包含编码本多肽的序列的表达载体转化大豆胚。
为了诱导体细胞胚,可将子叶(长度为3-5mm,从大豆品种A2872的表面灭菌的未成熟种子解剖出来)于26℃在光下或黑暗下培养6-10周。然后切取体细胞胚(其产生次生胚)并将其置于合适的液体培养基内。在重复选择增殖为早期球形阶段胚的体细胞胚的簇后,按下面的描述保持该悬浮液。
可将大豆胚发生悬浮培养物在26℃下在摇床(150rpm)上的35mL液体培养基中保持,荧光光照采用16:8小时(白天/黑夜)的时间表。通过将大约35mg组织移植进35ml液体培养基中,每两周将培养物进行传代培养。
然后可通过基因枪轰击方法(Klein等人,Nature(London)327:70-73,1987;美国专利4,945,050)转化大豆胚发生悬浮培养物。杜邦公司的BiolisticTM PDS1000/HE仪器(氦气改进型)可用于这些转化。
可用于帮助大豆转化的可选标记基因是由来自花椰菜花叶病毒的35S启动子(Odell等人,Nature 313:810-812,1985)、来自质粒pJR225(来自大肠杆菌;Gritz等人,Gene 25:179-188,1983)的潮霉素磷酸转移酶基因以及胭脂碱合成酶基因的3′区构成的嵌合基因,该胭脂碱合成酶基因来自根癌农杆菌(Agrobacterium tumefaciens)Ti质粒的T-DNA。可用于帮助大豆转化的另一种可选标记基因是来自大豆或拟南芥属的除草剂抗性乙酰乳酸合成酶(ALS)基因。ALS是支链氨基酸缬氨酸、亮氨酸和异亮氨酸的生物合成中的第一共用酶。已经鉴定出ALS中的突变导致对三类ALS抑制剂中的某些或全部具有抗性(美国专利5,013,659;其全部内容以引用的方式并入本文)。除草剂抗性ALS基因的表达可处于SAM合成酶启动子(美国专利申请US-2003-0226166-A1;其全部内容以引用方式并入本文)的控制下。
将如下物质(依次)加入50μL 60mg/mL的1μm金颗粒悬浮液:5μLDNA(1μg/μL)、20μL亚精胺(0.1M)和50μL CaCl2(2.5M)。然后搅拌该颗粒制备物三分钟,在微量离心机(microfuge)中离心10秒并除去上清液。然后将DNA包覆的颗粒在400μL 70%乙醇中洗涤一次并再悬浮于40μL无水乙醇中。可将DNA/颗粒悬浮液用超声波处理三次,每次一秒钟。然后将5μL该DNA-包覆的金颗粒装载至每个宏载体盘上。
将大约300-400mg两周大的悬浮培养物置于60×15mm的空培养皿中并用吸管将残留的液体从组织除去。对于每次转化实验,大约5-10板的组织受到正常轰击。膜破裂压力设定为1100psi并将腔室抽成28英寸汞柱的真空。将组织置于离阻挡网大约3.5英寸的地方并轰击三次。轰击后,可将组织分成两份并放回液体培养基中,如上所述进行培养。
轰击后五至七天,用新鲜培养基更换该液体培养基,并在轰击后七至十二天,用含有50mg/mL潮霉素的新鲜培养基更换。可每周更换这种选择培养基。轰击后七至八周,可观察到绿色的转化组织从未转化的坏死的胚芽发生簇长出来。移出分离的绿色组织并将其移植进单独的烧瓶中以产生新的、无性繁殖的、转化的胚发生悬浮培养物。可将每一新品系当成是独立的转化事件。然后可将这些悬浮培养物作为未成熟胚进行传代培养和维持,或者通过使单独体细胞胚成熟并萌发而再生成整株植株。
然后可分析用验证过的基因转化大豆植株以研究相对于对照或参照植株的农学特性。例如,在多种环境条件(如氮限制条件、干旱等)下的氮利用效率、产量增强和/或稳定性。
实施例11
使用颗粒轰击用验证过的拟南芥属前导基因转化玉米
为了检查所得表型,可将大豆植株转化以过表达验证过的拟南芥属前导基因或来自不同物种的对应同源物。
可将实施例5A中所述的入门克隆用于将每种基因定向克隆进玉米转化载体中。玉米基因的表达可处于组成型启动子的控制下,例如玉米泛素启动子(Christensen等人,Plant Mol.Biol.12:619-632,1989,以及Christensen等人,Plant Mol.Biol.18:675-689,1992)。
然后可通过下面的方法将上述重组DNA构建体引入玉米细胞中。可从源于近交玉米系H99和LH132杂交的发育中的颖果切取未成熟的玉米胚。在授粉后10至11天分离胚,这时它们长为1.0至1.5mm。然后将胚以轴线侧朝下放置并与琼脂糖硬化的N6培养基(Chu等人,Sci.Sin.Peking 18:659-668,1975)接触。将胚在27℃下保持在黑暗中。从这些未成熟胚的胚鳞增生出易脆的胚发生愈伤组织,该愈伤组织由未分化的细胞块构成,在胚柄结构上长有体细胞原胚状体和胚状体。可将从该原外植体分离的胚发生愈伤组织在N6培养基上培养,并每两至三周在这种培养基上进行传代培养。
可将质粒p35S/Ac(得自Peter Eckes博士,Hoechst Ag,Frankfurt,Germany)用于转化实验以便提供可选标记。该质粒含有pat基因(见欧洲专利公布0 242 236),该基因编码草胺膦乙酰转移酶(PAT)。酶PAT赋予对除草性谷氨酰胺合成酶抑制剂例如草胺膦的抗性。p35S/Ac的pat基因处于来自花椰菜花叶病毒的35S启动子(Odell等人,Nature313:810-812(1985))和胭脂碱合成酶基因的3′区的控制下,该胭脂碱合成酶基因来自根癌农杆菌Ti质粒的T-DNA。
可将粒子轰击法(Klein等人,Nature 327:70-73(1987))用于将基因转移至愈伤组织培养细胞。根据该方法,利用下面的技术用DNA包覆金颗粒(直径1μm)。将10μg质粒DNA加入到50μL金颗粒悬浮液(每mL 60mg)中。将氯化钙(50μL的2.5M溶液)和亚精胺游离碱(20μL的1.0M溶液)加入到该颗粒中。在加入这些溶液过程中涡旋该悬浮液。10分钟后,将试管粗略地离心(以15,000rpm进行5秒钟)并除去上清液。将该颗粒再悬浮于200μL的无水乙醇中,再次离心并除去上清液。再次进行乙醇冲洗并将颗粒再悬浮于终体积为30μL的乙醇中。可将DNA包覆的金颗粒等分试样(5μL)置于KaptonTM飞行圆盘(Bio-RadLabs)的中心。然后使用PDS-1000/He(Bio-Rad Instruments,Hercules CA),采用1000psi的氦气压、0.5cm的间隙距离以及1.0cm的飞行距离,将颗粒加速射入玉米组织中。
对于轰击,将胚发生组织置于琼脂糖硬化的N6培养基上的滤纸上。组织布置成薄薄一层,并覆盖直径为约5cm的圆形区域。然后可将包含组织的培养皿置于离阻挡网大约8cm的PDS-1000/He的腔室内。然后将该腔室中的空气抽出至28英寸汞柱的真空。利用在击波管中氦气压力达到1000psi时破裂的可破裂膜,宏载体被氦气冲击波加速。
轰击后七天,可将组织转移至N6培养基中,该培养基含有双丙氨磷(每升5mg)并缺少酪蛋白或脯氨酸。组织继续在这种培养基上缓慢生长。另外两周后,可将组织转移至含有bialaphos的新鲜N6培养基上。六周后,在某些装有补充了双丙氨磷的培养基的盘上,可辨别直径约1cm的区域上有活性生长的愈伤组织。当在选择培养基上传代培养时,这些愈伤组织可继续生长。
通过首先将组织簇转移到补充有0.2mg每升的2,4-D的N6培养基中,可从该转基因愈伤组织再生出植物。两周后,可将组织转移至再生培养基中(Fromm等人,Bio/Technology 8:833-839(1990))。
可再生出转基因的T0植株并按照下面的HTP步骤测定它们的表型。可收集T1种子。
可栽培T1植株并分析表型变化。利用图像分析可定量下面的参数:可收集并定量植株面积、体积、生长速率以及颜色分析。与合适的对照植物比较,导致根构造改变或上文列出的任何一种农学特性改变的表达构建体可被认为是拟南芥属前导基因在玉米中发挥功能以改变根构造或植物构造的证据。
此外,可通过直接转化或者从单独转化的品系基因渗入而将含有证实的拟南芥基因的重组DNA构建体导入玉米品系内。
可对转基因植株(或者是近交的或者是杂交的)进行更有力的基于田间的实验来研究在多种环境条件下(如营养物质的改变和水的可利用性)的根构造或植物构造、产量提高和/或抗根倒伏性。
也可进行后续的产量分析,以测定含有验证过的拟南芥属前导基因的植物与不包含验证过的拟南芥属前导基因的对照(或参照)植物相比较时是否具有改善的产量表现。包含验证过的拟南芥属前导基因的植物相对于对照植物将具有改善的产量,优选地在不利环境条件下产量损失减少50%,或在不同环境条件下相对于对照植物将具有提高的产量。
实施例12
电穿孔根癌农杆菌LBA4404
将电穿孔感受态细胞(40μl),例如根癌农杆菌(Agrobacteriumtumefaciens)LBA4404(含有PHP10523)在冰上解冻(20-30分钟)。PHP10523含有用于T-DNA转移的VIR基因、农杆菌属的低拷贝数质粒复制起始区、四环素抗性基因以及用于体内DNA生物分子重组的cos位点。同时,将电穿孔管(electroporation cuvette)在冰上冷却。将该电穿孔仪的设置调节至2.1kV。
将DNA等分试样(0.5μL JT(US 7,087,812)亲代DNA,在低盐缓冲液或双蒸H2O中的浓度为0.2μg-1.0μg)与解冻的农杆菌细胞混合,同时仍然保持在冰上。将该混合物转移至电穿孔管的底部并静止保持在冰上1-2分钟。通过按下“pulse(脉冲)”键两次(理想的是获得4.0毫秒的脉冲)对细胞进行电穿孔(Eppendorf电穿孔仪2510)。随后,将0.5ml 2xYT培养基(或SOCmedium)加入到电穿孔管并转移至15mlFalcon管中。将细胞在28-30℃、200-250rpm下培养3小时。
将250μl的等分试样散布在#30B(YM+50μg/mL奇放线菌素)板上并在28-30℃下培养3天。为了增加转化体的数目,可进行如下两个可选步骤中的其中一个:
选择1:用30μl 15mg/ml的利福平覆盖平板。LBA4404具有针对利福平的染色体抗性基因。这种附加的选择消除了在使用较差的LBA4404感受态细胞制备物时观察到的一些污染克隆。
选择2:进行两次重复的电穿孔以补偿较差的电感受态细胞。
转化体的鉴定:
选取四个独立的克隆并划痕接种在AB基本培养基+50mg/mL奇放线菌素的平板(#12S培养基)上用于分离单个克隆。将平板在28℃下孵育2至3天。
对于每个推定的共整合体,选取单个克隆并将其接种在4ml具有50mg/l的奇放线菌素的#60A中。将该混合物在28℃下摇动培养24小时。采用Qiagen Miniprep+可选的PB洗涤,从4ml培养物分离出质粒DNA。将DNA在30μl中洗提。如上所述,将2μl的等分试样用于电穿孔20μlDH10b+20μl ddH2O。
可任选地,可将15μl等分试样用于转化75-100μl的InvitrogenTMLibrary Efficiency DH5α。将细胞散布在LB培养基+50mg/mL奇放线菌素的平板(#34T培养基)上并将其在37℃下培养过夜。
对于每个推定的共整合体,选取3至4个独立的克隆并将其接种在4ml具有50μg/ml奇放线菌素的2xYT(#60A)上。将细胞在37℃下摇晃培养过夜。
对于4个质粒,利用限制性内切酶BamHI、EcoRI和HindIII再进行三次消化(使用亲代DNA和PHP10523作为对照),这4个质粒代表2种具有正确SalI消化模式的推定共整合体。推荐电凝胶(Electronic gel)用于比较。
作为另一种选择,对于高通量应用,例如针对Gaspe Bay Flint衍生的玉米品系(实施例15-17)所描述的,代替通过限制性酶切分析来评价所得的共整合载体,可将三个克隆同时用于如实施例13所述的感染步骤。
实施例13
农杆菌介导的玉米的转化
为了检查所得表型,可将大豆植株转化以过表达验证过的拟南芥属前导基因或来自不同物种的对应同源物。
农杆菌介导的玉米转化基本上按照Zhao等人,Meth.Mol.Biol.318:315-323(2006)中描述的方法进行(还可参见Zhao等人,Mol.Breed.8:323-333(2001)和1999年11月9日公布的美国专利5,981,840,以引用的方式将该文献并入本文)。该转化过程涉及细菌接种、共培养、静止期、选择以及植株再生。
1.未成熟胚芽制备
从颖果切取未成熟胚并置于装有2mL PHI-A培养基的2mL微型管中。
2.中农杆菌对胚胎的感染和共培养
2.1感染步骤
用1mL微量吸移管移出PHI-A培养基并加入1mL农杆菌悬浮液。轻轻倒置该管进行混合。将该混合物在室温下培养5分钟。
2.2共培养步骤
用1mL微量吸移管将农杆菌悬浮液从感染步骤中移出。使用无菌刮刀将胚从管中刮出并转移到100×15mm培养皿中的PHI-B培养基的平板中。测定胚的朝向,使得胚轴在培养基表面上朝下。将具有胚的平板在20℃下于黑暗中培养3天。L-半胱氨酸可用于共培养阶段。采用标准二元载体,补充有100-400mg/L L-半胱氨酸的共培养培养基对于回收稳定的转基因事件是至关重要的。
3.推定的转基因事件的选择
向在100×15mm培养皿中的PHI-D培养基的平板中转移10个胚芽,保持朝向,并且用parafilm将培养皿密封。将平板在黑暗中于28℃下培养。预计在6-8周将看见活性生长的推定事件(作为浅黄色胚组织)。不产生事件的胚可能是棕色和坏死的,并且几乎看不见脆性组织生长。取决于生长速率,以2-3周的间隔将推定的转基因胚组织转移到新鲜的PHI-D平板上进行传代培养。记录事件。
4.T0植株的再生
将在PHI-D培养基上增殖的胚组织转移至100×25mm培养皿中的PHI-E培养基(体细胞胚成熟培养基)进行传代培养并在28℃下,在黑暗中培养约10至18天,直至体细胞胚成熟。将具有良好限定的盾片和胚芽鞘的个体成熟体细胞胚芽转移到PHI-F胚芽发芽培养基中,并且在28℃下于光中(约80μE,来自冷光灯或同等荧光灯)培养。在7-10天,将约10cm高的再生植株盆载于园艺混合物中,并使用标准园艺方法使其受冷而变得耐寒。
用于植物转化的培养基
1.PHI-A:4g/L的CHU基础盐、1.0mL/L的1000×Eriksson维生素混合物、0.5mg/L的盐酸硫胺素、1.5mg/L的2,4-D、0.69g/L的L-脯氨酸、68.5g/L的蔗糖、36g/L的葡萄糖,pH为5.2。在使用前加入100μM的乙酰丁香酮,过滤灭菌。
2.PHI-B:无葡萄糖的PHI-A,2,4-D增加至2mg/L,蔗糖减少至30g/L并且补充有0.85mg/L的硝酸银(过滤灭菌),3.0g/L的固化剂(gelrite),100μM的乙酰丁香酮(过滤灭菌),pH为5.8。
3.PHI-C:无固化剂和乙酰丁香酮的PHI-B,2,4-D减少至1.5mg/L并且补充有8.0g/L的琼脂,0.5g/L的Ms-吗啉乙磺酸(MES)缓冲液,100mg/L的羧苄青霉素(过滤灭菌)。
4.PHI-D:PHI-C补充3mg/L bialaphos(过滤灭菌的)。
5.PHI-E:4.3g/L的Murashige and Skoog(MS)盐(Gibco,BRL11117-074)、0.5mg/L的烟酸、0.1mg/L的盐酸硫胺素、0.5mg/L的盐酸吡哆醇、2.0mg/L的甘氨酸、0.1g/L的肌醇、0.5mg/L的玉米素(Sigma,商品目录号:Z-0164)、1mg/L的吲哚乙酸(IAA)、26.4μg/L的脱落酸(ABA)、60g/L的蔗糖、3mg/L的双丙氨磷(过滤灭菌)、100mg/L的羧苄青霉素(过滤灭菌)、8g/L的琼脂,pH为5.6。
6.PHI-F:不含玉米素、IAA、ABA的PHI-E;蔗糖减少至40g/L;用1.5g/L的固化剂代替琼脂;pH为5.6。
通过首先将组织簇转移到补充有0.2mg每升的2,4-D的N6培养基中,可从该转基因愈伤组织再生出植物。两周后,可将组织转移至再生培养基(Fromm等人,(1990)Bio/Technology 8:833-839)中。
可进行对转基因T0植株和T1植株的表型分析。
可分析T1植株表型的变化。利用图像分析,可在植株生长过程中在多个时间点,分析T1植株在植株面积、体积、生长速率方面的表型变化并且可进行颜色分析。可如实施例20中所述分析根构造的改变。
可对农学特性的改变进行后续分析,以测定含有验证过的拟南芥属前导基因的植株在与不含有验证过的拟南芥属前导基因的对照(或参照)植株比较时是否具有至少一种农学特性的改善。还可在多种环境条件下研究改变。
导致根构造显著改变的表达构建体将被认为是拟南芥属基因在玉米中发挥功能以改变根构造的证据。
实施例14A
利用农杆菌介导的转化构建具有拟南芥属前导基因(AT1G69270)的玉
米表达载体
用拟南芥属llrk基因(AT1G69270)在NAS2(SEQ ID NO:30)和GOS2(SEQ ID NO:31)启动子控制下制备玉米表达载体。PINII是终止子(SEQ ID NO:34)使用InvitrogenTM 技术,如实施例5A所述制备的、包含拟南芥属llrk基因(AT1G69270)的入门克隆PHP28738被用于独立的LR反应:
1)组成型玉米GOS2启动子入门克隆(PHP28408,图11,SEQ IDNO:11)和PinII终止子入门克隆(PHP20234,图9,SEQ ID NO:9)形成目的载体PHP28529(图10,SEQ ID NO:10)。将所得载体命名为PHP29302。
2)根玉米NAS2启动子入门克隆(PHP22020,图12,SEQ ID NO:12)和PinII终止子入门克隆(PHP20234,图9,SEQ ID NO:9)形成目的载体PHP28529(图10,SEQ ID NO:10)。将所得载体命名为PHP29303。
目的载体PHP28529被加到每个最终载体(PHP28911和PHP28912)中,也是:
1)RD29A启动子::黄色荧光蛋白::PinII终止子盒,用于拟南芥属种子分选。
2)泛素启动子::moPAT/红色荧光蛋白融合基因::PinII终止子盒,用于转化选择和玉米种子分选。
实施例14B
制备包含拟南芥属llrk基因及其同源物的玉米表达构建体
可使用实施例5A和14A所述的程序将拟南芥属基因及其来自玉米和其他物种的对应同源物(表1和SEQ ID NO:35)转化到玉米品系中。能如实施例5A和14A所述制备具有拟南芥属llrk基因及其来自玉米和其他物种的对应同源物(表1和SEQ ID NO:35)的玉米表达载体。除了GOS2或NAS2启动子以外,其他启动子,例如(但不限于)泛素启动子、S2A和S2B启动子、玉米ROOTMET2启动子、玉米Cyclo、CR1BIO、CRWAQ81以及玉米ZRP2.4447,也可用于引导llrk和llrk样基因在玉米中的表达。此外,多种终止子,例如但不限于PINII终止子,可用于完成所关注基因在玉米中的表达。
实施例14C
使用农杆菌介导转化,用拟南芥属前导基因(AT1G69270)和来自其他
物种的对应同源物来转化玉米品系
然后可将最终载体(玉米中表达的载体,实施例14A和B)分别电穿孔进入包含PHP10523的LBA4404农杆菌(图7;SEQ ID NO:7,Komari等人Plant J 10:165-174(1996),NCBI GI:59797027)中,以产生用于玉米转化的共整合载体。该共整合载体是通过最终载体(玉米表达载体)与PHP10523的重组(通过每个载体上含有的COS重组位点)而形成。除了实施例14A-B中所述的表达盒以外,该共整合载体还含有农杆菌菌株以及农杆菌介导转化所需的基因(TET、TET、TRFA、ORI终止子、CTL、ORI V、VIR C1、VIR C2、VIR G、VIR B)。转化玉米品系可如实施例13所述进行。
实施例15
用于转化Gaspe Bay Flint衍生的玉米品系的目的载体PHP23236和
PHP29635的制备
目的载体PHP23236(图6,SEQ ID NO:6)是通过用质粒PHP23235(图8,SEQ ID NO:8)转化包含质粒PHP10523(图7,SEQ ID NO:7)的农杆菌菌株LBA4404并分离所得的共整合产物而获得。目的载体PHP23236可被用于如实施例16所述的与入门克隆的重组反应,以产生用于转化Gaspe Bay Flint衍生的玉米品系的玉米表达载体。所关注的基因的表达是处于泛素启动子(SEQ ID NO:32)的控制之下。
PHP29635(图13,SEQ ID NO:13)是通过用质粒PIIOXS2a-FRT87(ni)m(图14,SEQ ID NO:29)转化包含质粒PHP10523的农杆菌菌株LBA4404并分离所得的共整合产物而获得。目的载体PHP29635可被用于如实施例16所述的与入门克隆的重组反应,以产生用于转化Gaspe Bay Flint衍生的玉米品系的玉米表达载体。所关注的基因的表达是处于S2A启动子(SEQ ID NO:33)的控制之下。
实施例16
用于转化Gaspe Bay Flint衍生的玉米品系的质粒的制备
使用InvitrogenTM 重组技术,可如实施例5A和9所述制备包含拟南芥属llrk基因(AT1G69270)或玉米llrk样同源物的入门克隆,该克隆用于定向克隆每个基因进入目的载体PHP23236(实施例15)用于在泛素启动子下表达,或进入目的载体PHP29635(实施例15)用于在S2A启动子下表达。每一种表达载体都是用于农杆菌介导玉米转化的T-DNA二元载体。
Gaspe Bay Flint衍生的玉米品系可如实施例17中所述用表达构建体转化。
实施例17
用验证过的拟南芥属前导基因和来自其他物种的对应同源物转化Gaspe
Bay Flint衍生的玉米品系
为了检查所得表型,玉米植株可如实施例16所述进行转化以过表达拟南芥属AT1G69270基因和来自其他物种的同源物,如表1和SEQ IDNO:35中列出的基因。除了如实施例16所述的启动子之外,其他启动子,例如S2A和S2B启动子、玉米ROOTMET2启动子、玉米Cyclo、CRlBIO、CRWAQ81以及玉米ZRP2.4447,也可用于引导llrk和llrk样基因在玉米中的表达。此外,多种终止子,例如但不限于PINII终止子,可用于完成所关注基因在Gaspe Bay Flint衍生的玉米品系中的表达。
受体植株
受体植株细胞可来自具有短的生活周期(“快速循环”)、大小减少以及转化潜能高的单一玉米品系。对玉米典型的这些植株细胞是来自可公开获得的Gaspe Bay Flint(GBF)品系品种的植株细胞。一种可能的候选植株品系品种是GBF×QTM(Quick Turnaround Maize(快速周转玉米),选择用于在温室条件下生长的Gaspe Bay Flint的可公开获得形式)的F1杂交种,其在Tomes等人的美国专利申请公开2003/0221212中有所公开。从该品系获得的转基因植株具有如此小的大小使得它们可在4英寸的盆中生长(是正常大小的玉米植株所需空间的1/4)并且它们在少于2.5个月时间内成熟。(传统上,一旦转基因植株适应温室后需要3.5个月来获得转基因T0种子。)另一合适的品系是GS3(高度可转化的品系)×Gaspe Flint的双单倍体品系。还有另一种合适的品系是携带引起较早开花、高度减小或这两者的转基因的可转化的优良近交系。
转化规程
任何合适的方法可用于将转基因引入玉米细胞中,包括但不限于利用基于农杆菌载体的接种类型的步骤,如实施例9所述。转化可在受体(靶标)植株的未成熟胚上进行。
精确的生长和植株跟踪
将由转化的玉米胚产生的转基因(T0)植株的事件群体在受控的温室环境中栽培,该温室使用改良的随机分块(block)设计以降低或消除环境误差。随机分块设计是这样一种植株布局,在该布局中,实验植株被分成组(如,每组30株植株),称为块,而每株植株随块被随机分配一个位置。
对于一组30株植株,24株转化的实验植株和6株对照植株(具有设定好的表型的植株)(总起来说称为“重复组”)被置于盆中,这些盆在位于温室内的桌子上布置成阵列(也叫做重复组或块)。每株植株(对照植株或实验植株)随块被随机分配一个位置,所述的块映射一个唯一的、温室物理位置以及映射该重复组。在单次实验中多个30株植株的重复组中的每一个可栽培在相同的温室中。应该测定重复组的布局(布置方式)以使对空间的要求最小以及温室内的环境影响最小。这样一种布局可称为压缩的温室布局。
对于加入特定的对照组的一种替代方法是鉴定不表达所关注基因的那些转基因植株。可将诸如RT-PCR之类的多种技术应用于定量评估引入基因的表达水平。可将不表达转基因的T0植株与表达转基因的那些植株进行比较。
在整个评价过程中鉴定和跟踪事件群体中的每株植株,并且从那些植株收集的数据自动与那些植株相关联,使得所搜集的数据可与由该植株携带的转基因关联。例如,每个植株容器具有机器可读的标签(例如通用货单代码(UPC)条形码),该标签包含了关于植物身份的信息,身份信息继而又与温室位置相关,使得从植物获得的数据可自动与该植物相关联。
作为另外一种选择,可使用任何有效的、机器可读的植物识别系统,例如二维矩阵代码或甚至是射频识别标签(RFID),其中数据被接收并由射频接收器/处理器进行翻译。参见美国公布的专利申请2004/0122592,其以引用方式并入本文。
利用三维成像进行表型分析
对T0事件群体中的每株温室植株(包括任何对照植株)分析所关注的农学特性,并且以这样一种方式记录或存储每株植株的农学数据,该方式使得数据与该植株的辨识数据(见上面)相关联。可利用与上述类似的实验设计,可在T1代中完成对表型(基因效应)的确认。
在植物的整个温室生活周期中,利用定量的非破坏性成像技术在表型水平上来分析T0植株以评估所关注的性状。优选的是,将数字成像分析仪用于整株植物的自动多维分析。成像可在温室内进行。将两个摄像系统(位于顶部和侧面)和用于旋转植物的装置用于从所有侧面观察植物和成像。从每株植物的顶部、前面和侧面采集图像。所有的三个图像一起提供了足够的信息用于评价每株植物的生物量、大小和形态。
由于植物在第一片叶片从土壤显现出来时到植物处于它们发育的末期时大小的改变,最好是从顶部以较高的放大倍率记录植物发育的早期。这可通过利用完全由成像软件控制的自动变焦镜头系统来完成。
在单次成像分析操纵中,进行如下事件:(1)将植株传送至分析仪区域内,旋转360度以便其机器可读标签可被读取,并且让其保持静止直至其叶片停止移动;(2)获取侧面图像并将其输入数据库;(3)将植株旋转90度并再次让其保持静止直至其叶片停止移动,以及(4)将该植株传送出分析仪。
每24小时的周期让植物至少6个小时处于黑暗以便具有正常的白天/黑夜周期。
成像仪器
可使用任何合适的成像仪器,包括但不限于可从LemnaTec GmbH(Wurselen,Germany)商购获得的光谱数字成像仪。获取图像并用具有1/2″IT Progressive Scan IEE CCD成像设备的LemnaTec ScanalyzerHTS LT-0001-2进行分析。该成像照相机可配备有自动变焦、自动调节光圈和自动聚焦。可利用LemnaTec软件设定所有的照相机设置。优选的是,对于主要组成成像分析仪的仪器差异小于约5%,对于次要组成成像分析仪的仪器差异小于约10%。
软件
成像分析系统包括用于颜色和构造分析的LemnaTec HTS Bonit软件程序和用于存储约500,000次分析的数据(包括分析数据)的服务器数据库。原始图像和分析过的图像储存在一起以允许用户根据需要进行再次分析。可将数据库连接至成像硬件用于自动的数据收集和存储。多种可商购获得的软件系统(例如Matlab和其它软件)可用于定量解释图像数据,并且可将这些软件体系中的任何一种应用于所述图像数据集。
传送系统
具有植物旋转装置的传送系统可用于将植物传送至成像区域并在成像过程中选择植物。例如,将最多4株植物(每株最高高度为1.5m)装上汽车,该汽车在循环的传送系统上行进并通过成像测量区域。在这种情况下,该单位(成像分析仪和传送环线)的总占有面积为约5m×5m。
可扩大传送系统以同时容纳更多植物。将植物沿传送环线传送至成像区域并对每株植物分析最多50秒。获取植物的三个视图。传送系统以及成像设备应该能够用于温室环境条件。
照明
任何合适的照明模式可用于图像采集。例如,可在暗背景上使用顶部照明。作为另外一种选择,可采用使用白色背景的顶部照明和背部照明的组合。应该将被照亮的区域围起来以确保恒定的照明条件。遮蔽物应该长于测量区域使得能保持恒定的光条件而不需要打开和关闭门。作为另一种选择,可以变化照明以引起转基因(如,绿色荧光蛋白(GFP)、红色荧光蛋白(RFP))的激发或者引起内源性(如叶绿素)荧光基团 的激发。
基于三维成像的生物量评价
为了更好地评价生物量,应该从至少三个轴(优选顶部视图和两个侧面(侧面1和侧面2)视图)来获取植物图像。然后分析这些图像以将植物从背景(盆和花粉控制袋(如果适用的话))分离。可通过如下计算评价植物的体积:
在上面的等式中,体积和面积的单位是“任意单位”。在该系统中,任意单位完全足以检测基因对植物大小和生长影响,因为所需的是检测与实验平均值或对照平均值的差值(正较大和负较小两者)。大小(如面积)的任意单位可通过将物理参照加入到成像过程而轻易地转化成物理量度。例如,可在顶部成像过程和侧面成像过程两者中均包括已知面积的物理参照。基于这些物理参照的面积,可测定转换因子以允许从像素转换为面积单位,例如平方厘米(cm2)。物理参照可以是或可以不是独立的样本。例如,具有已知直径和高度的盆足可用作物理参照。
颜色分类
成像技术还可用于测定植物颜色以及用于将植物颜色归为各种衍生类型。将图像颜色归属于颜色类型是LemnaTec软件的固有特色。使用其他图像分析软件系统,可通过多种计算方法测定颜色分类。
对于植物大小和生长参数的测定,一种有用的分类方案是定义一种单一颜色方案,包括绿色的两种或三种色调,此外,还有关于缺绿病、坏死和漂白(在这些条件出现时)的颜色类型。还使用了背景颜色类型,其包括图像中的非植物颜色(例如盆和土壤颜色),并将这些像素特别地从测定大小中排除。在受控的恒定照明下分析植物,使得可以定量一株植物内随时间推移的任何改变,或者植物之间或植物不同分枝之间的任何改变(如季节差异)。
除了其在测定植物的大小、生长中的有效性以外,颜色分类还可用于评估其他产量构成性状。对于这些其他产量构成性状,可使用另外的颜色分离方案。例如,称为“保绿度(staygreen)”的性状(已经将其与产量的提高相关联)可通过颜色分类来评估,该颜色分类将绿色色调与黄色和棕色色调(其指示老化的组织)相分离。通过将这种颜色分类应用于在T0或T1植物生活周期末获取的图像,可鉴定绿色的量相对于黄色和棕色(例如,可表示为绿色/黄色比率)增加的植物。这种绿色/黄色比率具有显著差异的植物可被鉴定为携带影响这种重要农学特性的转基因。
熟练的植物学家将认识到可指示植物健康或应激反应的其他植物颜色(花青素)的出现,以及认识到其他颜色分类方案可提供对基因在与这些响应相关的性状方面的作用的进一步度量。
植物结构分析
改变植物构造参数的转基因也可用本发明鉴定,包括诸如最大高度和宽度、节间距离、叶与茎之间的角度、在节处开始的叶片数以及叶片长度。LemnaTec系统软件可如下用于测定植物构造。在第一成像步骤中将植物简化至其主要的几何构造,并且随后基于该图像可进行不同构造参数的参数化鉴定。或者是单独地或者是组合地修改任何这些构造参数的转基因可通过应用此前所述的统计方法来鉴定。
花粉脱落日期
花粉脱落日期是转基因植物中要分析的一个重要参数,并且可通过活性雄花第一次出现在植物上来测定。为了找到雄花目标,通过颜色对茎的上端进行分类以检测黄色或紫色花药。然后将这种颜色分类分析用于定义活性花,活性花继而可用于计算花粉脱落日期。
作为另外一种选择,花粉脱落日期和其他易于在视觉上检测到的植物属性(如授粉日期、第一穗丝日期)可由负责进行植物看护的工作任人员来记录。为了使数据完整性和过程效率最大化,通过利用相同的由LemnaTec光谱数字分析设备利用的条形码来跟踪该数据。可将具有条形码阅读器的电脑、掌上设备或笔记本电脑用于使记录观察时间、植物标识符的数据捕捉变得容易,以及使捕捉数据的操作者感觉舒适。
植物的取向
以接近商业栽培的密度种植的成熟玉米植物通常具有平面的构造。也就是说,植物具有一可清晰分辨的宽的侧面和窄的侧面。对来自植物宽侧的图像进行测定。对于每株植物,给其赋予一个明确界定的基本取向以获得宽侧图像与窄侧(edgewise)图像之间的最大差别。将顶部图像用于测定植物的主轴,而将额外的旋转装置用于在开始主图像采集前将植物转至合适的取向。
实施例18
在氮限制条件下筛选Gaspe Bav Flint衍生的玉米品系
一些转基因植物将含有两个或三个剂量的Gaspe Flint-3与一个剂量的GS3(GS3/(Gaspe-3)2X或GS3/(Gaspe-3)3X),并且对于显性转基因将会以1∶1分离。其他转基因植物将是常规近交系,并将被用于顶交以生成测试杂交体。将植物在Turface中栽培,每天用1mM KNO3生长培养基和2mM KNO3或更高的生长培养基浇洒四次(见图17)。在1mM KNO3培养基中培养的对照植物的绿度较小,产生较少的生物量并且在开花期具有较小的穗。Gaspe衍生的品系将生长至开花期,然而常规杂交种和近交系将生长至V4和V5阶段。
用统计学测定处理株之间所观察到的差异是否是显著差异。图18示出了一种方法,该方法将字母放在数值后面。同一列中其后具有相同字母(不是字母组)的那些值不具有显著的差异。使用该方法,如果在一列中的值的后面没有字母,则该列中的这些值的任何之间不存在显著的差异,换句话讲,该列中的所有这些值是均等的。与无效转基因相比较,转基因的表达将导致植物在1mM KNO3中具有改善的植物生长。因此生物量和绿度数据(如实施例17所述)将在取样(Gaspe在开花期,而其他的品系在V4-V5阶段)时间采集,并与无效转基因植物比较。此外,将在基本组织中分析植物中的总氮。在开花期的生长、绿度、氮积聚和穗大小的改善将指示氮利用效率提高。
实施例19
具有经验证的拟南芥属前导基因(AT1G69270)的玉米品系的产量分析
可通过直接转化或者从单独转化的品系基因渗入而将含有证实的拟南芥属基因的重组DNA构建体导入玉米品系内。
可以将转基因植物(自交系或杂种)进行更强的基于田间的试验,以研究在不同环境条件(例如改变水和营养物质可利用性)下的产量增加和/或稳定性。
可对产量进行后续分析以测定含有验证过的拟南芥属前导基因的植株在与不含有验证过的拟南芥属前导基因的对照植株比较时,在不同环境条件下是否具有产量的改善。可以测得这两种植物的产量都有所减少。包含验证过的拟南芥属前导基因的植物具有相对于对照植物更少的产量损失,优选50%更少的产量损失。
实施例20
测定玉米根构造改变的测定法
测定转基因玉米植物在幼苗期、花期或成熟期的根构造改变。测量玉米植物的根构造改变的测定法包括但不限于下面概述的方法。为了便于手动或自动地测定根构造改变,可让玉米植物在透明的盆中生长。
1)根质量(干重)。让植物在Turface中生长。将烘干的根和根组织称重并计算根冠比。
2)侧根分枝的程度。侧根分枝的程度(如侧根数量、侧根长度)通过这样测定:从完整的根系进行二次取样,将样本用平面扫描器或数码相机成像并用WinRHIZOTM软件(Regent Instruments Inc.)分析。
3)根带宽度测量。根带是植物成熟时在温室栽培盆的底部形成的根带或根量。测量成熟时根带的厚度(以mm为单位),作为对根量的粗略评价。
4)节生根的计数。从支持培养基(support medium)(如盆栽混合物(potting mix))中分离出根后,可测定上部节位处出现的冠根数。另外,可测量冠根和/或支柱根的角度。对节生根和节生根的分枝量的数值分析形成对上述手动方法的另一种延伸。
对提取的有关根表型的所有数据进行统计分析(通常为t检验),以将转基因根与非转基因姊妹株植株的根进行比较。在多个事件和/或构建体涉及该分析的情况下,还可使用单因素方差分析。
实施例21
氮利用效率幼苗检测分析法
使用颜色标记物将转基因事件的种子分成转基因种子(杂合子)和无效转基因种子。进行两组不同的随机分配处理,使用所有处理的9个平行测定,使每个随机分块(block)有排列成6排9列的54个盆。
将每个处理的两个种子种在4英寸的方盆中,盆中包含在8英寸交错中心上的Turface,每天用包含以下营养物质的溶液浇灌四次:
1mM CaCl2 2mM MgSO4 0.5mM KH2PO4 83ppm Sprint330
3mM KCl 1mM KNO3 1μM ZnSO4 1μM MnCl2
3μM H3BO4 1μM MnCl2 0.1μM CuSO4 0.1μM NaMoO4
植物出苗后,将其减少到每盆一个种子。在收获时从盆中移除植株,并且将Turface从根部洗脱。使根与苗分开,把根置于纸袋中并且在70℃干燥70小时。将干燥后的植株部分(根和苗)称重并置于50mL的圆锥管中,管中有大约20 5/32英寸的钢球,在涂料振荡器中进行振荡研磨。将大约30mg研磨组织(记录重量用于后续的调节)在2mL 20%H2O2和6MH2SO4中水解30分钟,水解温度为170℃。冷却后加水至20mL,充分混合,取出50μL等分试样并将其加入950μL 1M Na2CO3中。通过将100μL该溶液置于96孔板的每个孔中,然后加入50μL OPA溶液,使用该溶液中的氨评估减少的总植株氮。测定荧光强度,激发(excitation)=360nM/发射(emission)=530nM,并且与溶解在相似溶液并用OPA溶液处理过的NH4Cl标准品进行比较。
OPA溶液-5μL巯基乙醇+1mL OPA储备液
OPA储备液-50mg邻苯二醛(OPA-Sigma#P0657)溶解于1.5mL甲醇+4.4mL 1MBorate缓冲液pH9.5(3.09g H3BO4+1g NaOH,溶于50mL水中)+0.55mL 20%SDS
使用这些数据,测量以下参数,并且使用Student t检验比较参数平均值与无效参数平均值:
植株生物重量
根生物量
苗生物量
根/苗比率
植株氮浓度
植株总氮
在每个随机分块中使用最近邻计算以及使用完全随机设计(CRD)模型的方差分析(Analysis of Variance,ANOVA)计算差异。使用F统计,通过将总随机分块处理平均面积除以总随机分块误差平均面积计算每个随机分块的总处理效应。计算更大的Student t检验的概率用于比较每个转基因平均值与合适的无效转基因(或者批构建体或单个事件的无效转基因平均值)平均值。使用最小值(P<t)0.1作为临界值。
实施例22
包含拟南芥属llrk基因的玉米幼苗的根与来自不包含llrk基因的幼苗的
根的比较分析
如实施例14A所述制备玉米表达载体,该载体包含玉米NAS2启动子(SEQ ID NO:30)或玉米GOS2启动子(SEQ ID NO:31)以及拟南芥属llrk基因(AT1G69270)。
如实施例14C所述经由农杆菌介导的转化,通过制备共合体载体(分别是PHP29405和PHP29414)完成玉米转化,并使用如实施例20所述的幼苗检测分析法对根进行检测。
在温室实验中检测分析所有10个来自构建体PHP29405((ZM-NAS2::AT-RPK1)的事件,其中每个事件使9个植株在Turface培养基中生长至V4阶段。种子来自顶交杂交种子。实验中的对照物是9株生长至相同阶段的批无效植株(非转基因的分离植株)。使用完全随机分组设计种植种子。在种植后3周收获植株,此时它们达到V4阶段。洗涤根部并从苗中分开收集。在用天平称量干重之前,所有样本进行烘干。
发现在与批无效对照植株进行比较时,总计三(3)个事件具有显著的根干重变化、3个事件具有显著的苗干重变化、3个事件具有显著的总植株干重变化,P值小于0.1。
表5示出在每个转基因事件和对照之间的t检验分析的显著性。示出每个特性的P值,如根干重、苗干重、和根苗比率。粗体字指示转基因事件具有比对照更高的值,也用星号(*)指示。那些在与对照进行比较时P值大于0.1的事件用“NS”表示(不显著)。
表5
事件 | 根干重 | 苗干重 | 植株干重 |
1 | 0.038 | NS | NS |
2 | NS | NS | NS |
3 | NS | NS | NS |
4 | 0.076* | 0.076* | 0.070* |
5 | 0.063* | NS | NS |
6 | NS | NS | NS |
7 | <0.001* | <0.001* | <0.001* |
8 | NS | NS | NS |
9 | NS | NS | NS |
10 | NS | 0.047* | 0.094* |
实施例23
在田间标准氮和低氮条件下的转基因杂交体产量测试
在2008季,在Johnston,Iowa的农场中进行田间实验。测试十个(10)递送控制拟南芥属rpk1基因(AT1G69270)的玉米NAS2启动子和对照的转基因事件。对照是批杂交所有10个事件的非无效转基因与无效转基因。所有事件是由常见自交系受试者生成的IntroEF09B ETX品系的顶交杂交体。
施加两次处理,一次是在标准氮条件下进行处理,另一次是在氮耗尽(stress)环境下进行处理。在标准处理中,以260lb每英亩的比率施加氮肥。氮耗尽处理通过在其中土壤含氮量已经在以前多年的无肥料条件下被作物耗尽的土地上进行。氮耗尽处理以60至80lb每英亩的比率施加氮肥,在与标准氮处理进行比较时这导致6%的产量减少。用2排小块土地进行实验,其密度为每英亩32000株植物。重复4次标准氮处理和6次氮耗尽处理。该实验在2008年5月15日种植,并且在2008年10月18日一起收获。
下表6以相对于无效转基因对照植物的产量增加百分比综述了实验的以每英亩蒲式耳计算的谷物产量数据。总体上,在氮耗尽条件下的4个事件和在标准氮条件下的5个事件显示与批无效转基因对照相比,产量显著增加(α=0.2,2尾分析)。所有测试的事件都显示了相对于无效转基因植物的产量增加趋势。
表6
事件 | 产量相对于无效转基因的增加 | 显著性 | 处理 |
1 | 8.18% | * | 低氮 |
2 | 3.65% | 低氮 | |
3 | 3.67% | 低氮 | |
4 | 2.82% | 低氮 | |
5 | 5.64% | 低氮 | |
6 | 5.36% | 低氮 | |
7 | 7.79% | * | 低氮 |
8 | 4.91% | 低氮 | |
9 | 11.28% | * | 低氮 |
10 | 9.88% | * | 低氮 |
1 | 11.09% | * | 标准氮 |
2 | 1.59% | 标准氮 | |
3 | 15.22% | * | 标准氮 |
4 | 14.06% | * | 标准氮 |
5 | 10.64% | * | 标准氮 |
6 | 3.43% | 标准氮 | |
7 | 13.24% | * | 标准氮 |
8 | 0.52% | 标准氮 | |
9 | 4.01% | 标准氮 | |
10 | 7.85% | 标准氮 |
实施例24
关联作图分析
可使用关联作图方法来鉴定与玉米根构造改变有关的标记。
将获得对根构造改变或至少一种农学特性改变的表型评分。具有极端表型的品系将相对于全基因组关联测试(使用2×2相依表的Fisher精确检验)中的基因型进行测试。将使用基于结构的关联分析,其中使用标记数据控制群体结构。将使用基于模型的集分析软件,Structure,该软件由Pritchard等人开发,(Genetics 155:945-959(2000)),同时使用几百个标记的数百个玉米优良自交系的单倍型数据评估混合系数并且将自交系分配成许多亚群。这降低了假阳性的发生概率,假阳性可能由对关联作图统计的群体结构效应引发。使用Kuiper统计测试两个分配是否相同,并且测试给定标记与在给定亚群中的单倍型和表型之间的关联(Press等人,Numerical Recipes in C,第二版,Cambridge UniversityPress,NY(2002))。
在至少一个亚群中的至少一个强峰值指示显著标记-性状关联(例如p<0.001)。cM中给出了标记位点,零点是染色体起始处的第一个(距着丝点最远端)已知标记。这些图谱位点不是绝对的,而是提供基于内源遗传图谱的图谱位点预测。
序列表
<110>E.I.du Pont de Nemours and Company and
Pioneer Hi-Bred International
<120>具有改变的根构造的植物、涉及编码富含亮氨酸重复序列激酶(LLRK)多肽及其同源
物的基因的相关构建体和方法
<130>BB1632
<150>US 60/989,161
<151>2007-11-20
<150>US 60/041,281
<151>2008-04-01
<160>37
<170>PatentIn 3.5版本
<210>1
<211>18444
<212>DNA
<213>人工序列
<220>
<223>载体
<400>1
catgaatcaa acaaacatac acagcgactt attcacacga gctcaaatta caacggtata 60
tatcctgccg tcgacaacca tggtctagac aggatccccg ggtaccgagc tcgaatttgc 120
aggtcgactg cgtcatccct tacgtcagtg gagatatcac atcaatccac ttgctttgaa 180
gacgtggttg gaacgtcttc tttttccacg atgctcctcg tgggtggggg tccatctttg 240
ggaccactgt cggcagaggc atcttgaacg atagcctttc ctttatcgca atgatggcat 300
ttgtaggtgc caccttcctt ttctactgtc cttttgatga agtgacagat agctgggcaa 360
tggaatccga ggaggtttcc cgatattacc ctttgttgaa aagtctcaat tgccctttgg 420
tcttctgaga ctgttgcgtc atcccttacg tcagtggaga tatcacatca atccacttgc 480
tttgaagacg tggttggaac gtcttctttt tccacgatgc tcctcgtggg tgggggtcca 540
tctttgggac cactgtcggc agaggcatct tgaacgatag cctttccttt atcgcaatga 600
tggcatttgt aggtgccacc ttccttttct actgtccttt tgatgaagtg acagatagct 660
gggcaatgga atccgaggag gtttcccgat attacccttt gttgaaaagt ctcagttaac 720
ccgcgatcct gcgtcatccc ttacgtcagt ggagatatca catcaatcca cttgctttga 780
agacgtggtt ggaacgtctt ctttttccac gatgctcctc gtgggtgggg gtccatcttt 840
gggaccactg tcggcagagg catcttgaac gatagccttt cctttatcgc aatgatggca 900
tttgtaggtg ccaccttcct tttctactgt ccttttgatg aagtgacaga tagctgggca 960
atggaatccg aggaggtttc ccgatattac cctttgttga aaagtctcaa ttgccctttg 1020
gtcttctgag actgttgcgt catcccttac gtcagtggag atatcacatc aatccacttg 1080
ctttgaagac gtggttggaa cgtcttcttt ttccacgatg ctcctcgtgg gtgggggtcc 1140
atctttggga ccactgtcgg cagaggcatc ttgaacgata gcctttcctt tatcgcaatg 1200
atggcatttg taggtgccac cttccttttc tactgtcctt ttgatgaagt gacagatagc 1260
tgggcaatgg aatccgagga ggtttcccga tattaccctt tgttgaaaag tctcagttaa 1320
cccgcaattc actggccgtc gttttacaac gtcgtgactg ggaaaaccct ggcgttaccc 1380
aacttaatcg ccttgcagca catccccctt tcgccagctg gcgtaatagc gaagaggccc 1440
gcaccgatcg cccttcccaa cagttgcgca gcctgaatgg cgaatggatc gatccgtcga 1500
tcgaccaaag cggccatcgt gcctccccac tcctgcagtt cgggggcatg gatgcgcgga 1560
tagccgctgc tggtttcctg gatgccgacg gatttgcact gccggtagaa ctccgcgagg 1620
tcgtccagcc tcaggcagca gctgaaccaa ctcgcgaggg gatcgagccc ctgctgagcc 1680
tcgacatgtt gtcgcaaaat tcgccctgga cccgcccaac gatttgtcgt cactgtcaag 1740
gtttgacctg cacttcattt ggggcccaca tacaccaaaa aaatgctgca taattctcgg 1800
ggcagcaagt cggttacccg gccgccgtgc tggaccgggt tgaatggtgc ccgtaacttt 1860
cggtagagcg gacggccaat actcaacttc aaggaatctc acccatgcgc gccggcgggg 1920
aaccggagtt cccttcagtg aacgttatta gttcgccgct cggtgtgtcg tagatactag 1980
cccctggggc cttttgaaat ttgaataaga tttatgtaat cagtctttta ggtttgaccg 2040
gttctgccgc tttttttaaa attggatttg taataataaa acgcaattgt ttgttattgt 2100
ggcgctctat catagatgtc gctataaacc tattcagcac aatatattgt tttcatttta 2160
atattgtaca tataagtagt agggtacaat cagtaaattg aacggagaat attattcata 2220
aaaatacgat agtaacgggt gatatattca ttagaatgaa ccgaaaccgg cggtaaggat 2280
ctgagctaca catgctcagg ttttttacaa cgtgcacaac agaattgaaa gcaaatatca 2340
tgcgatcata ggcgtctcgc atatctcatt aaagcagggg gtgggcgaag aactccagca 2400
tgagatcccc gcgctggagg atcatccagc cggcgtcccg gaaaacgatt ccgaagccca 2460
acctttcata gaaggcggcg gtggaatcga aatctcgtga tggcaggttg ggcgtcgctt 2520
ggtcggtcat ttcgaacccc agagtcccgc tcagaagaac tcgtcaagaa ggcgatagaa 2580
ggcgatgcgc tgcgaatcgg gagcggcgat accgtaaagc acgaggaagc ggtcagccca 2640
ttcgccgcca agctcttcag caatatcacg ggtagccaac gctatgtcct gatagcggtc 2700
cgccacaccc agccggccac agtcgatgaa tccagaaaag cggccatttt ccaccatgat 2760
attcggcaag caggcatcgc catgggtcac gacgagatcc tcgccgtcgg gcatgccccc 2820
caattcactg gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg ttacccaact 2880
taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag aggcccgcac 2940
cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tggcgcctga tgcggtattt 3000
tctccttacg catctgtgcg gtatttcaca ccgcatatgg tgcactctca gtacaatctg 3060
ctctgatgcc gcatagttaa gccagccccg acacccgcca acacccgctg acgcgccctg 3120
acgggcttgt ctgctcccgg catccgctta cagacaagct gtgaccgtct ccgggagctg 3180
catgtgtcag aggttttcac cgtcatcacc gaaacgcgcg agacgaaagg gcctcgtgat 3240
acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt caggtggcac 3300
ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat 3360
gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag 3420
tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc 3480
tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc 3540
acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc 3600
cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc 3660
ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt 3720
ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt 3780
atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat 3840
cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg taactcgcct 3900
tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat 3960
gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc 4020
ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg 4080
ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc 4140
tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta 4200
cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc 4260
ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga 4320
tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat 4380
gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat 4440
caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa 4500
accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa 4560
ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt agccgtagtt 4620
aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt 4680
accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata 4740
gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt 4800
ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag aaagcgccac 4860
gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga 4920
gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg 4980
ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa 5040
aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat 5100
gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc 5160
tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga 5220
agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg 5280
gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta 5340
gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta tgttgtgtgg 5400
aattgtgagc ggataacaat ttcacacagg aaacagctat gaccatgatt acgccaagct 5460
ttctaggggg ggggtaccga tctgagatcg gtaacgaaaa cgaacgggta gggatgaaaa 5520
cggtcggtaa cggtcggtaa aatacctcta ccgttttcat tttcatattt aacttgcggg 5580
acggaaacga aaacgggata taccggtaac gaaaacgaac gggataaata cggtaatcga 5640
aaaccgatac gatccggtcg ggttaaagtc gaaatcggac gggaaccggt atttttgttc 5700
ggtaaaatca cacatgaaaa catatattca aaacttaaaa acaaatataa aaaattgtaa 5760
acacaagtct taattaaaca tagataaaat ccatataaat ctggagcaca catagtttaa 5820
tgtagcacat aagtgataag tcttgggctc ttggctaaca taagaagcca tataagtcta 5880
ctagcacaca tgacacaata taaagtttaa aacacatatt cataatcact tgctcacatc 5940
tggatcactt agcatgctac agctagtgca atattagaca ctttccaata tttctcaaac 6000
ttttcactca ttgcaacggc cattctccta atgacaaatt tttcatgaac acaccattgg 6060
tcaatcaaat cctttatctc acagaaacct ttgtaaaata aatttgcagt ggaatattga 6120
gtaccagata ggagttcagt gagatcaaaa aacttcttca aacacttaaa aagagttaat 6180
gccatcttcc actcctcggc tttaggacaa attgcatcgt acctacaata attgacattt 6240
gattaattga gaatttataa tgatgacatg tacaacaatt gagacaaaca tacctgcgag 6300
gatcacttgt tttaagccgt gttagtgcag gcttataata taaggcatcc ctcaacatca 6360
aataggttga attccatcta gttgagacat catatgagat ccctttagat ttatccaagt 6420
cacattcact agcacacttc attagttctt cccactgcaa aggagaagat tttacagcaa 6480
gaacaatcgc tttgattttc tcaattgttc ctgcaattac agccaagcca tcctttgcaa 6540
ccaagttcag tatgtgacaa gcacacctca catgaaagaa agcaccatca caaactagat 6600
ttgaatcagt gtcctgcaaa tcctcaatta tatcgtgcac agctacttca tttgcactag 6660
cattatccaa agacaaggca aacaattttt tctcaatgtt ccacttaacc atgattgcag 6720
tgaaggtttg tgataacctt tggccagtgt ggcgcccttc aacatgaaaa aagccaacaa 6780
ttcttttttg gagacaccaa tcatcatcaa tccaatggat ggtgacacac atgtatgact 6840
tattttgaca agatgtccac atatccatag ttgtactgaa gcgagactga acatctttta 6900
gttttccata caacttttct ttttcttcca aatacaaatc catgatatat tttctagcag 6960
tgacacggga ctttattgga aagtgagggc gcagagactt aacaaactca acaaagtact 7020
catgttctac aatattgaaa ggatattcat gcatgattat tgccaaatga agcttcttta 7080
ggctaaccac ttcatcgtac ttataaggct caatgagatt tatgtctttg ccatgatcct 7140
tttcactttt tagacacaac tgacctttaa ctaaactatg tgatgttctc aagtgatttc 7200
gaaatccgct tgttccatga tgaccctcag ccctatactt agccttgcaa ttaggaaagt 7260
tgcaatgtcc ccatacctga acgtatttct ttccatcgac ctccacttca atttccttct 7320
tggtgaaatg ctgccataca tccgatgtgc acttctttgc cctcttctgt ggtgcttctt 7380
cttcgggttc aggttgtggc tgtggttgtg gttctggttg tggttgtggt tgtggttgtg 7440
gttcatgaac aatagccata tcatcttgac tcggatctgt agctgtacca tttgcattac 7500
tactgcttac actctgaata aaatgcctct cggcctcagc tgttgatgat gatggtgatg 7560
tgcggccaca tccatgccca cgcgcacgtg cacgtacatt ctgaatccga ctagaagagg 7620
cttcagcttt tcttttcaac cctgttataa acagattttt cgtattattc tacagtcaat 7680
atgatgcttc ccaatctaca accaattagt aatgctaatg ctattgctac tgtttttcta 7740
atatatacct tgagcatatg cagagaatac ggaatttgtt ttgcgagtag aaggcgctct 7800
tgtggtagac atcaacttgg ccaatcttat ggctgagcct gagggaggat tatttccaac 7860
cggaggcgtc atctgaggaa tggagtcgta gccggctagc cgaagtggag agcagagccc 7920
tggacagcag gtgttcagca atcagcttgg tgctgtactg ctgtgacttg tgagcacctg 7980
gacggctgga cagcaatcag caggtgttgc agagcccctg gacagcacac aaatgacaca 8040
acagcttggt gcaatggtgc tgacgtgctg tactgctaag tgctgtgagc ctgtgagcag 8100
ccgtggagac agggagaccg cggatggccg gatgggcgag cgccgagcag tggaggtctg 8160
gaggaccgct gaccgcagat ggcggatggc ggatgggcgg accgcggatg ggcgagcagt 8220
ggagtggagg tctgggcgga tgggcggacc gcggcgcgga tgggcgagtc gcgagcagtg 8280
gagtggaggg cggaccgtgg atggcggcgt ctgcgtccgg cgtgccgcgt cacggccgtc 8340
accgcgtgtg gtgcctggtg cagcccagcg gccggccggc tgggagacag ggagagtcgg 8400
agagagcagg cgagagcgag acgcgtcgcc ggcgtcggcg tgcggctggc ggcgtccgga 8460
ctccggcgtg ggcgcgtggc ggcgtgtgaa tgtgtgatgc tgttactcgt gtggtgcctg 8520
gccgcctggg agagaggcag agcagcgttc gctaggtatt tcttacatgg gctgggcctc 8580
agtggttatg gatgggagtt ggagctggcc atattgcagt catcccgaat tagaaaatac 8640
ggtaacgaaa cgggatcatc ccgattaaaa acgggatccc ggtgaaacgg tcgggaaact 8700
agctctaccg tttccgtttc cgtttaccgt tttgtatatc ccgtttccgt tccgttttcg 8760
ttttttacct cgggttcgaa atcgatcggg ataaaactaa caaaatcggt tatacgataa 8820
cggtcggtac gggattttcc catcctactt tcatccctga gattattgtc gtttctttcg 8880
cagatcggta ccccccccct agagtcgaca tcgatctagt aacatagatg acaccgcgcg 8940
cgataattta tcctagtttg cgcgctatat tttgttttct atcgcgtatt aaatgtataa 9000
ttgcgggact ctaatcataa aaacccatct cataaataac gtcatgcatt acatgttaat 9060
tattacatgc ttaacgtaat tcaacagaaa ttatatgata atcatcgcaa gaccggcaac 9120
aggattcaat cttaagaaac tttattgcca aatgtttgaa cgatctgctt cgacgcactc 9180
cttctttagg tacggactag atctcggtga cgggcaggac cggacggggc ggtaccggca 9240
ggctgaagtc cagctgccag aaacccacgt catgccagtt cccgtgcttg aagccggccg 9300
cccgcagcat gccgcggggg gcatatccga gcgcctcgtg catgcgcacg ctcgggtcgt 9360
tgggcagccc gatgacagcg accacgctct tgaagccctg tgcctccagg gacttcagca 9420
ggtgggtgta gagcgtggag cccagtcccg tccgctggtg gcggggggag acgtacacgg 9480
tcgactcggc cgtccagtcg taggcgttgc gtgccttcca ggggcccgcg taggcgatgc 9540
cggcgacctc gccgtccacc tcggcgacga gccagggata gcgctcccgc agacggacga 9600
ggtcgtccgt ccactcctgc ggttcctgcg gctcggtacg gaagttgacc gtgcttgtct 9660
cgatgtagtg gttgacgatg gtgcagaccg ccggcatgtc cgcctcggtg gcacggcgga 9720
tgtcggccgg gcgtcgttct gggctcatgg atctggattg agagtgaata tgagactcta 9780
attggatacc gaggggaatt tatggaacgt cagtggagca tttttgacaa gaaatatttg 9840
ctagctgata gtgaccttag gcgacttttg aacgcgcaat aatggtttct gacgtatgtg 9900
cttagctcat taaactccag aaacccgcgg ctgagtggct ccttcaatcg ttgcggttct 9960
gtcagttcca aacgtaaaac ggcttgtccc gcgtcatcgg cgggggtcat aacgtgactc 10020
ccttaattct ccgctcatga tccccgggta ccgagctcga attgcggctg agtggctcct 10080
tcaatcgttg cggttctgtc agttccaaac gtaaaacggc ttgtcccgcg tcatcggcgg 10140
gggtcataac gtgactccct taattctccg ctcatgatct tgatcccctg cgccatcaga 10200
tccttggcgg caagaaagcc atccagttta ctttgcaggg cttcccaacc ttaccagagg 10260
gcgccccagc tggcaattcc ggttcgcttg ctgtatcgat atggtggatt tatcacaaat 10320
gggacccgcc gccgacagag gtgtgatgtt aggccaggac tttgaaaatt tgcgcaacta 10380
tcgtatagtg gccgacaaat tgacgccgag ttgacagact gcctagcatt tgagtgaatt 10440
atgtgaggta atgggctaca ctgaattggt agctcaaact gtcagtattt atgtatatga 10500
gtgtatattt tcgcataatc tcagaccaat ctgaagatga aatgggtatc tgggaatggc 10560
gaaatcaagg catcgatcgt gaagtttctc atctaagccc ccatttggac gtgaatgtag 10620
acacgtcgaa ataaagattt ccgaattaga ataatttgtt tattgctttc gcctataaat 10680
acgacggatc gtaatttgtc gttttatcaa aatgtacttt cattttataa taacgctgcg 10740
gacatctaca tttttgaatt gaaaaaaaat tggtaattac tctttctttt tctccatatt 10800
gaccatcata ctcattgctg atccatgtag atttcccgga catgaagcca tttacaattg 10860
aatatatcct gccgccgctg ccgctttgca cccggtggag cttgcatgtt ggtttctacg 10920
cagaactgag ccggttaggc agataatttc cattgagaac tgagccatgt gcaccttccc 10980
cccaacacgg tgagcgacgg ggcaacggag tgatccacat gggactttta aacatcatcc 11040
gtcggatggc gttgcgagag aagcagtcga tccgtgagat cagccgacgc accgggcagg 11100
cgcgcaacac gatcgcaaag tatttgaacg caggtacaat cgagccgacg ttcaccgtca 11160
ccctggatgc tgtaggcata ggcttggtta tgccggtact gccgggcctc ttgcgggata 11220
tcgtccattc cgacagcatc gccagtcact atggcgtgct gctagcgcta tatgcgttga 11280
tgcaatttct atgcgcaccc gttctcggag cactgtccga ccgctttggc cgccgcccag 11340
tcctgctcgc ttcgctactt ggagccacta tcgactacgc gatcatggcg accacacccg 11400
tcctgtggtc caacccctcc gctgctatag tgcagtcggc ttctgacgtt cagtgcagcc 11460
gtcttctgaa aacgacatgt cgcacaagtc ctaagttacg cgacaggctg ccgccctgcc 11520
cttttcctgg cgttttcttg tcgcgtgttt tagtcgcata aagtagaata cttgcgacta 11580
gaaccggaga cattacgcca tgaacaagag cgccgccgct ggcctgctgg gctatgcccg 11640
cgtcagcacc gacgaccagg acttgaccaa ccaacgggcc gaactgcacg cggccggctg 11700
caccaagctg ttttccgaga agatcaccgg caccaggcgc gaccgcccgg agctggccag 11760
gatgcttgac cacctacgcc ctggcgacgt tgtgacagtg accaggctag accgcctggc 11820
ccgcagcacc cgcgacctac tggacattgc cgagcgcatc caggaggccg gcgcgggcct 11880
gcgtagcctg gcagagccgt gggccgacac caccacgccg gccggccgca tggtgttgac 11940
cgtgttcgcc ggcattgccg agttcgagcg ttccctaatc atcgaccgca cccggagcgg 12000
gcgcgaggcc gccaaggccc gaggcgtgaa gtttggcccc cgccctaccc tcaccccggc 12060
acagatcgcg cacgcccgcg agctgatcga ccaggaaggc cgcaccgtga aagaggcggc 12120
tgcactgctt ggcgtgcatc gctcgaccct gtaccgcgca cttgagcgca gcgaggaagt 12180
gacgcccacc gaggccaggc ggcgcggtgc cttccgtgag gacgcattga ccgaggccga 12240
cgccctggcg gccgccgaga atgaacgcca agaggaacaa gcatgaaacc gcaccaggac 12300
ggccaggacg aaccgttttt cattaccgaa gagatcgagg cggagatgat cgcggccggg 12360
tacgtgttcg agccgcccgc gcacgtctca accgtgcggc tgcatgaaat cctggccggt 12420
ttgtctgatg ccaagctggc ggcctggccg gccagcttgg ccgctgaaga aaccgagcgc 12480
cgccgtctaa aaaggtgatg tgtatttgag taaaacagct tgcgtcatgc ggtcgctgcg 12540
tatatgatgc gatgagtaaa taaacaaata cgcaagggaa cgcatgaagt tatcgctgta 12600
cttaaccaga aaggcgggtc aggcaagacg accatcgcaa cccatctagc ccgcgccctg 12660
caactcgccg gggccgatgt tctgttagtc gattccgatc cccagggcag tgcccgcgat 12720
tgggcggccg tgcgggaaga tcaaccgcta accgttgtcg gcatcgaccg cccgacgatt 12780
gaccgcgacg tgaaggccat cggccggcgc gacttcgtag tgatcgacgg agcgccccag 12840
gcggcggact tggctgtgtc cgcgatcaag gcagccgact tcgtgctgat tccggtgcag 12900
ccaagccctt acgacatatg ggccaccgcc gacctggtgg agctggttaa gcagcgcatt 12960
gaggtcacgg atggaaggct acaagcggcc tttgtcgtgt cgcgggcgat caaaggcacg 13020
cgcatcggcg gtgaggttgc cgaggcgctg gccgggtacg agctgcccat tcttgagtcc 13080
cgtatcacgc agcgcgtgag ctacccaggc actgccgccg ccggcacaac cgttcttgaa 13140
tcagaacccg agggcgacgc tgcccgcgag gtccaggcgc tggccgctga aattaaatca 13200
aaactcattt gagttaatga ggtaaagaga aaatgagcaa aagcacaaac acgctaagtg 13260
ccggccgtcc gagcgcacgc agcagcaagg ctgcaacgtt ggccagcctg gcagacacgc 13320
cagccatgaa gcgggtcaac tttcagttgc cggcggagga tcacaccaag ctgaagatgt 13380
acgcggtacg ccaaggcaag accattaccg agctgctatc tgaatacatc gcgcagctac 13440
cagagtaaat gagcaaatga ataaatgagt agatgaattt tagcggctaa aggaggcggc 13500
atggaaaatc aagaacaacc aggcaccgac gccgtggaat gccccatgtg tggaggaacg 13560
ggcggttggc caggcgtaag cggctgggtt gtctgccggc cctgcaatgg cactggaacc 13620
cccaagcccg aggaatcggc gtgagcggtc gcaaaccatc cggcccggta caaatcggcg 13680
cggcgctggg tgatgacctg gtggagaagt tgaaggccgc gcaggccgcc cagcggcaac 13740
gcatcgaggc agaagcacgc cccggtgaat cgtggcaagc ggccgctgat cgaatccgca 13800
aagaatcccg gcaaccgccg gcagccggtg cgccgtcgat taggaagccg cccaagggcg 13860
acgagcaacc agattttttc gttccgatgc tctatgacgt gggcacccgc gatagtcgca 13920
gcatcatgga cgtggccgtt ttccgtctgt cgaagcgtga ccgacgagct ggcgaggtga 13980
tccgctacga gcttccagac gggcacgtag aggtttccgc agggccggcc ggcatggcca 14040
gtgtgtggga ttacgacctg gtactgatgg cggtttccca tctaaccgaa tccatgaacc 14100
gataccggga agggaaggga gacaagcccg gccgcgtgtt ccgtccacac gttgcggacg 14160
tactcaagtt ctgccggcga gccgatggcg gaaagcagaa agacgacctg gtagaaacct 14220
gcattcggtt aaacaccacg cacgttgcca tgcagcgtac gaagaaggcc aagaacggcc 14280
gcctggtgac ggtatccgag ggtgaagcct tgattagccg ctacaagatc gtaaagagcg 14340
aaaccgggcg gccggagtac atcgagatcg agctagctga ttggatgtac cgcgagatca 14400
cagaaggcaa gaacccggac gtgctgacgg ttcaccccga ttactttttg atcgatcccg 14460
gcatcggccg ttttctctac cgcctggcac gccgcgccgc aggcaaggca gaagccagat 14520
ggttgttcaa gacgatctac gaacgcagtg gcagcgccgg agagttcaag aagttctgtt 14580
tcaccgtgcg caagctgatc gggtcaaatg acctgccgga gtacgatttg aaggaggagg 14640
cggggcaggc tggcccgatc ctagtcatgc gctaccgcaa cctgatcgag ggcgaagcat 14700
ccgccggttc ctaatgtacg gagcagatgc tagggcaaat tgccctagca ggggaaaaag 14760
gtcgaaaagg tctctttcct gtggatagca cgtacattgg gaacccaaag ccgtacattg 14820
ggaaccggaa cccgtacatt gggaacccaa agccgtacat tgggaaccgg tcacacatgt 14880
aagtgactga tataaaagag aaaaaaggcg atttttccgc ctaaaactct ttaaaactta 14940
ttaaaactct taaaacccgc ctggcctgtg cataactgtc tggccagcgc acagccgaag 15000
agctgcaaaa agcgcctacc cttcggtcgc tgcgctccct acgccccgcc gcttcgcgtc 15060
ggcctatcgc ggccgctggc cgctcaaaaa tggctggcct acggccaggc aatctaccag 15120
ggcgcggaca agccgcgccg tcgccactcg accgccggcg cccacatcaa ggcaccctgc 15180
ctcgcgcgtt tcggtgatga cggtgaaaac ctctgacaca tgcagctccc ggagacggtc 15240
acagcttgtc tgtaagcgga tgccgggagc agacaagccc gtcagggcgc gtcagcgggt 15300
gttggcgggt gtcggggcgc agccatgacc cagtcacgta gcgatagcgg agtgtatact 15360
ggcttaacta tgcggcatca gagcagattg tactgagagt gcaccatatg cggtgtgaaa 15420
taccgcacag atgcgtaagg agaaaatacc gcatcaggcg ctcttccgct tcctcgctca 15480
ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 15540
taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc 15600
agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc 15660
cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 15720
tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc 15780
tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata 15840
gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc 15900
acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 15960
acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag 16020
cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta 16080
gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 16140
gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 16200
agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt 16260
ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa 16320
ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 16380
atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga 16440
tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac 16500
gggagggctt accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg 16560
ctccagattt atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg 16620
caactttatc cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt 16680
cgccagttaa tagtttgcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct 16740
cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat 16800
cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta 16860
agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca 16920
tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat 16980
agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaac acgggataat accgcgccac 17040
atagcagaac tttaaaagtg ctcatcattg gaaaagacct gcaggggggg gggggaaagc 17100
cacgttgtgt ctcaaaatct ctgatgttac attgcacaag ataaaaatat atcatcatga 17160
acaataaaac tgtctgctta cataaacagt aatacaaggg gtgttatgag ccatattcaa 17220
cgggaaacgt cttgctcgag gccgcgatta aattccaaca tggatgctga tttatatggg 17280
tataaatggg ctcgcgataa tgtcgggcaa tcaggtgcga caatctatcg attgtatggg 17340
aagcccgatg cgccagagtt gtttctgaaa catggcaaag gtagcgttgc caatgatgtt 17400
acagatgaga tggtcagact aaactggctg acggaattta tgcctcttcc gaccatcaag 17460
cattttatcc gtactcctga tgatgcatgg ttactcacca ctgcgatccc cgggaaaaca 17520
gcattccagg tattagaaga atatcctgat tcaggtgaaa atattgttga tgcgctggca 17580
gtgttcctgc gccggttgca ttcgattcct gtttgtaatt gtccttttaa cagcgatcgc 17640
gtatttcgtc tcgctcaggc gcaatcacga atgaataacg gtttggttga tgcgagtgat 17700
tttgatgacg agcgtaatgg ctggcctgtt gaacaagtct ggaaagaaat gcataagctt 17760
ttgccattct caccggattc agtcgtcact catggtgatt tctcacttga taaccttatt 17820
tttgacgagg ggaaattaat aggttgtatt gatgttggac gagtcggaat cgcagaccga 17880
taccaggatc ttgccatcct atggaactgc ctcggtgagt tttctccttc attacagaaa 17940
cggctttttc aaaaatatgg tattgataat cctgatatga ataaattgca gtttcatttg 18000
atgctcgatg agtttttcta atcagaattg gttaattggt tgtaacactg gcagagcatt 18060
acgctgactt gacgggacgg cggctttgtt gaataaatcg aacttttgct gagttgaagg 18120
atcagatcac gcatcttccc gacaacgcag accgttccgt ggcaaagcaa aagttcaaaa 18180
tcaccaactg gtccacctac aacaaagctc tcatcaaccg tggctccctc actttctggc 18240
tggatgatgg ggcgattcag gcctggtatg agtcagcaac accttcttca cgaggcagac 18300
ctcagcgccc ccccccccct gcaggtcaat tcggtcgata tggctattac gaagaaggct 18360
cgtgcgcgga gtcccgtgaa ctttcccacg caacaagtga accgcaccgg gtttgccgga 18420
ggccatttcg ttaaaatgcg cagc 18444
<210>2
<211>4291
<212>DNA
<213>人工序列
<220>
<223>载体
<400>2
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgttca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600
ctgttcgttg caacacattg atgagcaatg cttttttata atgccaactt tgtacaaaaa 660
agctgaacga gaaacgtaaa atgatataaa tatcaatata ttaaattaga ttttgcataa 720
aaaacagact acataatact gtaaaacaca acatatccag tcactatgaa tcaactactt 780
agatggtatt agtgacctgt agtcgaccga cagccttcca aatgttcttc gggtgatgct 840
gccaacttag tcgaccgaca gccttccaaa tgttcttctc aaacggaatc gtcgtatcca 900
gcctactcgc tattgtcctc aatgccgtat taaatcataa aaagaaataa gaaaaagagg 960
tgcgagcctc ttttttgtgt gacaaaataa aaacatctac ctattcatat acgctagtgt 1020
catagtcctg aaaatcatct gcatcaagaa caatttcaca actcttatac ttttctctta 1080
caagtcgttc ggcttcatct ggattttcag cctctatact tactaaacgt gataaagttt 1140
ctgtaatttc tactgtatcg acctgcagac tggctgtgta taagggagcc tgacatttat 1200
attccccaga acatcaggtt aatggcgttt ttgatgtcat tttcgcggtg gctgagatca 1260
gccacttctt ccccgataac ggagaccggc acactggcca tatcggtggt catcatgcgc 1320
cagctttcat ccccgatatg caccaccggg taaagttcac gggagacttt atctgacagc 1380
agacgtgcac tggccagggg gatcaccatc cgtcgcccgg gcgtgtcaat aatatcactc 1440
tgtacatcca caaacagacg ataacggctc tctcttttat aggtgtaaac cttaaactgc 1500
atttcaccag cccctgttct cgtcagcaaa agagccgttc atttcaataa accgggcgac 1560
ctcagccatc ccttcctgat tttccgcttt ccagcgttcg gcacgcagac gacgggcttc 1620
attctgcatg gttgtgctta ccagaccgga gatattgaca tcatatatgc cttgagcaac 1680
tgatagctgt cgctgtcaac tgtcactgta atacgctgct tcatagcata cctctttttg 1740
acatacttcg ggtatacata tcagtatata ttcttatacc gcaaaaatca gcgcgcaaat 1800
acgcatactg ttatctggct tttagtaagc cggatccacg cggcgtttac gccccgccct 1860
gccactcatc gcagtactgt tgtaattcat taagcattct gccgacatgg aagccatcac 1920
agacggcatg atgaacctga atcgccagcg gcatcagcac cttgtcgcct tgcgtataat 1980
atttgcccat ggtgaaaacg ggggcgaaga agttgtccat attggccacg tttaaatcaa 2040
aactggtgaa actcacccag ggattggctg agacgaaaaa catattctca ataaaccctt 2100
tagggaaata ggccaggttt tcaccgtaac acgccacatc ttgcgaatat atgtgtagaa 2160
actgccggaa atcgtcgtgg tattcactcc agagcgatga aaacgtttca gtttgctcat 2220
ggaaaacggt gtaacaaggg tgaacactat cccatatcac cagctcaccg tctttcattg 2280
ccatacggaa ttccggatga gcattcatca ggcgggcaag aatgtgaata aaggccggat 2340
aaaacttgtg cttatttttc tttacggtct ttaaaaaggc cgtaatatcc agctgaacgg 2400
tctggttata ggtacattga gcaactgact gaaatgcctc aaaatgttct ttacgatgcc 2460
attgggatat atcaacggtg gtatatccag tgattttttt ctccatttta gcttccttag 2520
ctcctgaaaa tctcgataac tcaaaaaata cgcccggtag tgatcttatt tcattatggt 2580
gaaagttgga acctcttacg tgccgatcaa cgtctcattt tcgccaaaag ttggcccagg 2640
gcttcccggt atcaacaggg acaccaggat ttatttattc tgcgaagtga tcttccgtca 2700
caggtattta ttcggcgcaa agtgcgtcgg gtgatgctgc caacttagtc gactacaggt 2760
cactaatacc atctaagtag ttgattcata gtgactggat atgttgtgtt ttacagtatt 2820
atgtagtctg ttttttatgc aaaatctaat ttaatatatt gatatttata tcattttacg 2880
tttctcgttc agctttcttg tacaaagttg gcattataag aaagcattgc ttatcaattt 2940
gttgcaacga acaggtcact atcagtcaaa ataaaatcat tatttgccat ccagctgata 3000
tcccctatag tgagtcgtat tacatggtca tagctgtttc ctggcagctc tggcccgtgt 3060
ctcaaaatct ctgatgttac attgcacaag ataaaataat atcatcatga tcagtcctgc 3120
tcctcggcca cgaagtgcac gcagttgccg gccgggtcgc gcagggcgaa ctcccgcccc 3180
cacggctgct cgccgatctc ggtcatggcc ggcccggagg cgtcccggaa gttcgtggac 3240
acgacctccg accactcggc gtacagctcg tccaggccgc gcacccacac ccaggccagg 3300
gtgttgtccg gcaccacctg gtcctggacc gcgctgatga acagggtcac gtcgtcccgg 3360
accacaccgg cgaagtcgtc ctccacgaag tcccgggaga acccgagccg gtcggtccag 3420
aactcgaccg ctccggcgac gtcgcgcgcg gtgagcaccg gaacggcact ggtcaacttg 3480
gccatggttt agttcctcac cttgtcgtat tatactatgc cgatatacta tgccgatgat 3540
taattgtcaa cacgtgctga tcatgaccaa aatcccttaa cgtgagttac gcgtcgttcc 3600
actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc 3660
gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg 3720
atcaagagct accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa 3780
atactgttct tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc 3840
ctacatacct cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt 3900
gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa 3960
cggggggttc gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc 4020
tacagcgtga gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc 4080
cggtaagcgg cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct 4140
ggtatcttta tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat 4200
gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc 4260
tggccttttg ctggcctttt gctcacatgt t 4291
<210>3
<211>4762
<212>DNA
<213>人工序列
<220>
<223>载体
<400>3
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgttca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggcccca aataatgatt ttattttgac tgatagtgac 600
ctgttcgttg caacacattg atgagcaatg cttttttata atgccaactt tgtacaaaaa 660
agctgaacga gaaacgtaaa atgatataaa tatcaatata ttaaattaga ttttgcataa 720
aaaacagact acataatact gtaaaacaca acatatccag tcactatgaa tcaactactt 780
agatggtatt agtgacctgt agtcgaccga cagccttcca aatgttcttc gggtgatgct 840
gccaacttag tcgaccgaca gccttccaaa tgttcttctc aaacggaatc gtcgtatcca 900
gcctactcgc tattgtcctc aatgccgtat taaatcataa aaagaaataa gaaaaagagg 960
tgcgagcctc ttttttgtgt gacaaaataa aaacatctac ctattcatat acgctagtgt 1020
catagtcctg aaaatcatct gcatcaagaa caatttcaca actcttatac ttttctctta 1080
caagtcgttc ggcttcatct ggattttcag cctctatact tactaaacgt gataaagttt 1140
ctgtaatttc tactgtatcg acctgcagac tggctgtgta taagggagcc tgacatttat 1200
attccccaga acatcaggtt aatggcgttt ttgatgtcat tttcgcggtg gctgagatca 1260
gccacttctt ccccgataac ggagaccggc acactggcca tatcggtggt catcatgcgc 1320
cagctttcat ccccgatatg caccaccggg taaagttcac gggagacttt atctgacagc 1380
agacgtgcac tggccagggg gatcaccatc cgtcgcccgg gcgtgtcaat aatatcactc 1440
tgtacatcca caaacagacg ataacggctc tctcttttat aggtgtaaac cttaaactgc 1500
atttcaccag cccctgttct cgtcagcaaa agagccgttc atttcaataa accgggcgac 1560
ctcagccatc ccttcctgat tttccgcttt ccagcgttcg gcacgcagac gacgggcttc 1620
attctgcatg gttgtgctta ccagaccgga gatattgaca tcatatatgc cttgagcaac 1680
tgatagctgt cgctgtcaac tgtcactgta atacgctgct tcatagcata cctctttttg 1740
acatacttcg ggtatacata tcagtatata ttcttatacc gcaaaaatca gcgcgcaaat 1800
acgcatactg ttatctggct tttagtaagc cggatccacg cggcgtttac gccccgccct 1860
gccactcatc gcagtactgt tgtaattcat taagcattct gccgacatgg aagccatcac 1920
agacggcatg atgaacctga atcgccagcg gcatcagcac cttgtcgcct tgcgtataat 1980
atttgcccat ggtgaaaacg ggggcgaaga agttgtccat attggccacg tttaaatcaa 2040
aactggtgaa actcacccag ggattggctg agacgaaaaa catattctca ataaaccctt 2100
tagggaaata ggccaggttt tcaccgtaac acgccacatc ttgcgaatat atgtgtagaa 2160
actgccggaa atcgtcgtgg tattcactcc agagcgatga aaacgtttca gtttgctcat 2220
ggaaaacggt gtaacaaggg tgaacactat cccatatcac cagctcaccg tctttcattg 2280
ccatacggaa ttccggatga gcattcatca ggcgggcaag aatgtgaata aaggccggat 2340
aaaacttgtg cttatttttc tttacggtct ttaaaaaggc cgtaatatcc agctgaacgg 2400
tctggttata ggtacattga gcaactgact gaaatgcctc aaaatgttct ttacgatgcc 2460
attgggatat atcaacggtg gtatatccag tgattttttt ctccatttta gcttccttag 2520
ctcctgaaaa tctcgataac tcaaaaaata cgcccggtag tgatcttatt tcattatggt 2580
gaaagttgga acctcttacg tgccgatcaa cgtctcattt tcgccaaaag ttggcccagg 2640
gcttcccggt atcaacaggg acaccaggat ttatttattc tgcgaagtga tcttccgtca 2700
caggtattta ttcggcgcaa agtgcgtcgg gtgatgctgc caacttagtc gactacaggt 2760
cactaatacc atctaagtag ttgattcata gtgactggat atgttgtgtt ttacagtatt 2820
atgtagtctg ttttttatgc aaaatctaat ttaatatatt gatatttata tcattttacg 2880
tttctcgttc agctttcttg tacaaagttg gcattataag aaagcattgc ttatcaattt 2940
gttgcaacga acaggtcact atcagtcaaa ataaaatcat tatttgccat ccagctgata 3000
tcccctatag tgagtcgtat tacatggtca tagctgtttc ctggcagctc tggcccgtgt 3060
ctcaaaatct ctgatgttac attgcacaag ataaaataat atcatcatga acaataaaac 3120
tgtctgctta cataaacagt aatacaaggg gtgttatgag ccatattcaa cgggaaacgt 3180
cgaggccgcg attaaattcc aacatggatg ctgatttata tgggtataaa tgggctcgcg 3240
ataatgtcgg gcaatcaggt gcgacaatct atcgcttgta tgggaagccc gatgcgccag 3300
agttgtttct gaaacatggc aaaggtagcg ttgccaatga tgttacagat gagatggtca 3360
gactaaactg gctgacggaa tttatgcctc ttccgaccat caagcatttt atccgtactc 3420
ctgatgatgc atggttactc accactgcga tccccggaaa aacagcattc caggtattag 3480
aagaatatcc tgattcaggt gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt 3540
tgcattcgat tcctgtttgt aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc 3600
aggcgcaatc acgaatgaat aacggtttgg ttgatgcgag tgattttgat gacgagcgta 3660
atggctggcc tgttgaacaa gtctggaaag aaatgcataa acttttgcca ttctcaccgg 3720
attcagtcgt cactcatggt gatttctcac ttgataacct tatttttgac gaggggaaat 3780
taataggttg tattgatgtt ggacgagtcg gaatcgcaga ccgataccag gatcttgcca 3840
tcctatggaa ctgcctcggt gagttttctc cttcattaca gaaacggctt tttcaaaaat 3900
atggtattga taatcctgat atgaataaat tgcagtttca tttgatgctc gatgagtttt 3960
tctaatcaga attggttaat tggttgtaac actggcagag cattacgctg acttgacggg 4020
acggcgcaag ctcatgacca aaatccctta acgtgagtta cgcgtcgttc cactgagcgt 4080
cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct 4140
gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc 4200
taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc 4260
ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc 4320
tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg 4380
ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt 4440
cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg 4500
agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg 4560
gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt 4620
atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag 4680
gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt 4740
gctggccttt tgctcacatg tt 4762
<210>4
<211>16843
<212>DNA
<213>人工序列
<220>
<223>载体
<400>4
ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60
aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120
aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180
ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240
cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300
caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360
gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420
tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480
ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540
tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600
cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660
tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720
atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780
ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840
ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900
gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960
ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020
acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080
acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140
agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200
ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260
ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320
atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380
agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440
agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500
cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560
ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620
gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680
gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740
tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800
ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860
tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920
tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980
ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040
aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100
aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160
ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220
aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280
taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340
tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400
tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 2460
catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520
tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580
tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640
tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700
attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760
cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820
ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880
agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940
cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000
tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060
attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120
tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180
ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240
cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300
gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360
gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420
ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480
aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540
gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600
gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660
tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720
agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780
tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840
ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900
tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960
acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020
tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080
acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140
accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200
gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260
gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320
ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380
gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440
cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500
tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560
ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620
gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4680
tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740
ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800
gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860
catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4920
tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4980
cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040
tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100
ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160
cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220
attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280
accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340
ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400
cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460
gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520
agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580
ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640
cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700
tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760
tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820
cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880
caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940
gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000
tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060
cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120
tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180
taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240
accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300
aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360
ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420
actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cctgtatggc 6480
cgcattcgca aaacacacct agactagatt tgttttgcta acccaattga tattaattat 6540
atatgattaa tatttatatg tatatggatt tggttaatga aatgcatctg gttcatcaaa 6600
gaattataaa gacacgtgac attcatttag gataagaaat atggatgatc tctttctctt 6660
ttattcagat aactagtaat tacacataac acacaacttt gatgcccaca ttatagtgat 6720
tagcatgtca ctatgtgtgc atccttttat ttcatacatt aattaagttg gccaatccag 6780
aagatggaca agtctaggtt aaccatgtgg tacctacgcg ttcgaatatc catgggccgc 6840
ttcaggccag ggcgctgggg aaggcgatgg cgtgctcggt cagctgccac ttctggttct 6900
tggcgtcgct ccggtcctcc cgcagcagct tgtgctggat gaagtgccac tcgggcatct 6960
tgctgggcac gctcttggcc ttgtacacgg tgtcgaactg gcaccggtac cggccgccgt 7020
ccttcagcag caggtacatg ctcacgtcgc ccttcaggat gccctgctta ggcacgggca 7080
tgatcttctc gcagctggcc tcccagttgg tggtcatctt cttcatcacg gggccgtcgg 7140
cggggaagtt cacgccgttg aagatgctct tgtggtagat gcagttctcc ttcacgctca 7200
cggtgatgtc cacgttacag atgcacacgg cgccgtcctc gaacaggaag ctccggcccc 7260
aggtgtagcc ggcggggcag ctgttcttga agtagtccac gatgtcctgg gggtactcgg 7320
tgaagatccg gtcgccgtac ttgaagccgg cgctcaggat gtcctcgctg aagggcaggg 7380
ggccgccctc gatcacgcac aggttgatgg tctgcttgcc cttgaagggg tagccgatgc 7440
cctcgccggt gatcacgaac ttgtggccgt tcacgcagcc ctccatgtgg tacttcatgg 7500
tcatctcctc cttcaggccg tgcttgctgt gggccatggt ggcgaccggt gaattcgagc 7560
tcggtacccg gggatcctga gtaaaacaga ggagggtctc actaagttta tagagagact 7620
gagagagata aagggacacg tatgaagcgt ctgttttcgt ggtgtgacgt caaagtcatt 7680
ttgctctcta cgcgtgtctg tgtcggcttg atcttttttt ttgctttttg gaactcatgt 7740
cggtagtata tcttttattt attttttctt tttttccctt ttctttcaaa ctgatgtcgg 7800
tatgatattt attccatcct aaaatgtaac ttactattat tagtagtcgg tccatgtcta 7860
ttggcccatc atgtggtcat tttacgttta cgtcgtgtgg ctgtttatta taacaaacgg 7920
cacatccttc tcattcgaat tgtatttctc cttaatcgtt ctaataggta tgatctttta 7980
ttttatacgt aaaattaaaa ttgaatgatg tcaagaacga aaattaattt gtatttacaa 8040
aggagctaaa tattgtttat tcctctactg gtagaagata aaagaagtag atgaaataat 8100
gatcttacta gagaatattc ctcatttaca ctagtcaaat ggaaatcttg taaactttta 8160
caataattta tcctgaaaat atgaaaaaat agaagaaaat gtttacctcc tctctcctct 8220
taattcacct acgatcggtg cgggcctctt cgctattacg ccagctggcg aaagggggat 8280
gtgctgcaag gcgattaagt tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa 8340
cgacggccag tgaattcgag ctcggtaccc ggggatcctc tagagtcgac ctgcaggcat 8400
gcaagcttgt tgaaacatcc ctgaagtgtc tcattttatt ttatttattc tttgctgata 8460
aaaaaataaa ataaaagaag ctaagcacac ggtcaaccat tgctctactg ctaaaagggt 8520
tatgtgtagt gttttactgc ataaattatg cagcaaacaa gacaactcaa attaaaaaat 8580
ttcctttgct tgtttttttg ttgtctctga cttgactttc ttgtggaagt tggttgtata 8640
aggattggga cacaccattg tccttcttaa tttaatttta tttctttgct gataaaaaaa 8700
aaaaatttca tatagtgtta aataataatt tgttaaataa ccaaaaagtc aaatatgttt 8760
actctcgttt aaataattga gagtcgtcca gcaaggctaa acgattgtat agatttatga 8820
caatatttac ttttttatag ataaatgtta tattataata aatttatata catatattat 8880
atgttattta ttatttatta ttattttaaa tccttcaata ttttatcaaa ccaactcata 8940
attttttttt tatctgtaag aagcaataaa attaaataga cccactttaa ggatgatcca 9000
acctttatac agagtaagag agttcaaata gtaccctttc atatacatat caactaaaat 9060
attagaaata tcatggatca aaccttataa agacattaaa taagtggata agtataatat 9120
ataaatgggt agtatataat atataaatgg atacaaactt ctctctttat aattgttatg 9180
tctccttaac atcctaatat aatacataag tgggtaatat ataatatata aatggagaca 9240
aacttcttcc attataattg ttatgtcttc ttaacactta tgtctcgttc acaatgctaa 9300
agttagaatt gtttagaaag tcttatagta cacatttgtt tttgtactat ttgaagcatt 9360
ccataagccg tcacgattca gatgatttat aataataaga ggaaatttat catagaacaa 9420
taaggtgcat agatagagtg ttaatatatc ataacatcct ttgtttattc atagaagaag 9480
tgagatggag ctcagttatt atactgttac atggtcggat acaatattcc atgctctcca 9540
tgagctctta cacctacatg cattttagtt catacttcat gcacgtggcc atcacagcta 9600
gctgcagcta catatttaca ttttacaaca ccaggagaac tgccctgtta gtgcataaca 9660
atcagaagat ggccgtggct actcgagtta tcgaaccact ttgtacaaga aagctgaacg 9720
agaaacgtaa aatgatataa atatcaatat attaaattag attttgcata aaaaacagac 9780
tacataatac tgtaaaacac aacatatcca gtcactatgg tcgacctgca gactggctgt 9840
gtataaggga gcctgacatt tatattcccc agaacatcag gttaatggcg tttttgatgt 9900
cattttcgcg gtggctgaga tcagccactt cttccccgat aacggagacc ggcacactgg 9960
ccatatcggt ggtcatcatg cgccagcttt catccccgat atgcaccacc gggtaaagtt 10020
cacgggagac tttatctgac agcagacgtg cactggccag ggggatcacc atccgtcgcc 10080
cgggcgtgtc aataatatca ctctgtacat ccacaaacag acgataacgg ctctctcttt 10140
tataggtgta aaccttaaac tgcatttcac cagtccctgt tctcgtcagc aaaagagccg 10200
ttcatttcaa taaaccgggc gacctcagcc atcccttcct gattttccgc tttccagcgt 10260
tcggcacgca gacgacgggc ttcattctgc atggttgtgc ttaccagacc ggagatattg 10320
acatcatata tgccttgagc aactgatagc tgtcgctgtc aactgtcact gtaatacgct 10380
gcttcatagc acacctcttt ttgacatact tcgggtatac atatcagtat atattcttat 10440
accgcaaaaa tcagcgcgca aatacgcata ctgttatctg gcttttagta agccggatcc 10500
tctagattac gccccgccct gccactcatc gcagtactgt tgtaattcat taagcattct 10560
gccgacatgg aagccatcac agacggcatg atgaacctga atcgccagcg gcatcagcac 10620
cttgtcgcct tgcgtataat atttgcccat ggtgaaaacg ggggcgaaga agttgtccat 10680
attggccacg tttaaatcaa aactggtgaa actcacccag ggattggctg agacgaaaaa 10740
catattctca ataaaccctt tagggaaata ggccaggttt tcaccgtaac acgccacatc 10800
ttgcgaatat atgtgtagaa actgccggaa atcgtcgtgg tattcactcc agagcgatga 10860
aaacgtttca gtttgctcat ggaaaacggt gtaacaaggg tgaacactat cccatatcac 10920
cagctcaccg tctttcattg ccatacggaa ttccggatga gcattcatca ggcgggcaag 10980
aatgtgaata aaggccggat aaaacttgtg cttatttttc tttacggtct ttaaaaaggc 11040
cgtaatatcc agctgaacgg tctggttata ggtacattga gcaactgact gaaatgcctc 11100
aaaatgttct ttacgatgcc attgggatat atcaacggtg gtatatccag tgattttttt 11160
ctccatttta gcttccttag ctcctgaaaa tctcgccgga tcctaactca aaatccacac 11220
attatacgag ccggaagcat aaagtgtaaa gcctggggtg cctaatgcgg ccgccatagt 11280
gactggatat gttgtgtttt acagtattat gtagtctgtt ttttatgcaa aatctaattt 11340
aatatattga tatttatatc attttacgtt tctcgttcag cttttttgta caaacttgtt 11400
tgataaccgg tactagtgtg cacgtcgagc gtgtcctctc caaatgaaat gaacttcctt 11460
atatagagga agggtcttgc gaaggatagt gggattgtgc gtcatccctt acgtcagtgg 11520
agatgtcaca tcaatccact tgctttgaag acgtggttgg aacgtcttct ttttccacga 11580
tgctcctcgt gggtgggggt ccatctttgg gaccactgtc ggcagaggca tcttgaatga 11640
tagcctttcc tttatcgcaa tgatggcatt tgtaggagcc accttccttt tctactgtcc 11700
tttcgatgaa gtgacagata gctgggcaat ggaatccgag gaggtttccc gaaattatcc 11760
tttgttgaaa agtctcaata gccctttggt cttctgagac tgtatctttg acatttttgg 11820
agtagaccag agtgtcgtgc tccaccatgt tgacgaagat tttcttcttg tcattgagtc 11880
gtaaaagact ctgtatgaac tgttcgccag tcttcacggc gagttctgtt agatcctcga 11940
tttgaatctt agactccatg catggcctta gattcagtag gaactacctt tttagagact 12000
ccaatctcta ttacttgcct tggtttatga agcaagcctt gaatcgtcca tactggaata 12060
gtacttctga tcttgagaaa tatgtctttc tctgtgttct tgatgcaatt agtcctgaat 12120
cttttgactg catctttaac cttcttggga aggtatttga tctcctggag attgttactc 12180
gggtagatcg tcttgatgag acctgctgcg taggcctctc taaccatctg tgggtcagca 12240
ttctttctga aattgaagag gctaaccttc tcattatcag tggtgaacat agtgtcgtca 12300
ccttcacctt cgaacttcct tcctagatcg taaagataga ggaaatcgtc cattgtaatc 12360
tccggggcaa aggagatctc ttttggggct ggatcactgc tgggcctttt ggttcctagc 12420
gtgagccagt gggctttttg ctttggtggg cttgttaggg ccttagcaaa gctcttgggc 12480
ttgagttgag cttctccttt ggggatgaag ttcaacctgt ctgtttgctg acttgttgtg 12540
tacgcgtcag ctgctgctct tgcctctgta atagtggcaa atttcttgtg tgcaactccg 12600
ggaacgccgt ttgttgccgc ctttgtacaa ccccagtcat cgtatatacc ggcatgtgga 12660
ccgttataca caacgtagta gttgatatga gggtgttgaa tacccgattc tgctctgaga 12720
ggagcaactg tgctgttaag ctcagatttt tgtgggattg gaattggatc ctctagagca 12780
aagcttggcg taatcatggt catagctgtt tcctgtgtga aattgttatc cgctcacaat 12840
tccacacaac atacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag 12900
ctaactcaca ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg 12960
ccagctgcat taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggccaaa 13020
gacaaaaggg cgacattcaa ccgattgagg gagggaaggt aaatattgac ggaaattatt 13080
cattaaaggt gaattatcac cgtcaccgac ttgagccatt tgggaattag agccagcaaa 13140
atcaccagta gcaccattac cattagcaag gccggaaacg tcaccaatga aaccatcatc 13200
tagtaacata gatgacaccg cgcgcgataa tttatcctag tttgcgcgct atattttgtt 13260
ttctatcgcg tattaaatgt ataattgcgg gactctaatc ataaaaaccc atctcataaa 13320
taacgtcatg cattacatgt taattattac atgcttaacg taattcaaca gaaattatat 13380
gataatcatc gcaagaccgg caacaggatt caatcttaag aaactttatt gccaaatgtt 13440
tgaacgatct gcttcgacgc actccttctt taggtacgga ctagatctcg gtgacgggca 13500
ggaccggacg gggcggtacc ggcaggctga agtccagctg ccagaaaccc acgtcatgcc 13560
agttcccgtg cttgaagccg gccgcccgca gcatgccgcg gggggcatat ccgagcgcct 13620
cgtgcatgcg cacgctcggg tcgttgggca gcccgatgac agcgaccacg ctcttgaagc 13680
cctgtgcctc cagggacttc agcaggtggg tgtagagcgt ggagcccagt cccgtccgct 13740
ggtggcgggg ggagacgtac acggtcgact cggccgtcca gtcgtaggcg ttgcgtgcct 13800
tccaggggcc cgcgtaggcg atgccggcga cctcgccgtc cacctcggcg acgagccagg 13860
gatagcgctc ccgcagacgg acgaggtcgt ccgtccactc ctgcggttcc tgcggctcgg 13920
tacggaagtt gaccgtgctt gtctcgatgt agtggttgac gatggtgcag accgccggca 13980
tgtccgcctc ggtggcacgg cggatgtcgg ccgggcgtcg ttctgggctc atggatctgg 14040
attgagagtg aatatgagac tctaattgga taccgagggg aatttatgga acgtcagtgg 14100
agcatttttg acaagaaata tttgctagct gatagtgacc ttaggcgact tttgaacgcg 14160
caataatggt ttctgacgta tgtgcttagc tcattaaact ccagaaaccc gcggctgagt 14220
ggctccttca acgttgcggt tctgtcagtt ccaaacgtaa aacggcttgt cccgcgtcat 14280
cggcgggggt cataacgtga ctcccttaat tctccgctca tgatcagatt gtcgtttccc 14340
gccttcagtt taaactatca gtgtttgaca ggatatattg gcgggtaaac ctaagagaaa 14400
agagcgttta ttagaataat cggatattta aaagggcgtg aaaaggttta tccgttcgtc 14460
catttgtatg tgcatgccaa ccacagggtt ccccagatct ggcgccggcc agcgagacga 14520
gcaagattgg ccgccgcccg aaacgatccg acagcgcgcc cagcacaggt gcgcaggcaa 14580
attgcaccaa cgcatacagc gccagcagaa tgccatagtg ggcggtgacg tcgttcgagt 14640
gaaccagatc gcgcaggagg cccggcagca ccggcataat caggccgatg ccgacagcgt 14700
cgagcgcgac agtgctcaga attacgatca ggggtatgtt gggtttcacg tctggcctcc 14760
ggaccagcct ccgctggtcc gattgaacgc gcggattctt tatcactgat aagttggtgg 14820
acatattatg tttatcagtg ataaagtgtc aagcatgaca aagttgcagc cgaatacagt 14880
gatccgtgcc gccctggacc tgttgaacga ggtcggcgta gacggtctga cgacacgcaa 14940
actggcggaa cggttggggg ttcagcagcc ggcgctttac tggcacttca ggaacaagcg 15000
ggcgctgctc gacgcactgg ccgaagccat gctggcggag aatcatacgc attcggtgcc 15060
gagagccgac gacgactggc gctcatttct gatcgggaat gcccgcagct tcaggcaggc 15120
gctgctcgcc taccgcgatg gcgcgcgcat ccatgccggc acgcgaccgg gcgcaccgca 15180
gatggaaacg gccgacgcgc agcttcgctt cctctgcgag gcgggttttt cggccgggga 15240
cgccgtcaat gcgctgatga caatcagcta cttcactgtt ggggccgtgc ttgaggagca 15300
ggccggcgac agcgatgccg gcgagcgcgg cggcaccgtt gaacaggctc cgctctcgcc 15360
gctgttgcgg gccgcgatag acgccttcga cgaagccggt ccggacgcag cgttcgagca 15420
gggactcgcg gtgattgtcg atggattggc gaaaaggagg ctcgttgtca ggaacgttga 15480
aggaccgaga aagggtgacg attgatcagg accgctgccg gagcgcaacc cactcactac 15540
agcagagcca tgtagacaac atcccctccc cctttccacc gcgtcagacg cccgtagcag 15600
cccgctacgg gctttttcat gccctgccct agcgtccaag cctcacggcc gcgctcggcc 15660
tctctggcgg ccttctggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc 15720
gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa 15780
tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt 15840
aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa 15900
aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt 15960
ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg 16020
tccgcctttc tcccttcggg aagcgtggcg cttttccgct gcataaccct gcttcggggt 16080
cattatagcg attttttcgg tatatccatc ctttttcgca cgatatacag gattttgcca 16140
aagggttcgt gtagactttc cttggtgtat ccaacggcgt cagccgggca ggataggtga 16200
agtaggccca cccgcgagcg ggtgttcctt cttcactgtc ccttattcgc acctggcggt 16260
gctcaacggg aatcctgctc tgcgaggctg gccggctacc gccggcgtaa cagatgaggg 16320
caagcggatg gctgatgaaa ccaagccaac caggaagggc agcccaccta tcaaggtgta 16380
ctgccttcca gacgaacgaa gagcgattga ggaaaaggcg gcggcggccg gcatgagcct 16440
gtcggcctac ctgctggccg tcggccaggg ctacaaaatc acgggcgtcg tggactatga 16500
gcacgtccgc gagctggccc gcatcaatgg cgacctgggc cgcctgggcg gcctgctgaa 16560
actctggctc accgacgacc cgcgcacggc gcggttcggt gatgccacga tcctcgccct 16620
gctggcgaag atcgaagaga agcaggacga gcttggcaag gtcatgatgg gcgtggtccg 16680
cccgagggca gagccatgac ttttttagcc gctaaaacgg ccggggggtg cgcgtgattg 16740
ccaagcacgt ccccatgcgc tccatcaaga agagcgactt cgcggagctg gtgaagtaca 16800
tcaccgacga gcaaggcaag accgagcgcc tttgcgacgc tca 16843
<210>5
<211>9142
<212>DNA
<213>人工序列
<220>
<223>载体
<400>5
ctagttatct gaataaaaga gaaagagatc atccatattt cttatcctaa atgaatgtca 60
cgtgtcttta taattctttg atgaaccaga tgcatttcat taaccaaatc catatacata 120
taaatattaa tcatatataa ttaatatcaa ttgggttagc aaaacaaatc tagtctaggt 180
gtgttttgcg aattcgatat caagcttgat gggtaccggc gcgcccgatc atccggatat 240
agttcctcct ttcagcaaaa aacccctcaa gacccgttta gaggccccaa ggggttatgc 300
tagttattgc tcagcggtgg cagcagccaa ctcagcttcc tttcgggctt tgttagcagc 360
cggatcgatc caagctgtac ctcactattc ctttgccctc ggacgagtgc tggggcgtcg 420
gtttccacta tcggcgagta cttctacaca gccatcggtc cagacggccg cgcttctgcg 480
ggcgatttgt gtacgcccga cagtcccggc tccggatcgg acgattgcgt cgcatcgacc 540
ctgcgcccaa gctgcatcat cgaaattgcc gtcaaccaag ctctgataga gttggtcaag 600
accaatgcgg agcatatacg cccggagccg cggcgatcct gcaagctccg gatgcctccg 660
ctcgaagtag cgcgtctgct gctccataca agccaaccac ggcctccaga agaagatgtt 720
ggcgacctcg tattgggaat ccccgaacat cgcctcgctc cagtcaatga ccgctgttat 780
gcggccattg tccgtcagga cattgttgga gccgaaatcc gcgtgcacga ggtgccggac 840
ttcggggcag tcctcggccc aaagcatcag ctcatcgaga gcctgcgcga cggacgcact 900
gacggtgtcg tccatcacag tttgccagtg atacacatgg ggatcagcaa tcgcgcatat 960
gaaatcacgc catgtagtgt attgaccgat tccttgcggt ccgaatgggc cgaacccgct 1020
cgtctggcta agatcggccg cagcgatcgc atccatagcc tccgcgaccg gctgcagaac 1080
agcgggcagt tcggtttcag gcaggtcttg caacgtgaca ccctgtgcac ggcgggagat 1140
gcaataggtc aggctctcgc tgaattcccc aatgtcaagc acttccggaa tcgggagcgc 1200
ggccgatgca aagtgccgat aaacataacg atctttgtag aaaccatcgg cgcagctatt 1260
tacccgcagg acatatccac gccctcctac atcgaagctg aaagcacgag attcttcgcc 1320
ctccgagagc tgcatcaggt cggagacgct gtcgaacttt tcgatcagaa acttctcgac 1380
agacgtcgcg gtgagttcag gcttttccat gggtatatct ccttcttaaa gttaaacaaa 1440
attatttcta gagggaaacc gttgtggtct ccctatagtg agtcgtatta atttcgcggg 1500
atcgagatct gatcaacctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 1560
gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc 1620
ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata 1680
acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 1740
cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct 1800
caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa 1860
gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc 1920
tcccttcggg aagcgtggcg ctttctcaat gctcacgctg taggtatctc agttcggtgt 1980
aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 2040
ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg 2100
cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct 2160
tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc tgcgctctgc 2220
tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg 2280
ctggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 2340
aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt 2400
aagggatttt ggtcatgaca ttaacctata aaaataggcg tatcacgagg ccctttcgtc 2460
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 2520
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 2580
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 2640
accatatgga catattgtcg ttagaacgcg gctacaatta atacataacc ttatgtatca 2700
tacacatacg atttaggtga cactatagaa cggcgcgcca agctgggtct agaactagaa 2760
acgtgatgcc acttgttatt gaagtcgatt acagcatcta ttctgtttta ctatttataa 2820
ctttgccatt tctgactttt gaaaactatc tctggatttc ggtatcgctt tgtgaagatc 2880
gagcaaaaga gacgttttgt ggacgcaatg gtccaaatcc gttctacatg aacaaattgg 2940
tcacaatttc cactaaaagt aaataaatgg caagttaaaa aaggaatatg cattttactg 3000
attgcctagg tgagctccaa gagaagttga atctacacgt ctaccaaccg ctaaaaaaag 3060
aaaaacattg aatatgtaac ctgattccat tagcttttga cttcttcaac agattctcta 3120
cttagatttc taacagaaat attattacta gcacatcatt ttcagtctca ctacagcaaa 3180
aaatccaacg gcacaataca gacaacagga gatatcagac tacagagata gatagatgct 3240
actgcatgta gtaagttaaa taaaaggaaa ataaaatgtc ttgctaccaa aactactaca 3300
gactatgatg ctcaccacag gccaaatcct gcaactagga cagcattatc ttatatatat 3360
tgtacaaaac aagcatcaag gaacatttgg tctaggcaat cagtacctcg ttctaccatc 3420
accctcagtt atcacatcct tgaaggatcc attactggga atcatcggca acacatgctc 3480
ctgatggggc acaatgacat caagaaggta ggggccaggg gtgtccaaca ttctctgaat 3540
tgccgctcta agctcttcct tcttcgtcac tcgcgctgcc ggtatcccac aagcatcagc 3600
aaacttgagc atgtttggga atatctcgct ctcgctagac ggatctccaa gataggtgtg 3660
agctctattg gacttgtaga acctatcctc caactgaacc accataccca aatgctgatt 3720
gttcaacaac aatatcttaa ctgggagatt ctccactctt atagtggcca actcctgaac 3780
attcatgatg aaactaccat ccccatcaat gtcaaccaca acagccccag ggttagcaac 3840
agcagcacca atagccgcag gcaatccaaa acccatggct ccaagacccc ctgaggtcaa 3900
ccactgcctc ggtctcttgt acttgtaaaa ctgcgcagcc cacatttgat gctgcccaac 3960
cccagtacta acaatagcat ctccattagt caactcatca agaacctcga tagcatgctg 4020
cggagaaatc gcgtcctgga atgtcttgta acccaatgga aacttgtgtt tctgcacatt 4080
aatctcttct ctccaacctc caagatcaaa cttaccctcc actcctttct cctccaaaat 4140
catattaatt cccttcaagg ccaacttcaa atccgcgcaa accgacacgt gcgcctgctt 4200
gttcttccca atctcggcag aatcaatatc aatgtgaaca atcttagccc tactagcaaa 4260
agcctcaagc ttcccagtaa cacggtcatc aaaccttacc ccaaaggcaa gcaacaaatc 4320
actattgtca acagcatagt tagcataaac agtaccatgc atacccagca tctgaaggga 4380
atattcatca ccaataggaa aagttccaag acccattaaa gtgctagcaa cgggaatacc 4440
agtgagttca acaaagcgcc tcaattcagc actggaattc aaactgccac cgccgacgta 4500
gagaacgggc ttttgggcct ccatgatgag tctgacaatg tgttccaatt gggcctcggc 4560
ggggggcctg ggcagcctgg cgaggtaacc ggggaggtta acgggctcgt cccaattagg 4620
cacggcgagt tgctgctgaa cgtctttggg aatgtcgatg aggaccggac cggggcggcc 4680
ggaggtggcg acgaagaaag cctcggcgac gacgcggggg atgtcgtcga cgtcgaggat 4740
gaggtagttg tgcttcgtga tggatctgct cacctccacg atcggggttt cttggaaggc 4800
gtcggtgccg atcatccggc gggcgacctg gccggtgatg gcgacgactg ggacgctgtc 4860
cattaaagcg tcggcgaggc cgctcacgag gttggtggcg ccggggccgg aggtggcaat 4920
gcagacgccg gggaggccgg aggaacgcgc gtagccttcg gcggcgaaga cgccgccctg 4980
ctcgtggcgc gggagcacgt tgcggatggc ggcggagcgc gtgagcgcct ggtggatctc 5040
catcgacgca ccgccggggt acgcgaacac cgtcgtcacg ccctgcctct ccagcgcctc 5100
cacaaggatg tccgcgccct tgcgaggttc gccggaggcg aaccgtgaca cgaagggctc 5160
cgtggtcggc gcttccttgg tgaagggcgc cgccgtgggg ggtttggaga tggaacattt 5220
gattttgaga gcgtggttgg gtttggtgag ggtttgatga gagagaggga gggtggatct 5280
agtaatgcgt ttggggaagg tggggtgtga agaggaagaa gagaatcggg tggttctgga 5340
agcggtggcc gccattgtgt tgtgtggcat ggttatactt caaaaactgc acaacaagcc 5400
tagagttagt acctaaacag taaatttaca acagagagca aagacacatg caaaaatttc 5460
agccataaaa aaagttataa tagaatttaa agcaaaagtt tcatttttta aacatatata 5520
caaacaaact ggatttgaag gaagggatta attcccctgc tcaaagtttg aattcctatt 5580
gtgacctata ctcgaataaa attgaagcct aaggaatgta tgagaaacaa gaaaacaaaa 5640
caaaactaca gacaaacaag tacaattaca aaattcgcta aaattctgta atcaccaaac 5700
cccatctcag tcagcacaag gcccaaggtt tattttgaaa taaaaaaaaa gtgattttat 5760
ttctcataag ctaaaagaaa gaaaggcaat tatgaaatga tttcgactag atctgaaagt 5820
caaacgcgta ttccgcagat attaaagaaa gagtagagtt tcacatggat cctagatgga 5880
cccagttgag gaaaaagcaa ggcaaagcaa accagaagtg caagatccga aattgaacca 5940
cggaatctag gatttggtag agggagaaga aaagtacctt gagaggtaga agagaagaga 6000
agagcagaga gatatatgaa cgagtgtgtc ttggtctcaa ctctgaagcg atacgagttt 6060
agaggggagc attgagttcc aatttatagg gaaaccgggt ggcaggggtg agttaatgac 6120
ggaaaagccc ctaagtaacg agattggatt gtgggttaga ttcaaccgtt tgcatccgcg 6180
gcttagattg gggaagtcag agtgaatctc aaccgttgac tgagttgaaa attgaatgta 6240
gcaaccaatt gagccaaccc cagcctttgc cctttgattt tgatttgttt gttgcatact 6300
ttttatttgt cttctggttc tgactctctt tctctcgttt caatgccagg ttgcctactc 6360
ccacaccact cacaagaaga ttctactgtt agtattaaat attttttaat gtattaaatg 6420
atgaatgctt ttgtaaacag aacaagacta tgtctaataa gtgtcttgca acatttttta 6480
agaaattaaa aaaaatatat ttattatcaa aatcaaatgt atgaaaaatc atgaataata 6540
taattttata cattttttta aaaaatcttt taatttctta attaatatct taaaaataat 6600
gattaatatt taacccaaaa taattagtat gattggtaag gaagatatcc atgttatgtt 6660
tggatgtgag tttgatctag agcaaagctt actagagtcg acctgcagcc cctccaccgc 6720
ggtggcggcc gctctagaga tccgtcaaca tggtggagca cgacactctc gtctactcca 6780
agaatatcaa agatacagtc tcagaagacc aaagggctat tgagactttt caacaaaggg 6840
taatatcggg aaacctcctc ggattccatt gcccagctat ctgtcacttc atcaaaagga 6900
cagtagaaaa ggaaggtggc acctacaaat gccatcattg cgataaagga aaggctatcg 6960
ttcaagatgc ctctgccgac agtggtccca aagatggacc cccacccacg aggagcatcg 7020
tggaaaaaga agacgttcca accacgtctt caaagcaagt ggattgatgt gatgatccta 7080
tgcgtatggt atgacgtgtg ttcaagatga tgacttcaaa cctacctatg acgtatggta 7140
tgacgtgtgt cgactgatga cttagatcca ctcgagcggc tataaatacg tacctacgca 7200
ccctgcgcta ccatccctag agctgcagct tatttttaca acaattacca acaacaacaa 7260
acaacaaaca acattacaat tactatttac aattacagtc gacccatcaa caagtttgta 7320
caaaaaagct gaacgagaaa cgtaaaatga tataaatatc aatatattaa attagatttt 7380
gcataaaaaa cagactacat aatactgtaa aacacaacat atccagtcat attggcggcc 7440
gcattaggca ccccaggctt tacactttat gcttccggct cgtataatgt gtggattttg 7500
agttaggatc cgtcgagatt ttcaggagct aaggaagcta aaatggagaa aaaaatcact 7560
ggatatacca ccgttgatat atcccaatgg catcgtaaag aacattttga ggcatttcag 7620
tcagttgctc aatgtaccta taaccagacc gttcagctgg atattacggc ctttttaaag 7680
accgtaaaga aaaataagca caagttttat ccggccttta ttcacattct tgcccgcctg 7740
atgaatgctc atccggaatt ccgtatggca atgaaagacg gtgagctggt gatatgggat 7800
agtgttcacc cttgttacac cgttttccat gagcaaactg aaacgttttc atcgctctgg 7860
agtgaatacc acgacgattt ccggcagttt ctacacatat attcgcaaga tgtggcgtgt 7920
tacggtgaaa acctggccta tttccctaaa gggtttattg agaatatgtt tttcgtctca 7980
gccaatccct gggtgagttt caccagtttt gatttaaacg tggccaatat ggacaacttc 8040
ttcgcccccg ttttcaccat gggcaaatat tatacgcaag gcgacaaggt gctgatgccg 8100
ctggcgattc aggttcatca tgccgtttgt gatggcttcc atgtcggcag aatgcttaat 8160
gaattacaac agtactgcga tgagtggcag ggcggggcgt aaagatctgg atccggctta 8220
ctaaaagcca gataacagta tgcgtatttg cgcgctgatt tttgcggtat aagaatatat 8280
actgatatgt atacccgaag tatgtcaaaa agaggtatgc tatgaagcag cgtattacag 8340
tgacagttga cagcgacagc tatcagttgc tcaaggcata tatgatgtca atatctccgg 8400
tctggtaagc acaaccatgc agaatgaagc ccgtcgtctg cgtgccgaac gctggaaagc 8460
ggaaaatcag gaagggatgg ctgaggtcgc ccggtttatt gaaatgaacg gctcttttgc 8520
tgacgagaac aggggctggt gaaatgcagt ttaaggttta cacctataaa agagagagcc 8580
gttatcgtct gtttgtggat gtacagagtg atattattga cacgcccggg cgacggatgg 8640
tgatccccct ggccagtgca cgtctgctgt cagataaagt ctcccgtgaa ctttacccgg 8700
tggtgcatat cggggatgaa agctggcgca tgatgaccac cgatatggcc agtgtgccgg 8760
tctccgttat cggggaagaa gtggctgatc tcagccaccg cgaaaatgac atcaaaaacg 8820
ccattaacct gatgttctgg ggaatataaa tgtcaggctc ccttatacac agccagtctg 8880
caggtcgacc atagtgactg gatatgttgt gttttacagt attatgtagt ctgtttttta 8940
tgcaaaatct aatttaatat attgatattt atatcatttt acgtttctcg ttcagctttc 9000
ttgtacaaag tggttgataa cctagacttg tccatcttct ggattggcca acttaattaa 9060
tgtatgaaat aaaaggatgc acacatagtg acatgctaat cactataatg tgggcatcaa 9120
agttgtgtgt tatgtgtaat ta 9142
<210>6
<211>49911
<212>DNA
<213>人工序列
<220>
<223>载体
<400>6
gtgcagcgtg acccggtcgt gcccctctct agagataatg agcattgcat gtctaagtta 60
taaaaaatta ccacatattt tttttgtcac acttgtttga agtgcagttt atctatcttt 120
atacatatat ttaaacttta ctctacgaat aatataatct atagtactac aataatatca 180
gtgttttaga gaatcatata aatgaacagt tagacatggt ctaaaggaca attgagtatt 240
ttgacaacag gactctacag ttttatcttt ttagtgtgca tgtgttctcc tttttttttg 300
caaatagctt cacctatata atacttcatc cattttatta gtacatccat ttagggttta 360
gggttaatgg tttttataga ctaatttttt tagtacatct attttattct attttagcct 420
ctaaattaag aaaactaaaa ctctatttta gtttttttat ttaataattt agatataaaa 480
tagaataaaa taaagtgact aaaaattaaa caaataccct ttaagaaatt aaaaaaacta 540
aggaaacatt tttcttgttt cgagtagata atgccagcct gttaaacgcc gtcgacgagt 600
ctaacggaca ccaaccagcg aaccagcagc gtcgcgtcgg gccaagcgaa gcagacggca 660
cggcatctct gtcgctgcct ctggacccct ctcgagagtt ccgctccacc gttggacttg 720
ctccgctgtc ggcatccaga aattgcgtgg cggagcggca gacgtgagcc ggcacggcag 780
gcggcctcct cctcctctca cggcacggca gctacggggg attcctttcc caccgctcct 840
tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc ctctttcccc 900
aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa tccacccgtc 960
ggcacctccg cttcaaggta cgccgctcgt cctccccccc cccccctctc taccttctct 1020
agatcggcgt tccggtccat ggttagggcc cggtagttct acttctgttc atgtttgtgt 1080
tagatccgtg tttgtgttag atccgtgctg ctagcgttcg tacacggatg cgacctgtac 1140
gtcagacacg ttctgattgc taacttgcca gtgtttctct ttggggaatc ctgggatggc 1200
tctagccgtt ccgcagacgg gatcgatttc atgatttttt ttgtttcgtt gcatagggtt 1260
tggtttgccc ttttccttta tttcaatata tgccgtgcac ttgtttgtcg ggtcatcttt 1320
tcatgctttt ttttgtcttg gttgtgatga tgtggtctgg ttgggcggtc gttctagatc 1380
ggagtagaat tctgtttcaa actacctggt ggatttatta attttggatc tgtatgtgtg 1440
tgccatacat attcatagtt acgaattgaa gatgatggat ggaaatatcg atctaggata 1500
ggtatacatg ttgatgcggg ttttactgat gcatatacag agatgctttt tgttcgcttg 1560
gttgtgatga tgtggtgtgg ttgggcggtc gttcattcgt tctagatcgg agtagaatac 1620
tgtttcaaac tacctggtgt atttattaat tttggaactg tatgtgtgtg tcatacatct 1680
tcatagttac gagtttaaga tggatggaaa tatcgatcta ggataggtat acatgttgat 1740
gtgggtttta ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac 1800
cttgagtacc tatctattat aataaacaag tatgttttat aattattttg atcttgatat 1860
acttggatga tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg 1920
ctatttattt gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact 1980
tctgcaggtc gactctagag gatccacaag tttgtacaaa aaagctgaac gagaaacgta 2040
aaatgatata aatatcaata tattaaatta gattttgcat aaaaaacaga ctacataata 2100
ctgtaaaaca caacatatcc agtcactatg gcggccgcat taggcacccc aggctttaca 2160
ctttatgctt ccggctcgta taatgtgtgg attttgagtt aggatttaaa tacgcgttga 2220
tccggcttac taaaagccag ataacagtat gcgtatttgc gcgctgattt ttgcggtata 2280
agaatatata ctgatatgta tacccgaagt atgtcaaaaa gaggtatgct atgaagcagc 2340
gtattacagt gacagttgac agcgacagct atcagttgct caaggcatat atgatgtcaa 2400
tatctccggt ctggtaagca caaccatgca gaatgaagcc cgtcgtctgc gtgccgaacg 2460
ctggaaagcg gaaaatcagg aagggatggc tgaggtcgcc cggtttattg aaatgaacgg 2520
ctcttttgct gacgagaaca ggggctggtg aaatgcagtt taaggtttac acctataaaa 2580
gagagagccg ttatcgtctg tttgtggatg tacagagtga tatcattgac acgcccggtc 2640
gacggatggt gatccccctg gccagtgcac gtctgctgtc agataaagtc tcccgtgaac 2700
tttacccggt ggtgcatatc ggggatgaaa gctggcgcat gatgaccacc gatatggcca 2760
gtgtgccggt ctccgttatc ggggaagaag tggctgatct cagccaccgc gaaaatgaca 2820
tcaaaaacgc cattaacctg atgttctggg gaatataaat gtcaggctcc cttatacaca 2880
gccagtctgc aggtcgacca tagtgactgg atatgttgtg ttttacagta ttatgtagtc 2940
tgttttttat gcaaaatcta atttaatata ttgatattta tatcatttta cgtttctcgt 3000
tcagctttct tgtacaaagt ggtgttaacc tagacttgtc catcttctgg attggccaac 3060
ttaattaatg tatgaaataa aaggatgcac acatagtgac atgctaatca ctataatgtg 3120
ggcatcaaag ttgtgtgtta tgtgtaatta ctagttatct gaataaaaga gaaagagatc 3180
atccatattt cttatcctaa atgaatgtca cgtgtcttta taattctttg atgaaccaga 3240
tgcatttcat taaccaaatc catatacata taaatattaa tcatatataa ttaatatcaa 3300
ttgggttagc aaaacaaatc tagtctaggt gtgttttgcg aattgcggcc gccaccgcgg 3360
tggagctcga attccggtcc gggtcacctt tgtccaccaa gatggaactg cggccgctca 3420
ttaattaagt caggcgcgcc tctagttgaa gacacgttca tgtcttcatc gtaagaagac 3480
actcagtagt cttcggccag aatggccatc tggattcagc aggcctagaa ggccatttaa 3540
atcctgagga tctggtcttc ctaaggaccc gggatatcgg accgattaaa ctttaattcg 3600
gtccgaagct tgcatgcctg cagtgcagcg tgacccggtc gtgcccctct ctagagataa 3660
tgagcattgc atgtctaagt tataaaaaat taccacatat tttttttgtc acacttgttt 3720
gaagtgcagt ttatctatct ttatacatat atttaaactt tactctacga ataatataat 3780
ctatagtact acaataatat cagtgtttta gagaatcata taaatgaaca gttagacatg 3840
gtctaaagga caattgagta ttttgacaac aggactctac agttttatct ttttagtgtg 3900
catgtgttct cctttttttt tgcaaatagc ttcacctata taatacttca tccattttat 3960
tagtacatcc atttagggtt tagggttaat ggtttttata gactaatttt tttagtacat 4020
ctattttatt ctattttagc ctctaaatta agaaaactaa aactctattt tagttttttt 4080
atttaataat ttagatataa aatagaataa aataaagtga ctaaaaatta aacaaatacc 4140
ctttaagaaa ttaaaaaaac taaggaaaca tttttcttgt ttcgagtaga taatgccagc 4200
ctgttaaacg ccgtcgacga gtctaacgga caccaaccag cgaaccagca gcgtcgcgtc 4260
gggccaagcg aagcagacgg cacggcatct ctgtcgctgc ctctggaccc ctctcgagag 4320
ttccgctcca ccgttggact tgctccgctg tcggcatcca gaaattgcgt ggcggagcgg 4380
cagacgtgag ccggcacggc aggcggcctc ctcctcctct cacggcaccg gcagctacgg 4440
gggattcctt tcccaccgct ccttcgcttt cccttcctcg cccgccgtaa taaatagaca 4500
ccccctccac accctctttc cccaacctcg tgttgttcgg agcgcacaca cacacaacca 4560
gatctccccc aaatccaccc gtcggcacct ccgcttcaag gtacgccgct cgtcctcccc 4620
cccccccctc tctaccttct ctagatcggc gttccggtcc atgcatggtt agggcccggt 4680
agttctactt ctgttcatgt ttgtgttaga tccgtgtttg tgttagatcc gtgctgctag 4740
cgttcgtaca cggatgcgac ctgtacgtca gacacgttct gattgctaac ttgccagtgt 4800
ttctctttgg ggaatcctgg gatggctcta gccgttccgc agacgggatc gatttcatga 4860
ttttttttgt ttcgttgcat agggtttggt ttgccctttt cctttatttc aatatatgcc 4920
gtgcacttgt ttgtcgggtc atcttttcat gctttttttt gtcttggttg tgatgatgtg 4980
gtctggttgg gcggtcgttc tagatcggag tagaattctg tttcaaacta cctggtggat 5040
ttattaattt tggatctgta tgtgtgtgcc atacatattc atagttacga attgaagatg 5100
atggatggaa atatcgatct aggataggta tacatgttga tgcgggtttt actgatgcat 5160
atacagagat gctttttgtt cgcttggttg tgatgatgtg gtgtggttgg gcggtcgttc 5220
attcgttcta gatcggagta gaatactgtt tcaaactacc tggtgtattt attaattttg 5280
gaactgtatg tgtgtgtcat acatcttcat agttacgagt ttaagatgga tggaaatatc 5340
gatctaggat aggtatacat gttgatgtgg gttttactga tgcatataca tgatggcata 5400
tgcagcatct attcatatgc tctaaccttg agtacctatc tattataata aacaagtatg 5460
ttttataatt attttgatct tgatatactt ggatgatggc atatgcagca gctatatgtg 5520
gattttttta gccctgcctt catacgctat ttatttgctt ggtactgttt cttttgtcga 5580
tgctcaccct gttgtttggt gttacttctg caggtcgact ttaacttagc ctaggatcca 5640
cacgacacca tgtcccccga gcgccgcccc gtcgagatcc gcccggccac cgccgccgac 5700
atggccgccg tgtgcgacat cgtgaaccac tacatcgaga cctccaccgt gaacttccgc 5760
accgagccgc agaccccgca ggagtggatc gacgacctgg agcgcctcca ggaccgctac 5820
ccgtggctcg tggccgaggt ggagggcgtg gtggccggca tcgcctacgc cggcccgtgg 5880
aaggcccgca acgcctacga ctggaccgtg gagtccaccg tgtacgtgtc ccaccgccac 5940
cagcgcctcg gcctcggctc caccctctac acccacctcc tcaagagcat ggaggcccag 6000
ggcttcaagt ccgtggtggc cgtgatcggc ctcccgaacg acccgtccgt gcgcctccac 6060
gaggccctcg gctacaccgc ccgcggcacc ctccgcgccg ccggctacaa gcacggcggc 6120
tggcacgacg tcggcttctg gcagcgcgac ttcgagctgc cggccccgcc gcgcccggtg 6180
cgcccggtga cgcagatctg agtcgaaacc tagacttgtc catcttctgg attggccaac 6240
ttaattaatg tatgaaataa aaggatgcac acatagtgac atgctaatca ctataatgtg 6300
ggcatcaaag ttgtgtgtta tgtgtaatta ctagttatct gaataaaaga gaaagagatc 6360
atccatattt cttatcctaa atgaatgtca cgtgtcttta taattctttg atgaaccaga 6420
tgcatttcat taaccaaatc catatacata taaatattaa tcatatataa ttaatatcaa 6480
ttgggttagc aaaacaaatc tagtctaggt gtgttttgcg aattgcggcc gccaccgcgg 6540
tggagctcga attcattccg attaatcgtg gcctcttgct cttcaggatg aagagctatg 6600
tttaaacgtg caagcgctac tagacaattc agtacattaa aaacgtccgc aatgtgttat 6660
taagttgtct aagcgtcaat ttggtttaca ccacaatata tcctgccacc agccagccaa 6720
cagctccccg accggcagct cggcacaaaa tcaccactcg atacaggcag cccatcagtc 6780
cgggacggcg tcagcgggag agccgttgta aggcggcaga ctttgctcat gttaccgatg 6840
ctattcggaa gaacggcaac taagctgccg ggtttgaaac acggatgatc tcgcggaggg 6900
tagcatgttg attgtaacga tgacagagcg ttgctgcctg tgatcaaata tcatctccct 6960
cgcagagatc cgaattatca gccttcttat tcatttctcg cttaaccgtg acaggctgtc 7020
gatcttgaga actatgccga cataatagga aatcgctgga taaagccgct gaggaagctg 7080
agtggcgcta tttctttaga agtgaacgtt gacgatcgtc gaccgtaccc cgatgaatta 7140
attcggacgt acgttctgaa cacagctgga tacttacttg ggcgattgtc atacatgaca 7200
tcaacaatgt acccgtttgt gtaaccgtct cttggaggtt cgtatgacac tagtggttcc 7260
cctcagcttg cgactagatg ttgaggccta acattttatt agagagcagg ctagttgctt 7320
agatacatga tcttcaggcc gttatctgtc agggcaagcg aaaattggcc atttatgacg 7380
accaatgccc cgcagaagct cccatctttg ccgccataga cgccgcgccc cccttttggg 7440
gtgtagaaca tccttttgcc agatgtggaa aagaagttcg ttgtcccatt gttggcaatg 7500
acgtagtagc cggcgaaagt gcgagaccca tttgcgctat atataagcct acgatttccg 7560
ttgcgactat tgtcgtaatt ggatgaacta ttatcgtagt tgctctcaga gttgtcgtaa 7620
tttgatggac tattgtcgta attgcttatg gagttgtcgt agttgcttgg agaaatgtcg 7680
tagttggatg gggagtagtc atagggaaga cgagcttcat ccactaaaac aattggcagg 7740
tcagcaagtg cctgccccga tgccatcgca agtacgaggc ttagaaccac cttcaacaga 7800
tcgcgcatag tcttccccag ctctctaacg cttgagttaa gccgcgccgc gaagcggcgt 7860
cggcttgaac gaattgttag acattatttg ccgactacct tggtgatctc gcctttcacg 7920
tagtgaacaa attcttccaa ctgatctgcg cgcgaggcca agcgatcttc ttgtccaaga 7980
taagcctgcc tagcttcaag tatgacgggc tgatactggg ccggcaggcg ctccattgcc 8040
cagtcggcag cgacatcctt cggcgcgatt ttgccggtta ctgcgctgta ccaaatgcgg 8100
gacaacgtaa gcactacatt tcgctcatcg ccagcccagt cgggcggcga gttccatagc 8160
gttaaggttt catttagcgc ctcaaataga tcctgttcag gaaccggatc aaagagttcc 8220
tccgccgctg gacctaccaa ggcaacgcta tgttctcttg cttttgtcag caagatagcc 8280
agatcaatgt cgatcgtggc tggctcgaag atacctgcaa gaatgtcatt gcgctgccat 8340
tctccaaatt gcagttcgcg cttagctgga taacgccacg gaatgatgtc gtcgtgcaca 8400
acaatggtga cttctacagc gcggagaatc tcgctctctc caggggaagc cgaagtttcc 8460
aaaaggtcgt tgatcaaagc tcgccgcgtt gtttcatcaa gccttacagt caccgtaacc 8520
agcaaatcaa tatcactgtg tggcttcagg ccgccatcca ctgcggagcc gtacaaatgt 8580
acggccagca acgtcggttc gagatggcgc tcgatgacgc caactacctc tgatagttga 8640
gtcgatactt cggcgatcac cgcttccctc atgatgttta actcctgaat taagccgcgc 8700
cgcgaagcgg tgtcggcttg aatgaattgt taggcgtcat cctgtgctcc cgagaaccag 8760
taccagtaca tcgctgtttc gttcgagact tgaggtctag ttttatacgt gaacaggtca 8820
atgccgccga gagtaaagcc acattttgcg tacaaattgc aggcaggtac attgttcgtt 8880
tgtgtctcta atcgtatgcc aaggagctgt ctgcttagtg cccacttttt cgcaaattcg 8940
atgagactgt gcgcgactcc tttgcctcgg tgcgtgtgcg acacaacaat gtgttcgata 9000
gaggctagat cgttccatgt tgagttgagt tcaatcttcc cgacaagctc ttggtcgatg 9060
aatgcgccat agcaagcaga gtcttcatca gagtcatcat ccgagatgta atccttccgg 9120
taggggctca cacttctggt agatagttca aagccttggt cggataggtg cacatcgaac 9180
acttcacgaa caatgaaatg gttctcagca tccaatgttt ccgccacctg ctcagggatc 9240
accgaaatct tcatatgacg cctaacgcct ggcacagcgg atcgcaaacc tggcgcggct 9300
tttggcacaa aaggcgtgac aggtttgcga atccgttgct gccacttgtt aacccttttg 9360
ccagatttgg taactataat ttatgttaga ggcgaagtct tgggtaaaaa ctggcctaaa 9420
attgctgggg atttcaggaa agtaaacatc accttccggc tcgatgtcta ttgtagatat 9480
atgtagtgta tctacttgat cgggggatct gctgcctcgc gcgtttcggt gatgacggtg 9540
aaaacctctg acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg 9600
ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca 9660
tgacccagtc acgtagcgat agcggagtgt atactggctt aactatgcgg catcagagca 9720
gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg taaggagaaa 9780
ataccgcatc aggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg 9840
gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg 9900
ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa 9960
ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg 10020
acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc 10080
tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc 10140
ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc 10200
ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg 10260
ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc 10320
actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga 10380
gttcttgaag tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc 10440
tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac 10500
caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg 10560
atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc 10620
acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa 10680
ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta 10740
ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt 10800
tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag 10860
tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag caataaacca 10920
gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc 10980
tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt 11040
tgttgccatt gctgcagggg gggggggggg gggggacttc cattgttcat tccacggaca 11100
aaaacagaga aaggaaacga cagaggccaa aaagcctcgc tttcagcacc tgtcgtttcc 11160
tttcttttca gagggtattt taaataaaaa cattaagtta tgacgaagaa gaacggaaac 11220
gccttaaacc ggaaaatttt cataaatagc gaaaacccgc gaggtcgccg ccccgtaacc 11280
tacctgtcgg atcaccggaa aggacccgta aagtgataat gattatcatc tacatatcac 11340
aacgtgcgtg gaggccatca aaccacgtca aataatcaat tatgacgcag gtatcgtatt 11400
aattgatctg catcaactta acgtaaaaac aacttcagac aatacaaatc agcgacactg 11460
aatacggggc aacctcatgt cccccccccc cccccccctg caggcatcgt ggtgtcacgc 11520
tcgtcgtttg gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga 11580
tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt 11640
aagttggccg cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc 11700
atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa 11760
tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa cacgggataa taccgcgcca 11820
catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca 11880
aggatcttac cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct 11940
tcagcatctt ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc 12000
gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa 12060
tattattgaa gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt 12120
tagaaaaata aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc 12180
taagaaacca ttattatcat gacattaacc tataaaaata ggcgtatcac gaggcccttt 12240
cgtcttcaag aattcggagc ttttgccatt ctcaccggat tcagtcgtca ctcatggtga 12300
tttctcactt gataacctta tttttgacga ggggaaatta ataggttgta ttgatgttgg 12360
acgagtcgga atcgcagacc gataccagga tcttgccatc ctatggaact gcctcggtga 12420
gttttctcct tcattacaga aacggctttt tcaaaaatat ggtattgata atcctgatat 12480
gaataaattg cagtttcatt tgatgctcga tgagtttttc taatcagaat tggttaattg 12540
gttgtaacac tggcagagca ttacgctgac ttgacgggac ggcggctttg ttgaataaat 12600
cgaacttttg ctgagttgaa ggatcagatc acgcatcttc ccgacaacgc agaccgttcc 12660
gtggcaaagc aaaagttcaa aatcaccaac tggtccacct acaacaaagc tctcatcaac 12720
cgtggctccc tcactttctg gctggatgat ggggcgattc aggcctggta tgagtcagca 12780
acaccttctt cacgaggcag acctcagcgc cagaaggccg ccagagaggc cgagcgcggc 12840
cgtgaggctt ggacgctagg gcagggcatg aaaaagcccg tagcgggctg ctacgggcgt 12900
ctgacgcggt ggaaaggggg aggggatgtt gtctacatgg ctctgctgta gtgagtgggt 12960
tgcgctccgg cagcggtcct gatcaatcgt caccctttct cggtccttca acgttcctga 13020
caacgagcct ccttttcgcc aatccatcga caatcaccgc gagtccctgc tcgaacgctg 13080
cgtccggacc ggcttcgtcg aaggcgtcta tcgcggcccg caacagcggc gagagcggag 13140
cctgttcaac ggtgccgccg cgctcgccgg catcgctgtc gccggcctgc tcctcaagca 13200
cggccccaac agtgaagtag ctgattgtca tcagcgcatt gacggcgtcc ccggccgaaa 13260
aacccgcctc gcagaggaag cgaagctgcg cgtcggccgt ttccatctgc ggtgcgcccg 13320
gtcgcgtgcc ggcatggatg cgcgcgccat cgcggtaggc gagcagcgcc tgcctgaagc 13380
tgcgggcatt cccgatcaga aatgagcgcc agtcgtcgtc ggctctcggc accgaatgcg 13440
tatgattctc cgccagcatg gcttcggcca gtgcgtcgag cagcgcccgc ttgttcctga 13500
agtgccagta aagcgccggc tgctgaaccc ccaaccgttc cgccagtttg cgtgtcgtca 13560
gaccgtctac gccgacctcg ttcaacaggt ccagggcggc acggatcact gtattcggct 13620
gcaactttgt catgcttgac actttatcac tgataaacat aatatgtcca ccaacttatc 13680
agtgataaag aatccgcgcg ttcaatcgga ccagcggagg ctggtccgga ggccagacgt 13740
gaaacccaac atacccctga tcgtaattct gagcactgtc gcgctcgacg ctgtcggcat 13800
cggcctgatt atgccggtgc tgccgggcct cctgcgcgat ctggttcact cgaacgacgt 13860
caccgcccac tatggcattc tgctggcgct gtatgcgttg gtgcaatttg cctgcgcacc 13920
tgtgctgggc gcgctgtcgg atcgtttcgg gcggcggcca atcttgctcg tctcgctggc 13980
cggcgccact gtcgactacg ccatcatggc gacagcgcct ttcctttggg ttctctatat 14040
cgggcggatc gtggccggca tcaccggggc gactggggcg gtagccggcg cttatattgc 14100
cgatatcact gatggcgatg agcgcgcgcg gcacttcggc ttcatgagcg cctgtttcgg 14160
gttcgggatg gtcgcgggac ctgtgctcgg tgggctgatg ggcggtttct ccccccacgc 14220
tccgttcttc gccgcggcag ccttgaacgg cctcaatttc ctgacgggct gtttcctttt 14280
gccggagtcg cacaaaggcg aacgccggcc gttacgccgg gaggctctca acccgctcgc 14340
ttcgttccgg tgggcccggg gcatgaccgt cgtcgccgcc ctgatggcgg tcttcttcat 14400
catgcaactt gtcggacagg tgccggccgc gctttgggtc attttcggcg aggatcgctt 14460
tcactgggac gcgaccacga tcggcatttc gcttgccgca tttggcattc tgcattcact 14520
cgcccaggca atgatcaccg gccctgtagc cgcccggctc ggcgaaaggc gggcactcat 14580
gctcggaatg attgccgacg gcacaggcta catcctgctt gccttcgcga cacggggatg 14640
gatggcgttc ccgatcatgg tcctgcttgc ttcgggtggc atcggaatgc cggcgctgca 14700
agcaatgttg tccaggcagg tggatgagga acgtcagggg cagctgcaag gctcactggc 14760
ggcgctcacc agcctgacct cgatcgtcgg acccctcctc ttcacggcga tctatgcggc 14820
ttctataaca acgtggaacg ggtgggcatg gattgcaggc gctgccctct acttgctctg 14880
cctgccggcg ctgcgtcgcg ggctttggag cggcgcaggg caacgagccg atcgctgatc 14940
gtggaaacga taggcctatg ccatgcgggt caaggcgact tccggcaagc tatacgcgcc 15000
ctaggagtgc ggttggaacg ttggcccagc cagatactcc cgatcacgag caggacgccg 15060
atgatttgaa gcgcactcag cgtctgatcc aagaacaacc atcctagcaa cacggcggtc 15120
cccgggctga gaaagcccag taaggaaaca actgtaggtt cgagtcgcga gatcccccgg 15180
aaccaaagga agtaggttaa acccgctccg atcaggccga gccacgccag gccgagaaca 15240
ttggttcctg taggcatcgg gattggcgga tcaaacacta aagctactgg aacgagcaga 15300
agtcctccgg ccgccagttg ccaggcggta aaggtgagca gaggcacggg aggttgccac 15360
ttgcgggtca gcacggttcc gaacgccatg gaaaccgccc ccgccaggcc cgctgcgacg 15420
ccgacaggat ctagcgctgc gtttggtgtc aacaccaaca gcgccacgcc cgcagttccg 15480
caaatagccc ccaggaccgc catcaatcgt atcgggctac ctagcagagc ggcagagatg 15540
aacacgacca tcagcggctg cacagcgcct accgtcgccg cgaccccgcc cggcaggcgg 15600
tagaccgaaa taaacaacaa gctccagaat agcgaaatat taagtgcgcc gaggatgaag 15660
atgcgcatcc accagattcc cgttggaatc tgtcggacga tcatcacgag caataaaccc 15720
gccggcaacg cccgcagcag cataccggcg acccctcggc ctcgctgttc gggctccacg 15780
aaaacgccgg acagatgcgc cttgtgagcg tccttggggc cgtcctcctg tttgaagacc 15840
gacagcccaa tgatctcgcc gtcgatgtag gcgccgaatg ccacggcatc tcgcaaccgt 15900
tcagcgaacg cctccatggg ctttttctcc tcgtgctcgt aaacggaccc gaacatctct 15960
ggagctttct tcagggccga caatcggatc tcgcggaaat cctgcacgtc ggccgctcca 16020
agccgtcgaa tctgagcctt aatcacaatt gtcaatttta atcctctgtt tatcggcagt 16080
tcgtagagcg cgccgtgcgt cccgagcgat actgagcgaa gcaagtgcgt cgagcagtgc 16140
ccgcttgttc ctgaaatgcc agtaaagcgc tggctgctga acccccagcc ggaactgacc 16200
ccacaaggcc ctagcgtttg caatgcacca ggtcatcatt gacccaggcg tgttccacca 16260
ggccgctgcc tcgcaactct tcgcaggctt cgccgacctg ctcgcgccac ttcttcacgc 16320
gggtggaatc cgatccgcac atgaggcgga aggtttccag cttgagcggg tacggctccc 16380
ggtgcgagct gaaatagtcg aacatccgtc gggccgtcgg cgacagcttg cggtacttct 16440
cccatatgaa tttcgtgtag tggtcgccag caaacagcac gacgatttcc tcgtcgatca 16500
ggacctggca acgggacgtt ttcttgccac ggtccaggac gcggaagcgg tgcagcagcg 16560
acaccgattc caggtgccca acgcggtcgg acgtgaagcc catcgccgtc gcctgtaggc 16620
gcgacaggca ttcctcggcc ttcgtgtaat accggccatt gatcgaccag cccaggtcct 16680
ggcaaagctc gtagaacgtg aaggtgatcg gctcgccgat aggggtgcgc ttcgcgtact 16740
ccaacacctg ctgccacacc agttcgtcat cgtcggcccg cagctcgacg ccggtgtagg 16800
tgatcttcac gtccttgttg acgtggaaaa tgaccttgtt ttgcagcgcc tcgcgcggga 16860
ttttcttgtt gcgcgtggtg aacagggcag agcgggccgt gtcgtttggc atcgctcgca 16920
tcgtgtccgg ccacggcgca atatcgaaca aggaaagctg catttccttg atctgctgct 16980
tcgtgtgttt cagcaacgcg gcctgcttgg cctcgctgac ctgttttgcc aggtcctcgc 17040
cggcggtttt tcgcttcttg gtcgtcatag ttcctcgcgt gtcgatggtc atcgacttcg 17100
ccaaacctgc cgcctcctgt tcgagacgac gcgaacgctc cacggcggcc gatggcgcgg 17160
gcagggcagg gggagccagt tgcacgctgt cgcgctcgat cttggccgta gcttgctgga 17220
ccatcgagcc gacggactgg aaggtttcgc ggggcgcacg catgacggtg cggcttgcga 17280
tggtttcggc atcctcggcg gaaaaccccg cgtcgatcag ttcttgcctg tatgccttcc 17340
ggtcaaacgt ccgattcatt caccctcctt gcgggattgc cccgactcac gccggggcaa 17400
tgtgccctta ttcctgattt gacccgcctg gtgccttggt gtccagataa tccaccttat 17460
cggcaatgaa gtcggtcccg tagaccgtct ggccgtcctt ctcgtacttg gtattccgaa 17520
tcttgccctg cacgaatacc agcgacccct tgcccaaata cttgccgtgg gcctcggcct 17580
gagagccaaa acacttgatg cggaagaagt cggtgcgctc ctgcttgtcg ccggcatcgt 17640
tgcgccactc ttcattaacc gctatatcga aaattgcttg cggcttgtta gaattgccat 17700
gacgtacctc ggtgtcacgg gtaagattac cgataaactg gaactgatta tggctcatat 17760
cgaaagtctc cttgagaaag gagactctag tttagctaaa cattggttcc gctgtcaaga 17820
actttagcgg ctaaaatttt gcgggccgcg accaaaggtg cgaggggcgg cttccgctgt 17880
gtacaaccag atatttttca ccaacatcct tcgtctgctc gatgagcggg gcatgacgaa 17940
acatgagctg tcggagaggg caggggtttc aatttcgttt ttatcagact taaccaacgg 18000
taaggccaac ccctcgttga aggtgatgga ggccattgcc gacgccctgg aaactcccct 18060
acctcttctc ctggagtcca ccgaccttga ccgcgaggca ctcgcggaga ttgcgggtca 18120
tcctttcaag agcagcgtgc cgcccggata cgaacgcatc agtgtggttt tgccgtcaca 18180
taaggcgttt atcgtaaaga aatggggcga cgacacccga aaaaagctgc gtggaaggct 18240
ctgacgccaa gggttagggc ttgcacttcc ttctttagcc gctaaaacgg ccccttctct 18300
gcgggccgtc ggctcgcgca tcatatcgac atcctcaacg gaagccgtgc cgcgaatggc 18360
atcgggcggg tgcgctttga cagttgtttt ctatcagaac ccctacgtcg tgcggttcga 18420
ttagctgttt gtcttgcagg ctaaacactt tcggtatatc gtttgcctgt gcgataatgt 18480
tgctaatgat ttgttgcgta ggggttactg aaaagtgagc gggaaagaag agtttcagac 18540
catcaaggag cgggccaagc gcaagctgga acgcgacatg ggtgcggacc tgttggccgc 18600
gctcaacgac ccgaaaaccg ttgaagtcat gctcaacgcg gacggcaagg tgtggcacga 18660
acgccttggc gagccgatgc ggtacatctg cgacatgcgg cccagccagt cgcaggcgat 18720
tatagaaacg gtggccggat tccacggcaa agaggtcacg cggcattcgc ccatcctgga 18780
aggcgagttc cccttggatg gcagccgctt tgccggccaa ttgccgccgg tcgtggccgc 18840
gccaaccttt gcgatccgca agcgcgcggt cgccatcttc acgctggaac agtacgtcga 18900
ggcgggcatc atgacccgcg agcaatacga ggtcattaaa agcgccgtcg cggcgcatcg 18960
aaacatcctc gtcattggcg gtactggctc gggcaagacc acgctcgtca acgcgatcat 19020
caatgaaatg gtcgccttca acccgtctga gcgcgtcgtc atcatcgagg acaccggcga 19080
aatccagtgc gccgcagaga acgccgtcca ataccacacc agcatcgacg tctcgatgac 19140
gctgctgctc aagacaacgc tgcgtatgcg ccccgaccgc atcctggtcg gtgaggtacg 19200
tggccccgaa gcccttgatc tgttgatggc ctggaacacc gggcatgaag gaggtgccgc 19260
caccctgcac gcaaacaacc ccaaagcggg cctgagccgg ctcgccatgc ttatcagcat 19320
gcacccggat tcaccgaaac ccattgagcc gctgattggc gaggcggttc atgtggtcgt 19380
ccatatcgcc aggaccccta gcggccgtcg agtgcaagaa attctcgaag ttcttggtta 19440
cgagaacggc cagtacatca ccaaaaccct gtaaggagta tttccaatga caacggctgt 19500
tccgttccgt ctgaccatga atcgcggcat tttgttctac cttgccgtgt tcttcgttct 19560
cgctctcgcg ttatccgcgc atccggcgat ggcctcggaa ggcaccggcg gcagcttgcc 19620
atatgagagc tggctgacga acctgcgcaa ctccgtaacc ggcccggtgg ccttcgcgct 19680
gtccatcatc ggcatcgtcg tcgccggcgg cgtgctgatc ttcggcggcg aactcaacgc 19740
cttcttccga accctgatct tcctggttct ggtgatggcg ctgctggtcg gcgcgcagaa 19800
cgtgatgagc accttcttcg gtcgtggtgc cgaaatcgcg gccctcggca acggggcgct 19860
gcaccaggtg caagtcgcgg cggcggatgc cgtgcgtgcg gtagcggctg gacggctcgc 19920
ctaatcatgg ctctgcgcac gatccccatc cgtcgcgcag gcaaccgaga aaacctgttc 19980
atgggtggtg atcgtgaact ggtgatgttc tcgggcctga tggcgtttgc gctgattttc 20040
agcgcccaag agctgcgggc caccgtggtc ggtctgatcc tgtggttcgg ggcgctctat 20100
gcgttccgaa tcatggcgaa ggccgatccg aagatgcggt tcgtgtacct gcgtcaccgc 20160
cggtacaagc cgtattaccc ggcccgctcg accccgttcc gcgagaacac caatagccaa 20220
gggaagcaat accgatgatc caagcaattg cgattgcaat cgcgggcctc ggcgcgcttc 20280
tgttgttcat cctctttgcc cgcatccgcg cggtcgatgc cgaactgaaa ctgaaaaagc 20340
atcgttccaa ggacgccggc ctggccgatc tgctcaacta cgccgctgtc gtcgatgacg 20400
gcgtaatcgt gggcaagaac ggcagcttta tggctgcctg gctgtacaag ggcgatgaca 20460
acgcaagcag caccgaccag cagcgcgaag tagtgtccgc ccgcatcaac caggccctcg 20520
cgggcctggg aagtgggtgg atgatccatg tggacgccgt gcggcgtcct gctccgaact 20580
acgcggagcg gggcctgtcg gcgttccctg accgtctgac ggcagcgatt gaagaagagc 20640
gctcggtctt gccttgctcg tcggtgatgt acttcaccag ctccgcgaag tcgctcttct 20700
tgatggagcg catggggacg tgcttggcaa tcacgcgcac cccccggccg ttttagcggc 20760
taaaaaagtc atggctctgc cctcgggcgg accacgccca tcatgacctt gccaagctcg 20820
tcctgcttct cttcgatctt cgccagcagg gcgaggatcg tggcatcacc gaaccgcgcc 20880
gtgcgcgggt cgtcggtgag ccagagtttc agcaggccgc ccaggcggcc caggtcgcca 20940
ttgatgcggg ccagctcgcg gacgtgctca tagtccacga cgcccgtgat tttgtagccc 21000
tggccgacgg ccagcaggta ggccgacagg ctcatgccgg ccgccgccgc cttttcctca 21060
atcgctcttc gttcgtctgg aaggcagtac accttgatag gtgggctgcc cttcctggtt 21120
ggcttggttt catcagccat ccgcttgccc tcatctgtta cgccggcggt agccggccag 21180
cctcgcagag caggattccc gttgagcacc gccaggtgcg aataagggac agtgaagaag 21240
gaacacccgc tcgcgggtgg gcctacttca cctatcctgc ccggctgacg ccgttggata 21300
caccaaggaa agtctacacg aaccctttgg caaaatcctg tatatcgtgc gaaaaaggat 21360
ggatataccg aaaaaatcgc tataatgacc ccgaagcagg gttatgcagc ggaaaagcgc 21420
tgcttccctg ctgttttgtg gaatatctac cgactggaaa caggcaaatg caggaaatta 21480
ctgaactgag gggacaggcg agagacgatg ccaaagagct acaccgacga gctggccgag 21540
tgggttgaat cccgcgcggc caagaagcgc cggcgtgatg aggctgcggt tgcgttcctg 21600
gcggtgaggg cggatgtcga ggcggcgtta gcgtccggct atgcgctcgt caccatttgg 21660
gagcacatgc gggaaacggg gaaggtcaag ttctcctacg agacgttccg ctcgcacgcc 21720
aggcggcaca tcaaggccaa gcccgccgat gtgcccgcac cgcaggccaa ggctgcggaa 21780
cccgcgccgg cacccaagac gccggagcca cggcggccga agcagggggg caaggctgaa 21840
aagccggccc ccgctgcggc cccgaccggc ttcaccttca acccaacacc ggacaaaaag 21900
gatctactgt aatggcgaaa attcacatgg ttttgcaggg caagggcggg gtcggcaagt 21960
cggccatcgc cgcgatcatt gcgcagtaca agatggacaa ggggcagaca cccttgtgca 22020
tcgacaccga cccggtgaac gcgacgttcg agggctacaa ggccctgaac gtccgccggc 22080
tgaacatcat ggccggcgac gaaattaact cgcgcaactt cgacaccctg gtcgagctga 22140
ttgcgccgac caaggatgac gtggtgatcg acaacggtgc cagctcgttc gtgcctctgt 22200
cgcattacct catcagcaac caggtgccgg ctctgctgca agaaatgggg catgagctgg 22260
tcatccatac cgtcgtcacc ggcggccagg ctctcctgga cacggtgagc ggcttcgccc 22320
agctcgccag ccagttcccg gccgaagcgc ttttcgtggt ctggctgaac ccgtattggg 22380
ggcctatcga gcatgagggc aagagctttg agcagatgaa ggcgtacacg gccaacaagg 22440
cccgcgtgtc gtccatcatc cagattccgg ccctcaagga agaaacctac ggccgcgatt 22500
tcagcgacat gctgcaagag cggctgacgt tcgaccaggc gctggccgat gaatcgctca 22560
cgatcatgac gcggcaacgc ctcaagatcg tgcggcgcgg cctgtttgaa cagctcgacg 22620
cggcggccgt gctatgagcg accagattga agagctgatc cgggagattg cggccaagca 22680
cggcatcgcc gtcggccgcg acgacccggt gctgatcctg cataccatca acgcccggct 22740
catggccgac agtgcggcca agcaagagga aatccttgcc gcgttcaagg aagagctgga 22800
agggatcgcc catcgttggg gcgaggacgc caaggccaaa gcggagcgga tgctgaacgc 22860
ggccctggcg gccagcaagg acgcaatggc gaaggtaatg aaggacagcg ccgcgcaggc 22920
ggccgaagcg atccgcaggg aaatcgacga cggccttggc cgccagctcg cggccaaggt 22980
cgcggacgcg cggcgcgtgg cgatgatgaa catgatcgcc ggcggcatgg tgttgttcgc 23040
ggccgccctg gtggtgtggg cctcgttatg aatcgcagag gcgcagatga aaaagcccgg 23100
cgttgccggg ctttgttttt gcgttagctg ggcttgtttg acaggcccaa gctctgactg 23160
cgcccgcgct cgcgctcctg ggcctgtttc ttctcctgct cctgcttgcg catcagggcc 23220
tggtgccgtc gggctgcttc acgcatcgaa tcccagtcgc cggccagctc gggatgctcc 23280
gcgcgcatct tgcgcgtcgc cagttcctcg atcttgggcg cgtgaatgcc catgccttcc 23340
ttgatttcgc gcaccatgtc cagccgcgtg tgcagggtct gcaagcgggc ttgctgttgg 23400
gcctgctgct gctgccaggc ggcctttgta cgcggcaggg acagcaagcc gggggcattg 23460
gactgtagct gctgcaaacg cgcctgctga cggtctacga gctgttctag gcggtcctcg 23520
atgcgctcca cctggtcatg ctttgcctgc acgtagagcg caagggtctg ctggtaggtc 23580
tgctcgatgg gcgcggattc taagagggcc tgctgttccg tctcggcctc ctgggccgcc 23640
tgtagcaaat cctcgccgct gttgccgctg gactgcttta ctgccgggga ctgctgttgc 23700
cctgctcgcg ccgtcgtcgc agttcggctt gcccccactc gattgactgc ttcatttcga 23760
gccgcagcga tgcgatctcg gattgcgtca acggacgggg cagcgcggag gtgtccggct 23820
tctccttggg tgagtcggtc gatgccatag ccaaaggttt ccttccaaaa tgcgtccatt 23880
gctggaccgt gtttctcatt gatgcccgca agcatcttcg gcttgaccgc caggtcaagc 23940
gcgccttcat gggcggtcat gacggacgcc gccatgacct tgccgccgtt gttctcgatg 24000
tagccgcgta atgaggcaat ggtgccgccc atcgtcagcg tgtcatcgac aacgatgtac 24060
ttctggccgg ggatcacctc cccctcgaaa gtcgggttga acgccaggcg atgatctgaa 24120
ccggctccgg ttcgggcgac cttctcccgc tgcacaatgt ccgtttcgac ctcaaggcca 24180
aggcggtcgg ccagaacgac cgccatcatg gccggaatct tgttgttccc cgccgcctcg 24240
acggcgagga ctggaacgat gcggggcttg tcgtcgccga tcagcgtctt gagctgggca 24300
acagtgtcgt ccgaaatcag gcgctcgacc aaattaagcg ccgcttccgc gtcgccctgc 24360
ttcgcagcct ggtattcagg ctcgttggtc aaagaaccaa ggtcgccgtt gcgaaccacc 24420
ttcgggaagt ctccccacgg tgcgcgctcg gctctgctgt agctgctcaa gacgcctccc 24480
tttttagccg ctaaaactct aacgagtgcg cccgcgactc aacttgacgc tttcggcact 24540
tacctgtgcc ttgccacttg cgtcataggt gatgcttttc gcactcccga tttcaggtac 24600
tttatcgaaa tctgaccggg cgtgcattac aaagttcttc cccacctgtt ggtaaatgct 24660
gccgctatct gcgtggacga tgctgccgtc gtggcgctgc gacttatcgg ccttttgggc 24720
catatagatg ttgtaaatgc caggtttcag ggccccggct ttatctacct tctggttcgt 24780
ccatgcgcct tggttctcgg tctggacaat tctttgccca ttcatgacca ggaggcggtg 24840
tttcattggg tgactcctga cggttgcctc tggtgttaaa cgtgtcctgg tcgcttgccg 24900
gctaaaaaaa agccgacctc ggcagttcga ggccggcttt ccctagagcc gggcgcgtca 24960
aggttgttcc atctatttta gtgaactgcg ttcgatttat cagttacttt cctcccgctt 25020
tgtgtttcct cccactcgtt tccgcgtcta gccgacccct caacatagcg gcctcttctt 25080
gggctgcctt tgcctcttgc cgcgcttcgt cacgctcggc ttgcaccgtc gtaaagcgct 25140
cggcctgcct ggccgcctct tgcgccgcca acttcctttg ctcctggtgg gcctcggcgt 25200
cggcctgcgc cttcgctttc accgctgcca actccgtgcg caaactctcc gcttcgcgcc 25260
tggtggcgtc gcgctcgccg cgaagcgcct gcatttcctg gttggccgcg tccagggtct 25320
tgcggctctc ttctttgaat gcgcgggcgt cctggtgagc gtagtccagc tcggcgcgca 25380
gctcctgcgc tcgacgctcc acctcgtcgg cccgctgcgt cgccagcgcg gcccgctgct 25440
cggctcctgc cagggcggtg cgtgcttcgg ccagggcttg ccgctggcgt gcggccagct 25500
cggccgcctc ggcggcctgc tgctctagca atgtaacgcg cgcctgggct tcttccagct 25560
cgcgggcctg cgcctcgaag gcgtcggcca gctccccgcg cacggcttcc aactcgttgc 25620
gctcacgatc ccagccggct tgcgctgcct gcaacgattc attggcaagg gcctgggcgg 25680
cttgccagag ggcggccacg gcctggttgc cggcctgctg caccgcgtcc ggcacctgga 25740
ctgccagcgg ggcggcctgc gccgtgcgct ggcgtcgcca ttcgcgcatg ccggcgctgg 25800
cgtcgttcat gttgacgcgg gcggccttac gcactgcatc cacggtcggg aagttctccc 25860
ggtcgccttg ctcgaacagc tcgtccgcag ccgcaaaaat gcggtcgcgc gtctctttgt 25920
tcagttccat gttggctccg gtaattggta agaataataa tactcttacc taccttatca 25980
gcgcaagagt ttagctgaac agttctcgac ttaacggcag gttttttagc ggctgaaggg 26040
caggcaaaaa aagccccgca cggtcggcgg gggcaaaggg tcagcgggaa ggggattagc 26100
gggcgtcggg cttcttcatg cgtcggggcc gcgcttcttg ggatggagca cgacgaagcg 26160
cgcacgcgca tcgtcctcgg ccctatcggc ccgcgtcgcg gtcaggaact tgtcgcgcgc 26220
taggtcctcc ctggtgggca ccaggggcat gaactcggcc tgctcgatgt aggtccactc 26280
catgaccgca tcgcagtcga ggccgcgttc cttcaccgtc tcttgcaggt cgcggtacgc 26340
ccgctcgttg agcggctggt aacgggccaa ttggtcgtaa atggctgtcg gccatgagcg 26400
gcctttcctg ttgagccagc agccgacgac gaagccggca atgcaggccc ctggcacaac 26460
caggccgacg ccgggggcag gggatggcag cagctcgcca accaggaacc ccgccgcgat 26520
gatgccgatg ccggtcaacc agcccttgaa actatccggc cccgaaacac ccctgcgcat 26580
tgcctggatg ctgcgccgga tagcttgcaa catcaggagc cgtttctttt gttcgtcagt 26640
catggtccgc cctcaccagt tgttcgtatc ggtgtcggac gaactgaaat cgcaagagct 26700
gccggtatcg gtccagccgc tgtccgtgtc gctgctgccg aagcacggcg aggggtccgc 26760
gaacgccgca gacggcgtat ccggccgcag cgcatcgccc agcatggccc cggtcagcga 26820
gccgccggcc aggtagccca gcatggtgct gttggtcgcc ccggccacca gggccgacgt 26880
gacgaaatcg ccgtcattcc ctctggattg ttcgctgctc ggcggggcag tgcgccgcgc 26940
cggcggcgtc gtggatggct cgggttggct ggcctgcgac ggccggcgaa aggtgcgcag 27000
cagctcgtta tcgaccggct gcggcgtcgg ggccgccgcc ttgcgctgcg gtcggtgttc 27060
cttcttcggc tcgcgcagct tgaacagcat gatcgcggaa accagcagca acgccgcgcc 27120
tacgcctccc gcgatgtaga acagcatcgg attcattctt cggtcctcct tgtagcggaa 27180
ccgttgtctg tgcggcgcgg gtggcccgcg ccgctgtctt tggggatcag ccctcgatga 27240
gcgcgaccag tttcacgtcg gcaaggttcg cctcgaactc ctggccgtcg tcctcgtact 27300
tcaaccaggc atagccttcc gccggcggcc gacggttgag gataaggcgg gcagggcgct 27360
cgtcgtgctc gacctggacg atggcctttt tcagcttgtc cgggtccggc tccttcgcgc 27420
ccttttcctt ggcgtcctta ccgtcctggt cgccgtcctc gccgtcctgg ccgtcgccgg 27480
cctccgcgtc acgctcggca tcagtctggc cgttgaaggc atcgacggtg ttgggatcgc 27540
ggcccttctc gtccaggaac tcgcgcagca gcttgaccgt gccgcgcgtg atttcctggg 27600
tgtcgtcgtc aagccacgcc tcgacttcct ccgggcgctt cttgaaggcc gtcaccagct 27660
cgttcaccac ggtcacgtcg cgcacgcggc cggtgttgaa cgcatcggcg atcttctccg 27720
gcaggtccag cagcgtgacg tgctgggtga tgaacgccgg cgacttgccg atttccttgg 27780
cgatatcgcc tttcttcttg cccttcgcca gctcgcggcc aatgaagtcg gcaatttcgc 27840
gcggggtcag ctcgttgcgt tgcaggttct cgataacctg gtcggcttcg ttgtagtcgt 27900
tgtcgatgaa cgccgggatg gacttcttgc cggcccactt cgagccacgg tagcggcggg 27960
cgccgtgatt gatgatatag cggcccggct gctcctggtt ctcgcgcacc gaaatgggtg 28020
acttcacccc gcgctctttg atcgtggcac cgatttccgc gatgctctcc ggggaaaagc 28080
cggggttgtc ggccgtccgc ggctgatgcg gatcttcgtc gatcaggtcc aggtccagct 28140
cgatagggcc ggaaccgccc tgagacgccg caggagcgtc caggaggctc gacaggtcgc 28200
cgatgctatc caaccccagg ccggacggct gcgccgcgcc tgcggcttcc tgagcggccg 28260
cagcggtgtt tttcttggtg gtcttggctt gagccgcagt cattgggaaa tctccatctt 28320
cgtgaacacg taatcagcca gggcgcgaac ctctttcgat gccttgcgcg cggccgtttt 28380
cttgatcttc cagaccggca caccggatgc gagggcatcg gcgatgctgc tgcgcaggcc 28440
aacggtggcc ggaatcatca tcttggggta cgcggccagc agctcggctt ggtggcgcgc 28500
gtggcgcgga ttccgcgcat cgaccttgct gggcaccatg ccaaggaatt gcagcttggc 28560
gttcttctgg cgcacgttcg caatggtcgt gaccatcttc ttgatgccct ggatgctgta 28620
cgcctcaagc tcgatggggg acagcacata gtcggccgcg aagagggcgg ccgccaggcc 28680
gacgccaagg gtcggggccg tgtcgatcag gcacacgtcg aagccttggt tcgccagggc 28740
cttgatgttc gccccgaaca gctcgcgggc gtcgtccagc gacagccgtt cggcgttcgc 28800
cagtaccggg ttggactcga tgagggcgag gcgcgcggcc tggccgtcgc cggctgcggg 28860
tgcggtttcg gtccagccgc cggcagggac agcgccgaac agcttgcttg catgcaggcc 28920
ggtagcaaag tccttgagcg tgtaggacgc attgccctgg gggtccaggt cgatcacggc 28980
aacccgcaag ccgcgctcga aaaagtcgaa ggcaagatgc acaagggtcg aagtcttgcc 29040
gacgccgcct ttctggttgg ccgtgaccaa agttttcatc gtttggtttc ctgttttttc 29100
ttggcgtccg cttcccactt ccggacgatg tacgcctgat gttccggcag aaccgccgtt 29160
acccgcgcgt acccctcggg caagttcttg tcctcgaacg cggcccacac gcgatgcacc 29220
gcttgcgaca ctgcgcccct ggtcagtccc agcgacgttg cgaacgtcgc ctgtggcttc 29280
ccatcgacta agacgccccg cgctatctcg atggtctgct gccccacttc cagcccctgg 29340
atcgcctcct ggaactggct ttcggtaagc cgtttcttca tggataacac ccataatttg 29400
ctccgcgcct tggttgaaca tagcggtgac agccgccagc acatgagaga agtttagcta 29460
aacatttctc gcacgtcaac acctttagcc gctaaaactc gtccttggcg taacaaaaca 29520
aaagcccgga aaccgggctt tcgtctcttg ccgcttatgg ctctgcaccc ggctccatca 29580
ccaacaggtc gcgcacgcgc ttcactcggt tgcggatcga cactgccagc ccaacaaagc 29640
cggttgccgc cgccgccagg atcgcgccga tgatgccggc cacaccggcc atcgcccacc 29700
aggtcgccgc cttccggttc cattcctgct ggtactgctt cgcaatgctg gacctcggct 29760
caccataggc tgaccgctcg atggcgtatg ccgcttctcc ccttggcgta aaacccagcg 29820
ccgcaggcgg cattgccatg ctgcccgccg ctttcccgac cacgacgcgc gcaccaggct 29880
tgcggtccag accttcggcc acggcgagct gcgcaaggac ataatcagcc gccgacttgg 29940
ctccacgcgc ctcgatcagc tcttgcactc gcgcgaaatc cttggcctcc acggccgcca 30000
tgaatcgcgc acgcggcgaa ggctccgcag ggccggcgtc gtgatcgccg ccgagaatgc 30060
ccttcaccaa gttcgacgac acgaaaatca tgctgacggc tatcaccatc atgcagacgg 30120
atcgcacgaa cccgctgaat tgaacacgag cacggcaccc gcgaccacta tgccaagaat 30180
gcccaaggta aaaattgccg gccccgccat gaagtccgtg aatgccccga cggccgaagt 30240
gaagggcagg ccgccaccca ggccgccgcc ctcactgccc ggcacctggt cgctgaatgt 30300
cgatgccagc acctgcggca cgtcaatgct tccgggcgtc gcgctcgggc tgatcgccca 30360
tcccgttact gccccgatcc cggcaatggc aaggactgcc agcgctgcca tttttggggt 30420
gaggccgttc gcggccgagg ggcgcagccc ctggggggat gggaggcccg cgttagcggg 30480
ccgggagggt tcgagaaggg ggggcacccc ccttcggcgt gcgcggtcac gcgcacaggg 30540
cgcagccctg gttaaaaaca aggtttataa atattggttt aaaagcaggt taaaagacag 30600
gttagcggtg gccgaaaaac gggcggaaac ccttgcaaat gctggatttt ctgcctgtgg 30660
acagcccctc aaatgtcaat aggtgcgccc ctcatctgtc agcactctgc ccctcaagtg 30720
tcaaggatcg cgcccctcat ctgtcagtag tcgcgcccct caagtgtcaa taccgcaggg 30780
cacttatccc caggcttgtc cacatcatct gtgggaaact cgcgtaaaat caggcgtttt 30840
cgccgatttg cgaggctggc cagctccacg tcgccggccg aaatcgagcc tgcccctcat 30900
ctgtcaacgc cgcgccgggt gagtcggccc ctcaagtgtc aacgtccgcc cctcatctgt 30960
cagtgagggc caagttttcc gcgaggtatc cacaacgccg gcggccgcgg tgtctcgcac 31020
acggcttcga cggcgtttct ggcgcgtttg cagggccata gacggccgcc agcccagcgg 31080
cgagggcaac cagcccggtg agcgtcggaa aggcgctgga agccccgtag cgacgcggag 31140
aggggcgaga caagccaagg gcgcaggctc gatgcgcagc acgacatagc cggttctcgc 31200
aaggacgaga atttccctgc ggtgcccctc aagtgtcaat gaaagtttcc aacgcgagcc 31260
attcgcgaga gccttgagtc cacgctagat gagagctttg ttgtaggtgg accagttggt 31320
gattttgaac ttttgctttg ccacggaacg gtctgcgttg tcgggaagat gcgtgatctg 31380
atccttcaac tcagcaaaag ttcgatttat tcaacaaagc cacgttgtgt ctcaaaatct 31440
ctgatgttac attgcacaag ataaaaatat atcatcatga acaataaaac tgtctgctta 31500
cataaacagt aatacaaggg gtgttatgag ccatattcaa cgggaaacgt cttgctcgac 31560
tctagagctc gttcctcgag gaacggtacc tgcggggaag cttacaataa tgtgtgttgt 31620
taagtcttgt tgcctgtcat cgtctgactg actttcgtca taaatcccgg cctccgtaac 31680
ccagctttgg gcaagctcac ggatttgatc cggcggaacg ggaatatcga gatgccgggc 31740
tgaacgctgc agttccagct ttccctttcg ggacaggtac tccagctgat tgattatctg 31800
ctgaagggtc ttggttccac ctcctggcac aatgcgaatg attacttgag cgcgatcggg 31860
catccaattt tctcccgtca ggtgcgtggt caagtgctac aaggcacctt tcagtaacga 31920
gcgaccgtcg atccgtcgcc gggatacgga caaaatggag cgcagtagtc catcgagggc 31980
ggcgaaagcc tcgccaaaag caatacgttc atctcgcaca gcctccagat ccgatcgagg 32040
gtcttcggcg taggcagata gaagcatgga tacattgctt gagagtattc cgatggactg 32100
aagtatggct tccatctttt ctcgtgtgtc tgcatctatt tcgagaaagc ccccgatgcg 32160
gcgcaccgca acgcgaattg ccatactatc cgaaagtccc agcaggcgcg cttgatagga 32220
aaaggtttca tactcggccg atcgcagacg ggcactcacg accttgaacc cttcaacttt 32280
cagggatcga tgctggttga tggtagtctc actcgacgtg gctctggtgt gttttgacat 32340
agcttcctcc aaagaaagcg gaaggtctgg atactccagc acgaaatgtg cccgggtaga 32400
cggatggaag tctagccctg ctcaatatga aatcaacagt acatttacag tcaatactga 32460
atatacttgc tacatttgca attgtcttat aacgaatgtg aaataaaaat agtgtaacaa 32520
cgcttttact catcgataat cacaaaaaca tttatacgaa caaaaataca aatgcactcc 32580
ggtttcacag gataggcggg atcagaatat gcaacttttg acgttttgtt ctttcaaagg 32640
gggtgctggc aaaaccaccg cactcatggg cctttgcgct gctttggcaa atgacggtaa 32700
acgagtggcc ctctttgatg ccgacgaaaa ccggcctctg acgcgatgga gagaaaacgc 32760
cttacaaagc agtactggga tcctcgctgt gaagtctatt ccgccgacga aatgcccctt 32820
cttgaagcag cctatgaaaa tgccgagctc gaaggatttg attatgcgtt ggccgatacg 32880
cgtggcggct cgagcgagct caacaacaca atcatcgcta gctcaaacct gcttctgatc 32940
cccaccatgc taacgccgct cgacatcgat gaggcactat ctacctaccg ctacgtcatc 33000
gagctgctgt tgagtgaaaa tttggcaatt cctacagctg ttttgcgcca acgcgtcccg 33060
gtcggccgat tgacaacatc gcaacgcagg atgtcagaga cgctagagag ccttccagtt 33120
gtaccgtctc ccatgcatga aagagatgca tttgccgcga tgaaagaacg cggcatgttg 33180
catcttacat tactaaacac gggaactgat ccgacgatgc gcctcataga gaggaatctt 33240
cggattgcga tggaggaagt cgtggtcatt tcgaaactga tcagcaaaat cttggaggct 33300
tgaagatggc aattcgcaag cccgcattgt cggtcggcga agcacggcgg cttgctggtg 33360
ctcgacccga gatccaccat cccaacccga cacttgttcc ccagaagctg gacctccagc 33420
acttgcctga aaaagccgac gagaaagacc agcaacgtga gcctctcgtc gccgatcaca 33480
tttacagtcc cgatcgacaa cttaagctaa ctgtggatgc ccttagtcca cctccgtccc 33540
cgaaaaagct ccaggttttt ctttcagcgc gaccgcccgc gcctcaagtg tcgaaaacat 33600
atgacaacct cgttcggcaa tacagtccct cgaagtcgct acaaatgatt ttaaggcgcg 33660
cgttggacga tttcgaaagc atgctggcag atggatcatt tcgcgtggcc ccgaaaagtt 33720
atccgatccc ttcaactaca gaaaaatccg ttctcgttca gacctcacgc atgttcccgg 33780
ttgcgttgct cgaggtcgct cgaagtcatt ttgatccgtt ggggttggag accgctcgag 33840
ctttcggcca caagctggct accgccgcgc tcgcgtcatt ctttgctgga gagaagccat 33900
cgagcaattg gtgaagaggg acctatcgga acccctcacc aaatattgag tgtaggtttg 33960
aggccgctgg ccgcgtcctc agtcaccttt tgagccagat aattaagagc caaatgcaat 34020
tggctcaggc tgccatcgtc cccccgtgcg aaacctgcac gtccgcgtca aagaaataac 34080
cggcacctct tgctgttttt atcagttgag ggcttgacgg atccgcctca agtttgcggc 34140
gcagccgcaa aatgagaaca tctatactcc tgtcgtaaac ctcctcgtcg cgtactcgac 34200
tggcaatgag aagttgctcg cgcgatagaa cgtcgcgggg tttctctaaa aacgcgagga 34260
gaagattgaa ctcacctgcc gtaagtttca cctcaccgcc agcttcggac atcaagcgac 34320
gttgcctgag attaagtgtc cagtcagtaa aacaaaaaga ccgtcggtct ttggagcgga 34380
caacgttggg gcgcacgcgc aaggcaaccc gaatgcgtgc aagaaactct ctcgtactaa 34440
acggcttagc gataaaatca cttgctccta gctcgagtgc aacaacttta tccgtctcct 34500
caaggcggtc gccactgata attatgattg gaatatcaga ctttgccgcc agatttcgaa 34560
cgatctcaag cccatcttca cgacctaaat ttagatcaac aaccacgaca tcgaccgtcg 34620
cggaagagag tactctagtg aactgggtgc tgtcggctac cgcggtcact ttgaaggcgt 34680
ggatcgtaag gtattcgata ataagatgcc gcatagcgac atcgtcatcg ataagaagaa 34740
cgtgtttcaa cggctcacct ttcaatctaa aatctgaacc cttgttcaca gcgcttgaga 34800
aattttcacg tgaaggatgt acaatcatct ccagctaaat gggcagttcg tcagaattgc 34860
ggctgaccgc ggatgacgaa aatgcgaacc aagtatttca attttatgac aaaagttctc 34920
aatcgttgtt acaagtgaaa cgcttcgagg ttacagctac tattgattaa ggagatcgcc 34980
tatggtctcg ccccggcgtc gtgcgtccgc cgcgagccag atctcgccta cttcataaac 35040
gtcctcatag gcacggaatg gaatgatgac atcgatcgcc gtagagagca tgtcaatcag 35100
tgtgcgatct tccaagctag caccttgggc gctacttttg acaagggaaa acagtttctt 35160
gaatccttgg attggattcg cgccgtgtat tgttgaaatc gatcccggat gtcccgagac 35220
gacttcactc agataagccc atgctgcatc gtcgcgcatc tcgccaagca atatccggtc 35280
cggccgcata cgcagacttg cttggagcaa gtgctcggcg ctcacagcac ccagcccagc 35340
accgttcttg gagtagagta gtctaacatg attatcgtgt ggaatgacga gttcgagcgt 35400
atcttctatg gtgattagcc tttcctgggg ggggatggcg ctgatcaagg tcttgctcat 35460
tgttgtcttg ccgcttccgg tagggccaca tagcaacatc gtcagtcggc tgacgacgca 35520
tgcgtgcaga aacgcttcca aatccccgtt gtcaaaatgc tgaaggatag cttcatcatc 35580
ctgattttgg cgtttccttc gtgtctgcca ctggttccac ctcgaagcat cataacggga 35640
ggagacttct ttaagaccag aaacacgcga gcttggccgt cgaatggtca agctgacggt 35700
gcccgaggga acggtcggcg gcagacagat ttgtagtcgt tcaccaccag gaagttcagt 35760
ggcgcagagg gggttacgtg gtccgacatc ctgctttctc agcgcgcccg ctaaaatagc 35820
gatatcttca agatcatcat aagagacggg caaaggcatc ttggtaaaaa tgccggcttg 35880
gcgcacaaat gcctctccag gtcgattgat cgcaatttct tcagtcttcg ggtcatcgag 35940
ccattccaaa atcggcttca gaagaaagcg tagttgcgga tccacttcca tttacaatgt 36000
atcctatctc taagcggaaa tttgaattca ttaagagcgg cggttcctcc cccgcgtggc 36060
gccgccagtc aggcggagct ggtaaacacc aaagaaatcg aggtcccgtg ctacgaaaat 36120
ggaaacggtg tcaccctgat tcttcttcag ggttggcggt atgttgatgg ttgccttaag 36180
ggctgtctca gttgtctgct caccgttatt ttgaaagctg ttgaagctca tcccgccacc 36240
cgagctgccg gcgtaggtgc tagctgcctg gaaggcgcct tgaacaacac tcaagagcat 36300
agctccgcta aaacgctgcc agaagtggct gtcgaccgag cccggcaatc ctgagcgacc 36360
gagttcgtcc gcgcttggcg atgttaacga gatcatcgca tggtcaggtg tctcggcgcg 36420
atcccacaac acaaaaacgc gcccatctcc ctgttgcaag ccacgctgta tttcgccaac 36480
aacggtggtg ccacgatcaa gaagcacgat attgttcgtt gttccacgaa tatcctgagg 36540
caagacacac tttacatagc ctgccaaatt tgtgtcgatt gcggtttgca agatgcacgg 36600
aattattgtc ccttgcgtta ccataaaatc ggggtgcggc aagagcgtgg cgctgctggg 36660
ctgcagctcg gtgggtttca tacgtatcga caaatcgttc tcgccggaca cttcgccatt 36720
cggcaaggag ttgtcgtcac gcttgccttc ttgtcttcgg cccgtgtcgc cctgaatggc 36780
gcgtttgctg accccttgat cgccgctgct atatgcaaaa atcggtgttt cttccggccg 36840
tggctcatgc cgctccggtt cgcccctcgg cggtagagga gcagcaggct gaacagcctc 36900
ttgaaccgct ggaggatccg gcggcacctc aatcggagct ggatgaaatg gcttggtgtt 36960
tgttgcgatc aaagttgacg gcgatgcgtt ctcattcacc ttcttttggc gcccacctag 37020
ccaaatgagg cttaatgata acgcgagaac gacacctccg acgatcaatt tctgagaccc 37080
cgaaagacgc cggcgatgtt tgtcggagac cagggatcca gatgcatcaa cctcatgtgc 37140
cgcttgctga ctatcgttat tcatcccttc gcccccttca ggacgcgttt cacatcgggc 37200
ctcaccgtgc ccgtttgcgg cctttggcca acgggatcgt aagcggtgtt ccagatacat 37260
agtactgtgt ggccatccct cagacgccaa cctcgggaaa ccgaagaaat ctcgacatcg 37320
ctccctttaa ctgaatagtt ggcaacagct tccttgccat caggattgat ggtgtagatg 37380
gagggtatgc gtacattgcc cggaaagtgg aataccgtcg taaatccatt gtcgaagact 37440
tcgagtggca acagcgaacg atcgccttgg gcgacgtagt gccaattact gtccgccgca 37500
ccaagggctg tgacaggctg atccaataaa ttctcagctt tccgttgata ttgtgcttcc 37560
gcgtgtagtc tgtccacaac agccttctgt tgtgcctccc ttcgccgagc cgccgcatcg 37620
tcggcggggt aggcgaattg gacgctgtaa tagagatcgg gctgctcttt atcgaggtgg 37680
gacagagtct tggaacttat actgaaaaca taacggcgca tcccggagtc gcttgcggtt 37740
agcacgatta ctggctgagg cgtgaggacc tggcttgcct tgaaaaatag ataatttccc 37800
cgcggtaggg ctgctagatc tttgctattt gaaacggcaa ccgctgtcac cgtttcgttc 37860
gtggcgaatg ttacgaccaa agtagctcca accgccgtcg agaggcgcac cacttgatcg 37920
ggattgtaag ccaaataacg catgcgcgga tctagcttgc ccgccattgg agtgtcttca 37980
gcctccgcac cagtcgcagc ggcaaataaa catgctaaaa tgaaaagtgc ttttctgatc 38040
atggttcgct gtggcctacg tttgaaacgg tatcttccga tgtctgatag gaggtgacaa 38100
ccagacctgc cgggttggtt agtctcaatc tgccgggcaa gctggtcacc ttttcgtagc 38160
gaactgtcgc ggtccacgta ctcaccacag gcattttgcc gtcaacgacg agggtccttt 38220
tatagcgaat ttgctgcgtg cttggagtta catcatttga agcgatgtgc tcgacctcca 38280
ccctgccgcg tttgccaaga atgacttgag gcgaactggg attgggatag ttgaagaatt 38340
gctggtaatc ctggcgcact gttggggcac tgaagttcga taccaggtcg taggcgtact 38400
gagcggtgtc ggcatcataa ctctcgcgca ggcgaacgta ctcccacaat gaggcgttaa 38460
cgacggcctc ctcttgagtt gcaggcaatc gcgagacaga cacctcgctg tcaacggtgc 38520
cgtccggccg tatccataga tatacgggca caagcctgct caacggcacc attgtggcta 38580
tagcgaacgc ttgagcaaca tttcccaaaa tcgcgatagc tgcgacagct gcaatgagtt 38640
tggagagacg tcgcgccgat ttcgctcgcg cggtttgaaa ggcttctact tccttatagt 38700
gctcggcaag gctttcgcgc gccactagca tggcatattc aggccccgtc atagcgtcca 38760
cccgaattgc cgagctgaag atctgacgga gtaggctgcc atcgccccac attcagcggg 38820
aagatcgggc ctttgcagct cgctaatgtg tcgtttgtct ggcagccgct caaagcgaca 38880
actaggcaca gcaggcaata cttcatagaa ttctccattg aggcgaattt ttgcgcgacc 38940
tagcctcgct caacctgagc gaagcgacgg tacaagctgc tggcagattg ggttgcgccg 39000
ctccagtaac tgcctccaat gttgccggcg atcgccggca aagcgacaat gagcgcatcc 39060
cctgtcagaa aaaacatatc gagttcgtaa agaccaatga tcttggccgc ggtcgtaccg 39120
gcgaaggtga ttacaccaag cataagggtg agcgcagtcg cttcggttag gatgacgatc 39180
gttgccacga ggtttaagag gagaagcaag agaccgtagg tgataagttg cccgatccac 39240
ttagctgcga tgtcccgcgt gcgatcaaaa atatatccga cgaggatcag aggcccgatc 39300
gcgagaagca ctttcgtgag aattccaacg gcgtcgtaaa ctccgaaggc agaccagagc 39360
gtgccgtaaa ggacccactg tgccccttgg aaagcaagga tgtcctggtc gttcatcgga 39420
ccgatttcgg atgcgatttt ctgaaaaacg gcctgggtca cggcgaacat tgtatccaac 39480
tgtgccggaa cagtctgcag aggcaagccg gttacactaa actgctgaac aaagtttggg 39540
accgtctttt cgaagatgga aaccacatag tcttggtagt tagcctgccc aacaattaga 39600
gcaacaacga tggtgaccgt gatcacccga gtgataccgc tacgggtatc gacttcgccg 39660
cgtatgacta aaataccctg aacaataatc caaagagtga cacaggcgat caatggcgca 39720
ctcaccgcct cctggatagt ctcaagcatc gagtccaagc ctgtcgtgaa ggctacatcg 39780
aagatcgtat gaatggccgt aaacggcgcc ggaatcgtga aattcatcga ttggacctga 39840
acttgactgg tttgtcgcat aatgttggat aaaatgagct cgcattcggc gaggatgcgg 39900
gcggatgaac aaatcgccca gccttagggg agggcaccaa agatgacagc ggtcttttga 39960
tgctccttgc gttgagcggc cgcctcttcc gcctcgtgaa ggccggcctg cgcggtagtc 40020
atcgttaata ggcttgtcgc ctgtacattt tgaatcattg cgtcatggat ctgcttgaga 40080
agcaaaccat tggtcacggt tgcctgcatg atattgcgag atcgggaaag ctgagcagac 40140
gtatcagcat tcgccgtcaa gcgtttgtcc atcgtttcca gattgtcagc cgcaatgcca 40200
gcgctgtttg cggaaccggt gatctgcgat cgcaacaggt ccgcttcagc atcactaccc 40260
acgactgcac gatctgtatc gctggtgatc gcacgtgccg tggtcgacat tggcattcgc 40320
ggcgaaaaca tttcattgtc taggtccttc gtcgaaggat actgattttt ctggttgagc 40380
gaagtcagta gtccagtaac gccgtaggcc gacgtcaaca tcgtaaccat cgctatagtc 40440
tgagtgagat tctccgcagt cgcgagcgca gtcgcgagcg tctcagcctc cgttgccggg 40500
tcgctaacaa caaactgcgc ccgcgcgggc tgaatatata gaaagctgca ggtcaaaact 40560
gttgcaataa gttgcgtcgt cttcatcgtt tcctacctta tcaatcttct gcctcgtggt 40620
gacgggccat gaattcgctg agccagccag atgagttgcc ttcttgtgcc tcgcgtagtc 40680
gagttgcaaa gcgcaccgtg ttggcacgcc ccgaaagcac ggcgacatat tcacgcatat 40740
cccgcagatc aaattcgcag atgacgcttc cactttctcg tttaagaaga aacttacggc 40800
tgccgaccgt catgtcttca cggatcgcct gaaattcctt ttcggtacat ttcagtccat 40860
cgacataagc cgatcgatct gcggttggtg atggatagaa aatcttcgtc atacattgcg 40920
caaccaagct ggctcctagc ggcgattcca gaacatgctc tggttgctgc gttgccagta 40980
ttagcatccc gttgtttttt cgaacggtca ggaggaattt gtcgacgaca gtcgaaaatt 41040
tagggtttaa caaataggcg cgaaactcat cgcagctcat cacaaaacgg cggccgtcga 41100
tcatggctcc aatccgatgc aggagatatg ctgcagcggg agcgcatact tcctcgtatt 41160
cgagaagatg cgtcatgtcg aagccggtaa tcgacggatc taactttact tcgtcaactt 41220
cgccgtcaaa tgcccagcca agcgcatggc cccggcacca gcgttggagc cgcgctcctg 41280
cgccttcggc gggcccatgc aacaaaaatt cacgtaaccc cgcgattgaa cgcatttgtg 41340
gatcaaacga gagctgacga tggataccac ggaccagacg gcggttctct tccggagaaa 41400
tcccaccccg accatcactc tcgatgagag ccacgatcca ttcgcgcaga aaatcgtgtg 41460
aggctgctgt gttttctagg ccacgcaacg gcgccaaccc gctgggtgtg cctctgtgaa 41520
gtgccaaata tgttcctcct gtggcgcgaa ccagcaattc gccaccccgg tccttgtcaa 41580
agaacacgac cgtacctgca cggtcgacca tgctctgttc gagcatggct agaacaaaca 41640
tcatgagcgt cgtcttaccc ctcccgatag gcccgaatat tgccgtcatg ccaacatcgt 41700
gctcatgcgg gatatagtcg aaaggcgttc cgccattggt acgaaatcgg gcaatcgcgt 41760
tgccccagtg gcctgagctg gcgccctctg gaaagttttc gaaagagaca aaccctgcga 41820
aattgcgtga agtgattgcg ccagggcgtg tgcgccactt aaaattcccc ggcaattggg 41880
accaataggc cgcttccata ccaatacctt cttggacaac cacggcacct gcatccgcca 41940
ttcgtgtccg agcccgcgcg cccctgtccc caagactatt gagatcgtct gcatagacgc 42000
aaaggctcaa atgatgtgag cccataacga attcgttgct cgcaagtgcg tcctcagcct 42060
cggataattt gccgatttga gtcacggctt tatcgccgga actcagcatc tggctcgatt 42120
tgaggctaag tttcgcgtgc gcttgcgggc gagtcaggaa cgaaaaactc tgcgtgagaa 42180
caagtggaaa atcgagggat agcagcgcgt tgagcatgcc cggccgtgtt tttgcagggt 42240
attcgcgaaa cgaatagatg gatccaacgt aactgtcttt tggcgttctg atctcgagtc 42300
ctcgcttgcc gcaaatgact ctgtcggtat aaatcgaagc gccgagtgag ccgctgacga 42360
ccggaaccgg tgtgaaccga ccagtcatga tcaaccgtag cgcttcgcca atttcggtga 42420
agagcacacc ctgcttctcg cggatgccaa gacgatgcag gccatacgct ttaagagagc 42480
cagcgacaac atgccaaaga tcttccatgt tcctgatctg gcccgtgaga tcgttttccc 42540
tttttccgct tagcttggtg aacctcctct ttaccttccc taaagccgcc tgtgggtaga 42600
caatcaacgt aaggaagtgt tcattgcgga ggagttggcc ggagagcacg cgctgttcaa 42660
aagcttcgtt caggctagcg gcgaaaacac tacggaagtg tcgcggcgcc gatgatggca 42720
cgtcggcatg acgtacgagg tgagcatata ttgacacatg atcatcagcg atattgcgca 42780
acagcgtgtt gaacgcacga caacgcgcat tgcgcatttc agtttcctca agctcgaatg 42840
caacgccatc aattctcgca atggtcatga tcgatccgtc ttcaagaagg acgatatggt 42900
cgctgaggtg gccaatataa gggagataga tctcaccgga tctttcggtc gttccactcg 42960
cgccgagcat cacaccattc ctctccctcg tgggggaacc ctaattggat ttgggctaac 43020
agtagcgccc ccccaaactg cactatcaat gcttcttccc gcggtccgca aaaatagcag 43080
gacgacgctc gccgcattgt agtctcgctc cacgatgagc cgggctgcaa accataacgg 43140
cacgagaacg acttcgtaga gcgggttctg aacgataacg atgacaaagc cggcgaacat 43200
catgaataac cctgccaatg tcagtggcac cccaagaaac aatgcgggcc gtgtggctgc 43260
gaggtaaagg gtcgattctt ccaaacgatc agccatcaac taccgccagt gagcgtttgg 43320
ccgaggaagc tcgccccaaa catgataaca atgccgccga cgacgccggc aaccagccca 43380
agcgaagccc gcccgaacat ccaggagatc ccgatagcga caatgccgag aacagcgagt 43440
gactggccga acggaccaag gataaacgtg catatattgt taaccattgt ggcggggtca 43500
gtgccgccac ccgcagattg cgctgcggcg ggtccggatg aggaaatgct ccatgcaatt 43560
gcaccgcaca agcttggggc gcagctcgat atcacgcgca tcatcgcatt cgagagcgag 43620
aggcgattta gatgtaaacg gtatctctca aagcatcgca tcaatgcgca cctccttagt 43680
ataagtcgaa taagacttga ttgtcgtctg cggatttgcc gttgtcctgg tgtggcggtg 43740
gcggagcgat taaaccgcca gcgccatcct cctgcgagcg gcgctgatat gacccccaaa 43800
catcccacgt ctcttcggat tttagcgcct cgtgatcgtc ttttggaggc tcgattaacg 43860
cgggcaccag cgattgagca gctgtttcaa cttttcgcac gtagccgttt gcaaaaccgc 43920
cgatgaaatt accggtgttg taagcggaga tcgcccgacg aagcgcaaat tgcttctcgt 43980
caatcgtttc gccgcctgca taacgacttt tcagcatgtt tgcagcggca gataatgatg 44040
tgcacgcctg gagcgcaccg tcaggtgtca gaccgagcat agaaaaattt cgagagttta 44100
tttgcatgag gccaacatcc agcgaatgcc gtgcatcgag acggtgcctg acgacttggg 44160
ttgcttggct gtgatcttgc cagtgaagcg tttcgccggt cgtgttgtca tgaatcgcta 44220
aaggatcaaa gcgactctcc accttagcta tcgccgcaag cgtagatgtc gcaactgatg 44280
gggcacactt gcgagcaaca tggtcaaact cagcagatga gagtggcgtg gcaaggctcg 44340
acgaacagaa ggagaccatc aaggcaagag aaagcgaccc cgatctctta agcatacctt 44400
atctccttag ctcgcaacta acaccgcctc tcccgttgga agaagtgcgt tgttttatgt 44460
tgaagattat cgggagggtc ggttactcga aaattttcaa ttgcttcttt atgatttcaa 44520
ttgaagcgag aaacctcgcc cggcgtcttg gaacgcaaca tggaccgaga accgcgcatc 44580
catgactaag caaccggatc gacctattca ggccgcagtt ggtcaggtca ggctcagaac 44640
gaaaatgctc ggcgaggtta cgctgtctgt aaacccattc gatgaacggg aagcttcctt 44700
ccgattgctc ttggcaggaa tattggccca tgcctgcttg cgctttgcaa atgctcttat 44760
cgcgttggta tcatatgcct tgtccgccag cagaaacgca ctctaagcga ttatttgtaa 44820
aaatgtttcg gtcatgcggc ggtcatgggc ttgacccgct gtcagcgcaa gacggatcgg 44880
tcaaccgtcg gcatcgacaa cagcgtgaat cttggtggtc aaaccgccac gggaacgtcc 44940
catacagcca tcgtcttgat cccgctgttt cccgtcgccg catgttggtg gacgcggaca 45000
caggaactgt caatcatgac gacattctat cgaaagcctt ggaaatcaca ctcagaatat 45060
gatcccagac gtctgcctca cgccatcgta caaagcgatt gtagcaggtt gtacaggaac 45120
cgtatcgatc aggaacgtct gcccagggcg ggcccgtccg gaagcgccac aagatgacat 45180
tgatcacccg cgtcaacgcg cggcacgcga cgcggcttat ttgggaacaa aggactgaac 45240
aacagtccat tcgaaatcgg tgacatcaaa gcggggacgg gttatcagtg gcctccaagt 45300
caagcctcaa tgaatcaaaa tcagaccgat ttgcaaacct gatttatgag tgtgcggcct 45360
aaatgatgaa atcgtccttc tagatcgcct ccgtggtgta gcaacacctc gcagtatcgc 45420
cgtgctgacc ttggccaggg aattgactgg caagggtgct ttcacatgac cgctcttttg 45480
gccgcgatag atgatttcgt tgctgctttg ggcacgtaga aggagagaag tcatatcgga 45540
gaaattcctc ctggcgcgag agcctgctct atcgcgacgg catcccactg tcgggaacag 45600
accggatcat tcacgaggcg aaagtcgtca acacatgcgt tataggcatc ttcccttgaa 45660
ggatgatctt gttgctgcca atctggaggt gcggcagccg caggcagatg cgatctcagc 45720
gcaacttgcg gcaaaacatc tcactcacct gaaaaccact agcgagtctc gcgatcagac 45780
gaaggccttt tacttaacga cacaatatcc gatgtctgca tcacaggcgt cgctatccca 45840
gtcaatacta aagcggtgca ggaactaaag attactgatg acttaggcgt gccacgaggc 45900
ctgagacgac gcgcgtagac agttttttga aatcattatc aaagtgatgg cctccgctga 45960
agcctatcac ctctgcgccg gtctgtcgga gagatgggca agcattatta cggtcttcgc 46020
gcccgtacat gcattggacg attgcagggt caatggatct gagatcatcc agaggattgc 46080
cgcccttacc ttccgtttcg agttggagcc agcccctaaa tgagacgaca tagtcgactt 46140
gatgtgacaa tgccaagaga gagatttgct taacccgatt tttttgctca agcgtaagcc 46200
tattgaagct tgccggcatg acgtccgcgc cgaaagaata tcctacaagt aaaacattct 46260
gcacaccgaa atgcttggtg tagacatcga ttatgtgacc aagatcctta gcagtttcgc 46320
ttggggaccg ctccgaccag aaataccgaa gtgaactgac gccaatgaca ggaatccctt 46380
ccgtctgcag ataggtacca tcgatagatc tgctgcctcg cgcgtttcgg tgatgacggt 46440
gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc 46500
gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggcgcagcc 46560
atgacccagt cacgtagcga tagcggagtg tatactggct taactatgcg gcatcagagc 46620
agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc gtaaggagaa 46680
aataccgcat caggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc 46740
ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag 46800
gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 46860
aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc 46920
gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc 46980
ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg 47040
cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt 47100
cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc 47160
gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc 47220
cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag 47280
agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg 47340
ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa 47400
ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag 47460
gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact 47520
cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa 47580
attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt 47640
accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag 47700
ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca 47760
gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc 47820
agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt 47880
ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg 47940
ttgttgccat tgctgcaggg gggggggggg ggggggactt ccattgttca ttccacggac 48000
aaaaacagag aaaggaaacg acagaggcca aaaagcctcg ctttcagcac ctgtcgtttc 48060
ctttcttttc agagggtatt ttaaataaaa acattaagtt atgacgaaga agaacggaaa 48120
cgccttaaac cggaaaattt tcataaatag cgaaaacccg cgaggtcgcc gccccgtagt 48180
cggatcaccg gaaaggaccc gtaaagtgat aatgattatc atctacatat cacaacgtgc 48240
gtggaggcca tcaaaccacg tcaaataatc aattatgacg caggtatcgt attaattgat 48300
ctgcatcaac ttaacgtaaa aacaacttca gacaatacaa atcagcgaca ctgaatacgg 48360
ggcaacctca tgtccccccc cccccccccc ctgcaggcat cgtggtgtca cgctcgtcgt 48420
ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca 48480
tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg 48540
ccgcagtgtt atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat 48600
ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta 48660
tgcggcgacc gagttgctct tgcccggcgt caacacggga taataccgcg ccacatagca 48720
gaactttaaa agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct 48780
taccgctgtt gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat 48840
cttttacttt caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa 48900
agggaataag ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt 48960
gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa 49020
ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccacctgac gtctaagaaa 49080
ccattattat catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtcttc 49140
aagaattggt cgacgatctt gctgcgttcg gatattttcg tggagttccc gccacagacc 49200
cggattgaag gcgagatcca gcaactcgcg ccagatcatc ctgtgacgga actttggcgc 49260
gtgatgactg gccaggacgt cggccgaaag agcgacaagc agatcacgct tttcgacagc 49320
gtcggatttg cgatcgagga tttttcggcg ctgcgctacg tccgcgaccg cgttgaggga 49380
tcaagccaca gcagcccact cgaccttcta gccgacccag acgagccaag ggatcttttt 49440
ggaatgctgc tccgtcgtca ggctttccga cgtttgggtg gttgaacaga agtcattatc 49500
gtacggaatg ccaagcactc ccgaggggaa ccctgtggtt ggcatgcaca tacaaatgga 49560
cgaacggata aaccttttca cgccctttta aatatccgtt attctaataa acgctctttt 49620
ctcttaggtt tacccgccaa tatatcctgt caaacactga tagtttaaac tgaaggcggg 49680
aaacgacaat ctgatcatga gcggagaatt aagggagtca cgttatgacc cccgccgatg 49740
acgcgggaca agccgtttta cgtttggaac tgacagaacc gcaacgttga aggagccact 49800
cagcaagctg gtacgattgt aatacgactc actatagggc gaattgagcg ctgtttaaac 49860
gctcttcaac tggaagagcg gttacccgga ccgaagcttg catgcctgca g 49911
<210>7
<211>36909
<212>DNA
<213>人工序列
<220>
<223>载体
<400>7
tctagagctc gttcctcgag gcctcgaggc ctcgaggaac ggtacctgcg gggaagctta 60
caataatgtg tgttgttaag tcttgttgcc tgtcatcgtc tgactgactt tcgtcataaa 120
tcccggcctc cgtaacccag ctttgggcaa gctcacggat ttgatccggc ggaacgggaa 180
tatcgagatg ccgggctgaa cgctgcagtt ccagctttcc ctttcgggac aggtactcca 240
gctgattgat tatctgctga agggtcttgg ttccacctcc tggcacaatg cgaatgatta 300
cttgagcgcg atcgggcatc caattttctc ccgtcaggtg cgtggtcaag tgctacaagg 360
cacctttcag taacgagcga ccgtcgatcc gtcgccggga tacggacaaa atggagcgca 420
gtagtccatc gagggcggcg aaagcctcgc caaaagcaat acgttcatct cgcacagcct 480
ccagatccga tcgagggtct tcggcgtagg cagatagaag catggataca ttgcttgaga 540
gtattccgat ggactgaagt atggcttcca tcttttctcg tgtgtctgca tctatttcga 600
gaaagccccc gatgcggcgc accgcaacgc gaattgccat actatccgaa agtcccagca 660
ggcgcgcttg ataggaaaag gtttcatact cggccgatcg cagacgggca ctcacgacct 720
tgaacccttc aactttcagg gatcgatgct ggttgatggt agtctcactc gacgtggctc 780
tggtgtgttt tgacatagct tcctccaaag aaagcggaag gtctggatac tccagcacga 840
aatgtgcccg ggtagacgga tggaagtcta gccctgctca atatgaaatc aacagtacat 900
ttacagtcaa tactgaatat acttgctaca tttgcaattg tcttataacg aatgtgaaat 960
aaaaatagtg taacaacgct tttactcatc gataatcaca aaaacattta tacgaacaaa 1020
aatacaaatg cactccggtt tcacaggata ggcgggatca gaatatgcaa cttttgacgt 1080
tttgttcttt caaagggggt gctggcaaaa ccaccgcact catgggcctt tgcgctgctt 1140
tggcaaatga cggtaaacga gtggccctct ttgatgccga cgaaaaccgg cctctgacgc 1200
gatggagaga aaacgcctta caaagcagta ctgggatcct cgctgtgaag tctattccgc 1260
cgacgaaatg ccccttcttg aagcagccta tgaaaatgcc gagctcgaag gatttgatta 1320
tgcgttggcc gatacgcgtg gcggctcgag cgagctcaac aacacaatca tcgctagctc 1380
aaacctgctt ctgatcccca ccatgctaac gccgctcgac atcgatgagg cactatctac 1440
ctaccgctac gtcatcgagc tgctgttgag tgaaaatttg gcaattccta cagctgtttt 1500
gcgccaacgc gtcccggtcg gccgattgac aacatcgcaa cgcaggatgt cagagacgct 1560
agagagcctt ccagttgtac cgtctcccat gcatgaaaga gatgcatttg ccgcgatgaa 1620
agaacgcggc atgttgcatc ttacattact aaacacggga actgatccga cgatgcgcct 1680
catagagagg aatcttcgga ttgcgatgga ggaagtcgtg gtcatttcga aactgatcag 1740
caaaatcttg gaggcttgaa gatggcaatt cgcaagcccg cattgtcggt cggcgaagca 1800
cggcggcttg ctggtgctcg acccgagatc caccatccca acccgacact tgttccccag 1860
aagctggacc tccagcactt gcctgaaaaa gccgacgaga aagaccagca acgtgagcct 1920
ctcgtcgccg atcacattta cagtcccgat cgacaactta agctaactgt ggatgccctt 1980
agtccacctc cgtccccgaa aaagctccag gtttttcttt cagcgcgacc gcccgcgcct 2040
caagtgtcga aaacatatga caacctcgtt cggcaataca gtccctcgaa gtcgctacaa 2100
atgattttaa ggcgcgcgtt ggacgatttc gaaagcatgc tggcagatgg atcatttcgc 2160
gtggccccga aaagttatcc gatcccttca actacagaaa aatccgttct cgttcagacc 2220
tcacgcatgt tcccggttgc gttgctcgag gtcgctcgaa gtcattttga tccgttgggg 2280
ttggagaccg ctcgagcttt cggccacaag ctggctaccg ccgcgctcgc gtcattcttt 2340
gctggagaga agccatcgag caattggtga agagggacct atcggaaccc ctcaccaaat 2400
attgagtgta ggtttgaggc cgctggccgc gtcctcagtc accttttgag ccagataatt 2460
aagagccaaa tgcaattggc tcaggctgcc atcgtccccc cgtgcgaaac ctgcacgtcc 2520
gcgtcaaaga aataaccggc acctcttgct gtttttatca gttgagggct tgacggatcc 2580
gcctcaagtt tgcggcgcag ccgcaaaatg agaacatcta tactcctgtc gtaaacctcc 2640
tcgtcgcgta ctcgactggc aatgagaagt tgctcgcgcg atagaacgtc gcggggtttc 2700
tctaaaaacg cgaggagaag attgaactca cctgccgtaa gtttcacctc accgccagct 2760
tcggacatca agcgacgttg cctgagatta agtgtccagt cagtaaaaca aaaagaccgt 2820
cggtctttgg agcggacaac gttggggcgc acgcgcaagg caacccgaat gcgtgcaaga 2880
aactctctcg tactaaacgg cttagcgata aaatcacttg ctcctagctc gagtgcaaca 2940
actttatccg tctcctcaag gcggtcgcca ctgataatta tgattggaat atcagacttt 3000
gccgccagat ttcgaacgat ctcaagccca tcttcacgac ctaaatttag atcaacaacc 3060
acgacatcga ccgtcgcgga agagagtact ctagtgaact gggtgctgtc ggctaccgcg 3120
gtcactttga aggcgtggat cgtaaggtat tcgataataa gatgccgcat agcgacatcg 3180
tcatcgataa gaagaacgtg tttcaacggc tcacctttca atctaaaatc tgaacccttg 3240
ttcacagcgc ttgagaaatt ttcacgtgaa ggatgtacaa tcatctccag ctaaatgggc 3300
agttcgtcag aattgcggct gaccgcggat gacgaaaatg cgaaccaagt atttcaattt 3360
tatgacaaaa gttctcaatc gttgttacaa gtgaaacgct tcgaggttac agctactatt 3420
gattaaggag atcgcctatg gtctcgcccc ggcgtcgtgc gtccgccgcg agccagatct 3480
cgcctacttc ataaacgtcc tcataggcac ggaatggaat gatgacatcg atcgccgtag 3540
agagcatgtc aatcagtgtg cgatcttcca agctagcacc ttgggcgcta cttttgacaa 3600
gggaaaacag tttcttgaat ccttggattg gattcgcgcc gtgtattgtt gaaatcgatc 3660
ccggatgtcc cgagacgact tcactcagat aagcccatgc tgcatcgtcg cgcatctcgc 3720
caagcaatat ccggtccggc cgcatacgca gacttgcttg gagcaagtgc tcggcgctca 3780
cagcacccag cccagcaccg ttcttggagt agagtagtct aacatgatta tcgtgtggaa 3840
tgacgagttc gagcgtatct tctatggtga ttagcctttc ctgggggggg atggcgctga 3900
tcaaggtctt gctcattgtt gtcttgccgc ttccggtagg gccacatagc aacatcgtca 3960
gtcggctgac gacgcatgcg tgcagaaacg cttccaaatc cccgttgtca aaatgctgaa 4020
ggatagcttc atcatcctga ttttggcgtt tccttcgtgt ctgccactgg ttccacctcg 4080
aagcatcata acgggaggag acttctttaa gaccagaaac acgcgagctt ggccgtcgaa 4140
tggtcaagct gacggtgccc gagggaacgg tcggcggcag acagatttgt agtcgttcac 4200
caccaggaag ttcagtggcg cagagggggt tacgtggtcc gacatcctgc tttctcagcg 4260
cgcccgctaa aatagcgata tcttcaagat catcataaga gacgggcaaa ggcatcttgg 4320
taaaaatgcc ggcttggcgc acaaatgcct ctccaggtcg attgatcgca atttcttcag 4380
tcttcgggtc atcgagccat tccaaaatcg gcttcagaag aaagcgtagt tgcggatcca 4440
cttccattta caatgtatcc tatctctaag cggaaatttg aattcattaa gagcggcggt 4500
tcctcccccg cgtggcgccg ccagtcaggc ggagctggta aacaccaaag aaatcgaggt 4560
cccgtgctac gaaaatggaa acggtgtcac cctgattctt cttcagggtt ggcggtatgt 4620
tgatggttgc cttaagggct gtctcagttg tctgctcacc gttattttga aagctgttga 4680
agctcatccc gccacccgag ctgccggcgt aggtgctagc tgcctggaag gcgccttgaa 4740
caacactcaa gagcatagct ccgctaaaac gctgccagaa gtggctgtcg accgagcccg 4800
gcaatcctga gcgaccgagt tcgtccgcgc ttggcgatgt taacgagatc atcgcatggt 4860
caggtgtctc ggcgcgatcc cacaacacaa aaacgcgccc atctccctgt tgcaagccac 4920
gctgtatttc gccaacaacg gtggtgccac gatcaagaag cacgatattg ttcgttgttc 4980
cacgaatatc ctgaggcaag acacacttta catagcctgc caaatttgtg tcgattgcgg 5040
tttgcaagat gcacggaatt attgtccctt gcgttaccat aaaatcgggg tgcggcaaga 5100
gcgtggcgct gctgggctgc agctcggtgg gtttcatacg tatcgacaaa tcgttctcgc 5160
cggacacttc gccattcggc aaggagttgt cgtcacgctt gccttcttgt cttcggcccg 5220
tgtcgccctg aatggcgcgt ttgctgaccc cttgatcgcc gctgctatat gcaaaaatcg 5280
gtgtttcttc cggccgtggc tcatgccgct ccggttcgcc cctcggcggt agaggagcag 5340
caggctgaac agcctcttga accgctggag gatccggcgg cacctcaatc ggagctggat 5400
gaaatggctt ggtgtttgtt gcgatcaaag ttgacggcga tgcgttctca ttcaccttct 5460
tttggcgccc acctagccaa atgaggctta atgataacgc gagaacgaca cctccgacga 5520
tcaatttctg agaccccgaa agacgccggc gatgtttgtc ggagaccagg gatccagatg 5580
catcaacctc atgtgccgct tgctgactat cgttattcat cccttcgccc ccttcaggac 5640
gcgtttcaca tcgggcctca ccgtgcccgt ttgcggcctt tggccaacgg gatcgtaagc 5700
ggtgttccag atacatagta ctgtgtggcc atccctcaga cgccaacctc gggaaaccga 5760
agaaatctcg acatcgctcc ctttaactga atagttggca acagcttcct tgccatcagg 5820
attgatggtg tagatggagg gtatgcgtac attgcccgga aagtggaata ccgtcgtaaa 5880
tccattgtcg aagacttcga gtggcaacag cgaacgatcg ccttgggcga cgtagtgcca 5940
attactgtcc gccgcaccaa gggctgtgac aggctgatcc aataaattct cagctttccg 6000
ttgatattgt gcttccgcgt gtagtctgtc cacaacagcc ttctgttgtg cctcccttcg 6060
ccgagccgcc gcatcgtcgg cggggtaggc gaattggacg ctgtaataga gatcgggctg 6120
ctctttatcg aggtgggaca gagtcttgga acttatactg aaaacataac ggcgcatccc 6180
ggagtcgctt gcggttagca cgattactgg ctgaggcgtg aggacctggc ttgccttgaa 6240
aaatagataa tttccccgcg gtagggctgc tagatctttg ctatttgaaa cggcaaccgc 6300
tgtcaccgtt tcgttcgtgg cgaatgttac gaccaaagta gctccaaccg ccgtcgagag 6360
gcgcaccact tgatcgggat tgtaagccaa ataacgcatg cgcggatcta gcttgcccgc 6420
cattggagtg tcttcagcct ccgcaccagt cgcagcggca aataaacatg ctaaaatgaa 6480
aagtgctttt ctgatcatgg ttcgctgtgg cctacgtttg aaacggtatc ttccgatgtc 6540
tgataggagg tgacaaccag acctgccggg ttggttagtc tcaatctgcc gggcaagctg 6600
gtcacctttt cgtagcgaac tgtcgcggtc cacgtactca ccacaggcat tttgccgtca 6660
acgacgaggg tccttttata gcgaatttgc tgcgtgcttg gagttacatc atttgaagcg 6720
atgtgctcga cctccaccct gccgcgtttg ccaagaatga cttgaggcga actgggattg 6780
ggatagttga agaattgctg gtaatcctgg cgcactgttg gggcactgaa gttcgatacc 6840
aggtcgtagg cgtactgagc ggtgtcggca tcataactct cgcgcaggcg aacgtactcc 6900
cacaatgagg cgttaacgac ggcctcctct tgagttgcag gcaatcgcga gacagacacc 6960
tcgctgtcaa cggtgccgtc cggccgtatc catagatata cgggcacaag cctgctcaac 7020
ggcaccattg tggctatagc gaacgcttga gcaacatttc ccaaaatcgc gatagctgcg 7080
acagctgcaa tgagtttgga gagacgtcgc gccgatttcg ctcgcgcggt ttgaaaggct 7140
tctacttcct tatagtgctc ggcaaggctt tcgcgcgcca ctagcatggc atattcaggc 7200
cccgtcatag cgtccacccg aattgccgag ctgaagatct gacggagtag gctgccatcg 7260
ccccacattc agcgggaaga tcgggccttt gcagctcgct aatgtgtcgt ttgtctggca 7320
gccgctcaaa gcgacaacta ggcacagcag gcaatacttc atagaattct ccattgaggc 7380
gaatttttgc gcgacctagc ctcgctcaac ctgagcgaag cgacggtaca agctgctggc 7440
agattgggtt gcgccgctcc agtaactgcc tccaatgttg ccggcgatcg ccggcaaagc 7500
gacaatgagc gcatcccctg tcagaaaaaa catatcgagt tcgtaaagac caatgatctt 7560
ggccgcggtc gtaccggcga aggtgattac accaagcata agggtgagcg cagtcgcttc 7620
ggttaggatg acgatcgttg ccacgaggtt taagaggaga agcaagagac cgtaggtgat 7680
aagttgcccg atccacttag ctgcgatgtc ccgcgtgcga tcaaaaatat atccgacgag 7740
gatcagaggc ccgatcgcga gaagcacttt cgtgagaatt ccaacggcgt cgtaaactcc 7800
gaaggcagac cagagcgtgc cgtaaaggac ccactgtgcc ccttggaaag caaggatgtc 7860
ctggtcgttc atcggaccga tttcggatgc gattttctga aaaacggcct gggtcacggc 7920
gaacattgta tccaactgtg ccggaacagt ctgcagaggc aagccggtta cactaaactg 7980
ctgaacaaag tttgggaccg tcttttcgaa gatggaaacc acatagtctt ggtagttagc 8040
ctgcccaaca attagagcaa caacgatggt gaccgtgatc acccgagtga taccgctacg 8100
ggtatcgact tcgccgcgta tgactaaaat accctgaaca ataatccaaa gagtgacaca 8160
ggcgatcaat ggcgcactca ccgcctcctg gatagtctca agcatcgagt ccaagcctgt 8220
cgtgaaggct acatcgaaga tcgtatgaat ggccgtaaac ggcgccggaa tcgtgaaatt 8280
catcgattgg acctgaactt gactggtttg tcgcataatg ttggataaaa tgagctcgca 8340
ttcggcgagg atgcgggcgg atgaacaaat cgcccagcct taggggaggg caccaaagat 8400
gacagcggtc ttttgatgct ccttgcgttg agcggccgcc tcttccgcct cgtgaaggcc 8460
ggcctgcgcg gtagtcatcg ttaataggct tgtcgcctgt acattttgaa tcattgcgtc 8520
atggatctgc ttgagaagca aaccattggt cacggttgcc tgcatgatat tgcgagatcg 8580
ggaaagctga gcagacgtat cagcattcgc cgtcaagcgt ttgtccatcg tttccagatt 8640
gtcagccgca atgccagcgc tgtttgcgga accggtgatc tgcgatcgca acaggtccgc 8700
ttcagcatca ctacccacga ctgcacgatc tgtatcgctg gtgatcgcac gtgccgtggt 8760
cgacattggc attcgcggcg aaaacatttc attgtctagg tccttcgtcg aaggatactg 8820
atttttctgg ttgagcgaag tcagtagtcc agtaacgccg taggccgacg tcaacatcgt 8880
aaccatcgct atagtctgag tgagattctc cgcagtcgcg agcgcagtcg cgagcgtctc 8940
agcctccgtt gccgggtcgc taacaacaaa ctgcgcccgc gcgggctgaa tatatagaaa 9000
gctgcaggtc aaaactgttg caataagttg cgtcgtcttc atcgtttcct accttatcaa 9060
tcttctgcct cgtggtgacg ggccatgaat tcgctgagcc agccagatga gttgccttct 9120
tgtgcctcgc gtagtcgagt tgcaaagcgc accgtgttgg cacgccccga aagcacggcg 9180
acatattcac gcatatcccg cagatcaaat tcgcagatga cgcttccact ttctcgttta 9240
agaagaaact tacggctgcc gaccgtcatg tcttcacgga tcgcctgaaa ttccttttcg 9300
gtacatttca gtccatcgac ataagccgat cgatctgcgg ttggtgatgg atagaaaatc 9360
ttcgtcatac attgcgcaac caagctggct cctagcggcg attccagaac atgctctggt 9420
tgctgcgttg ccagtattag catcccgttg ttttttcgaa cggtcaggag gaatttgtcg 9480
acgacagtcg aaaatttagg gtttaacaaa taggcgcgaa actcatcgca gctcatcaca 9540
aaacggcggc cgtcgatcat ggctccaatc cgatgcagga gatatgctgc agcgggagcg 9600
catacttcct cgtattcgag aagatgcgtc atgtcgaagc cggtaatcga cggatctaac 9660
tttacttcgt caacttcgcc gtcaaatgcc cagccaagcg catggccccg gcaccagcgt 9720
tggagccgcg ctcctgcgcc ttcggcgggc ccatgcaaca aaaattcacg taaccccgcg 9780
attgaacgca tttgtggatc aaacgagagc tgacgatgga taccacggac cagacggcgg 9840
ttctcttccg gagaaatccc accccgacca tcactctcga tgagagccac gatccattcg 9900
cgcagaaaat cgtgtgaggc tgctgtgttt tctaggccac gcaacggcgc caacccgctg 9960
ggtgtgcctc tgtgaagtgc caaatatgtt cctcctgtgg cgcgaaccag caattcgcca 10020
ccccggtcct tgtcaaagaa cacgaccgta cctgcacggt cgaccatgct ctgttcgagc 10080
atggctagaa caaacatcat gagcgtcgtc ttacccctcc cgataggccc gaatattgcc 10140
gtcatgccaa catcgtgctc atgcgggata tagtcgaaag gcgttccgcc attggtacga 10200
aatcgggcaa tcgcgttgcc ccagtggcct gagctggcgc cctctggaaa gttttcgaaa 10260
gagacaaacc ctgcgaaatt gcgtgaagtg attgcgccag ggcgtgtgcg ccacttaaaa 10320
ttccccggca attgggacca ataggccgct tccataccaa taccttcttg gacaaccacg 10380
gcacctgcat ccgccattcg tgtccgagcc cgcgcgcccc tgtccccaag actattgaga 10440
tcgtctgcat agacgcaaag gctcaaatga tgtgagccca taacgaattc gttgctcgca 10500
agtgcgtcct cagcctcgga taatttgccg atttgagtca cggctttatc gccggaactc 10560
agcatctggc tcgatttgag gctaagtttc gcgtgcgctt gcgggcgagt caggaacgaa 10620
aaactctgcg tgagaacaag tggaaaatcg agggatagca gcgcgttgag catgcccggc 10680
cgtgtttttg cagggtattc gcgaaacgaa tagatggatc caacgtaact gtcttttggc 10740
gttctgatct cgagtcctcg cttgccgcaa atgactctgt cggtataaat cgaagcgccg 10800
agtgagccgc tgacgaccgg aaccggtgtg aaccgaccag tcatgatcaa ccgtagcgct 10860
tcgccaattt cggtgaagag cacaccctgc ttctcgcgga tgccaagacg atgcaggcca 10920
tacgctttaa gagagccagc gacaacatgc caaagatctt ccatgttcct gatctggccc 10980
gtgagatcgt tttccctttt tccgcttagc ttggtgaacc tcctctttac cttccctaaa 11040
gccgcctgtg ggtagacaat caacgtaagg aagtgttcat tgcggaggag ttggccggag 11100
agcacgcgct gttcaaaagc ttcgttcagg ctagcggcga aaacactacg gaagtgtcgc 11160
ggcgccgatg atggcacgtc ggcatgacgt acgaggtgag catatattga cacatgatca 11220
tcagcgatat tgcgcaacag cgtgttgaac gcacgacaac gcgcattgcg catttcagtt 11280
tcctcaagct cgaatgcaac gccatcaatt ctcgcaatgg tcatgatcga tccgtcttca 11340
agaaggacga tatggtcgct gaggtggcca atataaggga gatagatctc accggatctt 11400
tcggtcgttc cactcgcgcc gagcatcaca ccattcctct ccctcgtggg ggaaccctaa 11460
ttggatttgg gctaacagta gcgccccccc aaactgcact atcaatgctt cttcccgcgg 11520
tccgcaaaaa tagcaggacg acgctcgccg cattgtagtc tcgctccacg atgagccggg 11580
ctgcaaacca taacggcacg agaacgactt cgtagagcgg gttctgaacg ataacgatga 11640
caaagccggc gaacatcatg aataaccctg ccaatgtcag tggcacccca agaaacaatg 11700
cgggccgtgt ggctgcgagg taaagggtcg attcttccaa acgatcagcc atcaactacc 11760
gccagtgagc gtttggccga ggaagctcgc cccaaacatg ataacaatgc cgccgacgac 11820
gccggcaacc agcccaagcg aagcccgccc gaacatccag gagatcccga tagcgacaat 11880
gccgagaaca gcgagtgact ggccgaacgg accaaggata aacgtgcata tattgttaac 11940
cattgtggcg gggtcagtgc cgccacccgc agattgcgct gcggcgggtc cggatgagga 12000
aatgctccat gcaattgcac cgcacaagct tggggcgcag ctcgatatca cgcgcatcat 12060
cgcattcgag agcgagaggc gatttagatg taaacggtat ctctcaaagc atcgcatcaa 12120
tgcgcacctc cttagtataa gtcgaataag acttgattgt cgtctgcgga tttgccgttg 12180
tcctggtgtg gcggtggcgg agcgattaaa ccgccagcgc catcctcctg cgagcggcgc 12240
tgatatgacc cccaaacatc ccacgtctct tcggatttta gcgcctcgtg atcgtctttt 12300
ggaggctcga ttaacgcggg caccagcgat tgagcagctg tttcaacttt tcgcacgtag 12360
ccgtttgcaa aaccgccgat gaaattaccg gtgttgtaag cggagatcgc ccgacgaagc 12420
gcaaattgct tctcgtcaat cgtttcgccg cctgcataac gacttttcag catgtttgca 12480
gcggcagata atgatgtgca cgcctggagc gcaccgtcag gtgtcagacc gagcatagaa 12540
aaatttcgag agtttatttg catgaggcca acatccagcg aatgccgtgc atcgagacgg 12600
tgcctgacga cttgggttgc ttggctgtga tcttgccagt gaagcgtttc gccggtcgtg 12660
ttgtcatgaa tcgctaaagg atcaaagcga ctctccacct tagctatcgc cgcaagcgta 12720
gatgtcgcaa ctgatggggc acacttgcga gcaacatggt caaactcagc agatgagagt 12780
ggcgtggcaa ggctcgacga acagaaggag accatcaagg caagagaaag cgaccccgat 12840
ctcttaagca taccttatct ccttagctcg caactaacac cgcctctccc gttggaagaa 12900
gtgcgttgtt ttatgttgaa gattatcggg agggtcggtt actcgaaaat tttcaattgc 12960
ttctttatga tttcaattga agcgagaaac ctcgcccggc gtcttggaac gcaacatgga 13020
ccgagaaccg cgcatccatg actaagcaac cggatcgacc tattcaggcc gcagttggtc 13080
aggtcaggct cagaacgaaa atgctcggcg aggttacgct gtctgtaaac ccattcgatg 13140
aacgggaagc ttccttccga ttgctcttgg caggaatatt ggcccatgcc tgcttgcgct 13200
ttgcaaatgc tcttatcgcg ttggtatcat atgccttgtc cgccagcaga aacgcactct 13260
aagcgattat ttgtaaaaat gtttcggtca tgcggcggtc atgggcttga cccgctgtca 13320
gcgcaagacg gatcggtcaa ccgtcggcat cgacaacagc gtgaatcttg gtggtcaaac 13380
cgccacggga acgtcccata cagccatcgt cttgatcccg ctgtttcccg tcgccgcatg 13440
ttggtggacg cggacacagg aactgtcaat catgacgaca ttctatcgaa agccttggaa 13500
atcacactca gaatatgatc ccagacgtct gcctcacgcc atcgtacaaa gcgattgtag 13560
caggttgtac aggaaccgta tcgatcagga acgtctgccc agggcgggcc cgtccggaag 13620
cgccacaaga tgacattgat cacccgcgtc aacgcgcggc acgcgacgcg gcttatttgg 13680
gaacaaagga ctgaacaaca gtccattcga aatcggtgac atcaaagcgg ggacgggtta 13740
tcagtggcct ccaagtcaag cctcaatgaa tcaaaatcag accgatttgc aaacctgatt 13800
tatgagtgtg cggcctaaat gatgaaatcg tccttctaga tcgcctccgt ggtgtagcaa 13860
cacctcgcag tatcgccgtg ctgaccttgg ccagggaatt gactggcaag ggtgctttca 13920
catgaccgct cttttggccg cgatagatga tttcgttgct gctttgggca cgtagaagga 13980
gagaagtcat atcggagaaa ttcctcctgg cgcgagagcc tgctctatcg cgacggcatc 14040
ccactgtcgg gaacagaccg gatcattcac gaggcgaaag tcgtcaacac atgcgttata 14100
ggcatcttcc cttgaaggat gatcttgttg ctgccaatct ggaggtgcgg cagccgcagg 14160
cagatgcgat ctcagcgcaa cttgcggcaa aacatctcac tcacctgaaa accactagcg 14220
agtctcgcga tcagacgaag gccttttact taacgacaca atatccgatg tctgcatcac 14280
aggcgtcgct atcccagtca atactaaagc ggtgcaggaa ctaaagatta ctgatgactt 14340
aggcgtgcca cgaggcctga gacgacgcgc gtagacagtt ttttgaaatc attatcaaag 14400
tgatggcctc cgctgaagcc tatcacctct gcgccggtct gtcggagaga tgggcaagca 14460
ttattacggt cttcgcgccc gtacatgcat tggacgattg cagggtcaat ggatctgaga 14520
tcatccagag gattgccgcc cttaccttcc gtttcgagtt ggagccagcc cctaaatgag 14580
acgacatagt cgacttgatg tgacaatgcc aagagagaga tttgcttaac ccgatttttt 14640
tgctcaagcg taagcctatt gaagcttgcc ggcatgacgt ccgcgccgaa agaatatcct 14700
acaagtaaaa cattctgcac accgaaatgc ttggtgtaga catcgattat gtgaccaaga 14760
tccttagcag tttcgcttgg ggaccgctcc gaccagaaat accgaagtga actgacgcca 14820
atgacaggaa tcccttccgt ctgcagatag gtaccatcga tagatctgct gcctcgcgcg 14880
tttcggtgat gacggtgaaa acctctgaca catgcagctc ccggagacgg tcacagcttg 14940
tctgtaagcg gatgccggga gcagacaagc ccgtcagggc gcgtcagcgg gtgttggcgg 15000
gtgtcggggc gcagccatga cccagtcacg tagcgatagc ggagtgtata ctggcttaac 15060
tatgcggcat cagagcagat tgtactgaga gtgcaccata tgcggtgtga aataccgcac 15120
agatgcgtaa ggagaaaata ccgcatcagg cgctcttccg cttcctcgct cactgactcg 15180
ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg 15240
ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag 15300
gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac 15360
gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga 15420
taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt 15480
accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc 15540
tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc 15600
cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta 15660
agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat 15720
gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca 15780
gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct 15840
tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt 15900
acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct 15960
cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 16020
acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 16080
acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 16140
tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 16200
ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 16260
ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 16320
tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 16380
aatagtttgc gcaacgttgt tgccattgct gcaggggggg gggggggggg gttccattgt 16440
tcattccacg gacaaaaaca gagaaaggaa acgacagagg ccaaaaagct cgctttcagc 16500
acctgtcgtt tcctttcttt tcagagggta ttttaaataa aaacattaag ttatgacgaa 16560
gaagaacgga aacgccttaa accggaaaat tttcataaat agcgaaaacc cgcgaggtcg 16620
ccgccccgta acctgtcgga tcaccggaaa ggacccgtaa agtgataatg attatcatct 16680
acatatcaca acgtgcgtgg aggccatcaa accacgtcaa ataatcaatt atgacgcagg 16740
tatcgtatta attgatctgc atcaacttaa cgtaaaaaca acttcagaca atacaaatca 16800
gcgacactga atacggggca acctcatgtc cccccccccc ccccccctgc aggcatcgtg 16860
gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga 16920
gttacatgat cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt 16980
gtcagaagta agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct 17040
cttactgtca tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca 17100
ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaac acgggataat 17160
accgcgccac atagcagaac tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga 17220
aaactctcaa ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgtgcaccc 17280
aactgatctt cagcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg 17340
caaaatgccg caaaaaaggg aataagggcg acacggaaat gttgaatact catactcttc 17400
ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt 17460
gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca 17520
cctgacgtct aagaaaccat tattatcatg acattaacct ataaaaatag gcgtatcacg 17580
aggccctttc gtcttcaaga attcggagct tttgccattc tcaccggatt cagtcgtcac 17640
tcatggtgat ttctcacttg ataaccttat ttttgacgag gggaaattaa taggttgtat 17700
tgatgttgga cgagtcggaa tcgcagaccg ataccaggat cttgccatcc tatggaactg 17760
cctcggtgag ttttctcctt cattacagaa acggcttttt caaaaatatg gtattgataa 17820
tcctgatatg aataaattgc agtttcattt gatgctcgat gagtttttct aatcagaatt 17880
ggttaattgg ttgtaacact ggcagagcat tacgctgact tgacgggacg gcggctttgt 17940
tgaataaatc gaacttttgc tgagttgaag gatcagatca cgcatcttcc cgacaacgca 18000
gaccgttccg tggcaaagca aaagttcaaa atcaccaact ggtccaccta caacaaagct 18060
ctcatcaacc gtggctccct cactttctgg ctggatgatg gggcgattca ggcctggtat 18120
gagtcagcaa caccttcttc acgaggcaga cctcagcgcc agaaggccgc cagagaggcc 18180
gagcgcggcc gtgaggcttg gacgctaggg cagggcatga aaaagcccgt agcgggctgc 18240
tacgggcgtc tgacgcggtg gaaaggggga ggggatgttg tctacatggc tctgctgtag 18300
tgagtgggtt gcgctccggc agcggtcctg atcaatcgtc accctttctc ggtccttcaa 18360
cgttcctgac aacgagcctc cttttcgcca atccatcgac aatcaccgcg agtccctgct 18420
cgaacgctgc gtccggaccg gcttcgtcga aggcgtctat cgcggcccgc aacagcggcg 18480
agagcggagc ctgttcaacg gtgccgccgc gctcgccggc atcgctgtcg ccggcctgct 18540
cctcaagcac ggccccaaca gtgaagtagc tgattgtcat cagcgcattg acggcgtccc 18600
cggccgaaaa acccgcctcg cagaggaagc gaagctgcgc gtcggccgtt tccatctgcg 18660
gtgcgcccgg tcgcgtgccg gcatggatgc gcgcgccatc gcggtaggcg agcagcgcct 18720
gcctgaagct gcgggcattc ccgatcagaa atgagcgcca gtcgtcgtcg gctctcggca 18780
ccgaatgcgt atgattctcc gccagcatgg cttcggccag tgcgtcgagc agcgcccgct 18840
tgttcctgaa gtgccagtaa agcgccggct gctgaacccc caaccgttcc gccagtttgc 18900
gtgtcgtcag accgtctacg ccgacctcgt tcaacaggtc cagggcggca cggatcactg 18960
tattcggctg caactttgtc atgcttgaca ctttatcact gataaacata atatgtccac 19020
caacttatca gtgataaaga atccgcgcgt tcaatcggac cagcggaggc tggtccggag 19080
gccagacgtg aaacccaaca tacccctgat cgtaattctg agcactgtcg cgctcgacgc 19140
tgtcggcatc ggcctgatta tgccggtgct gccgggcctc ctgcgcgatc tggttcactc 19200
gaacgacgtc accgcccact atggcattct gctggcgctg tatgcgttgg tgcaatttgc 19260
ctgcgcacct gtgctgggcg cgctgtcgga tcgtttcggg cggcggccaa tcttgctcgt 19320
ctcgctggcc ggcgccactg tcgactacgc catcatggcg acagcgcctt tcctttgggt 19380
tctctatatc gggcggatcg tggccggcat caccggggcg actggggcgg tagccggcgc 19440
ttatattgcc gatatcactg atggcgatga gcgcgcgcgg cacttcggct tcatgagcgc 19500
ctgtttcggg ttcgggatgg tcgcgggacc tgtgctcggt gggctgatgg gcggtttctc 19560
cccccacgct ccgttcttcg ccgcggcagc cttgaacggc ctcaatttcc tgacgggctg 19620
tttccttttg ccggagtcgc acaaaggcga acgccggccg ttacgccggg aggctctcaa 19680
cccgctcgct tcgttccggt gggcccgggg catgaccgtc gtcgccgccc tgatggcggt 19740
cttcttcatc atgcaacttg tcggacaggt gccggccgcg ctttgggtca ttttcggcga 19800
ggatcgcttt cactgggacg cgaccacgat cggcatttcg cttgccgcat ttggcattct 19860
gcattcactc gcccaggcaa tgatcaccgg ccctgtagcc gcccggctcg gcgaaaggcg 19920
ggcactcatg ctcggaatga ttgccgacgg cacaggctac atcctgcttg ccttcgcgac 19980
acggggatgg atggcgttcc cgatcatggt cctgcttgct tcgggtggca tcggaatgcc 20040
ggcgctgcaa gcaatgttgt ccaggcaggt ggatgaggaa cgtcaggggc agctgcaagg 20100
ctcactggcg gcgctcacca gcctgacctc gatcgtcgga cccctcctct tcacggcgat 20160
ctatgcggct tctataacaa cgtggaacgg gtgggcatgg attgcaggcg ctgccctcta 20220
cttgctctgc ctgccggcgc tgcgtcgcgg gctttggagc ggcgcagggc aacgagccga 20280
tcgctgatcg tggaaacgat aggcctatgc catgcgggtc aaggcgactt ccggcaagct 20340
atacgcgccc taggagtgcg gttggaacgt tggcccagcc agatactccc gatcacgagc 20400
aggacgccga tgatttgaag cgcactcagc gtctgatcca agaacaacca tcctagcaac 20460
acggcggtcc ccgggctgag aaagcccagt aaggaaacaa ctgtaggttc gagtcgcgag 20520
atcccccgga accaaaggaa gtaggttaaa cccgctccga tcaggccgag ccacgccagg 20580
ccgagaacat tggttcctgt aggcatcggg attggcggat caaacactaa agctactgga 20640
acgagcagaa gtcctccggc cgccagttgc caggcggtaa aggtgagcag aggcacggga 20700
ggttgccact tgcgggtcag cacggttccg aacgccatgg aaaccgcccc cgccaggccc 20760
gctgcgacgc cgacaggatc tagcgctgcg tttggtgtca acaccaacag cgccacgccc 20820
gcagttccgc aaatagcccc caggaccgcc atcaatcgta tcgggctacc tagcagagcg 20880
gcagagatga acacgaccat cagcggctgc acagcgccta ccgtcgccgc gaccccgccc 20940
ggcaggcggt agaccgaaat aaacaacaag ctccagaata gcgaaatatt aagtgcgccg 21000
aggatgaaga tgcgcatcca ccagattccc gttggaatct gtcggacgat catcacgagc 21060
aataaacccg ccggcaacgc ccgcagcagc ataccggcga cccctcggcc tcgctgttcg 21120
ggctccacga aaacgccgga cagatgcgcc ttgtgagcgt ccttggggcc gtcctcctgt 21180
ttgaagaccg acagcccaat gatctcgccg tcgatgtagg cgccgaatgc cacggcatct 21240
cgcaaccgtt cagcgaacgc ctccatgggc tttttctcct cgtgctcgta aacggacccg 21300
aacatctctg gagctttctt cagggccgac aatcggatct cgcggaaatc ctgcacgtcg 21360
gccgctccaa gccgtcgaat ctgagcctta atcacaattg tcaattttaa tcctctgttt 21420
atcggcagtt cgtagagcgc gccgtgcgtc ccgagcgata ctgagcgaag caagtgcgtc 21480
gagcagtgcc cgcttgttcc tgaaatgcca gtaaagcgct ggctgctgaa cccccagccg 21540
gaactgaccc cacaaggccc tagcgtttgc aatgcaccag gtcatcattg acccaggcgt 21600
gttccaccag gccgctgcct cgcaactctt cgcaggcttc gccgacctgc tcgcgccact 21660
tcttcacgcg ggtggaatcc gatccgcaca tgaggcggaa ggtttccagc ttgagcgggt 21720
acggctcccg gtgcgagctg aaatagtcga acatccgtcg ggccgtcggc gacagcttgc 21780
ggtacttctc ccatatgaat ttcgtgtagt ggtcgccagc aaacagcacg acgatttcct 21840
cgtcgatcag gacctggcaa cgggacgttt tcttgccacg gtccaggacg cggaagcggt 21900
gcagcagcga caccgattcc aggtgcccaa cgcggtcgga cgtgaagccc atcgccgtcg 21960
cctgtaggcg cgacaggcat tcctcggcct tcgtgtaata ccggccattg atcgaccagc 22020
ccaggtcctg gcaaagctcg tagaacgtga aggtgatcgg ctcgccgata ggggtgcgct 22080
tcgcgtactc caacacctgc tgccacacca gttcgtcatc gtcggcccgc agctcgacgc 22140
cggtgtaggt gatcttcacg tccttgttga cgtggaaaat gaccttgttt tgcagcgcct 22200
cgcgcgggat tttcttgttg cgcgtggtga acagggcaga gcgggccgtg tcgtttggca 22260
tcgctcgcat cgtgtccggc cacggcgcaa tatcgaacaa ggaaagctgc atttccttga 22320
tctgctgctt cgtgtgtttc agcaacgcgg cctgcttggc ctcgctgacc tgttttgcca 22380
ggtcctcgcc ggcggttttt cgcttcttgg tcgtcatagt tcctcgcgtg tcgatggtca 22440
tcgacttcgc caaacctgcc gcctcctgtt cgagacgacg cgaacgctcc acggcggccg 22500
atggcgcggg cagggcaggg ggagccagtt gcacgctgtc gcgctcgatc ttggccgtag 22560
cttgctggac catcgagccg acggactgga aggtttcgcg gggcgcacgc atgacggtgc 22620
ggcttgcgat ggtttcggca tcctcggcgg aaaaccccgc gtcgatcagt tcttgcctgt 22680
atgccttccg gtcaaacgtc cgattcattc accctccttg cgggattgcc ccgactcacg 22740
ccggggcaat gtgcccttat tcctgatttg acccgcctgg tgccttggtg tccagataat 22800
ccaccttatc ggcaatgaag tcggtcccgt agaccgtctg gccgtccttc tcgtacttgg 22860
tattccgaat cttgccctgc acgaatacca gcgacccctt gcccaaatac ttgccgtggg 22920
cctcggcctg agagccaaaa cacttgatgc ggaagaagtc ggtgcgctcc tgcttgtcgc 22980
cggcatcgtt gcgccactct tcattaaccg ctatatcgaa aattgcttgc ggcttgttag 23040
aattgccatg acgtacctcg gtgtcacggg taagattacc gataaactgg aactgattat 23100
ggctcatatc gaaagtctcc ttgagaaagg agactctagt ttagctaaac attggttccg 23160
ctgtcaagaa ctttagcggc taaaattttg cgggccgcga ccaaaggtgc gaggggcggc 23220
ttccgctgtg tacaaccaga tatttttcac caacatcctt cgtctgctcg atgagcgggg 23280
catgacgaaa catgagctgt cggagagggc aggggtttca atttcgtttt tatcagactt 23340
aaccaacggt aaggccaacc cctcgttgaa ggtgatggag gccattgccg acgccctgga 23400
aactccccta cctcttctcc tggagtccac cgaccttgac cgcgaggcac tcgcggagat 23460
tgcgggtcat cctttcaaga gcagcgtgcc gcccggatac gaacgcatca gtgtggtttt 23520
gccgtcacat aaggcgttta tcgtaaagaa atggggcgac gacacccgaa aaaagctgcg 23580
tggaaggctc tgacgccaag ggttagggct tgcacttcct tctttagccg ctaaaacggc 23640
cccttctctg cgggccgtcg gctcgcgcat catatcgaca tcctcaacgg aagccgtgcc 23700
gcgaatggca tcgggcgggt gcgctttgac agttgttttc tatcagaacc cctacgtcgt 23760
gcggttcgat tagctgtttg tcttgcaggc taaacacttt cggtatatcg tttgcctgtg 23820
cgataatgtt gctaatgatt tgttgcgtag gggttactga aaagtgagcg ggaaagaaga 23880
gtttcagacc atcaaggagc gggccaagcg caagctggaa cgcgacatgg gtgcggacct 23940
gttggccgcg ctcaacgacc cgaaaaccgt tgaagtcatg ctcaacgcgg acggcaaggt 24000
gtggcacgaa cgccttggcg agccgatgcg gtacatctgc gacatgcggc ccagccagtc 24060
gcaggcgatt atagaaacgg tggccggatt ccacggcaaa gaggtcacgc ggcattcgcc 24120
catcctggaa ggcgagttcc ccttggatgg cagccgcttt gccggccaat tgccgccggt 24180
cgtggccgcg ccaacctttg cgatccgcaa gcgcgcggtc gccatcttca cgctggaaca 24240
gtacgtcgag gcgggcatca tgacccgcga gcaatacgag gtcattaaaa gcgccgtcgc 24300
ggcgcatcga aacatcctcg tcattggcgg tactggctcg ggcaagacca cgctcgtcaa 24360
cgcgatcatc aatgaaatgg tcgccttcaa cccgtctgag cgcgtcgtca tcatcgagga 24420
caccggcgaa atccagtgcg ccgcagagaa cgccgtccaa taccacacca gcatcgacgt 24480
ctcgatgacg ctgctgctca agacaacgct gcgtatgcgc cccgaccgca tcctggtcgg 24540
tgaggtacgt ggccccgaag cccttgatct gttgatggcc tggaacaccg ggcatgaagg 24600
aggtgccgcc accctgcacg caaacaaccc caaagcgggc ctgagccggc tcgccatgct 24660
tatcagcatg cacccggatt caccgaaacc cattgagccg ctgattggcg aggcggttca 24720
tgtggtcgtc catatcgcca ggacccctag cggccgtcga gtgcaagaaa ttctcgaagt 24780
tcttggttac gagaacggcc agtacatcac caaaaccctg taaggagtat ttccaatgac 24840
aacggctgtt ccgttccgtc tgaccatgaa tcgcggcatt ttgttctacc ttgccgtgtt 24900
cttcgttctc gctctcgcgt tatccgcgca tccggcgatg gcctcggaag gcaccggcgg 24960
cagcttgcca tatgagagct ggctgacgaa cctgcgcaac tccgtaaccg gcccggtggc 25020
cttcgcgctg tccatcatcg gcatcgtcgt cgccggcggc gtgctgatct tcggcggcga 25080
actcaacgcc ttcttccgaa ccctgatctt cctggttctg gtgatggcgc tgctggtcgg 25140
cgcgcagaac gtgatgagca ccttcttcgg tcgtggtgcc gaaatcgcgg ccctcggcaa 25200
cggggcgctg caccaggtgc aagtcgcggc ggcggatgcc gtgcgtgcgg tagcggctgg 25260
acggctcgcc taatcatggc tctgcgcacg atccccatcc gtcgcgcagg caaccgagaa 25320
aacctgttca tgggtggtga tcgtgaactg gtgatgttct cgggcctgat ggcgtttgcg 25380
ctgattttca gcgcccaaga gctgcgggcc accgtggtcg gtctgatcct gtggttcggg 25440
gcgctctatg cgttccgaat catggcgaag gccgatccga agatgcggtt cgtgtacctg 25500
cgtcaccgcc ggtacaagcc gtattacccg gcccgctcga ccccgttccg cgagaacacc 25560
aatagccaag ggaagcaata ccgatgatcc aagcaattgc gattgcaatc gcgggcctcg 25620
gcgcgcttct gttgttcatc ctctttgccc gcatccgcgc ggtcgatgcc gaactgaaac 25680
tgaaaaagca tcgttccaag gacgccggcc tggccgatct gctcaactac gccgctgtcg 25740
tcgatgacgg cgtaatcgtg ggcaagaacg gcagctttat ggctgcctgg ctgtacaagg 25800
gcgatgacaa cgcaagcagc accgaccagc agcgcgaagt agtgtccgcc cgcatcaacc 25860
aggccctcgc gggcctggga agtgggtgga tgatccatgt ggacgccgtg cggcgtcctg 25920
ctccgaacta cgcggagcgg ggcctgtcgg cgttccctga ccgtctgacg gcagcgattg 25980
aagaagagcg ctcggtcttg ccttgctcgt cggtgatgta cttcaccagc tccgcgaagt 26040
cgctcttctt gatggagcgc atggggacgt gcttggcaat cacgcgcacc ccccggccgt 26100
tttagcggct aaaaaagtca tggctctgcc ctcgggcgga ccacgcccat catgaccttg 26160
ccaagctcgt cctgcttctc ttcgatcttc gccagcaggg cgaggatcgt ggcatcaccg 26220
aaccgcgccg tgcgcgggtc gtcggtgagc cagagtttca gcaggccgcc caggcggccc 26280
aggtcgccat tgatgcgggc cagctcgcgg acgtgctcat agtccacgac gcccgtgatt 26340
ttgtagccct ggccgacggc cagcaggtag gccgacaggc tcatgccggc cgccgccgcc 26400
ttttcctcaa tcgctcttcg ttcgtctgga aggcagtaca ccttgatagg tgggctgccc 26460
ttcctggttg gcttggtttc atcagccatc cgcttgccct catctgttac gccggcggta 26520
gccggccagc ctcgcagagc aggattcccg ttgagcaccg ccaggtgcga ataagggaca 26580
gtgaagaagg aacacccgct cgcgggtggg cctacttcac ctatcctgcc cggctgacgc 26640
cgttggatac accaaggaaa gtctacacga accctttggc aaaatcctgt atatcgtgcg 26700
aaaaaggatg gatataccga aaaaatcgct ataatgaccc cgaagcaggg ttatgcagcg 26760
gaaaagcgct gcttccctgc tgttttgtgg aatatctacc gactggaaac aggcaaatgc 26820
aggaaattac tgaactgagg ggacaggcga gagacgatgc caaagagcta caccgacgag 26880
ctggccgagt gggttgaatc ccgcgcggcc aagaagcgcc ggcgtgatga ggctgcggtt 26940
gcgttcctgg cggtgagggc ggatgtcgag gcggcgttag cgtccggcta tgcgctcgtc 27000
accatttggg agcacatgcg ggaaacgggg aaggtcaagt tctcctacga gacgttccgc 27060
tcgcacgcca ggcggcacat caaggccaag cccgccgatg tgcccgcacc gcaggccaag 27120
gctgcggaac ccgcgccggc acccaagacg ccggagccac ggcggccgaa gcaggggggc 27180
aaggctgaaa agccggcccc cgctgcggcc ccgaccggct tcaccttcaa cccaacaccg 27240
gacaaaaagg atctactgta atggcgaaaa ttcacatggt tttgcagggc aagggcgggg 27300
tcggcaagtc ggccatcgcc gcgatcattg cgcagtacaa gatggacaag gggcagacac 27360
ccttgtgcat cgacaccgac ccggtgaacg cgacgttcga gggctacaag gccctgaacg 27420
tccgccggct gaacatcatg gccggcgacg aaattaactc gcgcaacttc gacaccctgg 27480
tcgagctgat tgcgccgacc aaggatgacg tggtgatcga caacggtgcc agctcgttcg 27540
tgcctctgtc gcattacctc atcagcaacc aggtgccggc tctgctgcaa gaaatggggc 27600
atgagctggt catccatacc gtcgtcaccg gcggccaggc tctcctggac acggtgagcg 27660
gcttcgccca gctcgccagc cagttcccgg ccgaagcgct tttcgtggtc tggctgaacc 27720
cgtattgggg gcctatcgag catgagggca agagctttga gcagatgaag gcgtacacgg 27780
ccaacaaggc ccgcgtgtcg tccatcatcc agattccggc cctcaaggaa gaaacctacg 27840
gccgcgattt cagcgacatg ctgcaagagc ggctgacgtt cgaccaggcg ctggccgatg 27900
aatcgctcac gatcatgacg cggcaacgcc tcaagatcgt gcggcgcggc ctgtttgaac 27960
agctcgacgc ggcggccgtg ctatgagcga ccagattgaa gagctgatcc gggagattgc 28020
ggccaagcac ggcatcgccg tcggccgcga cgacccggtg ctgatcctgc ataccatcaa 28080
cgcccggctc atggccgaca gtgcggccaa gcaagaggaa atccttgccg cgttcaagga 28140
agagctggaa gggatcgccc atcgttgggg cgaggacgcc aaggccaaag cggagcggat 28200
gctgaacgcg gccctggcgg ccagcaagga cgcaatggcg aaggtaatga aggacagcgc 28260
cgcgcaggcg gccgaagcga tccgcaggga aatcgacgac ggccttggcc gccagctcgc 28320
ggccaaggtc gcggacgcgc ggcgcgtggc gatgatgaac atgatcgccg gcggcatggt 28380
gttgttcgcg gccgccctgg tggtgtgggc ctcgttatga atcgcagagg cgcagatgaa 28440
aaagcccggc gttgccgggc tttgtttttg cgttagctgg gcttgtttga caggcccaag 28500
ctctgactgc gcccgcgctc gcgctcctgg gcctgtttct tctcctgctc ctgcttgcgc 28560
atcagggcct ggtgccgtcg ggctgcttca cgcatcgaat cccagtcgcc ggccagctcg 28620
ggatgctccg cgcgcatctt gcgcgtcgcc agttcctcga tcttgggcgc gtgaatgccc 28680
atgccttcct tgatttcgcg caccatgtcc agccgcgtgt gcagggtctg caagcgggct 28740
tgctgttggg cctgctgctg ctgccaggcg gcctttgtac gcggcaggga cagcaagccg 28800
ggggcattgg actgtagctg ctgcaaacgc gcctgctgac ggtctacgag ctgttctagg 28860
cggtcctcga tgcgctccac ctggtcatgc tttgcctgca cgtagagcgc aagggtctgc 28920
tggtaggtct gctcgatggg cgcggattct aagagggcct gctgttccgt ctcggcctcc 28980
tgggccgcct gtagcaaatc ctcgccgctg ttgccgctgg actgctttac tgccggggac 29040
tgctgttgcc ctgctcgcgc cgtcgtcgca gttcggcttg cccccactcg attgactgct 29100
tcatttcgag ccgcagcgat gcgatctcgg attgcgtcaa cggacggggc agcgcggagg 29160
tgtccggctt ctccttgggt gagtcggtcg atgccatagc caaaggtttc cttccaaaat 29220
gcgtccattg ctggaccgtg tttctcattg atgcccgcaa gcatcttcgg cttgaccgcc 29280
aggtcaagcg cgccttcatg ggcggtcatg acggacgccg ccatgacctt gccgccgttg 29340
ttctcgatgt agccgcgtaa tgaggcaatg gtgccgccca tcgtcagcgt gtcatcgaca 29400
acgatgtact tctggccggg gatcacctcc ccctcgaaag tcgggttgaa cgccaggcga 29460
tgatctgaac cggctccggt tcgggcgacc ttctcccgct gcacaatgtc cgtttcgacc 29520
tcaaggccaa ggcggtcggc cagaacgacc gccatcatgg ccggaatctt gttgttcccc 29580
gccgcctcga cggcgaggac tggaacgatg cggggcttgt cgtcgccgat cagcgtcttg 29640
agctgggcaa cagtgtcgtc cgaaatcagg cgctcgacca aattaagcgc cgcttccgcg 29700
tcgccctgct tcgcagcctg gtattcaggc tcgttggtca aagaaccaag gtcgccgttg 29760
cgaaccacct tcgggaagtc tccccacggt gcgcgctcgg ctctgctgta gctgctcaag 29820
acgcctccct ttttagccgc taaaactcta acgagtgcgc ccgcgactca acttgacgct 29880
ttcggcactt acctgtgcct tgccacttgc gtcataggtg atgcttttcg cactcccgat 29940
ttcaggtact ttatcgaaat ctgaccgggc gtgcattaca aagttcttcc ccacctgttg 30000
gtaaatgctg ccgctatctg cgtggacgat gctgccgtcg tggcgctgcg acttatcggc 30060
cttttgggcc atatagatgt tgtaaatgcc aggtttcagg gccccggctt tatctacctt 30120
ctggttcgtc catgcgcctt ggttctcggt ctggacaatt ctttgcccat tcatgaccag 30180
gaggcggtgt ttcattgggt gactcctgac ggttgcctct ggtgttaaac gtgtcctggt 30240
cgcttgccgg ctaaaaaaaa gccgacctcg gcagttcgag gccggctttc cctagagccg 30300
ggcgcgtcaa ggttgttcca tctattttag tgaactgcgt tcgatttatc agttactttc 30360
ctcccgcttt gtgtttcctc ccactcgttt ccgcgtctag ccgacccctc aacatagcgg 30420
cctcttcttg ggctgccttt gcctcttgcc gcgcttcgtc acgctcggct tgcaccgtcg 30480
taaagcgctc ggcctgcctg gccgcctctt gcgccgccaa cttcctttgc tcctggtggg 30540
cctcggcgtc ggcctgcgcc ttcgctttca ccgctgccaa ctccgtgcgc aaactctccg 30600
cttcgcgcct ggtggcgtcg cgctcgccgc gaagcgcctg catttcctgg ttggccgcgt 30660
ccagggtctt gcggctctct tctttgaatg cgcgggcgtc ctggtgagcg tagtccagct 30720
cggcgcgcag ctcctgcgct cgacgctcca cctcgtcggc ccgctgcgtc gccagcgcgg 30780
cccgctgctc ggctcctgcc agggcggtgc gtgcttcggc cagggcttgc cgctggcgtg 30840
cggccagctc ggccgcctcg gcggcctgct gctctagcaa tgtaacgcgc gcctgggctt 30900
cttccagctc gcgggcctgc gcctcgaagg cgtcggccag ctccccgcgc acggcttcca 30960
actcgttgcg ctcacgatcc cagccggctt gcgctgcctg caacgattca ttggcaaggg 31020
cctgggcggc ttgccagagg gcggccacgg cctggttgcc ggcctgctgc accgcgtccg 31080
gcacctggac tgccagcggg gcggcctgcg ccgtgcgctg gcgtcgccat tcgcgcatgc 31140
cggcgctggc gtcgttcatg ttgacgcggg cggccttacg cactgcatcc acggtcggga 31200
agttctcccg gtcgccttgc tcgaacagct cgtccgcagc cgcaaaaatg cggtcgcgcg 31260
tctctttgtt cagttccatg ttggctccgg taattggtaa gaataataat actcttacct 31320
accttatcag cgcaagagtt tagctgaaca gttctcgact taacggcagg ttttttagcg 31380
gctgaagggc aggcaaaaaa agccccgcac ggtcggcggg ggcaaagggt cagcgggaag 31440
gggattagcg ggcgtcgggc ttcttcatgc gtcggggccg cgcttcttgg gatggagcac 31500
gacgaagcgc gcacgcgcat cgtcctcggc cctatcggcc cgcgtcgcgg tcaggaactt 31560
gtcgcgcgct aggtcctccc tggtgggcac caggggcatg aactcggcct gctcgatgta 31620
ggtccactcc atgaccgcat cgcagtcgag gccgcgttcc ttcaccgtct cttgcaggtc 31680
gcggtacgcc cgctcgttga gcggctggta acgggccaat tggtcgtaaa tggctgtcgg 31740
ccatgagcgg cctttcctgt tgagccagca gccgacgacg aagccggcaa tgcaggcccc 31800
tggcacaacc aggccgacgc cgggggcagg ggatggcagc agctcgccaa ccaggaaccc 31860
cgccgcgatg atgccgatgc cggtcaacca gcccttgaaa ctatccggcc ccgaaacacc 31920
cctgcgcatt gcctggatgc tgcgccggat agcttgcaac atcaggagcc gtttcttttg 31980
ttcgtcagtc atggtccgcc ctcaccagtt gttcgtatcg gtgtcggacg aactgaaatc 32040
gcaagagctg ccggtatcgg tccagccgct gtccgtgtcg ctgctgccga agcacggcga 32100
ggggtccgcg aacgccgcag acggcgtatc cggccgcagc gcatcgccca gcatggcccc 32160
ggtcagcgag ccgccggcca ggtagcccag catggtgctg ttggtcgccc cggccaccag 32220
ggccgacgtg acgaaatcgc cgtcattccc tctggattgt tcgctgctcg gcggggcagt 32280
gcgccgcgcc ggcggcgtcg tggatggctc gggttggctg gcctgcgacg gccggcgaaa 32340
ggtgcgcagc agctcgttat cgaccggctg cggcgtcggg gccgccgcct tgcgctgcgg 32400
tcggtgttcc ttcttcggct cgcgcagctt gaacagcatg atcgcggaaa ccagcagcaa 32460
cgccgcgcct acgcctcccg cgatgtagaa cagcatcgga ttcattcttc ggtcctcctt 32520
gtagcggaac cgttgtctgt gcggcgcggg tggcccgcgc cgctgtcttt ggggatcagc 32580
cctcgatgag cgcgaccagt ttcacgtcgg caaggttcgc ctcgaactcc tggccgtcgt 32640
cctcgtactt caaccaggca tagccttccg ccggcggccg acggttgagg ataaggcggg 32700
cagggcgctc gtcgtgctcg acctggacga tggccttttt cagcttgtcc gggtccggct 32760
ccttcgcgcc cttttccttg gcgtccttac cgtcctggtc gccgtcctcg ccgtcctggc 32820
cgtcgccggc ctccgcgtca cgctcggcat cagtctggcc gttgaaggca tcgacggtgt 32880
tgggatcgcg gcccttctcg tccaggaact cgcgcagcag cttgaccgtg ccgcgcgtga 32940
tttcctgggt gtcgtcgtca agccacgcct cgacttcctc cgggcgcttc ttgaaggccg 33000
tcaccagctc gttcaccacg gtcacgtcgc gcacgcggcc ggtgttgaac gcatcggcga 33060
tcttctccgg caggtccagc agcgtgacgt gctgggtgat gaacgccggc gacttgccga 33120
tttccttggc gatatcgcct ttcttcttgc ccttcgccag ctcgcggcca atgaagtcgg 33180
caatttcgcg cggggtcagc tcgttgcgtt gcaggttctc gataacctgg tcggcttcgt 33240
tgtagtcgtt gtcgatgaac gccgggatgg acttcttgcc ggcccacttc gagccacggt 33300
agcggcgggc gccgtgattg atgatatagc ggcccggctg ctcctggttc tcgcgcaccg 33360
aaatgggtga cttcaccccg cgctctttga tcgtggcacc gatttccgcg atgctctccg 33420
gggaaaagcc ggggttgtcg gccgtccgcg gctgatgcgg atcttcgtcg atcaggtcca 33480
ggtccagctc gatagggccg gaaccgccct gagacgccgc aggagcgtcc aggaggctcg 33540
acaggtcgcc gatgctatcc aaccccaggc cggacggctg cgccgcgcct gcggcttcct 33600
gagcggccgc agcggtgttt ttcttggtgg tcttggcttg agccgcagtc attgggaaat 33660
ctccatcttc gtgaacacgt aatcagccag ggcgcgaacc tctttcgatg ccttgcgcgc 33720
ggccgttttc ttgatcttcc agaccggcac accggatgcg agggcatcgg cgatgctgct 33780
gcgcaggcca acggtggccg gaatcatcat cttggggtac gcggccagca gctcggcttg 33840
gtggcgcgcg tggcgcggat tccgcgcatc gaccttgctg ggcaccatgc caaggaattg 33900
cagcttggcg ttcttctggc gcacgttcgc aatggtcgtg accatcttct tgatgccctg 33960
gatgctgtac gcctcaagct cgatggggga cagcacatag tcggccgcga agagggcggc 34020
cgccaggccg acgccaaggg tcggggccgt gtcgatcagg cacacgtcga agccttggtt 34080
cgccagggcc ttgatgttcg ccccgaacag ctcgcgggcg tcgtccagcg acagccgttc 34140
ggcgttcgcc agtaccgggt tggactcgat gagggcgagg cgcgcggcct ggccgtcgcc 34200
ggctgcgggt gcggtttcgg tccagccgcc ggcagggaca gcgccgaaca gcttgcttgc 34260
atgcaggccg gtagcaaagt ccttgagcgt gtaggacgca ttgccctggg ggtccaggtc 34320
gatcacggca acccgcaagc cgcgctcgaa aaagtcgaag gcaagatgca caagggtcga 34380
agtcttgccg acgccgcctt tctggttggc cgtgaccaaa gttttcatcg tttggtttcc 34440
tgttttttct tggcgtccgc ttcccacttc cggacgatgt acgcctgatg ttccggcaga 34500
accgccgtta cccgcgcgta cccctcgggc aagttcttgt cctcgaacgc ggcccacacg 34560
cgatgcaccg cttgcgacac tgcgcccctg gtcagtccca gcgacgttgc gaacgtcgcc 34620
tgtggcttcc catcgactaa gacgccccgc gctatctcga tggtctgctg ccccacttcc 34680
agcccctgga tcgcctcctg gaactggctt tcggtaagcc gtttcttcat ggataacacc 34740
cataatttgc tccgcgcctt ggttgaacat agcggtgaca gccgccagca catgagagaa 34800
gtttagctaa acatttctcg cacgtcaaca cctttagccg ctaaaactcg tccttggcgt 34860
aacaaaacaa aagcccggaa accgggcttt cgtctcttgc cgcttatggc tctgcacccg 34920
gctccatcac caacaggtcg cgcacgcgct tcactcggtt gcggatcgac actgccagcc 34980
caacaaagcc ggttgccgcc gccgccagga tcgcgccgat gatgccggcc acaccggcca 35040
tcgcccacca ggtcgccgcc ttccggttcc attcctgctg gtactgcttc gcaatgctgg 35100
acctcggctc accataggct gaccgctcga tggcgtatgc cgcttctccc cttggcgtaa 35160
aacccagcgc cgcaggcggc attgccatgc tgcccgccgc tttcccgacc acgacgcgcg 35220
caccaggctt gcggtccaga ccttcggcca cggcgagctg cgcaaggaca taatcagccg 35280
ccgacttggc tccacgcgcc tcgatcagct cttgcactcg cgcgaaatcc ttggcctcca 35340
cggccgccat gaatcgcgca cgcggcgaag gctccgcagg gccggcgtcg tgatcgccgc 35400
cgagaatgcc cttcaccaag ttcgacgaca cgaaaatcat gctgacggct atcaccatca 35460
tgcagacgga tcgcacgaac ccgctgaatt gaacacgagc acggcacccg cgaccactat 35520
gccaagaatg cccaaggtaa aaattgccgg ccccgccatg aagtccgtga atgccccgac 35580
ggccgaagtg aagggcaggc cgccacccag gccgccgccc tcactgcccg gcacctggtc 35640
gctgaatgtc gatgccagca cctgcggcac gtcaatgctt ccgggcgtcg cgctcgggct 35700
gatcgcccat cccgttactg ccccgatccc ggcaatggca aggactgcca gcgctgccat 35760
ttttggggtg aggccgttcg cggccgaggg gcgcagcccc tggggggatg ggaggcccgc 35820
gttagcgggc cgggagggtt cgagaagggg gggcaccccc cttcggcgtg cgcggtcacg 35880
cgcacagggc gcagccctgg ttaaaaacaa ggtttataaa tattggttta aaagcaggtt 35940
aaaagacagg ttagcggtgg ccgaaaaacg ggcggaaacc cttgcaaatg ctggattttc 36000
tgcctgtgga cagcccctca aatgtcaata ggtgcgcccc tcatctgtca gcactctgcc 36060
cctcaagtgt caaggatcgc gcccctcatc tgtcagtagt cgcgcccctc aagtgtcaat 36120
accgcagggc acttatcccc aggcttgtcc acatcatctg tgggaaactc gcgtaaaatc 36180
aggcgttttc gccgatttgc gaggctggcc agctccacgt cgccggccga aatcgagcct 36240
gcccctcatc tgtcaacgcc gcgccgggtg agtcggcccc tcaagtgtca acgtccgccc 36300
ctcatctgtc agtgagggcc aagttttccg cgaggtatcc acaacgccgg cggccgcggt 36360
gtctcgcaca cggcttcgac ggcgtttctg gcgcgtttgc agggccatag acggccgcca 36420
gcccagcggc gagggcaacc agcccggtga gcgtcggaaa ggcgctggaa gccccgtagc 36480
gacgcggaga ggggcgagac aagccaaggg cgcaggctcg atgcgcagca cgacatagcc 36540
ggttctcgca aggacgagaa tttccctgcg gtgcccctca agtgtcaatg aaagtttcca 36600
acgcgagcca ttcgcgagag ccttgagtcc acgctagatg agagctttgt tgtaggtgga 36660
ccagttggtg attttgaact tttgctttgc cacggaacgg tctgcgttgt cgggaagatg 36720
cgtgatctga tccttcaact cagcaaaagt tcgatttatt caacaaagcc acgttgtgtc 36780
tcaaaatctc tgatgttaca ttgcacaaga taaaaatata tcatcatgaa caataaaact 36840
gtctgcttac ataaacagta atacaagggg tgttatgagc catattcaac gggaaacgtc 36900
ttgctcgac 36909
<210>8
<211>13019
<212>DNA
<213>人工序列
<220>
<223>载体
<400>8
gttacccgga ccgaagctta gcccgggcat gcctgcagtg cagcgtgacc cggtcgtgcc 60
cctctctaga gataatgagc attgcatgtc taagttataa aaaattacca catatttttt 120
ttgtcacact tgtttgaagt gcagtttatc tatctttata catatattta aactttactc 180
tacgaataat ataatctata gtactacaat aatatcagtg ttttagagaa tcatataaat 240
gaacagttag acatggtcta aaggacaatt gagtattttg acaacaggac tctacagttt 300
tatcttttta gtgtgcatgt gttctccttt ttttttgcaa atagcttcac ctatataata 360
cttcatccat tttattagta catccattta gggtttaggg ttaatggttt ttatagacta 420
atttttttag tacatctatt ttattctatt ttagcctcta aattaagaaa actaaaactc 480
tattttagtt tttttattta ataatttaga tataaaatag aataaaataa agtgactaaa 540
aattaaacaa atacccttta agaaattaaa aaaactaagg aaacattttt cttgtttcga 600
gtagataatg ccagcctgtt aaacgccgtc gacgagtcta acggacacca accagcgaac 660
cagcagcgtc gcgtcgggcc aagcgaagca gacggcacgg catctctgtc gctgcctctg 720
gacccctctc gagagttccg ctccaccgtt ggacttgctc cgctgtcggc atccagaaat 780
tgcgtggcgg agcggcagac gtgagccggc acggcaggcg gcctcctcct cctctcacgg 840
cacggcagct acgggggatt cctttcccac cgctccttcg ctttcccttc ctcgcccgcc 900
gtaataaata gacaccccct ccacaccctc tttccccaac ctcgtgttgt tcggagcgca 960
cacacacaca accagatctc ccccaaatcc acccgtcggc acctccgctt caaggtacgc 1020
cgctcgtcct cccccccccc ccctctctac cttctctaga tcggcgttcc ggtccatggt 1080
tagggcccgg tagttctact tctgttcatg tttgtgttag atccgtgttt gtgttagatc 1140
cgtgctgcta gcgttcgtac acggatgcga cctgtacgtc agacacgttc tgattgctaa 1200
cttgccagtg tttctctttg gggaatcctg ggatggctct agccgttccg cagacgggat 1260
cgatttcatg attttttttg tttcgttgca tagggtttgg tttgcccttt tcctttattt 1320
caatatatgc cgtgcacttg tttgtcgggt catcttttca tgcttttttt tgtcttggtt 1380
gtgatgatgt ggtctggttg ggcggtcgtt ctagatcgga gtagaattct gtttcaaact 1440
acctggtgga tttattaatt ttggatctgt atgtgtgtgc catacatatt catagttacg 1500
aattgaagat gatggatgga aatatcgatc taggataggt atacatgttg atgcgggttt 1560
tactgatgca tatacagaga tgctttttgt tcgcttggtt gtgatgatgt ggtgtggttg 1620
ggcggtcgtt cattcgttct agatcggagt agaatactgt ttcaaactac ctggtgtatt 1680
tattaatttt ggaactgtat gtgtgtgtca tacatcttca tagttacgag tttaagatgg 1740
atggaaatat cgatctagga taggtataca tgttgatgtg ggttttactg atgcatatac 1800
atgatggcat atgcagcatc tattcatatg ctctaacctt gagtacctat ctattataat 1860
aaacaagtat gttttataat tattttgatc ttgatatact tggatgatgg catatgcagc 1920
agctatatgt ggattttttt agccctgcct tcatacgcta tttatttgct tggtactgtt 1980
tcttttgtcg atgctcaccc tgttgtttgg tgttacttct gcaggtcgac tctagaggat 2040
ccacaagttt gtacaaaaaa gctgaacgag aaacgtaaaa tgatataaat atcaatatat 2100
taaattagat tttgcataaa aaacagacta cataatactg taaaacacaa catatccagt 2160
cactatggcg gccgcattag gcaccccagg ctttacactt tatgcttccg gctcgtataa 2220
tgtgtggatt ttgagttagg atttaaatac gcgttgatcc ggcttactaa aagccagata 2280
acagtatgcg tatttgcgcg ctgatttttg cggtataaga atatatactg atatgtatac 2340
ccgaagtatg tcaaaaagag gtatgctatg aagcagcgta ttacagtgac agttgacagc 2400
gacagctatc agttgctcaa ggcatatatg atgtcaatat ctccggtctg gtaagcacaa 2460
ccatgcagaa tgaagcccgt cgtctgcgtg ccgaacgctg gaaagcggaa aatcaggaag 2520
ggatggctga ggtcgcccgg tttattgaaa tgaacggctc ttttgctgac gagaacaggg 2580
gctggtgaaa tgcagtttaa ggtttacacc tataaaagag agagccgtta tcgtctgttt 2640
gtggatgtac agagtgatat cattgacacg cccggtcgac ggatggtgat ccccctggcc 2700
agtgcacgtc tgctgtcaga taaagtctcc cgtgaacttt acccggtggt gcatatcggg 2760
gatgaaagct ggcgcatgat gaccaccgat atggccagtg tgccggtctc cgttatcggg 2820
gaagaagtgg ctgatctcag ccaccgcgaa aatgacatca aaaacgccat taacctgatg 2880
ttctggggaa tataaatgtc aggctccctt atacacagcc agtctgcagg tcgaccatag 2940
tgactggata tgttgtgttt tacagtatta tgtagtctgt tttttatgca aaatctaatt 3000
taatatattg atatttatat cattttacgt ttctcgttca gctttcttgt acaaagtggt 3060
gttaacctag acttgtccat cttctggatt ggccaactta attaatgtat gaaataaaag 3120
gatgcacaca tagtgacatg ctaatcacta taatgtgggc atcaaagttg tgtgttatgt 3180
gtaattacta gttatctgaa taaaagagaa agagatcatc catatttctt atcctaaatg 3240
aatgtcacgt gtctttataa ttctttgatg aaccagatgc atttcattaa ccaaatccat 3300
atacatataa atattaatca tatataatta atatcaattg ggttagcaaa acaaatctag 3360
tctaggtgtg ttttgcgaat tgcggccgcc accgcggtgg agctcgaatt ccggtccggg 3420
tcacctttgt ccaccaagat ggaactgcgg ccgctcatta attaagtcag gcgcgcctct 3480
agttgaagac acgttcatgt cttcatcgta agaagacact cagtagtctt cggccagaat 3540
ggccatctgg attcagcagg cctagaaggc catttaaatc ctgaggatct ggtcttccta 3600
aggacccggg atatcggacc gattaaactt taattcggtc cgaagcttgc atgcctgcag 3660
tgcagcgtga cccggtcgtg cccctctcta gagataatga gcattgcatg tctaagttat 3720
aaaaaattac cacatatttt ttttgtcaca cttgtttgaa gtgcagttta tctatcttta 3780
tacatatatt taaactttac tctacgaata atataatcta tagtactaca ataatatcag 3840
tgttttagag aatcatataa atgaacagtt agacatggtc taaaggacaa ttgagtattt 3900
tgacaacagg actctacagt tttatctttt tagtgtgcat gtgttctcct ttttttttgc 3960
aaatagcttc acctatataa tacttcatcc attttattag tacatccatt tagggtttag 4020
ggttaatggt ttttatagac taattttttt agtacatcta ttttattcta ttttagcctc 4080
taaattaaga aaactaaaac tctattttag tttttttatt taataattta gatataaaat 4140
agaataaaat aaagtgacta aaaattaaac aaataccctt taagaaatta aaaaaactaa 4200
ggaaacattt ttcttgtttc gagtagataa tgccagcctg ttaaacgccg tcgacgagtc 4260
taacggacac caaccagcga accagcagcg tcgcgtcggg ccaagcgaag cagacggcac 4320
ggcatctctg tcgctgcctc tggacccctc tcgagagttc cgctccaccg ttggacttgc 4380
tccgctgtcg gcatccagaa attgcgtggc ggagcggcag acgtgagccg gcacggcagg 4440
cggcctcctc ctcctctcac ggcaccggca gctacggggg attcctttcc caccgctcct 4500
tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc ctctttcccc 4560
aacctcgtgt tgttcggagc gcacacacac acaaccagat ctcccccaaa tccacccgtc 4620
ggcacctccg cttcaaggta cgccgctcgt cctccccccc ccccctctct accttctcta 4680
gatcggcgtt ccggtccatg catggttagg gcccggtagt tctacttctg ttcatgtttg 4740
tgttagatcc gtgtttgtgt tagatccgtg ctgctagcgt tcgtacacgg atgcgacctg 4800
tacgtcagac acgttctgat tgctaacttg ccagtgtttc tctttgggga atcctgggat 4860
ggctctagcc gttccgcaga cgggatcgat ttcatgattt tttttgtttc gttgcatagg 4920
gtttggtttg cccttttcct ttatttcaat atatgccgtg cacttgtttg tcgggtcatc 4980
ttttcatgct tttttttgtc ttggttgtga tgatgtggtc tggttgggcg gtcgttctag 5040
atcggagtag aattctgttt caaactacct ggtggattta ttaattttgg atctgtatgt 5100
gtgtgccata catattcata gttacgaatt gaagatgatg gatggaaata tcgatctagg 5160
ataggtatac atgttgatgc gggttttact gatgcatata cagagatgct ttttgttcgc 5220
ttggttgtga tgatgtggtg tggttgggcg gtcgttcatt cgttctagat cggagtagaa 5280
tactgtttca aactacctgg tgtatttatt aattttggaa ctgtatgtgt gtgtcataca 5340
tcttcatagt tacgagttta agatggatgg aaatatcgat ctaggatagg tatacatgtt 5400
gatgtgggtt ttactgatgc atatacatga tggcatatgc agcatctatt catatgctct 5460
aaccttgagt acctatctat tataataaac aagtatgttt tataattatt ttgatcttga 5520
tatacttgga tgatggcata tgcagcagct atatgtggat ttttttagcc ctgccttcat 5580
acgctattta tttgcttggt actgtttctt ttgtcgatgc tcaccctgtt gtttggtgtt 5640
acttctgcag gtcgacttta acttagccta ggatccacac gacaccatgt cccccgagcg 5700
ccgccccgtc gagatccgcc cggccaccgc cgccgacatg gccgccgtgt gcgacatcgt 5760
gaaccactac atcgagacct ccaccgtgaa cttccgcacc gagccgcaga ccccgcagga 5820
gtggatcgac gacctggagc gcctccagga ccgctacccg tggctcgtgg ccgaggtgga 5880
gggcgtggtg gccggcatcg cctacgccgg cccgtggaag gcccgcaacg cctacgactg 5940
gaccgtggag tccaccgtgt acgtgtccca ccgccaccag cgcctcggcc tcggctccac 6000
cctctacacc cacctcctca agagcatgga ggcccagggc ttcaagtccg tggtggccgt 6060
gatcggcctc ccgaacgacc cgtccgtgcg cctccacgag gccctcggct acaccgcccg 6120
cggcaccctc cgcgccgccg gctacaagca cggcggctgg cacgacgtcg gcttctggca 6180
gcgcgacttc gagctgccgg ccccgccgcg cccggtgcgc ccggtgacgc agatctgagt 6240
cgaaacctag acttgtccat cttctggatt ggccaactta attaatgtat gaaataaaag 6300
gatgcacaca tagtgacatg ctaatcacta taatgtgggc atcaaagttg tgtgttatgt 6360
gtaattacta gttatctgaa taaaagagaa agagatcatc catatttctt atcctaaatg 6420
aatgtcacgt gtctttataa ttctttgatg aaccagatgc atttcattaa ccaaatccat 6480
atacatataa atattaatca tatataatta atatcaattg ggttagcaaa acaaatctag 6540
tctaggtgtg ttttgcgaat tgcggccgcc accgcggtgg agctcgaatt cattccgatt 6600
aatcgtggcc tcttgctctt caggatgaag agctatgttt aaacgtgcaa gcgctactag 6660
acaattcagt acattaaaaa cgtccgcaat gtgttattaa gttgtctaag cgtcaatttg 6720
tttacaccac aatatatcct gccaccagcc agccaacagc tccccgaccg gcagctcggc 6780
acaaaatcac cactcgatac aggcagccca tcagtccggg acggcgtcag cgggagagcc 6840
gttgtaaggc ggcagacttt gctcatgtta ccgatgctat tcggaagaac ggcaactaag 6900
ctgccgggtt tgaaacacgg atgatctcgc ggagggtagc atgttgattg taacgatgac 6960
agagcgttgc tgcctgtgat caaatatcat ctccctcgca gagatccgaa ttatcagcct 7020
tcttattcat ttctcgctta accgtgacag gctgtcgatc ttgagaacta tgccgacata 7080
ataggaaatc gctggataaa gccgctgagg aagctgagtg gcgctatttc tttagaagtg 7140
aacgttgacg atcgtcgacc gtaccccgat gaattaattc ggacgtacgt tctgaacaca 7200
gctggatact tacttgggcg attgtcatac atgacatcaa caatgtaccc gtttgtgtaa 7260
ccgtctcttg gaggttcgta tgacactagt ggttcccctc agcttgcgac tagatgttga 7320
ggcctaacat tttattagag agcaggctag ttgcttagat acatgatctt caggccgtta 7380
tctgtcaggg caagcgaaaa ttggccattt atgacgacca atgccccgca gaagctccca 7440
tctttgccgc catagacgcc gcgcccccct tttggggtgt agaacatcct tttgccagat 7500
gtggaaaaga agttcgttgt cccattgttg gcaatgacgt agtagccggc gaaagtgcga 7560
gacccatttg cgctatatat aagcctacga tttccgttgc gactattgtc gtaattggat 7620
gaactattat cgtagttgct ctcagagttg tcgtaatttg atggactatt gtcgtaattg 7680
cttatggagt tgtcgtagtt gcttggagaa atgtcgtagt tggatgggga gtagtcatag 7740
ggaagacgag cttcatccac taaaacaatt ggcaggtcag caagtgcctg ccccgatgcc 7800
atcgcaagta cgaggcttag aaccaccttc aacagatcgc gcatagtctt ccccagctct 7860
ctaacgcttg agttaagccg cgccgcgaag cggcgtcggc ttgaacgaat tgttagacat 7920
tatttgccga ctaccttggt gatctcgcct ttcacgtagt gaacaaattc ttccaactga 7980
tctgcgcgcg aggccaagcg atcttcttgt ccaagataag cctgcctagc ttcaagtatg 8040
acgggctgat actgggccgg caggcgctcc attgcccagt cggcagcgac atccttcggc 8100
gcgattttgc cggttactgc gctgtaccaa atgcgggaca acgtaagcac tacatttcgc 8160
tcatcgccag cccagtcggg cggcgagttc catagcgtta aggtttcatt tagcgcctca 8220
aatagatcct gttcaggaac cggatcaaag agttcctccg ccgctggacc taccaaggca 8280
acgctatgtt ctcttgcttt tgtcagcaag atagccagat caatgtcgat cgtggctggc 8340
tcgaagatac ctgcaagaat gtcattgcgc tgccattctc caaattgcag ttcgcgctta 8400
gctggataac gccacggaat gatgtcgtcg tgcacaacaa tggtgacttc tacagcgcgg 8460
agaatctcgc tctctccagg ggaagccgaa gtttccaaaa ggtcgttgat caaagctcgc 8520
cgcgttgttt catcaagcct tacagtcacc gtaaccagca aatcaatatc actgtgtggc 8580
ttcaggccgc catccactgc ggagccgtac aaatgtacgg ccagcaacgt cggttcgaga 8640
tggcgctcga tgacgccaac tacctctgat agttgagtcg atacttcggc gatcaccgct 8700
tccctcatga tgtttaactc ctgaattaag ccgcgccgcg aagcggtgtc ggcttgaatg 8760
aattgttagg cgtcatcctg tgctcccgag aaccagtacc agtacatcgc tgtttcgttc 8820
gagacttgag gtctagtttt atacgtgaac aggtcaatgc cgccgagagt aaagccacat 8880
tttgcgtaca aattgcaggc aggtacattg ttcgtttgtg tctctaatcg tatgccaagg 8940
agctgtctgc ttagtgccca ctttttcgca aattcgatga gactgtgcgc gactcctttg 9000
cctcggtgcg tgtgcgacac aacaatgtgt tcgatagagg ctagatcgtt ccatgttgag 9060
ttgagttcaa tcttcccgac aagctcttgg tcgatgaatg cgccatagca agcagagtct 9120
tcatcagagt catcatccga gatgtaatcc ttccggtagg ggctcacact tctggtagat 9180
agttcaaagc cttggtcgga taggtgcaca tcgaacactt cacgaacaat gaaatggttc 9240
tcagcatcca atgtttccgc cacctgctca gggatcaccg aaatcttcat atgacgccta 9300
acgcctggca cagcggatcg caaacctggc gcggcttttg gcacaaaagg cgtgacaggt 9360
ttgcgaatcc gttgctgcca cttgttaacc cttttgccag atttggtaac tataatttat 9420
gttagaggcg aagtcttggg taaaaactgg cctaaaattg ctggggattt caggaaagta 9480
aacatcacct tccggctcga tgtctattgt agatatatgt agtgtatcta cttgatcggg 9540
ggatctgctg cctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac atgcagctcc 9600
cggagacggt cacagcttgt ctgtaagcgg atgccgggag cagacaagcc cgtcagggcg 9660
cgtcagcggg tgttggcggg tgtcggggcg cagccatgac ccagtcacgt agcgatagcg 9720
gagtgtatac tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat 9780
gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc gctcttccgc 9840
ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca 9900
ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg 9960
agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca 10020
taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa 10080
cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc 10140
tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc 10200
gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct 10260
gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg 10320
tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag 10380
gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta 10440
cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg 10500
aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt 10560
tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt 10620
ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgag 10680
attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat 10740
ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca gtgaggcacc 10800
tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg tcgtgtagat 10860
aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac cgcgagaccc 10920
acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg ccgagcgcag 10980
aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag 11040
agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgctg cagggggggg 11100
gggggggggg gacttccatt gttcattcca cggacaaaaa cagagaaagg aaacgacaga 11160
ggccaaaaag cctcgctttc agcacctgtc gtttcctttc ttttcagagg gtattttaaa 11220
taaaaacatt aagttatgac gaagaagaac ggaaacgcct taaaccggaa aattttcata 11280
aatagcgaaa acccgcgagg tcgccgcccc gtaacctgtc ggatcaccgg aaaggacccg 11340
taaagtgata atgattatca tctacatatc acaacgtgcg tggaggccat caaaccacgt 11400
caaataatca attatgacgc aggtatcgta ttaattgatc tgcatcaact taacgtaaaa 11460
acaacttcag acaatacaaa tcagcgacac tgaatacggg gcaacctcat gtcccccccc 11520
cccccccccc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct tcattcagct 11580
ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 11640
gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg 11700
ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc ttttctgtga 11760
ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg agttgctctt 11820
gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa gtgctcatca 11880
ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg agatccagtt 11940
cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc accagcgttt 12000
ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga 12060
aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat cagggttatt 12120
gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata ggggttccgc 12180
gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc atgacattaa 12240
cctataaaaa taggcgtatc acgaggccct ttcgtcttca agaattggtc gacgatcttg 12300
ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggattgaagg cgagatccag 12360
caactcgcgc cagatcatcc tgtgacggaa ctttggcgcg tgatgactgg ccaggacgtc 12420
ggccgaaaga gcgacaagca gatcacgctt ttcgacagcg tcggatttgc gatcgaggat 12480
ttttcggcgc tgcgctacgt ccgcgaccgc gttgagggat caagccacag cagcccactc 12540
gaccttctag ccgacccaga cgagccaagg gatctttttg gaatgctgct ccgtcgtcag 12600
gctttccgac gtttgggtgg ttgaacagaa gtcattatcg tacggaatgc caagcactcc 12660
cgaggggaac cctgtggttg gcatgcacat acaaatggac gaacggataa accttttcac 12720
gcccttttaa atatccgtta ttctaataaa cgctcttttc tcttaggttt acccgccaat 12780
atatcctgtc aaacactgat agtttaaact gaaggcggga aacgacaatc tgatcatgag 12840
cggagaatta agggagtcac gttatgaccc ccgccgatga cgcgggacaa gccgttttac 12900
gtttggaact gacagaaccg caacgttgaa ggagccactc agcaagctgg tacgattgta 12960
atacgactca ctatagggcg aattgagcgc tgtttaaacg ctcttcaact ggaagagcg 13019
<210>9
<211>2991
<212>DNA
<213>人工序列
<220>
<223>载体
<400>9
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 60
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 120
gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca 180
cgacaggttt cccgactgga aagcgggcag tgagcgcaac gcaattaata cgcgtaccgc 240
tagccaggaa gagtttgtag aaacgcaaaa aggccatccg tcaggatggc cttctgctta 300
gtttgatgcc tggcagttta tggcgggcgt cctgcccgcc accctccggg ccgttgcttc 360
acaacgttca aatccgctcc cggcggattt gtcctactca ggagagcgtt caccgacaaa 420
caacagataa aacgaaaggc ccagtcttcc gactgagcct ttcgttttat ttgatgcctg 480
gcagttccct actctcgcgt taacgctagc atggatgttt tcccagtcac gacgttgtaa 540
aacgacggcc agtcttaagc tcgggccctg cagctctaga gctcgaattc tacaggtcac 600
taataccatc taagtagttg gttcatagtg actgcatatg ttgtgtttta cagtattatg 660
tagtctgttt tttatgcaaa atctaattta atatattgat atttatatca ttttacgttt 720
ctcgttcaac tttcttgtac aaagtggccg ttaacggatc cagacttgtc catcttctgg 780
attggccaac ttaattaatg tatgaaataa aaggatgcac acatagtgac atgctaatca 840
ctataatgtg ggcatcaaag ttgtgtgtta tgtgtaatta ctagttatct gaataaaaga 900
gaaagagatc atccatattt cttatcctaa atgaatgtca cgtgtcttta taattctttg 960
atgaaccaga tgcatttcat taaccaaatc catatacata taaatattaa tcatatataa 1020
ttaatatcaa ttgggttagc aaaacaaatc tagtctaggt gtgttttgcg aattgcggca 1080
agcttgcggc cgccccgggc aactttatta tacaaagttg gcattataaa aaagcattgc 1140
ttatcaattt gttgcaacga acaggtcact atcagtcaaa ataaaatcat tatttggagc 1200
tccatggtag cgttaacgcg gccgcgatat cccctatagt gagtcgtatt acatggtcat 1260
agctgtttcc tggcagctct ggcccgtgtc tcaaaatctc tgatgttaca ttgcacaaga 1320
taaaaatata tcatcatgaa caataaaact gtctgcttac ataaacagta atacaagggg 1380
tgttatgagc catattcaac gggaaacgtc gaggccgcga ttaaattcca acatggatgc 1440
tgatttatat gggtataaat gggctcgcga taatgtcggg caatcaggtg cgacaatcta 1500
tcgcttgtat gggaagcccg atgcgccaga gttgtttctg aaacatggca aaggtagcgt 1560
tgccaatgat gttacagatg agatggtcag actaaactgg ctgacggaat ttatgcctct 1620
tccgaccatc aagcatttta tccgtactcc tgatgatgca tggttactca ccactgcgat 1680
ccccggaaaa acagcattcc aggtattaga agaatatcct gattcaggtg aaaatattgt 1740
tgatgcgctg gcagtgttcc tgcgccggtt gcattcgatt cctgtttgta attgtccttt 1800
taacagcgat cgcgtatttc gtctcgctca ggcgcaatca cgaatgaata acggtttggt 1860
tgatgcgagt gattttgatg acgagcgtaa tggctggcct gttgaacaag tctggaaaga 1920
aatgcataaa cttttgccat tctcaccgga ttcagtcgtc actcatggtg atttctcact 1980
tgataacctt atttttgacg aggggaaatt aataggttgt attgatgttg gacgagtcgg 2040
aatcgcagac cgataccagg atcttgccat cctatggaac tgcctcggtg agttttctcc 2100
ttcattacag aaacggcttt ttcaaaaata tggtattgat aatcctgata tgaataaatt 2160
gcagtttcat ttgatgctcg atgagttttt ctaatcagaa ttggttaatt ggttgtaaca 2220
ctggcagagc attacgctga cttgacggga cggcgcaagc tcatgaccaa aatcccttaa 2280
cgtgagttac gcgtcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc 2340
ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc 2400
agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt 2460
cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag gccaccactt 2520
caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc 2580
tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa 2640
ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac 2700
ctacaccgaa ctgagatacc tacagcgtga gcattgagaa agcgccacgc ttcccgaagg 2760
gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga 2820
gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact 2880
tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa 2940
cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt t 2991
<210>10
<211>13807
<212>DNA
<213>人工序列
<220>
<223>载体
<400>10
aagctggtac gattgtaata cgactcacta tagggcgaat tgagcgctgt ttaaacgctc 60
ttcaactgga agagcggtta ccagagctgg tcacctttgt ccaccaagat ggaactgcgg 120
ccgctcatta attaagtcag gcgcgcctct agttgaagac acgttcatgt cttcatcgta 180
agaagacact cagtagtctt cggccagaat ggccgtaggt gaattaagag gagagaggag 240
gtaaacattt tcttctattt tttcatattt tcaggataaa ttattgtaaa agtttacaag 300
atttccattt gactagtgta aatgaggaat attctctagt aagatcatta tttcatctac 360
ttcttttatc ttctaccagt agaggaataa acaatattta gctcctttgt aaatacaaat 420
taattttcgt tcttgacatc attcaatttt aattttacgt ataaaataaa agatcatacc 480
tattagaacg attaaggaga aatacaattc gaatgagaag gatgtgccgt ttgttataat 540
aaacagccac acgacgtaaa cgtaaaatga ccacatgatg ggccaataga catggaccga 600
ctactaataa tagtaagtta cattttagga tggaataaat atcataccga catcagtttg 660
aaagaaaagg gaaaaaaaga aaaaataaat aaaagatata ctaccgacat gagttccaaa 720
aagcaaaaaa aaagatcaag ccgacacaga cacgcgtaga gagcaaaatg actttgacgt 780
cacaccacga aaacagacgc ttcatacgtg tccctttatc tctctcagtc tctctataaa 840
cttagtgaga ccctcctctg ttttactcag gatccccggg taccgagctc gaattcaccg 900
gtcgccacca tggcccacag caagcacggc ctgaaggagg agatgaccat gaagtaccac 960
atggagggct gcgtgaacgg ccacaagttc gtgatcaccg gcgagggcat cggctacccc 1020
ttcaagggca agcagaccat caacctgtgc gtgatcgagg gcggccccct gcccttcagc 1080
gaggacatcc tgagcgccgg cttcaagtac ggcgaccgga tcttcaccga gtacccccag 1140
gacatcgtgg actacttcaa gaacagctgc cccgccggct acacctgggg ccggagcttc 1200
ctgttcgagg acggcgccgt gtgcatctgt aacgtggaca tcaccgtgag cgtgaaggag 1260
aactgcatct accacaagag catcttcaac ggcgtgaact tccccgccga cggccccgtg 1320
atgaagaaga tgaccaccaa ctgggaggcc agctgcgaga agatcatgcc cgtgcctaag 1380
cagggcatcc tgaagggcga cgtgagcatg tacctgctgc tgaaggacgg cggccggtac 1440
cggtgccagt tcgacaccgt gtacaaggcc aagagcgtgc ccagcaagat gcccgagtgg 1500
cacttcatcc agcacaagct gctgcgggag gaccggagcg acgccaagaa ccagaagtgg 1560
cagctgaccg agcacgccat cgccttcccc agcgccctgg cctgaagcgg cccatggata 1620
ttcgaacgcg taggtaccac atggttaacc tagacttgtc catcttctgg attggccaac 1680
ttaattaatg tatgaaataa aaggatgcac acatagtgac atgctaatca ctataatgtg 1740
ggcatcaaag ttgtgtgtta tgtgtaatta ctagttatct gaataaaaga gaaagagatc 1800
atccatattt cttatcctaa atgaatgtca cgtgtcttta taattctttg atgaaccaga 1860
tgcatttcat taaccaaatc catatacata taaatattaa tcatatataa ttaatatcaa 1920
ttgggttagc aaaacaaatc tagtctaggt gtgttttgcg aatgcggcca ttggcctaga 1980
aggccattta aatcctgagg atctggtctt cctaaggacc cgggatatcg ctatcaactt 2040
tgtatagaaa agttgaacga gaaacgtaaa atgatataaa tatcaatata ttaaattaga 2100
ttttgcataa aaaacagact acataatact gtaaaacaca acatatccag tcactatggt 2160
cgacctgcag actggctgtg tataagggag cctgacattt atattcccca gaacatcagg 2220
ttaatggcgt ttttgatgtc attttcgcgg tggctgagat cagccacttc ttccccgata 2280
acggagaccg gcacactggc catatcggtg gtcatcatgc gccagctttc atccccgata 2340
tgcaccaccg ggtaaagttc acgggggact ttatctgaca gcagacgtgc actggccagg 2400
gggatcacca tccgtcgccc gggcgtgtca ataatatcac tctgtacatc cacaaacaga 2460
cgataacggc tctctctttt ataggtgtaa accttaaact gcatttcacc agcccctgtt 2520
ctcgtcggca aaagagccgt tcatttcaat aaaccgggcg acctcagcca tcccttcctg 2580
attttccgct ttccagcgtt cggcacgcag acgacgggct tcattctgca tggttgtgct 2640
taccgaaccg gagatattga catcatatat gccttgagca actgatagct gtcgctgtca 2700
actgtcactg taatacgctg cttcatagca tacctctttt tgacatactt cgggtataca 2760
tatcagtata tattcttata ccgcaaaaat cagcgcgcaa atacgcatac tgttatctgg 2820
cttttagtaa gccggatcct ctagattacg ccccgcctgc cactcatcgc agtactgttg 2880
taattcatta agcattctgc cgacatggaa gccatcacaa acggcatgat gaacctgaat 2940
cgccagcggc atcagcacct tgtcgccttg cgtataatat ttgcccatgg tgaaaacggg 3000
ggcgaagaag ttgtccatat tggccacgtt taaatcaaaa ctggtgaaac tcacccaggg 3060
attggctgag acgaaaaaca tattctcaat aaacccttta gggaaatagg ccaggttttc 3120
accgtaacac gccacatctt gcgaatatat gtgtagaaac tgccggaaat cgtcgtggta 3180
ttcactccag agcgatgaaa acgtttcagt ttgctcatgg aaaacggtgt aacaagggtg 3240
aacactatcc catatcacca gctcaccgtc tttcattgcc atacggaatt ccggatgagc 3300
attcatcagg cgggcaagaa tgtgaataaa ggccggataa aacttgtgct tatttttctt 3360
tacggtcttt aaaaaggccg taatatccag ctgaacggtc tggttatagg tacattgagc 3420
aactgactga aatgcctcaa aatgttcttt acgatgccat tgggatatat caacggtggt 3480
atatccagtg atttttttct ccattttagc ttccttagct cctgaaaatc tcgacggatc 3540
ctaactcaaa atccacacat tatacgagcc ggaagcataa agtgtaaagc ctggggtgcc 3600
ctaatgcggc cgccatagtg actggatatg ttgtgtttta cagtattatg tagtctgttt 3660
tttatgcaaa atctaattta atatattgat atttatatca ttttacgttt ctcgttcaac 3720
tttattatac aaagttgata gatatcggac cgattaaact ttaattcggt ccgaagcttg 3780
catgcctgca gtgcagcgtg acccggtcgt gcccctctct agagataatg agcattgcat 3840
gtctaagtta taaaaaatta ccacatattt tttttgtcac acttgtttga agtgcagttt 3900
atctatcttt atacatatat ttaaacttta ctctacgaat aatataatct atagtactac 3960
aataatatca gtgttttaga gaatcatata aatgaacagt tagacatggt ctaaaggaca 4020
attgagtatt ttgacaacag gactctacag ttttatcttt ttagtgtgca tgtgttctcc 4080
tttttttttg caaatagctt cacctatata atacttcatc cattttatta gtacatccat 4140
ttagggttta gggttaatgg tttttataga ctaatttttt tagtacatct attttattct 4200
attttagcct ctaaattaag aaaactaaaa ctctatttta gtttttttat ttaataattt 4260
agatataaaa tagaataaaa taaagtgact aaaaattaaa caaataccct ttaagaaatt 4320
aaaaaaacta aggaaacatt tttcttgttt cgagtagata atgccagcct gttaaacgcc 4380
gtcgacgagt ctaacggaca ccaaccagcg aaccagcagc gtcgcgtcgg gccaagcgaa 4440
gcagacggca cggcatctct gtcgctgcct ctggacccct ctcgagagtt ccgctccacc 4500
gttggacttg ctccgctgtc ggcatccaga aattgcgtgg cggagcggca gacgtgagcc 4560
ggcacggcag gcggcctcct cctcctctca cggcaccggc agctacgggg gattcctttc 4620
ccaccgctcc ttcgctttcc cttcctcgcc cgccgtaata aatagacacc ccctccacac 4680
cctctttccc caacctcgtg ttgttcggag cgcacacaca cacaaccaga tctcccccaa 4740
atccacccgt cggcacctcc gcttcaaggt acgccgctcg tcctcccccc cccccctctc 4800
taccttctct agatcggcgt tccggtccat gcatggttag ggcccggtag ttctacttct 4860
gttcatgttt gtgttagatc cgtgtttgtg ttagatccgt gctgctagcg ttcgtacacg 4920
gatgcgacct gtacgtcaga cacgttctga ttgctaactt gccagtgttt ctctttgggg 4980
aatcctggga tggctctagc cgttccgcag acgggatcga tttcatgatt ttttttgttt 5040
cgttgcatag ggtttggttt gcccttttcc tttatttcaa tatatgccgt gcacttgttt 5100
gtcgggtcat cttttcatgc ttttttttgt cttggttgtg atgatgtggt ctggttgggc 5160
ggtcgttcta gatcggagta gaattctgtt tcaaactacc tggtggattt attaattttg 5220
gatctgtatg tgtgtgccat acatattcat agttacgaat tgaagatgat ggatggaaat 5280
atcgatctag gataggtata catgttgatg cgggttttac tgatgcatat acagagatgc 5340
tttttgttcg cttggttgtg atgatgtggt gtggttgggc ggtcgttcat tcgttctaga 5400
tcggagtaga atactgtttc aaactacctg gtgtatttat taattttgga actgtatgtg 5460
tgtgtcatac atcttcatag ttacgagttt aagatggatg gaaatatcga tctaggatag 5520
gtatacatgt tgatgtgggt tttactgatg catatacatg atggcatatg cagcatctat 5580
tcatatgctc taaccttgag tacctatcta ttataataaa caagtatgtt ttataattat 5640
tttgatcttg atatacttgg atgatggcat atgcagcagc tatatgtgga tttttttagc 5700
cctgccttca tacgctattt atttgcttgg tactgtttct tttgtcgatg ctcaccctgt 5760
tgtttggtgt tacttctgca ggtcgacttt aacttagcct aggatccaca cgacaccatg 5820
tcccccgagc gccgccccgt cgagatccgc ccggccaccg ccgccgacat ggccgccgtg 5880
tgcgacatcg tgaaccacta catcgagacc tccaccgtga acttccgcac cgagccgcag 5940
accccgcagg agtggatcga cgacctggag cgcctccagg accgctaccc gtggctcgtg 6000
gccgaggtgg agggcgtggt ggccggcatc gcctacgccg gcccgtggaa ggcccgcaac 6060
gcctacgact ggaccgtgga gtccaccgtg tacgtgtccc accgccacca gcgcctcggc 6120
ctcggctcca ccctctacac ccacctcctc aagagcatgg aggcccaggg cttcaagtcc 6180
gtggtggccg tgatcggcct cccgaacgac ccgtccgtgc gcctccacga ggccctcggc 6240
tacaccgccc gcggcaccct ccgcgccgcc ggctacaagc acggcggctg gcacgacgtc 6300
ggcttctggc agcgcgactt cgagctgccg gccccgccgc gcccggtgcg cccggtgacg 6360
cagatctccg gtggaggcgg cagcggtggc ggaggctccg gaggcggtgg ctccatggcc 6420
tcctccgagg acgtcatcaa ggagttcatg cgcttcaagg tgcgcatgga gggctccgtg 6480
aacggccacg agttcgagat cgagggcgag ggcgagggcc gcccctacga gggcacccag 6540
accgccaagc tgaaggtgac caagggcggc cccctgccct tcgcctggga catcctgtcc 6600
ccccagttcc agtacggctc caaggtgtac gtgaagcacc ccgccgacat ccccgactac 6660
aagaagctgt ccttccccga gggcttcaag tgggagcgcg tgatgaactt cgaggacggc 6720
ggcgtggtga ccgtgaccca ggactcctcc ctgcaggacg gctccttcat ctacaaggtg 6780
aagttcatcg gcgtgaactt cccctccgac ggccccgtaa tgcagaagaa gactatgggc 6840
tgggaggcct ccaccgagcg cctgtacccc cgcgacggcg tgctgaaggg cgagatccac 6900
aaggccctga agctgaagga cggcggccac tacctggtgg agttcaagtc catctacatg 6960
gccaagaagc ccgtgcagct gcccggctac tactacgtgg actccaagct ggacatcacc 7020
tcccacaacg aggactacac catcgtggag cagtacgagc gcgccgaggg ccgccaccac 7080
ctgttcctgt agtcaggatc tgagtcgaaa cctagacttg tccatcttct ggattggcca 7140
acttaattaa tgtatgaaat aaaaggatgc acacatagtg acatgctaat cactataatg 7200
tgggcatcaa agttgtgtgt tatgtgtaat tactagttat ctgaataaaa gagaaagaga 7260
tcatccatat ttcttatcct aaatgaatgt cacgtgtctt tataattctt tgatgaacca 7320
gatgcatttc attaaccaaa tccatataca tataaatatt aatcatatat aattaatatc 7380
aattgggtta gcaaaacaaa tctagtctag gtgtgttttg cgaatgcggc cgccaccgcg 7440
gtggagctcg aattcattcc gattaatcgt ggcctcttgc tcttcaggat gaagagctat 7500
gtttaaacgt gcaagcgcta ctagacaatt cagtacatta aaaacgtccg caatgtgtta 7560
ttaagttgtc taagcgtcaa tttgtttaca ccacaatata tcctgccacc agccagccaa 7620
cagctccccg accggcagct cggcacaaaa tcaccactcg atacaggcag cccatcagtc 7680
cgggacggcg tcagcgggag agccgttgta aggcggcaga ctttgctcat gttaccgatg 7740
ctattcggaa gaacggcaac taagctgccg ggtttgaaac acggatgatc tcgcggaggg 7800
tagcatgttg attgtaacga tgacagagcg ttgctgcctg tgatcaaata tcatctccct 7860
cgcagagatc cgaattatca gccttcttat tcatttctcg cttaaccgtg acaggctgtc 7920
gatcttgaga actatgccga cataatagga aatcgctgga taaagccgct gaggaagctg 7980
agtggcgcta tttctttaga agtgaacgtt gacgatcgtc gaccgtaccc cgatgaatta 8040
attcggacgt acgttctgaa cacagctgga tacttacttg ggcgattgtc atacatgaca 8100
tcaacaatgt acccgtttgt gtaaccgtct cttggaggtt cgtatgacac tagtggttcc 8160
cctcagcttg cgactagatg ttgaggccta acattttatt agagagcagg ctagttgctt 8220
agatacatga tcttcaggcc gttatctgtc agggcaagcg aaaattggcc atttatgacg 8280
accaatgccc cgcagaagct cccatctttg ccgccataga cgccgcgccc cccttttggg 8340
gtgtagaaca tccttttgcc agatgtggaa aagaagttcg ttgtcccatt gttggcaatg 8400
acgtagtagc cggcgaaagt gcgagaccca tttgcgctat atataagcct acgatttccg 8460
ttgcgactat tgtcgtaatt ggatgaacta ttatcgtagt tgctctcaga gttgtcgtaa 8520
tttgatggac tattgtcgta attgcttatg gagttgtcgt agttgcttgg agaaatgtcg 8580
tagttggatg gggagtagtc atagggaaga cgagcttcat ccactaaaac aattggcagg 8640
tcagcaagtg cctgccccga tgccatcgca agtacgaggc ttagaaccac cttcaacaga 8700
tcgcgcatag tcttccccag ctctctaacg cttgagttaa gccgcgccgc gaagcggcgt 8760
cggcttgaac gaattgttag acattatttg ccgactacct tggtgatctc gcctttcacg 8820
tagtgaacaa attcttccaa ctgatctgcg cgcgaggcca agcgatcttc ttgtccaaga 8880
taagcctgcc tagcttcaag tatgacgggc tgatactggg ccggcaggcg ctccattgcc 8940
cagtcggcag cgacatcctt cggcgcgatt ttgccggtta ctgcgctgta ccaaatgcgg 9000
gacaacgtaa gcactacatt tcgctcatcg ccagcccagt cgggcggcga gttccatagc 9060
gttaaggttt catttagcgc ctcaaataga tcctgttcag gaaccggatc aaagagttcc 9120
tccgccgctg gacctaccaa ggcaacgcta tgttctcttg cttttgtcag caagatagcc 9180
agatcaatgt cgatcgtggc tggctcgaag atacctgcaa gaatgtcatt gcgctgccat 9240
tctccaaatt gcagttcgcg cttagctgga taacgccacg gaatgatgtc gtcgtgcaca 9300
acaatggtga cttctacagc gcggagaatc tcgctctctc caggggaagc cgaagtttcc 9360
aaaaggtcgt tgatcaaagc tcgccgcgtt gtttcatcaa gccttacagt caccgtaacc 9420
agcaaatcaa tatcactgtg tggcttcagg ccgccatcca ctgcggagcc gtacaaatgt 9480
acggccagca acgtcggttc gagatggcgc tcgatgacgc caactacctc tgatagttga 9540
gtcgatactt cggcgatcac cgcttccctc atgatgttta actcctgaat taagccgcgc 9600
cgcgaagcgg tgtcggcttg aatgaattgt taggcgtcat cctgtgctcc cgagaaccag 9660
taccagtaca tcgctgtttc gttcgagact tgaggtctag ttttatacgt gaacaggtca 9720
atgccgccga gagtaaagcc acattttgcg tacaaattgc aggcaggtac attgttcgtt 9780
tgtgtctcta atcgtatgcc aaggagctgt ctgcttagtg cccacttttt cgcaaattcg 9840
atgagactgt gcgcgactcc tttgcctcgg tgcgtgtgcg acacaacaat gtgttcgata 9900
gaggctagat cgttccatgt tgagttgagt tcaatcttcc cgacaagctc ttggtcgatg 9960
aatgcgccat agcaagcaga gtcttcatca gagtcatcat ccgagatgta atccttccgg 10020
taggggctca cacttctggt agatagttca aagccttggt cggataggtg cacatcgaac 10080
acttcacgaa caatgaaatg gttctcagca tccaatgttt ccgccacctg ctcagggatc 10140
accgaaatct tcatatgacg cctaacgcct ggcacagcgg atcgcaaacc tggcgcggct 10200
tttggcacaa aaggcgtgac aggtttgcga atccgttgct gccacttgtt aacccttttg 10260
ccagatttgg taactataat ttatgttaga ggcgaagtct tgggtaaaaa ctggcctaaa 10320
attgctgggg atttcaggaa agtaaacatc accttccggc tcgatgtcta ttgtagatat 10380
atgtagtgta tctacttgat cgggggatct gctgcctcgc gcgtttcggt gatgacggtg 10440
aaaacctctg acacatgcag ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg 10500
ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca 10560
tgacccagtc acgtagcgat agcggagtgt atactggctt aactatgcgg catcagagca 10620
gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg taaggagaaa 10680
ataccgcatc aggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg 10740
gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg 10800
ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa 10860
ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg 10920
acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc 10980
tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc 11040
ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc 11100
ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg 11160
ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc 11220
actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga 11280
gttcttgaag tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc 11340
tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac 11400
caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg 11460
atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc 11520
acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa 11580
ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta 11640
ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt 11700
tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag 11760
tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag caataaacca 11820
gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc 11880
tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt 11940
tgttgccatt gctgcagggg gggggggggg gggggacttc cattgttcat tccacggaca 12000
aaaacagaga aaggaaacga cagaggccaa aaagcctcgc tttcagcacc tgtcgtttcc 12060
tttcttttca gagggtattt taaataaaaa cattaagtta tgacgaagaa gaacggaaac 12120
gccttaaacc ggaaaatttt cataaatagc gaaaacccgc gaggtcgccg ccccgtaacc 12180
tgtcggatca ccggaaagga cccgtaaagt gataatgatt atcatctaca tatcacaacg 12240
tgcgtggagg ccatcaaacc acgtcaaata atcaattatg acgcaggtat cgtattaatt 12300
gatctgcatc aacttaacgt aaaaacaact tcagacaata caaatcagcg acactgaata 12360
cggggcaacc tcatgtcccc cccccccccc cccctgcagg catcgtggtg tcacgctcgt 12420
cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc 12480
ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt 12540
tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc 12600
catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt 12660
gtatgcggcg accgagttgc tcttgcccgg cgtcaacacg ggataatacc gcgccacata 12720
gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga 12780
tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag 12840
catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa 12900
aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt 12960
attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga 13020
aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgtctaag 13080
aaaccattat tatcatgaca ttaacctata aaaataggcg tatcacgagg ccctttcgtc 13140
ttcaagaatt ggtcgacgat cttgctgcgt tcggatattt tcgtggagtt cccgccacag 13200
acccggattg aaggcgagat ccagcaactc gcgccagatc atcctgtgac ggaactttgg 13260
cgcgtgatga ctggccagga cgtcggccga aagagcgaca agcagatcac gcttttcgac 13320
agcgtcggat ttgcgatcga ggatttttcg gcgctgcgct acgtccgcga ccgcgttgag 13380
ggatcaagcc acagcagccc actcgacctt ctagccgacc cagacgagcc aagggatctt 13440
tttggaatgc tgctccgtcg tcaggctttc cgacgtttgg gtggttgaac agaagtcatt 13500
atcgtacgga atgccaagca ctcccgaggg gaaccctgtg gttggcatgc acatacaaat 13560
ggacgaacgg ataaaccttt tcacgccctt ttaaatatcc gttattctaa taaacgctct 13620
tttctcttag gtttacccgc caatatatcc tgtcaaacac tgatagttta aactgaaggc 13680
gggaaacgac aatctgatca tgagcggaga attaagggag tcacgttatg acccccgccg 13740
atgacgcggg acaagccgtt ttacgtttgg aactgacaga accgcaacgt tgaaggagcc 13800
actcagc 13807
<210>11
<211>4678
<212>DNA
<213>人工序列
<220>
<223>载体
<400>11
gaaaggccca gtcttccgac tgagcctttc gttttatttg atgcctggca gttccctact 60
ctcgcgttaa cgctagcatg gatgttttcc cagtcacgac gttgtaaaac gacggccagt 120
cttaagctcg ggcccgcgtt aacgctacca tggagctcca aataatgatt ttattttgac 180
tgatagtgac ctgttcgttg caacaaattg ataagcaatg cttttttata atgccaactt 240
tgtatagaaa agttgggccg aattcgagct cggtacggcc agaatggccc ggaccgggtt 300
accgaattcg agctcggtac cctgggatcc ctggtaatta ttggctgtag gattctaaac 360
agagcctaaa tagctggaat agctctagcc ctcaatccaa actaatgata tctatactta 420
tgcaactcta aatttttatt ctaaaagtaa tatttcattt ttgtcaacga gattctctac 480
tctattccac aatcttttga agcaatattt accttaaatc tgtactctat accaataatc 540
atatattcta ttatttattt ttatctctct cctaaggagc atccccctat gtctgcatgg 600
cccccgcctc gggtcccaat ctcttgctct gctagtagca cagaagaaaa cactagaaat 660
gacttgcttg acttagagta tcagataaac atcatgttta cttaacttta atttgtatcg 720
gtttctacta tttttataat atttttgtct ctatagatac tacgtgcaac agtataatca 780
acctagttta atccagagcg aaggattttt tactaagtac gtgactccat atgcacagcg 840
ttccttttat ggttcctcac tgggcacagc ataaacgaac cctgtccaat gttttcagcg 900
cgaacaaaca gaaattccat cagcgaacaa acaacataca tgcgagatga aaataaataa 960
taaaaaaagc tccgtctcga taggccggca cgaatcgaga gcctccatag ccagtttttt 1020
ccatcggaac ggcggttcgc gcacctaatt atatgcacca cacgcctata aagccaacca 1080
acccgtcgga ggggcgcaag ccagacagaa gacagcccgt cagcccctct cgtttttcat 1140
ccgccttcgc ctccaaccgc gtgcgctcca cgcctcctcc aggaaagcga ggatctcccc 1200
caaatccacc cgtcggcacc tccgcttcaa ggtacgccgc tcgtcctccc cccccccccc 1260
tctctacctt ctctagatcg gcgttccggt ccatggttag ggcccggtag ttctacttct 1320
gttcatgttt gtgttagatc cgtgtttgtg ttagatccgt gctgctagcg ttcgtacacg 1380
gatgcgacct gtacgtcaga cacgttctga ttgctaactt gccagtgttt ctctttgggg 1440
aatcctggga tggctctagc cgttccgcag acgggatcga tttcatgatt ttttttgttt 1500
cgttgcatag ggtttggttt gcccttttcc tttatttcaa tatatgccgt gcacttgttt 1560
gtcgggtcat cttttcatgc ttttttttgt cttggttgtg atgatgtggt ctggttgggc 1620
ggtcgttcta gatcggagta gaattctgtt tcaaactacc tggtggattt attaattttg 1680
gatctgtatg tgtgtgccat acatattcat agttacgaat tgaagatgat ggatggaaat 1740
atcgatctag gataggtata catgttgatg cgggttttac tgatgcatat acagagatgc 1800
tttttgttcg cttggttgtg atgatgtggt gtggttgggc ggtcgttcat tcgttctaga 1860
tcggagtaga atactgtttc aaactacctg gtgtatttat taattttgga actgtatgtg 1920
tgtgtcatac atcttcatag ttacgagttt aagatggatg gaaatatcga tctaggatag 1980
gtatacatgt tgatgtgggt tttactgatg catatacatg atggcatatg cagcatctat 2040
tcatatgctc taaccttgag tacctatcta ttataataaa caagtatgtt ttataattat 2100
tttgatcttg atatacttgg atgatggcat atgcagcagc tatatgtgga tttttttagc 2160
cctgccttca tacgctattt atttgcttgg tactgtttct tttgtcgatg ctcaccctgt 2220
tgtttggtgt tacttctgca ggtcgactct agaagcttgg tcacccggtc cgggcctaga 2280
aggccagctt caagtttgta caaaaaagtt gaacgagaaa cgtaaaatga tataaatatc 2340
aatatattaa attagatttt gcataaaaaa cagactacat aatactgtaa aacacaacat 2400
atgcagtcac tatgaatcaa ctacttagat ggtattagtg acctgtagaa ttcgagctct 2460
agagctgcag ggcggccgcg atatccccta tagtgagtcg tattacatgg tcatagctgt 2520
ttcctggcag ctctggcccg tgtctcaaaa tctctgatgt tacattgcac aagataaaaa 2580
tatatcatca tgaacaataa aactgtctgc ttacataaac agtaatacaa ggggtgttat 2640
gagccatatt caacgggaaa cgtcgaggcc gcgattaaat tccaacatgg atgctgattt 2700
atatgggtat aaatgggctc gcgataatgt cgggcaatca ggtgcgacaa tctatcgctt 2760
gtatgggaag cccgatgcgc cagagttgtt tctgaaacat ggcaaaggta gcgttgccaa 2820
tgatgttaca gatgagatgg tcagactaaa ctggctgacg gaatttatgc ctcttccgac 2880
catcaagcat tttatccgta ctcctgatga tgcatggtta ctcaccactg cgatccccgg 2940
aaaaacagca ttccaggtat tagaagaata tcctgattca ggtgaaaata ttgttgatgc 3000
gctggcagtg ttcctgcgcc ggttgcattc gattcctgtt tgtaattgtc cttttaacag 3060
cgatcgcgta tttcgtctcg ctcaggcgca atcacgaatg aataacggtt tggttgatgc 3120
gagtgatttt gatgacgagc gtaatggctg gcctgttgaa caagtctgga aagaaatgca 3180
taaacttttg ccattctcac cggattcagt cgtcactcat ggtgatttct cacttgataa 3240
ccttattttt gacgagggga aattaatagg ttgtattgat gttggacgag tcggaatcgc 3300
agaccgatac caggatcttg ccatcctatg gaactgcctc ggtgagtttt ctccttcatt 3360
acagaaacgg ctttttcaaa aatatggtat tgataatcct gatatgaata aattgcagtt 3420
tcatttgatg ctcgatgagt ttttctaatc agaattggtt aattggttgt aacactggca 3480
gagcattacg ctgacttgac gggacggcgc aagctcatga ccaaaatccc ttaacgtgag 3540
ttacgcgtcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga 3600
tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt 3660
ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag 3720
agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa 3780
ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag 3840
tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca 3900
gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac 3960
cgaactgaga tacctacagc gtgagcattg agaaagcgcc acgcttcccg aagggagaaa 4020
ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc 4080
agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg 4140
tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc 4200
ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc ctgcgttatc 4260
ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg ctcgccgcag 4320
ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa 4380
accgcctctc cccgcgcgtt ggccgattca ttaatgcagc tggcacgaca ggtttcccga 4440
ctggaaagcg ggcagtgagc gcaacgcaat taatacgcgt accgctagcc aggaagagtt 4500
tgtagaaacg caaaaaggcc atccgtcagg atggccttct gcttagtttg atgcctggca 4560
gtttatggcg ggcgtcctgc ccgccaccct ccgggccgtt gcttcacaac gttcaaatcc 4620
gctcccggcg gatttgtcct actcaggaga gcgttcaccg acaaacaaca gataaaac 4678
<210>12
<211>3505
<212>DNA
<213>人工序列
<220>
<223>载体
<400>12
gatccccggg taccgagctc gaattcggcc caagtttgta caaaaaagtt gaacgagaaa 60
cgtaaaatga tataaatatc aatatattaa attagatttt gcataaaaaa cagactacat 120
aatactgtaa aacacaacat atgcagtcac tatgaatcaa ctacttagat ggtattagtg 180
acctgtagaa ttcgagctct agagctgcag ggcggccgcg atatccccta tagtgagtcg 240
tattacatgg tcatagctgt ttcctggcag ctctggcccg tgtctcaaaa tctctgatgt 300
tacattgcac aagataaaaa tatatcatca tgaacaataa aactgtctgc ttacataaac 360
agtaatacaa ggggtgttat gagccatatt caacgggaaa cgtcgaggcc gcgattaaat 420
tccaacatgg atgctgattt atatgggtat aaatgggctc gcgataatgt cgggcaatca 480
ggtgcgacaa tctatcgctt gtatgggaag cccgatgcgc cagagttgtt tctgaaacat 540
ggcaaaggta gcgttgccaa tgatgttaca gatgagatgg tcagactaaa ctggctgacg 600
gaatttatgc ctcttccgac catcaagcat tttatccgta ctcctgatga tgcatggtta 660
ctcaccactg cgatccccgg aaaaacagca ttccaggtat tagaagaata tcctgattca 720
ggtgaaaata ttgttgatgc gctggcagtg ttcctgcgcc ggttgcattc gattcctgtt 780
tgtaattgtc cttttaacag cgatcgcgta tttcgtctcg ctcaggcgca atcacgaatg 840
aataacggtt tggttgatgc gagtgatttt gatgacgagc gtaatggctg gcctgttgaa 900
caagtctgga aagaaatgca taaacttttg ccattctcac cggattcagt cgtcactcat 960
ggtgatttct cacttgataa ccttattttt gacgagggga aattaatagg ttgtattgat 1020
gttggacgag tcggaatcgc agaccgatac caggatcttg ccatcctatg gaactgcctc 1080
ggtgagtttt ctccttcatt acagaaacgg ctttttcaaa aatatggtat tgataatcct 1140
gatatgaata aattgcagtt tcatttgatg ctcgatgagt ttttctaatc agaattggtt 1200
aattggttgt aacactggca gagcattacg ctgacttgac gggacggcgc aagctcatga 1260
ccaaaatccc ttaacgtgag ttacgcgtcg ttccactgag cgtcagaccc cgtagaaaag 1320
atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa 1380
aaaccaccgc taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg 1440
aaggtaactg gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag 1500
ttaggccacc acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg 1560
ttaccagtgg ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga 1620
tagttaccgg ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc 1680
ttggagcgaa cgacctacac cgaactgaga tacctacagc gtgagcattg agaaagcgcc 1740
acgcttcccg aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga 1800
gagcgcacga gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt 1860
cgccacctct gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg 1920
aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac 1980
atgttctttc ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga 2040
gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg 2100
gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt ggccgattca ttaatgcagc 2160
tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc gcaacgcaat taatacgcgt 2220
accgctagcc aggaagagtt tgtagaaacg caaaaaggcc atccgtcagg atggccttct 2280
gcttagtttg atgcctggca gtttatggcg ggcgtcctgc ccgccaccct ccgggccgtt 2340
gcttcacaac gttcaaatcc gctcccggcg gatttgtcct actcaggaga gcgttcaccg 2400
acaaacaaca gataaaacga aaggcccagt cttccgactg agcctttcgt tttatttgat 2460
gcctggcagt tccctactct cgcgttaacg ctagcatgga tgttttccca gtcacgacgt 2520
tgtaaaacga cggccagtct taagctcggg cccgcgttaa cgctaccatg gagctccaaa 2580
taatgatttt attttgactg atagtgacct gttcgttgca acaaattgat aagcaatgct 2640
tttttataat gccaactttg tatagaaaag ttgaagctta aatccttaca gaattgctgt 2700
agtttcatag tgctagatgt ggacagcaaa gcgccgctgt atgcttctgc ttttcttttt 2760
tggtgtgtgt agccacatcc tttgttcctg cccggcgcca tcccacttgg ttgttttttt 2820
ttatgattga aagccttcat gcttcctcgg tcaatcaccg gtgcgcactg ggagcatcgc 2880
cggaaaaaaa attcttcggc taagagtaac ttctttctcc ttttcttctc tgatctcgcg 2940
agcagtgctg ataacgtgtt gtaatctact tagcggtaac gagattgaga gagacaaaat 3000
gacagaacta ttgtctttat tgcagagtgt catgtattta tacaggggat acaaagtctc 3060
ccaaggggtg tgtcccttgg gagtaactgc cagttgatca caggacaata ttttgtaaca 3120
aaacgtacac atcgtcaaaa tagcgaggca tgaaactggc cttggccatg gacgcgtgaa 3180
gcgcgccatg cgttggatat gtggtcaata agtatataca atacaatgtt taacagagct 3240
gatagtactg ctttggcaca tttttgtcca cgcttcatga gagataaaac acctgcacgt 3300
aaattcacat gctgcactga aggcccgatc actgaggagc gaactgccgt aactcccttc 3360
tatatatacc cccagtccct gtttcagttt tcgtcaagct agcagcacca agttgtcgat 3420
cacttgcctg ctcttgagct cgattaagct atcatcagct acagcatccg atcccaaact 3480
gcaactgtag cagcgacaac tgccg 3505
<210>13
<211>49765
<212>DNA
<213>人工序列
<220>
<223>载体
<400>13
gggggggggg ggggggggtt ccattgttca ttccacggac aaaaacagag aaaggaaacg 60
acagaggcca aaaagctcgc tttcagcacc tgtcgtttcc tttcttttca gagggtattt 120
taaataaaaa cattaagtta tgacgaagaa gaacggaaac gccttaaacc ggaaaatttt 180
cataaatagc gaaaacccgc gaggtcgccg ccccgtaacc tgtcggatca ccggaaagga 240
cccgtaaagt gataatgatt atcatctaca tatcacaacg tgcgtggagg ccatcaaacc 300
acgtcaaata atcaattatg acgcaggtat cgtattaatt gatctgcatc aacttaacgt 360
aaaaacaact tcagacaata caaatcagcg acactgaata cggggcaacc tcatgtcccc 420
cccccccccc cccctgcagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc 480
agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg 540
gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc 600
atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct 660
gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc 720
tcttgcccgg cgtcaacacg ggataatacc gcgccacata gcagaacttt aaaagtgctc 780
atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc 840
agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc 900
gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca 960
cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt 1020
tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt 1080
ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat tatcatgaca 1140
ttaacctata aaaataggcg tatcacgagg ccctttcgtc ttcaagaatt cggagctttt 1200
gccattctca ccggattcag tcgtcactca tggtgatttc tcacttgata accttatttt 1260
tgacgagggg aaattaatag gttgtattga tgttggacga gtcggaatcg cagaccgata 1320
ccaggatctt gccatcctat ggaactgcct cggtgagttt tctccttcat tacagaaacg 1380
gctttttcaa aaatatggta ttgataatcc tgatatgaat aaattgcagt ttcatttgat 1440
gctcgatgag tttttctaat cagaattggt taattggttg taacactggc agagcattac 1500
gctgacttga cgggacggcg gctttgttga ataaatcgaa cttttgctga gttgaaggat 1560
cagatcacgc atcttcccga caacgcagac cgttccgtgg caaagcaaaa gttcaaaatc 1620
accaactggt ccacctacaa caaagctctc atcaaccgtg gctccctcac tttctggctg 1680
gatgatgggg cgattcaggc ctggtatgag tcagcaacac cttcttcacg aggcagacct 1740
cagcgccaga aggccgccag agaggccgag cgcggccgtg aggcttggac gctagggcag 1800
ggcatgaaaa agcccgtagc gggctgctac gggcgtctga cgcggtggaa agggggaggg 1860
gatgttgtct acatggctct gctgtagtga gtgggttgcg ctccggcagc ggtcctgatc 1920
aatcgtcacc ctttctcggt ccttcaacgt tcctgacaac gagcctcctt ttcgccaatc 1980
catcgacaat caccgcgagt ccctgctcga acgctgcgtc cggaccggct tcgtcgaagg 2040
cgtctatcgc ggcccgcaac agcggcgaga gcggagcctg ttcaacggtg ccgccgcgct 2100
cgccggcatc gctgtcgccg gcctgctcct caagcacggc cccaacagtg aagtagctga 2160
ttgtcatcag cgcattgacg gcgtccccgg ccgaaaaacc cgcctcgcag aggaagcgaa 2220
gctgcgcgtc ggccgtttcc atctgcggtg cgcccggtcg cgtgccggca tggatgcgcg 2280
cgccatcgcg gtaggcgagc agcgcctgcc tgaagctgcg ggcattcccg atcagaaatg 2340
agcgccagtc gtcgtcggct ctcggcaccg aatgcgtatg attctccgcc agcatggctt 2400
cggccagtgc gtcgagcagc gcccgcttgt tcctgaagtg ccagtaaagc gccggctgct 2460
gaacccccaa ccgttccgcc agtttgcgtg tcgtcagacc gtctacgccg acctcgttca 2520
acaggtccag ggcggcacgg atcactgtat tcggctgcaa ctttgtcatg cttgacactt 2580
tatcactgat aaacataata tgtccaccaa cttatcagtg ataaagaatc cgcgcgttca 2640
atcggaccag cggaggctgg tccggaggcc agacgtgaaa cccaacatac ccctgatcgt 2700
aattctgagc actgtcgcgc tcgacgctgt cggcatcggc ctgattatgc cggtgctgcc 2760
gggcctcctg cgcgatctgg ttcactcgaa cgacgtcacc gcccactatg gcattctgct 2820
ggcgctgtat gcgttggtgc aatttgcctg cgcacctgtg ctgggcgcgc tgtcggatcg 2880
tttcgggcgg cggccaatct tgctcgtctc gctggccggc gccactgtcg actacgccat 2940
catggcgaca gcgcctttcc tttgggttct ctatatcggg cggatcgtgg ccggcatcac 3000
cggggcgact ggggcggtag ccggcgctta tattgccgat atcactgatg gcgatgagcg 3060
cgcgcggcac ttcggcttca tgagcgcctg tttcgggttc gggatggtcg cgggacctgt 3120
gctcggtggg ctgatgggcg gtttctcccc ccacgctccg ttcttcgccg cggcagcctt 3180
gaacggcctc aatttcctga cgggctgttt ccttttgccg gagtcgcaca aaggcgaacg 3240
ccggccgtta cgccgggagg ctctcaaccc gctcgcttcg ttccggtggg cccggggcat 3300
gaccgtcgtc gccgccctga tggcggtctt cttcatcatg caacttgtcg gacaggtgcc 3360
ggccgcgctt tgggtcattt tcggcgagga tcgctttcac tgggacgcga ccacgatcgg 3420
catttcgctt gccgcatttg gcattctgca ttcactcgcc caggcaatga tcaccggccc 3480
tgtagccgcc cggctcggcg aaaggcgggc actcatgctc ggaatgattg ccgacggcac 3540
aggctacatc ctgcttgcct tcgcgacacg gggatggatg gcgttcccga tcatggtcct 3600
gcttgcttcg ggtggcatcg gaatgccggc gctgcaagca atgttgtcca ggcaggtgga 3660
tgaggaacgt caggggcagc tgcaaggctc actggcggcg ctcaccagcc tgacctcgat 3720
cgtcggaccc ctcctcttca cggcgatcta tgcggcttct ataacaacgt ggaacgggtg 3780
ggcatggatt gcaggcgctg ccctctactt gctctgcctg ccggcgctgc gtcgcgggct 3840
ttggagcggc gcagggcaac gagccgatcg ctgatcgtgg aaacgatagg cctatgccat 3900
gcgggtcaag gcgacttccg gcaagctata cgcgccctag gagtgcggtt ggaacgttgg 3960
cccagccaga tactcccgat cacgagcagg acgccgatga tttgaagcgc actcagcgtc 4020
tgatccaaga acaaccatcc tagcaacacg gcggtccccg ggctgagaaa gcccagtaag 4080
gaaacaactg taggttcgag tcgcgagatc ccccggaacc aaaggaagta ggttaaaccc 4140
gctccgatca ggccgagcca cgccaggccg agaacattgg ttcctgtagg catcgggatt 4200
ggcggatcaa acactaaagc tactggaacg agcagaagtc ctccggccgc cagttgccag 4260
gcggtaaagg tgagcagagg cacgggaggt tgccacttgc gggtcagcac ggttccgaac 4320
gccatggaaa ccgcccccgc caggcccgct gcgacgccga caggatctag cgctgcgttt 4380
ggtgtcaaca ccaacagcgc cacgcccgca gttccgcaaa tagcccccag gaccgccatc 4440
aatcgtatcg ggctacctag cagagcggca gagatgaaca cgaccatcag cggctgcaca 4500
gcgcctaccg tcgccgcgac cccgcccggc aggcggtaga ccgaaataaa caacaagctc 4560
cagaatagcg aaatattaag tgcgccgagg atgaagatgc gcatccacca gattcccgtt 4620
ggaatctgtc ggacgatcat cacgagcaat aaacccgccg gcaacgcccg cagcagcata 4680
ccggcgaccc ctcggcctcg ctgttcgggc tccacgaaaa cgccggacag atgcgccttg 4740
tgagcgtcct tggggccgtc ctcctgtttg aagaccgaca gcccaatgat ctcgccgtcg 4800
atgtaggcgc cgaatgccac ggcatctcgc aaccgttcag cgaacgcctc catgggcttt 4860
ttctcctcgt gctcgtaaac ggacccgaac atctctggag ctttcttcag ggccgacaat 4920
cggatctcgc ggaaatcctg cacgtcggcc gctccaagcc gtcgaatctg agccttaatc 4980
acaattgtca attttaatcc tctgtttatc ggcagttcgt agagcgcgcc gtgcgtcccg 5040
agcgatactg agcgaagcaa gtgcgtcgag cagtgcccgc ttgttcctga aatgccagta 5100
aagcgctggc tgctgaaccc ccagccggaa ctgaccccac aaggccctag cgtttgcaat 5160
gcaccaggtc atcattgacc caggcgtgtt ccaccaggcc gctgcctcgc aactcttcgc 5220
aggcttcgcc gacctgctcg cgccacttct tcacgcgggt ggaatccgat ccgcacatga 5280
ggcggaaggt ttccagcttg agcgggtacg gctcccggtg cgagctgaaa tagtcgaaca 5340
tccgtcgggc cgtcggcgac agcttgcggt acttctccca tatgaatttc gtgtagtggt 5400
cgccagcaaa cagcacgacg atttcctcgt cgatcaggac ctggcaacgg gacgttttct 5460
tgccacggtc caggacgcgg aagcggtgca gcagcgacac cgattccagg tgcccaacgc 5520
ggtcggacgt gaagcccatc gccgtcgcct gtaggcgcga caggcattcc tcggccttcg 5580
tgtaataccg gccattgatc gaccagccca ggtcctggca aagctcgtag aacgtgaagg 5640
tgatcggctc gccgataggg gtgcgcttcg cgtactccaa cacctgctgc cacaccagtt 5700
cgtcatcgtc ggcccgcagc tcgacgccgg tgtaggtgat cttcacgtcc ttgttgacgt 5760
ggaaaatgac cttgttttgc agcgcctcgc gcgggatttt cttgttgcgc gtggtgaaca 5820
gggcagagcg ggccgtgtcg tttggcatcg ctcgcatcgt gtccggccac ggcgcaatat 5880
cgaacaagga aagctgcatt tccttgatct gctgcttcgt gtgtttcagc aacgcggcct 5940
gcttggcctc gctgacctgt tttgccaggt cctcgccggc ggtttttcgc ttcttggtcg 6000
tcatagttcc tcgcgtgtcg atggtcatcg acttcgccaa acctgccgcc tcctgttcga 6060
gacgacgcga acgctccacg gcggccgatg gcgcgggcag ggcaggggga gccagttgca 6120
cgctgtcgcg ctcgatcttg gccgtagctt gctggaccat cgagccgacg gactggaagg 6180
tttcgcgggg cgcacgcatg acggtgcggc ttgcgatggt ttcggcatcc tcggcggaaa 6240
accccgcgtc gatcagttct tgcctgtatg ccttccggtc aaacgtccga ttcattcacc 6300
ctccttgcgg gattgccccg actcacgccg gggcaatgtg cccttattcc tgatttgacc 6360
cgcctggtgc cttggtgtcc agataatcca ccttatcggc aatgaagtcg gtcccgtaga 6420
ccgtctggcc gtccttctcg tacttggtat tccgaatctt gccctgcacg aataccagcg 6480
accccttgcc caaatacttg ccgtgggcct cggcctgaga gccaaaacac ttgatgcgga 6540
agaagtcggt gcgctcctgc ttgtcgccgg catcgttgcg ccactcttca ttaaccgcta 6600
tatcgaaaat tgcttgcggc ttgttagaat tgccatgacg tacctcggtg tcacgggtaa 6660
gattaccgat aaactggaac tgattatggc tcatatcgaa agtctccttg agaaaggaga 6720
ctctagttta gctaaacatt ggttccgctg tcaagaactt tagcggctaa aattttgcgg 6780
gccgcgacca aaggtgcgag gggcggcttc cgctgtgtac aaccagatat ttttcaccaa 6840
catccttcgt ctgctcgatg agcggggcat gacgaaacat gagctgtcgg agagggcagg 6900
ggtttcaatt tcgtttttat cagacttaac caacggtaag gccaacccct cgttgaaggt 6960
gatggaggcc attgccgacg ccctggaaac tcccctacct cttctcctgg agtccaccga 7020
ccttgaccgc gaggcactcg cggagattgc gggtcatcct ttcaagagca gcgtgccgcc 7080
cggatacgaa cgcatcagtg tggttttgcc gtcacataag gcgtttatcg taaagaaatg 7140
gggcgacgac acccgaaaaa agctgcgtgg aaggctctga cgccaagggt tagggcttgc 7200
acttccttct ttagccgcta aaacggcccc ttctctgcgg gccgtcggct cgcgcatcat 7260
atcgacatcc tcaacggaag ccgtgccgcg aatggcatcg ggcgggtgcg ctttgacagt 7320
tgttttctat cagaacccct acgtcgtgcg gttcgattag ctgtttgtct tgcaggctaa 7380
acactttcgg tatatcgttt gcctgtgcga taatgttgct aatgatttgt tgcgtagggg 7440
ttactgaaaa gtgagcggga aagaagagtt tcagaccatc aaggagcggg ccaagcgcaa 7500
gctggaacgc gacatgggtg cggacctgtt ggccgcgctc aacgacccga aaaccgttga 7560
agtcatgctc aacgcggacg gcaaggtgtg gcacgaacgc cttggcgagc cgatgcggta 7620
catctgcgac atgcggccca gccagtcgca ggcgattata gaaacggtgg ccggattcca 7680
cggcaaagag gtcacgcggc attcgcccat cctggaaggc gagttcccct tggatggcag 7740
ccgctttgcc ggccaattgc cgccggtcgt ggccgcgcca acctttgcga tccgcaagcg 7800
cgcggtcgcc atcttcacgc tggaacagta cgtcgaggcg ggcatcatga cccgcgagca 7860
atacgaggtc attaaaagcg ccgtcgcggc gcatcgaaac atcctcgtca ttggcggtac 7920
tggctcgggc aagaccacgc tcgtcaacgc gatcatcaat gaaatggtcg ccttcaaccc 7980
gtctgagcgc gtcgtcatca tcgaggacac cggcgaaatc cagtgcgccg cagagaacgc 8040
cgtccaatac cacaccagca tcgacgtctc gatgacgctg ctgctcaaga caacgctgcg 8100
tatgcgcccc gaccgcatcc tggtcggtga ggtacgtggc cccgaagccc ttgatctgtt 8160
gatggcctgg aacaccgggc atgaaggagg tgccgccacc ctgcacgcaa acaaccccaa 8220
agcgggcctg agccggctcg ccatgcttat cagcatgcac ccggattcac cgaaacccat 8280
tgagccgctg attggcgagg cggttcatgt ggtcgtccat atcgccagga cccctagcgg 8340
ccgtcgagtg caagaaattc tcgaagttct tggttacgag aacggccagt acatcaccaa 8400
aaccctgtaa ggagtatttc caatgacaac ggctgttccg ttccgtctga ccatgaatcg 8460
cggcattttg ttctaccttg ccgtgttctt cgttctcgct ctcgcgttat ccgcgcatcc 8520
ggcgatggcc tcggaaggca ccggcggcag cttgccatat gagagctggc tgacgaacct 8580
gcgcaactcc gtaaccggcc cggtggcctt cgcgctgtcc atcatcggca tcgtcgtcgc 8640
cggcggcgtg ctgatcttcg gcggcgaact caacgccttc ttccgaaccc tgatcttcct 8700
ggttctggtg atggcgctgc tggtcggcgc gcagaacgtg atgagcacct tcttcggtcg 8760
tggtgccgaa atcgcggccc tcggcaacgg ggcgctgcac caggtgcaag tcgcggcggc 8820
ggatgccgtg cgtgcggtag cggctggacg gctcgcctaa tcatggctct gcgcacgatc 8880
cccatccgtc gcgcaggcaa ccgagaaaac ctgttcatgg gtggtgatcg tgaactggtg 8940
atgttctcgg gcctgatggc gtttgcgctg attttcagcg cccaagagct gcgggccacc 9000
gtggtcggtc tgatcctgtg gttcggggcg ctctatgcgt tccgaatcat ggcgaaggcc 9060
gatccgaaga tgcggttcgt gtacctgcgt caccgccggt acaagccgta ttacccggcc 9120
cgctcgaccc cgttccgcga gaacaccaat agccaaggga agcaataccg atgatccaag 9180
caattgcgat tgcaatcgcg ggcctcggcg cgcttctgtt gttcatcctc tttgcccgca 9240
tccgcgcggt cgatgccgaa ctgaaactga aaaagcatcg ttccaaggac gccggcctgg 9300
ccgatctgct caactacgcc gctgtcgtcg atgacggcgt aatcgtgggc aagaacggca 9360
gctttatggc tgcctggctg tacaagggcg atgacaacgc aagcagcacc gaccagcagc 9420
gcgaagtagt gtccgcccgc atcaaccagg ccctcgcggg cctgggaagt gggtggatga 9480
tccatgtgga cgccgtgcgg cgtcctgctc cgaactacgc ggagcggggc ctgtcggcgt 9540
tccctgaccg tctgacggca gcgattgaag aagagcgctc ggtcttgcct tgctcgtcgg 9600
tgatgtactt caccagctcc gcgaagtcgc tcttcttgat ggagcgcatg gggacgtgct 9660
tggcaatcac gcgcaccccc cggccgtttt agcggctaaa aaagtcatgg ctctgccctc 9720
gggcggacca cgcccatcat gaccttgcca agctcgtcct gcttctcttc gatcttcgcc 9780
agcagggcga ggatcgtggc atcaccgaac cgcgccgtgc gcgggtcgtc ggtgagccag 9840
agtttcagca ggccgcccag gcggcccagg tcgccattga tgcgggccag ctcgcggacg 9900
tgctcatagt ccacgacgcc cgtgattttg tagccctggc cgacggccag caggtaggcc 9960
gacaggctca tgccggccgc cgccgccttt tcctcaatcg ctcttcgttc gtctggaagg 10020
cagtacacct tgataggtgg gctgcccttc ctggttggct tggtttcatc agccatccgc 10080
ttgccctcat ctgttacgcc ggcggtagcc ggccagcctc gcagagcagg attcccgttg 10140
agcaccgcca ggtgcgaata agggacagtg aagaaggaac acccgctcgc gggtgggcct 10200
acttcaccta tcctgcccgg ctgacgccgt tggatacacc aaggaaagtc tacacgaacc 10260
ctttggcaaa atcctgtata tcgtgcgaaa aaggatggat ataccgaaaa aatcgctata 10320
atgaccccga agcagggtta tgcagcggaa aagcgctgct tccctgctgt tttgtggaat 10380
atctaccgac tggaaacagg caaatgcagg aaattactga actgagggga caggcgagag 10440
acgatgccaa agagctacac cgacgagctg gccgagtggg ttgaatcccg cgcggccaag 10500
aagcgccggc gtgatgaggc tgcggttgcg ttcctggcgg tgagggcgga tgtcgaggcg 10560
gcgttagcgt ccggctatgc gctcgtcacc atttgggagc acatgcggga aacggggaag 10620
gtcaagttct cctacgagac gttccgctcg cacgccaggc ggcacatcaa ggccaagccc 10680
gccgatgtgc ccgcaccgca ggccaaggct gcggaacccg cgccggcacc caagacgccg 10740
gagccacggc ggccgaagca ggggggcaag gctgaaaagc cggcccccgc tgcggccccg 10800
accggcttca ccttcaaccc aacaccggac aaaaaggatc tactgtaatg gcgaaaattc 10860
acatggtttt gcagggcaag ggcggggtcg gcaagtcggc catcgccgcg atcattgcgc 10920
agtacaagat ggacaagggg cagacaccct tgtgcatcga caccgacccg gtgaacgcga 10980
cgttcgaggg ctacaaggcc ctgaacgtcc gccggctgaa catcatggcc ggcgacgaaa 11040
ttaactcgcg caacttcgac accctggtcg agctgattgc gccgaccaag gatgacgtgg 11100
tgatcgacaa cggtgccagc tcgttcgtgc ctctgtcgca ttacctcatc agcaaccagg 11160
tgccggctct gctgcaagaa atggggcatg agctggtcat ccataccgtc gtcaccggcg 11220
gccaggctct cctggacacg gtgagcggct tcgcccagct cgccagccag ttcccggccg 11280
aagcgctttt cgtggtctgg ctgaacccgt attgggggcc tatcgagcat gagggcaaga 11340
gctttgagca gatgaaggcg tacacggcca acaaggcccg cgtgtcgtcc atcatccaga 11400
ttccggccct caaggaagaa acctacggcc gcgatttcag cgacatgctg caagagcggc 11460
tgacgttcga ccaggcgctg gccgatgaat cgctcacgat catgacgcgg caacgcctca 11520
agatcgtgcg gcgcggcctg tttgaacagc tcgacgcggc ggccgtgcta tgagcgacca 11580
gattgaagag ctgatccggg agattgcggc caagcacggc atcgccgtcg gccgcgacga 11640
cccggtgctg atcctgcata ccatcaacgc ccggctcatg gccgacagtg cggccaagca 11700
agaggaaatc cttgccgcgt tcaaggaaga gctggaaggg atcgcccatc gttggggcga 11760
ggacgccaag gccaaagcgg agcggatgct gaacgcggcc ctggcggcca gcaaggacgc 11820
aatggcgaag gtaatgaagg acagcgccgc gcaggcggcc gaagcgatcc gcagggaaat 11880
cgacgacggc cttggccgcc agctcgcggc caaggtcgcg gacgcgcggc gcgtggcgat 11940
gatgaacatg atcgccggcg gcatggtgtt gttcgcggcc gccctggtgg tgtgggcctc 12000
gttatgaatc gcagaggcgc agatgaaaaa gcccggcgtt gccgggcttt gtttttgcgt 12060
tagctgggct tgtttgacag gcccaagctc tgactgcgcc cgcgctcgcg ctcctgggcc 12120
tgtttcttct cctgctcctg cttgcgcatc agggcctggt gccgtcgggc tgcttcacgc 12180
atcgaatccc agtcgccggc cagctcggga tgctccgcgc gcatcttgcg cgtcgccagt 12240
tcctcgatct tgggcgcgtg aatgcccatg ccttccttga tttcgcgcac catgtccagc 12300
cgcgtgtgca gggtctgcaa gcgggcttgc tgttgggcct gctgctgctg ccaggcggcc 12360
tttgtacgcg gcagggacag caagccgggg gcattggact gtagctgctg caaacgcgcc 12420
tgctgacggt ctacgagctg ttctaggcgg tcctcgatgc gctccacctg gtcatgcttt 12480
gcctgcacgt agagcgcaag ggtctgctgg taggtctgct cgatgggcgc ggattctaag 12540
agggcctgct gttccgtctc ggcctcctgg gccgcctgta gcaaatcctc gccgctgttg 12600
ccgctggact gctttactgc cggggactgc tgttgccctg ctcgcgccgt cgtcgcagtt 12660
cggcttgccc ccactcgatt gactgcttca tttcgagccg cagcgatgcg atctcggatt 12720
gcgtcaacgg acggggcagc gcggaggtgt ccggcttctc cttgggtgag tcggtcgatg 12780
ccatagccaa aggtttcctt ccaaaatgcg tccattgctg gaccgtgttt ctcattgatg 12840
cccgcaagca tcttcggctt gaccgccagg tcaagcgcgc cttcatgggc ggtcatgacg 12900
gacgccgcca tgaccttgcc gccgttgttc tcgatgtagc cgcgtaatga ggcaatggtg 12960
ccgcccatcg tcagcgtgtc atcgacaacg atgtacttct ggccggggat cacctccccc 13020
tcgaaagtcg ggttgaacgc caggcgatga tctgaaccgg ctccggttcg ggcgaccttc 13080
tcccgctgca caatgtccgt ttcgacctca aggccaaggc ggtcggccag aacgaccgcc 13140
atcatggccg gaatcttgtt gttccccgcc gcctcgacgg cgaggactgg aacgatgcgg 13200
ggcttgtcgt cgccgatcag cgtcttgagc tgggcaacag tgtcgtccga aatcaggcgc 13260
tcgaccaaat taagcgccgc ttccgcgtcg ccctgcttcg cagcctggta ttcaggctcg 13320
ttggtcaaag aaccaaggtc gccgttgcga accaccttcg ggaagtctcc ccacggtgcg 13380
cgctcggctc tgctgtagct gctcaagacg cctccctttt tagccgctaa aactctaacg 13440
agtgcgcccg cgactcaact tgacgctttc ggcacttacc tgtgccttgc cacttgcgtc 13500
ataggtgatg cttttcgcac tcccgatttc aggtacttta tcgaaatctg accgggcgtg 13560
cattacaaag ttcttcccca cctgttggta aatgctgccg ctatctgcgt ggacgatgct 13620
gccgtcgtgg cgctgcgact tatcggcctt ttgggccata tagatgttgt aaatgccagg 13680
tttcagggcc ccggctttat ctaccttctg gttcgtccat gcgccttggt tctcggtctg 13740
gacaattctt tgcccattca tgaccaggag gcggtgtttc attgggtgac tcctgacggt 13800
tgcctctggt gttaaacgtg tcctggtcgc ttgccggcta aaaaaaagcc gacctcggca 13860
gttcgaggcc ggctttccct agagccgggc gcgtcaaggt tgttccatct attttagtga 13920
actgcgttcg atttatcagt tactttcctc ccgctttgtg tttcctccca ctcgtttccg 13980
cgtctagccg acccctcaac atagcggcct cttcttgggc tgcctttgcc tcttgccgcg 14040
cttcgtcacg ctcggcttgc accgtcgtaa agcgctcggc ctgcctggcc gcctcttgcg 14100
ccgccaactt cctttgctcc tggtgggcct cggcgtcggc ctgcgccttc gctttcaccg 14160
ctgccaactc cgtgcgcaaa ctctccgctt cgcgcctggt ggcgtcgcgc tcgccgcgaa 14220
gcgcctgcat ttcctggttg gccgcgtcca gggtcttgcg gctctcttct ttgaatgcgc 14280
gggcgtcctg gtgagcgtag tccagctcgg cgcgcagctc ctgcgctcga cgctccacct 14340
cgtcggcccg ctgcgtcgcc agcgcggccc gctgctcggc tcctgccagg gcggtgcgtg 14400
cttcggccag ggcttgccgc tggcgtgcgg ccagctcggc cgcctcggcg gcctgctgct 14460
ctagcaatgt aacgcgcgcc tgggcttctt ccagctcgcg ggcctgcgcc tcgaaggcgt 14520
cggccagctc cccgcgcacg gcttccaact cgttgcgctc acgatcccag ccggcttgcg 14580
ctgcctgcaa cgattcattg gcaagggcct gggcggcttg ccagagggcg gccacggcct 14640
ggttgccggc ctgctgcacc gcgtccggca cctggactgc cagcggggcg gcctgcgccg 14700
tgcgctggcg tcgccattcg cgcatgccgg cgctggcgtc gttcatgttg acgcgggcgg 14760
ccttacgcac tgcatccacg gtcgggaagt tctcccggtc gccttgctcg aacagctcgt 14820
ccgcagccgc aaaaatgcgg tcgcgcgtct ctttgttcag ttccatgttg gctccggtaa 14880
ttggtaagaa taataatact cttacctacc ttatcagcgc aagagtttag ctgaacagtt 14940
ctcgacttaa cggcaggttt tttagcggct gaagggcagg caaaaaaagc cccgcacggt 15000
cggcgggggc aaagggtcag cgggaagggg attagcgggc gtcgggcttc ttcatgcgtc 15060
ggggccgcgc ttcttgggat ggagcacgac gaagcgcgca cgcgcatcgt cctcggccct 15120
atcggcccgc gtcgcggtca ggaacttgtc gcgcgctagg tcctccctgg tgggcaccag 15180
gggcatgaac tcggcctgct cgatgtaggt ccactccatg accgcatcgc agtcgaggcc 15240
gcgttccttc accgtctctt gcaggtcgcg gtacgcccgc tcgttgagcg gctggtaacg 15300
ggccaattgg tcgtaaatgg ctgtcggcca tgagcggcct ttcctgttga gccagcagcc 15360
gacgacgaag ccggcaatgc aggcccctgg cacaaccagg ccgacgccgg gggcagggga 15420
tggcagcagc tcgccaacca ggaaccccgc cgcgatgatg ccgatgccgg tcaaccagcc 15480
cttgaaacta tccggccccg aaacacccct gcgcattgcc tggatgctgc gccggatagc 15540
ttgcaacatc aggagccgtt tcttttgttc gtcagtcatg gtccgccctc accagttgtt 15600
cgtatcggtg tcggacgaac tgaaatcgca agagctgccg gtatcggtcc agccgctgtc 15660
cgtgtcgctg ctgccgaagc acggcgaggg gtccgcgaac gccgcagacg gcgtatccgg 15720
ccgcagcgca tcgcccagca tggccccggt cagcgagccg ccggccaggt agcccagcat 15780
ggtgctgttg gtcgccccgg ccaccagggc cgacgtgacg aaatcgccgt cattccctct 15840
ggattgttcg ctgctcggcg gggcagtgcg ccgcgccggc ggcgtcgtgg atggctcggg 15900
ttggctggcc tgcgacggcc ggcgaaaggt gcgcagcagc tcgttatcga ccggctgcgg 15960
cgtcggggcc gccgccttgc gctgcggtcg gtgttccttc ttcggctcgc gcagcttgaa 16020
cagcatgatc gcggaaacca gcagcaacgc cgcgcctacg cctcccgcga tgtagaacag 16080
catcggattc attcttcggt cctccttgta gcggaaccgt tgtctgtgcg gcgcgggtgg 16140
cccgcgccgc tgtctttggg gatcagccct cgatgagcgc gaccagtttc acgtcggcaa 16200
ggttcgcctc gaactcctgg ccgtcgtcct cgtacttcaa ccaggcatag ccttccgccg 16260
gcggccgacg gttgaggata aggcgggcag ggcgctcgtc gtgctcgacc tggacgatgg 16320
cctttttcag cttgtccggg tccggctcct tcgcgccctt ttccttggcg tccttaccgt 16380
cctggtcgcc gtcctcgccg tcctggccgt cgccggcctc cgcgtcacgc tcggcatcag 16440
tctggccgtt gaaggcatcg acggtgttgg gatcgcggcc cttctcgtcc aggaactcgc 16500
gcagcagctt gaccgtgccg cgcgtgattt cctgggtgtc gtcgtcaagc cacgcctcga 16560
cttcctccgg gcgcttcttg aaggccgtca ccagctcgtt caccacggtc acgtcgcgca 16620
cgcggccggt gttgaacgca tcggcgatct tctccggcag gtccagcagc gtgacgtgct 16680
gggtgatgaa cgccggcgac ttgccgattt ccttggcgat atcgcctttc ttcttgccct 16740
tcgccagctc gcggccaatg aagtcggcaa tttcgcgcgg ggtcagctcg ttgcgttgca 16800
ggttctcgat aacctggtcg gcttcgttgt agtcgttgtc gatgaacgcc gggatggact 16860
tcttgccggc ccacttcgag ccacggtagc ggcgggcgcc gtgattgatg atatagcggc 16920
ccggctgctc ctggttctcg cgcaccgaaa tgggtgactt caccccgcgc tctttgatcg 16980
tggcaccgat ttccgcgatg ctctccgggg aaaagccggg gttgtcggcc gtccgcggct 17040
gatgcggatc ttcgtcgatc aggtccaggt ccagctcgat agggccggaa ccgccctgag 17100
acgccgcagg agcgtccagg aggctcgaca ggtcgccgat gctatccaac cccaggccgg 17160
acggctgcgc cgcgcctgcg gcttcctgag cggccgcagc ggtgtttttc ttggtggtct 17220
tggcttgagc cgcagtcatt gggaaatctc catcttcgtg aacacgtaat cagccagggc 17280
gcgaacctct ttcgatgcct tgcgcgcggc cgttttcttg atcttccaga ccggcacacc 17340
ggatgcgagg gcatcggcga tgctgctgcg caggccaacg gtggccggaa tcatcatctt 17400
ggggtacgcg gccagcagct cggcttggtg gcgcgcgtgg cgcggattcc gcgcatcgac 17460
cttgctgggc accatgccaa ggaattgcag cttggcgttc ttctggcgca cgttcgcaat 17520
ggtcgtgacc atcttcttga tgccctggat gctgtacgcc tcaagctcga tgggggacag 17580
cacatagtcg gccgcgaaga gggcggccgc caggccgacg ccaagggtcg gggccgtgtc 17640
gatcaggcac acgtcgaagc cttggttcgc cagggccttg atgttcgccc cgaacagctc 17700
gcgggcgtcg tccagcgaca gccgttcggc gttcgccagt accgggttgg actcgatgag 17760
ggcgaggcgc gcggcctggc cgtcgccggc tgcgggtgcg gtttcggtcc agccgccggc 17820
agggacagcg ccgaacagct tgcttgcatg caggccggta gcaaagtcct tgagcgtgta 17880
ggacgcattg ccctgggggt ccaggtcgat cacggcaacc cgcaagccgc gctcgaaaaa 17940
gtcgaaggca agatgcacaa gggtcgaagt cttgccgacg ccgcctttct ggttggccgt 18000
gaccaaagtt ttcatcgttt ggtttcctgt tttttcttgg cgtccgcttc ccacttccgg 18060
acgatgtacg cctgatgttc cggcagaacc gccgttaccc gcgcgtaccc ctcgggcaag 18120
ttcttgtcct cgaacgcggc ccacacgcga tgcaccgctt gcgacactgc gcccctggtc 18180
agtcccagcg acgttgcgaa cgtcgcctgt ggcttcccat cgactaagac gccccgcgct 18240
atctcgatgg tctgctgccc cacttccagc ccctggatcg cctcctggaa ctggctttcg 18300
gtaagccgtt tcttcatgga taacacccat aatttgctcc gcgccttggt tgaacatagc 18360
ggtgacagcc gccagcacat gagagaagtt tagctaaaca tttctcgcac gtcaacacct 18420
ttagccgcta aaactcgtcc ttggcgtaac aaaacaaaag cccggaaacc gggctttcgt 18480
ctcttgccgc ttatggctct gcacccggct ccatcaccaa caggtcgcgc acgcgcttca 18540
ctcggttgcg gatcgacact gccagcccaa caaagccggt tgccgccgcc gccaggatcg 18600
cgccgatgat gccggccaca ccggccatcg cccaccaggt cgccgccttc cggttccatt 18660
cctgctggta ctgcttcgca atgctggacc tcggctcacc ataggctgac cgctcgatgg 18720
cgtatgccgc ttctcccctt ggcgtaaaac ccagcgccgc aggcggcatt gccatgctgc 18780
ccgccgcttt cccgaccacg acgcgcgcac caggcttgcg gtccagacct tcggccacgg 18840
cgagctgcgc aaggacataa tcagccgccg acttggctcc acgcgcctcg atcagctctt 18900
gcactcgcgc gaaatccttg gcctccacgg ccgccatgaa tcgcgcacgc ggcgaaggct 18960
ccgcagggcc ggcgtcgtga tcgccgccga gaatgccctt caccaagttc gacgacacga 19020
aaatcatgct gacggctatc accatcatgc agacggatcg cacgaacccg ctgaattgaa 19080
cacgagcacg gcacccgcga ccactatgcc aagaatgccc aaggtaaaaa ttgccggccc 19140
cgccatgaag tccgtgaatg ccccgacggc cgaagtgaag ggcaggccgc cacccaggcc 19200
gccgccctca ctgcccggca cctggtcgct gaatgtcgat gccagcacct gcggcacgtc 19260
aatgcttccg ggcgtcgcgc tcgggctgat cgcccatccc gttactgccc cgatcccggc 19320
aatggcaagg actgccagcg ctgccatttt tggggtgagg ccgttcgcgg ccgaggggcg 19380
cagcccctgg ggggatggga ggcccgcgtt agcgggccgg gagggttcga gaaggggggg 19440
cacccccctt cggcgtgcgc ggtcacgcgc acagggcgca gccctggtta aaaacaaggt 19500
ttataaatat tggtttaaaa gcaggttaaa agacaggtta gcggtggccg aaaaacgggc 19560
ggaaaccctt gcaaatgctg gattttctgc ctgtggacag cccctcaaat gtcaataggt 19620
gcgcccctca tctgtcagca ctctgcccct caagtgtcaa ggatcgcgcc cctcatctgt 19680
cagtagtcgc gcccctcaag tgtcaatacc gcagggcact tatccccagg cttgtccaca 19740
tcatctgtgg gaaactcgcg taaaatcagg cgttttcgcc gatttgcgag gctggccagc 19800
tccacgtcgc cggccgaaat cgagcctgcc cctcatctgt caacgccgcg ccgggtgagt 19860
cggcccctca agtgtcaacg tccgcccctc atctgtcagt gagggccaag ttttccgcga 19920
ggtatccaca acgccggcgg ccgcggtgtc tcgcacacgg cttcgacggc gtttctggcg 19980
cgtttgcagg gccatagacg gccgccagcc cagcggcgag ggcaaccagc ccggtgagcg 20040
tcggaaaggc gctggaagcc ccgtagcgac gcggagaggg gcgagacaag ccaagggcgc 20100
aggctcgatg cgcagcacga catagccggt tctcgcaagg acgagaattt ccctgcggtg 20160
cccctcaagt gtcaatgaaa gtttccaacg cgagccattc gcgagagcct tgagtccacg 20220
ctagatgaga gctttgttgt aggtggacca gttggtgatt ttgaactttt gctttgccac 20280
ggaacggtct gcgttgtcgg gaagatgcgt gatctgatcc ttcaactcag caaaagttcg 20340
atttattcaa caaagccacg ttgtgtctca aaatctctga tgttacattg cacaagataa 20400
aaatatatca tcatgaacaa taaaactgtc tgcttacata aacagtaata caaggggtgt 20460
tatgagccat attcaacggg aaacgtcttg ctcgactcta gagctcgttc ctcgaggcct 20520
cgaggcctcg aggaacggta cctgcgggga agcttacaat aatgtgtgtt gttaagtctt 20580
gttgcctgtc atcgtctgac tgactttcgt cataaatccc ggcctccgta acccagcttt 20640
gggcaagctc acggatttga tccggcggaa cgggaatatc gagatgccgg gctgaacgct 20700
gcagttccag ctttcccttt cgggacaggt actccagctg attgattatc tgctgaaggg 20760
tcttggttcc acctcctggc acaatgcgaa tgattacttg agcgcgatcg ggcatccaat 20820
tttctcccgt caggtgcgtg gtcaagtgct acaaggcacc tttcagtaac gagcgaccgt 20880
cgatccgtcg ccgggatacg gacaaaatgg agcgcagtag tccatcgagg gcggcgaaag 20940
cctcgccaaa agcaatacgt tcatctcgca cagcctccag atccgatcga gggtcttcgg 21000
cgtaggcaga tagaagcatg gatacattgc ttgagagtat tccgatggac tgaagtatgg 21060
cttccatctt ttctcgtgtg tctgcatcta tttcgagaaa gcccccgatg cggcgcaccg 21120
caacgcgaat tgccatacta tccgaaagtc ccagcaggcg cgcttgatag gaaaaggttt 21180
catactcggc cgatcgcaga cgggcactca cgaccttgaa cccttcaact ttcagggatc 21240
gatgctggtt gatggtagtc tcactcgacg tggctctggt gtgttttgac atagcttcct 21300
ccaaagaaag cggaaggtct ggatactcca gcacgaaatg tgcccgggta gacggatgga 21360
agtctagccc tgctcaatat gaaatcaaca gtacatttac agtcaatact gaatatactt 21420
gctacatttg caattgtctt ataacgaatg tgaaataaaa atagtgtaac aacgctttta 21480
ctcatcgata atcacaaaaa catttatacg aacaaaaata caaatgcact ccggtttcac 21540
aggataggcg ggatcagaat atgcaacttt tgacgttttg ttctttcaaa gggggtgctg 21600
gcaaaaccac cgcactcatg ggcctttgcg ctgctttggc aaatgacggt aaacgagtgg 21660
ccctctttga tgccgacgaa aaccggcctc tgacgcgatg gagagaaaac gccttacaaa 21720
gcagtactgg gatcctcgct gtgaagtcta ttccgccgac gaaatgcccc ttcttgaagc 21780
agcctatgaa aatgccgagc tcgaaggatt tgattatgcg ttggccgata cgcgtggcgg 21840
ctcgagcgag ctcaacaaca caatcatcgc tagctcaaac ctgcttctga tccccaccat 21900
gctaacgccg ctcgacatcg atgaggcact atctacctac cgctacgtca tcgagctgct 21960
gttgagtgaa aatttggcaa ttcctacagc tgttttgcgc caacgcgtcc cggtcggccg 22020
attgacaaca tcgcaacgca ggatgtcaga gacgctagag agccttccag ttgtaccgtc 22080
tcccatgcat gaaagagatg catttgccgc gatgaaagaa cgcggcatgt tgcatcttac 22140
attactaaac acgggaactg atccgacgat gcgcctcata gagaggaatc ttcggattgc 22200
gatggaggaa gtcgtggtca tttcgaaact gatcagcaaa atcttggagg cttgaagatg 22260
gcaattcgca agcccgcatt gtcggtcggc gaagcacggc ggcttgctgg tgctcgaccc 22320
gagatccacc atcccaaccc gacacttgtt ccccagaagc tggacctcca gcacttgcct 22380
gaaaaagccg acgagaaaga ccagcaacgt gagcctctcg tcgccgatca catttacagt 22440
cccgatcgac aacttaagct aactgtggat gcccttagtc cacctccgtc cccgaaaaag 22500
ctccaggttt ttctttcagc gcgaccgccc gcgcctcaag tgtcgaaaac atatgacaac 22560
ctcgttcggc aatacagtcc ctcgaagtcg ctacaaatga ttttaaggcg cgcgttggac 22620
gatttcgaaa gcatgctggc agatggatca tttcgcgtgg ccccgaaaag ttatccgatc 22680
ccttcaacta cagaaaaatc cgttctcgtt cagacctcac gcatgttccc ggttgcgttg 22740
ctcgaggtcg ctcgaagtca ttttgatccg ttggggttgg agaccgctcg agctttcggc 22800
cacaagctgg ctaccgccgc gctcgcgtca ttctttgctg gagagaagcc atcgagcaat 22860
tggtgaagag ggacctatcg gaacccctca ccaaatattg agtgtaggtt tgaggccgct 22920
ggccgcgtcc tcagtcacct tttgagccag ataattaaga gccaaatgca attggctcag 22980
gctgccatcg tccccccgtg cgaaacctgc acgtccgcgt caaagaaata accggcacct 23040
cttgctgttt ttatcagttg agggcttgac ggatccgcct caagtttgcg gcgcagccgc 23100
aaaatgagaa catctatact cctgtcgtaa acctcctcgt cgcgtactcg actggcaatg 23160
agaagttgct cgcgcgatag aacgtcgcgg ggtttctcta aaaacgcgag gagaagattg 23220
aactcacctg ccgtaagttt cacctcaccg ccagcttcgg acatcaagcg acgttgcctg 23280
agattaagtg tccagtcagt aaaacaaaaa gaccgtcggt ctttggagcg gacaacgttg 23340
gggcgcacgc gcaaggcaac ccgaatgcgt gcaagaaact ctctcgtact aaacggctta 23400
gcgataaaat cacttgctcc tagctcgagt gcaacaactt tatccgtctc ctcaaggcgg 23460
tcgccactga taattatgat tggaatatca gactttgccg ccagatttcg aacgatctca 23520
agcccatctt cacgacctaa atttagatca acaaccacga catcgaccgt cgcggaagag 23580
agtactctag tgaactgggt gctgtcggct accgcggtca ctttgaaggc gtggatcgta 23640
aggtattcga taataagatg ccgcatagcg acatcgtcat cgataagaag aacgtgtttc 23700
aacggctcac ctttcaatct aaaatctgaa cccttgttca cagcgcttga gaaattttca 23760
cgtgaaggat gtacaatcat ctccagctaa atgggcagtt cgtcagaatt gcggctgacc 23820
gcggatgacg aaaatgcgaa ccaagtattt caattttatg acaaaagttc tcaatcgttg 23880
ttacaagtga aacgcttcga ggttacagct actattgatt aaggagatcg cctatggtct 23940
cgccccggcg tcgtgcgtcc gccgcgagcc agatctcgcc tacttcataa acgtcctcat 24000
aggcacggaa tggaatgatg acatcgatcg ccgtagagag catgtcaatc agtgtgcgat 24060
cttccaagct agcaccttgg gcgctacttt tgacaaggga aaacagtttc ttgaatcctt 24120
ggattggatt cgcgccgtgt attgttgaaa tcgatcccgg atgtcccgag acgacttcac 24180
tcagataagc ccatgctgca tcgtcgcgca tctcgccaag caatatccgg tccggccgca 24240
tacgcagact tgcttggagc aagtgctcgg cgctcacagc acccagccca gcaccgttct 24300
tggagtagag tagtctaaca tgattatcgt gtggaatgac gagttcgagc gtatcttcta 24360
tggtgattag cctttcctgg ggggggatgg cgctgatcaa ggtcttgctc attgttgtct 24420
tgccgcttcc ggtagggcca catagcaaca tcgtcagtcg gctgacgacg catgcgtgca 24480
gaaacgcttc caaatccccg ttgtcaaaat gctgaaggat agcttcatca tcctgatttt 24540
ggcgtttcct tcgtgtctgc cactggttcc acctcgaagc atcataacgg gaggagactt 24600
ctttaagacc agaaacacgc gagcttggcc gtcgaatggt caagctgacg gtgcccgagg 24660
gaacggtcgg cggcagacag atttgtagtc gttcaccacc aggaagttca gtggcgcaga 24720
gggggttacg tggtccgaca tcctgctttc tcagcgcgcc cgctaaaata gcgatatctt 24780
caagatcatc ataagagacg ggcaaaggca tcttggtaaa aatgccggct tggcgcacaa 24840
atgcctctcc aggtcgattg atcgcaattt cttcagtctt cgggtcatcg agccattcca 24900
aaatcggctt cagaagaaag cgtagttgcg gatccacttc catttacaat gtatcctatc 24960
tctaagcgga aatttgaatt cattaagagc ggcggttcct cccccgcgtg gcgccgccag 25020
tcaggcggag ctggtaaaca ccaaagaaat cgaggtcccg tgctacgaaa atggaaacgg 25080
tgtcaccctg attcttcttc agggttggcg gtatgttgat ggttgcctta agggctgtct 25140
cagttgtctg ctcaccgtta ttttgaaagc tgttgaagct catcccgcca cccgagctgc 25200
cggcgtaggt gctagctgcc tggaaggcgc cttgaacaac actcaagagc atagctccgc 25260
taaaacgctg ccagaagtgg ctgtcgaccg agcccggcaa tcctgagcga ccgagttcgt 25320
ccgcgcttgg cgatgttaac gagatcatcg catggtcagg tgtctcggcg cgatcccaca 25380
acacaaaaac gcgcccatct ccctgttgca agccacgctg tatttcgcca acaacggtgg 25440
tgccacgatc aagaagcacg atattgttcg ttgttccacg aatatcctga ggcaagacac 25500
actttacata gcctgccaaa tttgtgtcga ttgcggtttg caagatgcac ggaattattg 25560
tcccttgcgt taccataaaa tcggggtgcg gcaagagcgt ggcgctgctg ggctgcagct 25620
cggtgggttt catacgtatc gacaaatcgt tctcgccgga cacttcgcca ttcggcaagg 25680
agttgtcgtc acgcttgcct tcttgtcttc ggcccgtgtc gccctgaatg gcgcgtttgc 25740
tgaccccttg atcgccgctg ctatatgcaa aaatcggtgt ttcttccggc cgtggctcat 25800
gccgctccgg ttcgcccctc ggcggtagag gagcagcagg ctgaacagcc tcttgaaccg 25860
ctggaggatc cggcggcacc tcaatcggag ctggatgaaa tggcttggtg tttgttgcga 25920
tcaaagttga cggcgatgcg ttctcattca ccttcttttg gcgcccacct agccaaatga 25980
ggcttaatga taacgcgaga acgacacctc cgacgatcaa tttctgagac cccgaaagac 26040
gccggcgatg tttgtcggag accagggatc cagatgcatc aacctcatgt gccgcttgct 26100
gactatcgtt attcatccct tcgccccctt caggacgcgt ttcacatcgg gcctcaccgt 26160
gcccgtttgc ggcctttggc caacgggatc gtaagcggtg ttccagatac atagtactgt 26220
gtggccatcc ctcagacgcc aacctcggga aaccgaagaa atctcgacat cgctcccttt 26280
aactgaatag ttggcaacag cttccttgcc atcaggattg atggtgtaga tggagggtat 26340
gcgtacattg cccggaaagt ggaataccgt cgtaaatcca ttgtcgaaga cttcgagtgg 26400
caacagcgaa cgatcgcctt gggcgacgta gtgccaatta ctgtccgccg caccaagggc 26460
tgtgacaggc tgatccaata aattctcagc tttccgttga tattgtgctt ccgcgtgtag 26520
tctgtccaca acagccttct gttgtgcctc ccttcgccga gccgccgcat cgtcggcggg 26580
gtaggcgaat tggacgctgt aatagagatc gggctgctct ttatcgaggt gggacagagt 26640
cttggaactt atactgaaaa cataacggcg catcccggag tcgcttgcgg ttagcacgat 26700
tactggctga ggcgtgagga cctggcttgc cttgaaaaat agataatttc cccgcggtag 26760
ggctgctaga tctttgctat ttgaaacggc aaccgctgtc accgtttcgt tcgtggcgaa 26820
tgttacgacc aaagtagctc caaccgccgt cgagaggcgc accacttgat cgggattgta 26880
agccaaataa cgcatgcgcg gatctagctt gcccgccatt ggagtgtctt cagcctccgc 26940
accagtcgca gcggcaaata aacatgctaa aatgaaaagt gcttttctga tcatggttcg 27000
ctgtggccta cgtttgaaac ggtatcttcc gatgtctgat aggaggtgac aaccagacct 27060
gccgggttgg ttagtctcaa tctgccgggc aagctggtca ccttttcgta gcgaactgtc 27120
gcggtccacg tactcaccac aggcattttg ccgtcaacga cgagggtcct tttatagcga 27180
atttgctgcg tgcttggagt tacatcattt gaagcgatgt gctcgacctc caccctgccg 27240
cgtttgccaa gaatgacttg aggcgaactg ggattgggat agttgaagaa ttgctggtaa 27300
tcctggcgca ctgttggggc actgaagttc gataccaggt cgtaggcgta ctgagcggtg 27360
tcggcatcat aactctcgcg caggcgaacg tactcccaca atgaggcgtt aacgacggcc 27420
tcctcttgag ttgcaggcaa tcgcgagaca gacacctcgc tgtcaacggt gccgtccggc 27480
cgtatccata gatatacggg cacaagcctg ctcaacggca ccattgtggc tatagcgaac 27540
gcttgagcaa catttcccaa aatcgcgata gctgcgacag ctgcaatgag tttggagaga 27600
cgtcgcgccg atttcgctcg cgcggtttga aaggcttcta cttccttata gtgctcggca 27660
aggctttcgc gcgccactag catggcatat tcaggccccg tcatagcgtc cacccgaatt 27720
gccgagctga agatctgacg gagtaggctg ccatcgcccc acattcagcg ggaagatcgg 27780
gcctttgcag ctcgctaatg tgtcgtttgt ctggcagccg ctcaaagcga caactaggca 27840
cagcaggcaa tacttcatag aattctccat tgaggcgaat ttttgcgcga cctagcctcg 27900
ctcaacctga gcgaagcgac ggtacaagct gctggcagat tgggttgcgc cgctccagta 27960
actgcctcca atgttgccgg cgatcgccgg caaagcgaca atgagcgcat cccctgtcag 28020
aaaaaacata tcgagttcgt aaagaccaat gatcttggcc gcggtcgtac cggcgaaggt 28080
gattacacca agcataaggg tgagcgcagt cgcttcggtt aggatgacga tcgttgccac 28140
gaggtttaag aggagaagca agagaccgta ggtgataagt tgcccgatcc acttagctgc 28200
gatgtcccgc gtgcgatcaa aaatatatcc gacgaggatc agaggcccga tcgcgagaag 28260
cactttcgtg agaattccaa cggcgtcgta aactccgaag gcagaccaga gcgtgccgta 28320
aaggacccac tgtgcccctt ggaaagcaag gatgtcctgg tcgttcatcg gaccgatttc 28380
ggatgcgatt ttctgaaaaa cggcctgggt cacggcgaac attgtatcca actgtgccgg 28440
aacagtctgc agaggcaagc cggttacact aaactgctga acaaagtttg ggaccgtctt 28500
ttcgaagatg gaaaccacat agtcttggta gttagcctgc ccaacaatta gagcaacaac 28560
gatggtgacc gtgatcaccc gagtgatacc gctacgggta tcgacttcgc cgcgtatgac 28620
taaaataccc tgaacaataa tccaaagagt gacacaggcg atcaatggcg cactcaccgc 28680
ctcctggata gtctcaagca tcgagtccaa gcctgtcgtg aaggctacat cgaagatcgt 28740
atgaatggcc gtaaacggcg ccggaatcgt gaaattcatc gattggacct gaacttgact 28800
ggtttgtcgc ataatgttgg ataaaatgag ctcgcattcg gcgaggatgc gggcggatga 28860
acaaatcgcc cagccttagg ggagggcacc aaagatgaca gcggtctttt gatgctcctt 28920
gcgttgagcg gccgcctctt ccgcctcgtg aaggccggcc tgcgcggtag tcatcgttaa 28980
taggcttgtc gcctgtacat tttgaatcat tgcgtcatgg atctgcttga gaagcaaacc 29040
attggtcacg gttgcctgca tgatattgcg agatcgggaa agctgagcag acgtatcagc 29100
attcgccgtc aagcgtttgt ccatcgtttc cagattgtca gccgcaatgc cagcgctgtt 29160
tgcggaaccg gtgatctgcg atcgcaacag gtccgcttca gcatcactac ccacgactgc 29220
acgatctgta tcgctggtga tcgcacgtgc cgtggtcgac attggcattc gcggcgaaaa 29280
catttcattg tctaggtcct tcgtcgaagg atactgattt ttctggttga gcgaagtcag 29340
tagtccagta acgccgtagg ccgacgtcaa catcgtaacc atcgctatag tctgagtgag 29400
attctccgca gtcgcgagcg cagtcgcgag cgtctcagcc tccgttgccg ggtcgctaac 29460
aacaaactgc gcccgcgcgg gctgaatata tagaaagctg caggtcaaaa ctgttgcaat 29520
aagttgcgtc gtcttcatcg tttcctacct tatcaatctt ctgcctcgtg gtgacgggcc 29580
atgaattcgc tgagccagcc agatgagttg ccttcttgtg cctcgcgtag tcgagttgca 29640
aagcgcaccg tgttggcacg ccccgaaagc acggcgacat attcacgcat atcccgcaga 29700
tcaaattcgc agatgacgct tccactttct cgtttaagaa gaaacttacg gctgccgacc 29760
gtcatgtctt cacggatcgc ctgaaattcc ttttcggtac atttcagtcc atcgacataa 29820
gccgatcgat ctgcggttgg tgatggatag aaaatcttcg tcatacattg cgcaaccaag 29880
ctggctccta gcggcgattc cagaacatgc tctggttgct gcgttgccag tattagcatc 29940
ccgttgtttt ttcgaacggt caggaggaat ttgtcgacga cagtcgaaaa tttagggttt 30000
aacaaatagg cgcgaaactc atcgcagctc atcacaaaac ggcggccgtc gatcatggct 30060
ccaatccgat gcaggagata tgctgcagcg ggagcgcata cttcctcgta ttcgagaaga 30120
tgcgtcatgt cgaagccggt aatcgacgga tctaacttta cttcgtcaac ttcgccgtca 30180
aatgcccagc caagcgcatg gccccggcac cagcgttgga gccgcgctcc tgcgccttcg 30240
gcgggcccat gcaacaaaaa ttcacgtaac cccgcgattg aacgcatttg tggatcaaac 30300
gagagctgac gatggatacc acggaccaga cggcggttct cttccggaga aatcccaccc 30360
cgaccatcac tctcgatgag agccacgatc cattcgcgca gaaaatcgtg tgaggctgct 30420
gtgttttcta ggccacgcaa cggcgccaac ccgctgggtg tgcctctgtg aagtgccaaa 30480
tatgttcctc ctgtggcgcg aaccagcaat tcgccacccc ggtccttgtc aaagaacacg 30540
accgtacctg cacggtcgac catgctctgt tcgagcatgg ctagaacaaa catcatgagc 30600
gtcgtcttac ccctcccgat aggcccgaat attgccgtca tgccaacatc gtgctcatgc 30660
gggatatagt cgaaaggcgt tccgccattg gtacgaaatc gggcaatcgc gttgccccag 30720
tggcctgagc tggcgccctc tggaaagttt tcgaaagaga caaaccctgc gaaattgcgt 30780
gaagtgattg cgccagggcg tgtgcgccac ttaaaattcc ccggcaattg ggaccaatag 30840
gccgcttcca taccaatacc ttcttggaca accacggcac ctgcatccgc cattcgtgtc 30900
cgagcccgcg cgcccctgtc cccaagacta ttgagatcgt ctgcatagac gcaaaggctc 30960
aaatgatgtg agcccataac gaattcgttg ctcgcaagtg cgtcctcagc ctcggataat 31020
ttgccgattt gagtcacggc tttatcgccg gaactcagca tctggctcga tttgaggcta 31080
agtttcgcgt gcgcttgcgg gcgagtcagg aacgaaaaac tctgcgtgag aacaagtgga 31140
aaatcgaggg atagcagcgc gttgagcatg cccggccgtg tttttgcagg gtattcgcga 31200
aacgaataga tggatccaac gtaactgtct tttggcgttc tgatctcgag tcctcgcttg 31260
ccgcaaatga ctctgtcggt ataaatcgaa gcgccgagtg agccgctgac gaccggaacc 31320
ggtgtgaacc gaccagtcat gatcaaccgt agcgcttcgc caatttcggt gaagagcaca 31380
ccctgcttct cgcggatgcc aagacgatgc aggccatacg ctttaagaga gccagcgaca 31440
acatgccaaa gatcttccat gttcctgatc tggcccgtga gatcgttttc cctttttccg 31500
cttagcttgg tgaacctcct ctttaccttc cctaaagccg cctgtgggta gacaatcaac 31560
gtaaggaagt gttcattgcg gaggagttgg ccggagagca cgcgctgttc aaaagcttcg 31620
ttcaggctag cggcgaaaac actacggaag tgtcgcggcg ccgatgatgg cacgtcggca 31680
tgacgtacga ggtgagcata tattgacaca tgatcatcag cgatattgcg caacagcgtg 31740
ttgaacgcac gacaacgcgc attgcgcatt tcagtttcct caagctcgaa tgcaacgcca 31800
tcaattctcg caatggtcat gatcgatccg tcttcaagaa ggacgatatg gtcgctgagg 31860
tggccaatat aagggagata gatctcaccg gatctttcgg tcgttccact cgcgccgagc 31920
atcacaccat tcctctccct cgtgggggaa ccctaattgg atttgggcta acagtagcgc 31980
ccccccaaac tgcactatca atgcttcttc ccgcggtccg caaaaatagc aggacgacgc 32040
tcgccgcatt gtagtctcgc tccacgatga gccgggctgc aaaccataac ggcacgagaa 32100
cgacttcgta gagcgggttc tgaacgataa cgatgacaaa gccggcgaac atcatgaata 32160
accctgccaa tgtcagtggc accccaagaa acaatgcggg ccgtgtggct gcgaggtaaa 32220
gggtcgattc ttccaaacga tcagccatca actaccgcca gtgagcgttt ggccgaggaa 32280
gctcgcccca aacatgataa caatgccgcc gacgacgccg gcaaccagcc caagcgaagc 32340
ccgcccgaac atccaggaga tcccgatagc gacaatgccg agaacagcga gtgactggcc 32400
gaacggacca aggataaacg tgcatatatt gttaaccatt gtggcggggt cagtgccgcc 32460
acccgcagat tgcgctgcgg cgggtccgga tgaggaaatg ctccatgcaa ttgcaccgca 32520
caagcttggg gcgcagctcg atatcacgcg catcatcgca ttcgagagcg agaggcgatt 32580
tagatgtaaa cggtatctct caaagcatcg catcaatgcg cacctcctta gtataagtcg 32640
aataagactt gattgtcgtc tgcggatttg ccgttgtcct ggtgtggcgg tggcggagcg 32700
attaaaccgc cagcgccatc ctcctgcgag cggcgctgat atgaccccca aacatcccac 32760
gtctcttcgg attttagcgc ctcgtgatcg tcttttggag gctcgattaa cgcgggcacc 32820
agcgattgag cagctgtttc aacttttcgc acgtagccgt ttgcaaaacc gccgatgaaa 32880
ttaccggtgt tgtaagcgga gatcgcccga cgaagcgcaa attgcttctc gtcaatcgtt 32940
tcgccgcctg cataacgact tttcagcatg tttgcagcgg cagataatga tgtgcacgcc 33000
tggagcgcac cgtcaggtgt cagaccgagc atagaaaaat ttcgagagtt tatttgcatg 33060
aggccaacat ccagcgaatg ccgtgcatcg agacggtgcc tgacgacttg ggttgcttgg 33120
ctgtgatctt gccagtgaag cgtttcgccg gtcgtgttgt catgaatcgc taaaggatca 33180
aagcgactct ccaccttagc tatcgccgca agcgtagatg tcgcaactga tggggcacac 33240
ttgcgagcaa catggtcaaa ctcagcagat gagagtggcg tggcaaggct cgacgaacag 33300
aaggagacca tcaaggcaag agaaagcgac cccgatctct taagcatacc ttatctcctt 33360
agctcgcaac taacaccgcc tctcccgttg gaagaagtgc gttgttttat gttgaagatt 33420
atcgggaggg tcggttactc gaaaattttc aattgcttct ttatgatttc aattgaagcg 33480
agaaacctcg cccggcgtct tggaacgcaa catggaccga gaaccgcgca tccatgacta 33540
agcaaccgga tcgacctatt caggccgcag ttggtcaggt caggctcaga acgaaaatgc 33600
tcggcgaggt tacgctgtct gtaaacccat tcgatgaacg ggaagcttcc ttccgattgc 33660
tcttggcagg aatattggcc catgcctgct tgcgctttgc aaatgctctt atcgcgttgg 33720
tatcatatgc cttgtccgcc agcagaaacg cactctaagc gattatttgt aaaaatgttt 33780
cggtcatgcg gcggtcatgg gcttgacccg ctgtcagcgc aagacggatc ggtcaaccgt 33840
cggcatcgac aacagcgtga atcttggtgg tcaaaccgcc acgggaacgt cccatacagc 33900
catcgtcttg atcccgctgt ttcccgtcgc cgcatgttgg tggacgcgga cacaggaact 33960
gtcaatcatg acgacattct atcgaaagcc ttggaaatca cactcagaat atgatcccag 34020
acgtctgcct cacgccatcg tacaaagcga ttgtagcagg ttgtacagga accgtatcga 34080
tcaggaacgt ctgcccaggg cgggcccgtc cggaagcgcc acaagatgac attgatcacc 34140
cgcgtcaacg cgcggcacgc gacgcggctt atttgggaac aaaggactga acaacagtcc 34200
attcgaaatc ggtgacatca aagcggggac gggttatcag tggcctccaa gtcaagcctc 34260
aatgaatcaa aatcagaccg atttgcaaac ctgatttatg agtgtgcggc ctaaatgatg 34320
aaatcgtcct tctagatcgc ctccgtggtg tagcaacacc tcgcagtatc gccgtgctga 34380
ccttggccag ggaattgact ggcaagggtg ctttcacatg accgctcttt tggccgcgat 34440
agatgatttc gttgctgctt tgggcacgta gaaggagaga agtcatatcg gagaaattcc 34500
tcctggcgcg agagcctgct ctatcgcgac ggcatcccac tgtcgggaac agaccggatc 34560
attcacgagg cgaaagtcgt caacacatgc gttataggca tcttcccttg aaggatgatc 34620
ttgttgctgc caatctggag gtgcggcagc cgcaggcaga tgcgatctca gcgcaacttg 34680
cggcaaaaca tctcactcac ctgaaaacca ctagcgagtc tcgcgatcag acgaaggcct 34740
tttacttaac gacacaatat ccgatgtctg catcacaggc gtcgctatcc cagtcaatac 34800
taaagcggtg caggaactaa agattactga tgacttaggc gtgccacgag gcctgagacg 34860
acgcgcgtag acagtttttt gaaatcatta tcaaagtgat ggcctccgct gaagcctatc 34920
acctctgcgc cggtctgtcg gagagatggg caagcattat tacggtcttc gcgcccgtac 34980
atgcattgga cgattgcagg gtcaatggat ctgagatcat ccagaggatt gccgccctta 35040
ccttccgttt cgagttggag ccagccccta aatgagacga catagtcgac ttgatgtgac 35100
aatgccaaga gagagatttg cttaacccga tttttttgct caagcgtaag cctattgaag 35160
cttgccggca tgacgtccgc gccgaaagaa tatcctacaa gtaaaacatt ctgcacaccg 35220
aaatgcttgg tgtagacatc gattatgtga ccaagatcct tagcagtttc gcttggggac 35280
cgctccgacc agaaataccg aagtgaactg acgccaatga caggaatccc ttccgtctgc 35340
agataggtac catcgataga tctgctgcct cgcgcgtttc ggtgatgacg gtgaaaacct 35400
ctgacacatg cagctcccgg agacggtcac agcttgtctg taagcggatg ccgggagcag 35460
acaagcccgt cagggcgcgt cagcgggtgt tggcgggtgt cggggcgcag ccatgaccca 35520
gtcacgtagc gatagcggag tgtatactgg cttaactatg cggcatcaga gcagattgta 35580
ctgagagtgc accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc 35640
atcaggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg 35700
cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc aggggataac 35760
gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg 35820
ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa tcgacgctca 35880
agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggaagc 35940
tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc cgcctttctc 36000
ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag ttcggtgtag 36060
gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc 36120
ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc gccactggca 36180
gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac agagttcttg 36240
aagtggtggc ctaactacgg ctacactaga aggacagtat ttggtatctg cgctctgctg 36300
aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca aaccaccgct 36360
ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa 36420
gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa 36480
gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt aaattaaaaa 36540
tgaagtttta aatcaatcta aagtatatat gagtaaactt ggtctgacag ttaccaatgc 36600
ttaatcagtg aggcacctat ctcagcgatc tgtctatttc gttcatccat agttgcctga 36660
ctccccgtcg tgtagataac tacgatacgg gagggcttac catctggccc cagtgctgca 36720
atgataccgc gagacccacg ctcaccggct ccagatttat cagcaataaa ccagccagcc 36780
ggaagggccg agcgcagaag tggtcctgca actttatccg cctccatcca gtctattaat 36840
tgttgccggg aagctagagt aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc 36900
attgctgcag gggggggggg ggggggggac ttccattgtt cattccacgg acaaaaacag 36960
agaaaggaaa cgacagaggc caaaaagcct cgctttcagc acctgtcgtt tcctttcttt 37020
tcagagggta ttttaaataa aaacattaag ttatgacgaa gaagaacgga aacgccttaa 37080
accggaaaat tttcataaat agcgaaaacc cgcgaggtcg ccgccccgta acctgtcgga 37140
tcaccggaaa ggacccgtaa agtgataatg attatcatct acatatcaca acgtgcgtgg 37200
aggccatcaa accacgtcaa ataatcaatt atgacgcagg tatcgtatta attgatctgc 37260
atcaacttaa cgtaaaaaca acttcagaca atacaaatca gcgacactga atacggggca 37320
acctcatgtc cccccccccc ccccccctgc aggcatcgtg gtgtcacgct cgtcgtttgg 37380
tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt 37440
gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc 37500
agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt 37560
aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg 37620
gcgaccgagt tgctcttgcc cggcgtcaac acgggataat accgcgccac atagcagaac 37680
tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc 37740
gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt 37800
tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg 37860
aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat attattgaag 37920
catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa 37980
acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacgtct aagaaaccat 38040
tattatcatg acattaacct ataaaaatag gcgtatcacg aggccctttc gtcttcaaga 38100
attggtcgac gatcttgctg cgttcggata ttttcgtgga gttcccgcca cagacccgga 38160
ttgaaggcga gatccagcaa ctcgcgccag atcatcctgt gacggaactt tggcgcgtga 38220
tgactggcca ggacgtcggc cgaaagagcg acaagcagat cacgcttttc gacagcgtcg 38280
gatttgcgat cgaggatttt tcggcgctgc gctacgtccg cgaccgcgtt gagggatcaa 38340
gccacagcag cccactcgac cttctagccg acccagacga gccaagggat ctttttggaa 38400
tgctgctccg tcgtcaggct ttccgacgtt tgggtggttg aacagaagtc attatcgtac 38460
ggaatgccaa gcactcccga ggggaaccct gtggttggca tgcacataca aatggacgaa 38520
cggataaacc ttttcacgcc cttttaaata tccgttattc taataaacgc tcttttctct 38580
taggtttacc cgccaatata tcctgtcaaa cactgatagt ttaaactgaa ggcgggaaac 38640
gacaatctga tcatgagcgg agaattaagg gagtcacgtt atgacccccg ccgatgacgc 38700
gggacaagcc gttttacgtt tggaactgac agaaccgcaa cgttgaagga gccactcagc 38760
aagctggtac gattgtaata cgactcacta tagggcgaat tgagcgctgt ttaaacgctc 38820
ttcaactgga agagcggtta cccggaccga agcttgaagt tcctattccg aagttcctat 38880
tctctagaaa gtataggaac ttcagatctc gatgctcacc ctgttgtttg gtgttacttc 38940
tgcaggtcga ctctagagga tccaccatga gcccagaacg acgcccggcc gacatccgcc 39000
gtgccaccga ggcggacatg ccggcggtct gcaccatcgt caaccactac atcgagacaa 39060
gcacggtcaa cttccgtacc gagccgcagg aaccgcagga ctggacggac gacctcgtcc 39120
gtctgcggga gcgctatccc tggctcgtcg ccgaggtgga cggcgaggtc gccggcatcg 39180
cctacgcggg cccctggaag gcacgcaacg cctacgactg gacggccgag tcgaccgtgt 39240
acgtctcccc ccgccaccag cggacgggac tgggctccac gctctacacc cacctgctga 39300
agtccctgga ggcacagggc ttcaagagcg tggtcgctgt catcgggctg cccaacgacc 39360
cgagcgtgcg catgcacgag gcgctcggat atgccccccg cggcatgctg cgggcggccg 39420
gcttcaagca cgggaactgg catgacgtgg gtttctggca gctggacttc agcctgccgg 39480
taccgccccg tccggtcctg cccgtcaccg agatctgatc cgtcgaccaa cctagacttg 39540
tccatcttct ggattggcca acttaattaa tgtatgaaat aaaaggatgc acacatagtg 39600
acatgctaat cactataatg tgggcatcaa agttgtgtgt tatgtgtaat tactagttat 39660
ctgaataaaa gagaaagaga tcatccatat ttcttatcct aaatgaatgt cacgtgtctt 39720
tataattctt tgatgaacca gatgcatttc attaaccaaa tccatataca tataaatatt 39780
aatcatatat aattaatatc aattgggtta gcaaaacaaa tctagtctag gtgtgttttg 39840
cgaattgcgg ccgcgatctg gggaattccc atggacaccg gtaattccca tgatcttctc 39900
tccttcatca atggatgcca tgtttcataa caataacacc aaatgtttga tgagctacca 39960
acaattgcgc aaagactatg gctaagctcg agctcgctcg ctacaagttg ttgactttca 40020
aatacaagtt tgtttttgga acaccaaata ttctacatga tctttcacta agttgcgcac 40080
cactatcaaa agattatcta ggccattatt caagtaaaga gtgaacacgt ctaagaccca 40140
caaccacacc aaatagaata cgcatacatg caacatattg tgcaagaagt atccaactgg 40200
actcccatgt attctaaaac tattttcgta gagttaaagt tatgacaaac ttatcaaata 40260
aaaatttgaa cgctggacca aaactttcat ctttcaaatc caccatcgtc tatcctcata 40320
aattgttttg attataacac atctacgtaa atcatttgtt ttgaacaata ctaatttaat 40380
tttattaagt caaataacct gcttagaaaa taatccctcc acctcattta acaatttctt 40440
gtcaaacaca caccaagaaa aaaattaatg aaagagaaaa gaaatgaaaa ggacatggag 40500
ttgaatacta gcaaaattga ttgaaggaag attcacaatt gaaattgaaa ccatttaatt 40560
tattttcggg tccataataa taaattggta agaataaaaa cccgatcaag tccggtacag 40620
tacaattcca ctccaccaac tccttactta aacccctatt tatacccact ctcatcctca 40680
ctcttccttc acctctcaca ctctcttctc tctctcaaaa ccctcacaca aacgctgcgt 40740
ttagtgtaag aaattcaatc cggcgccttg gcgcgccgat catccacaag tttgtacaaa 40800
aaagctgaac gagaaacgta aaatgatata aatatcaata tattaaatta gattttgcat 40860
aaaaaacaga ctacataata ctgtaaaaca caacatatcc agtcactatg gcggccgcat 40920
taggcacccc aggctttaca ctttatgctt ccggctcgta taatgtgtgg attttgagtt 40980
aggatttaaa tacgcgttga tccggcttac taaaagccag ataacagtat gcgtatttgc 41040
gcgctgattt ttgcggtata agaatatata ctgatatgta tacccgaagt atgtcaaaaa 41100
gaggtatgct atgaagcagc gtattacagt gacagttgac agcgacagct atcagttgct 41160
caaggcatat atgatgtcaa tatctccggt ctggtaagca caaccatgca gaatgaagcc 41220
cgtcgtctgc gtgccgaacg ctggaaagcg gaaaatcagg aagggatggc tgaggtcgcc 41280
cggtttattg aaatgaacgg ctcttttgct gacgagaaca ggggctggtg aaatgcagtt 41340
taaggtttac acctataaaa gagagagccg ttatcgtctg tttgtggatg tacagagtga 41400
tatcattgac acgcccggtc gacggatggt gatccccctg gccagtgcac gtctgctgtc 41460
agataaagtc tcccgtgaac tttacccggt ggtgcatatc ggggatgaaa gctggcgcat 41520
gatgaccacc gatatggcca gtgtgccggt ctccgttatc ggggaagaag tggctgatct 41580
cagccaccgc gaaaatgaca tcaaaaacgc cattaacctg atgttctggg gaatataaat 41640
gtcaggctcc cttatacaca gccagtctgc aggtcgacca tagtgactgg atatgttgtg 41700
ttttacagta ttatgtagtc tgttttttat gcaaaatcta atttaatata ttgatattta 41760
tatcatttta cgtttctcgt tcagctttct tgtacaaagt ggtgttaacc tagacttgtc 41820
catcttctgg attggccaac ttaattaatg tatgaaataa aaggatgcac acatagtgac 41880
atgctaatca ctataatgtg ggcatcaaag ttgtgtgtta tgtgtaatta ctagttatct 41940
gaataaaaga gaaagagatc atccatattt cttatcctaa atgaatgtca cgtgtcttta 42000
taattctttg atgaaccaga tgcatttcat taaccaaatc catatacata taaatattaa 42060
tcatatataa ttaatatcaa ttgggttagc aaaacaaatc tagtctaggt gtgttttgcg 42120
aattgcggcc gccaccgcgg tggagctcga attccggtcc gggtcacctt tgtccaccaa 42180
gatggaactg cggccgctca ttaattaagt caggcgcgcc tctagttgaa gacacgttca 42240
tgtcttcatc gtaagaagac actcagtagt cttcggccag aatggccatc tggattcagc 42300
aggcctagaa ggccatttaa atcctgagga tctggtcttc ctaaggaccc gggatatcgg 42360
accgattaaa ctttaattcg gtccgaagct tgaagttcct attccgaagt tcctattctc 42420
cagaaagtat aggaacttcg catgcctgca gtgcagcgtg acccggtcgt gcccctctct 42480
agagataatg agcattgcat gtctaagtta taaaaaatta ccacatattt tttttgtcac 42540
acttgtttga agtgcagttt atctatcttt atacatatat ttaaacttta ctctacgaat 42600
aatataatct atagtactac aataatatca gtgttttaga gaatcatata aatgaacagt 42660
tagacatggt ctaaaggaca attgagtatt ttgacaacag gactctacag ttttatcttt 42720
ttagtgtgca tgtgttctcc tttttttttg caaatagctt cacctatata atacttcatc 42780
cattttatta gtacatccat ttagggttta gggttaatgg tttttataga ctaatttttt 42840
tagtacatct attttattct attttagcct ctaaattaag aaaactaaaa ctctatttta 42900
gtttttttat ttaataattt agatataaaa tagaataaaa taaagtgact aaaaattaaa 42960
caaataccct ttaagaaatt aaaaaaacta aggaaacatt tttcttgttt cgagtagata 43020
atgccagcct gttaaacgcc gtcgacgagt ctaacggaca ccaaccagcg aaccagcagc 43080
gtcgcgtcgg gccaagcgaa gcagacggca cggcatctct gtcgctgcct ctggacccct 43140
ctcgagagtt ccgctccacc gttggacttg ctccgctgtc ggcatccaga aattgcgtgg 43200
cggagcggca gacgtgagcc ggcacggcag gcggcctcct cctcctctca cggcaccggc 43260
agctacgggg gattcctttc ccaccgctcc ttcgctttcc cttcctcgcc cgccgtaata 43320
aatagacacc ccctccacac cctctttccc caacctcgtg ttgttcggag cgcacacaca 43380
cacaaccaga tctcccccaa atccacccgt cggcacctcc gcttcaaggt acgccgctcg 43440
tcctcccccc cccccctctc taccttctct agatcggcgt tccggtccat gcatggttag 43500
ggcccggtag ttctacttct gttcatgttt gtgttagatc cgtgtttgtg ttagatccgt 43560
gctgctagcg ttcgtacacg gatgcgacct gtacgtcaga cacgttctga ttgctaactt 43620
gccagtgttt ctctttgggg aatcctggga tggctctagc cgttccgcag acgggatcga 43680
tttcatgatt ttttttgttt cgttgcatag ggtttggttt gcccttttcc tttatttcaa 43740
tatatgccgt gcacttgttt gtcgggtcat cttttcatgc ttttttttgt cttggttgtg 43800
atgatgtggt ctggttgggc ggtcgttcta gatcggagta gaattctgtt tcaaactacc 43860
tggtggattt attaattttg gatctgtatg tgtgtgccat acatattcat agttacgaat 43920
tgaagatgat ggatggaaat atcgatctag gataggtata catgttgatg cgggttttac 43980
tgatgcatat acagagatgc tttttgttcg cttggttgtg atgatgtggt gtggttgggc 44040
ggtcgttcat tcgttctaga tcggagtaga atactgtttc aaactacctg gtgtatttat 44100
taattttgga actgtatgtg tgtgtcatac atcttcatag ttacgagttt aagatggatg 44160
gaaatatcga tctaggatag gtatacatgt tgatgtgggt tttactgatg catatacatg 44220
atggcatatg cagcatctat tcatatgctc taaccttgag tacctatcta ttataataaa 44280
caagtatgtt ttataattat tttgatcttg atatacttgg atgatggcat atgcagcagc 44340
tatatgtgga tttttttagc cctgccttca tacgctattt atttgcttgg tactgtttct 44400
tttgtcgatg ctcaccctgt tgtttggtgt tacttctgca ggtcgacttt aacttagcct 44460
aggatccaca cgacaccatg atagaggtga aaccgattaa cgcagaggat acctatgaac 44520
taaggcatag aatactcaga ccaaaccagc cgatagaagc gtgtatgttt gaaagcgatt 44580
tacttcgtgg tgcatttcac ttaggcggct attacggggg caaactgatt tccatagctt 44640
cattccacca ggccgagcac tcagaactcc aaggccagaa acagtaccag ctccgaggta 44700
tggctacctt ggaaggttat cgtgagcaga aggcgggatc gagtctaatt aaacacgctg 44760
aagaaattct tcgtaagagg ggggcggact tgctttggtg taatgcgcgg acatccgcct 44820
caggctacta caaaaagtta ggcttcagcg agcagggaga ggtattcgac acgccgccag 44880
taggacctca catcctgatg tataaaagga tcacataact agctagtcag ttaacctaga 44940
cttgtccatc ttctggattg gccaacttaa ttaatgtatg aaataaaagg atgcacacat 45000
agtgacatgc taatcactat aatgtgggca tcaaagttgt gtgttatgtg taattactag 45060
ttatctgaat aaaagagaaa gagatcatcc atatttctta tcctaaatga atgtcacgtg 45120
tctttataat tctttgatga accagatgca tttcattaac caaatccata tacatataaa 45180
tattaatcat atataattaa tatcaattgg gttagcaaaa caaatctagt ctaggtgtgt 45240
tttgcgaatt cagagctcga attcattccg attaatcgtg gcctcttgct cttcaggatg 45300
aagagctatg tttaaacgtg caagcgctac tagacaattc agtacattaa aaacgtccgc 45360
aatgtgttat taagttgtct aagcgtcaat ttgtttacac cacaatatat cctgccacca 45420
gccagccaac agctccccga ccggcagctc ggcacaaaat caccactcga tacaggcagc 45480
ccatcagtcc gggacggcgt cagcgggaga gccgttgtaa ggcggcagac tttgctcatg 45540
ttaccgatgc tattcggaag aacggcaact aagctgccgg gtttgaaaca cggatgatct 45600
cgcggagggt agcatgttga ttgtaacgat gacagagcgt tgctgcctgt gatcaaatat 45660
catctccctc gcagagatcc gaattatcag ccttcttatt catttctcgc ttaaccgtga 45720
caggctgtcg atcttgagaa ctatgccgac ataataggaa atcgctggat aaagccgctg 45780
aggaagctga gtggcgctat ttctttagaa gtgaacgttg acgatcgtcg accgtacccc 45840
gatgaattaa ttcggacgta cgttctgaac acagctggat acttacttgg gcgattgtca 45900
tacatgacat caacaatgta cccgtttgtg taaccgtctc ttggaggttc gtatgacact 45960
agtggttccc ctcagcttgc gactagatgt tgaggcctaa cattttatta gagagcaggc 46020
tagttgctta gatacatgat cttcaggccg ttatctgtca gggcaagcga aaattggcca 46080
tttatgacga ccaatgcccc gcagaagctc ccatctttgc cgccatagac gccgcgcccc 46140
ccttttgggg tgtagaacat ccttttgcca gatgtggaaa agaagttcgt tgtcccattg 46200
ttggcaatga cgtagtagcc ggcgaaagtg cgagacccat ttgcgctata tataagccta 46260
cgatttccgt tgcgactatt gtcgtaattg gatgaactat tatcgtagtt gctctcagag 46320
ttgtcgtaat ttgatggact attgtcgtaa ttgcttatgg agttgtcgta gttgcttgga 46380
gaaatgtcgt agttggatgg ggagtagtca tagggaagac gagcttcatc cactaaaaca 46440
attggcaggt cagcaagtgc ctgccccgat gccatcgcaa gtacgaggct tagaaccacc 46500
ttcaacagat cgcgcatagt cttccccagc tctctaacgc ttgagttaag ccgcgccgcg 46560
aagcggcgtc ggcttgaacg aattgttaga cattatttgc cgactacctt ggtgatctcg 46620
cctttcacgt agtgaacaaa ttcttccaac tgatctgcgc gcgaggccaa gcgatcttct 46680
tgtccaagat aagcctgcct agcttcaagt atgacgggct gatactgggc cggcaggcgc 46740
tccattgccc agtcggcagc gacatccttc ggcgcgattt tgccggttac tgcgctgtac 46800
caaatgcggg acaacgtaag cactacattt cgctcatcgc cagcccagtc gggcggcgag 46860
ttccatagcg ttaaggtttc atttagcgcc tcaaatagat cctgttcagg aaccggatca 46920
aagagttcct ccgccgctgg acctaccaag gcaacgctat gttctcttgc ttttgtcagc 46980
aagatagcca gatcaatgtc gatcgtggct ggctcgaaga tacctgcaag aatgtcattg 47040
cgctgccatt ctccaaattg cagttcgcgc ttagctggat aacgccacgg aatgatgtcg 47100
tcgtgcacaa caatggtgac ttctacagcg cggagaatct cgctctctcc aggggaagcc 47160
gaagtttcca aaaggtcgtt gatcaaagct cgccgcgttg tttcatcaag ccttacagtc 47220
accgtaacca gcaaatcaat atcactgtgt ggcttcaggc cgccatccac tgcggagccg 47280
tacaaatgta cggccagcaa cgtcggttcg agatggcgct cgatgacgcc aactacctct 47340
gatagttgag tcgatacttc ggcgatcacc gcttccctca tgatgtttaa ctcctgaatt 47400
aagccgcgcc gcgaagcggt gtcggcttga atgaattgtt aggcgtcatc ctgtgctccc 47460
gagaaccagt accagtacat cgctgtttcg ttcgagactt gaggtctagt tttatacgtg 47520
aacaggtcaa tgccgccgag agtaaagcca cattttgcgt acaaattgca ggcaggtaca 47580
ttgttcgttt gtgtctctaa tcgtatgcca aggagctgtc tgcttagtgc ccactttttc 47640
gcaaattcga tgagactgtg cgcgactcct ttgcctcggt gcgtgtgcga cacaacaatg 47700
tgttcgatag aggctagatc gttccatgtt gagttgagtt caatcttccc gacaagctct 47760
tggtcgatga atgcgccata gcaagcagag tcttcatcag agtcatcatc cgagatgtaa 47820
tccttccggt aggggctcac acttctggta gatagttcaa agccttggtc ggataggtgc 47880
acatcgaaca cttcacgaac aatgaaatgg ttctcagcat ccaatgtttc cgccacctgc 47940
tcagggatca ccgaaatctt catatgacgc ctaacgcctg gcacagcgga tcgcaaacct 48000
ggcgcggctt ttggcacaaa aggcgtgaca ggtttgcgaa tccgttgctg ccacttgtta 48060
acccttttgc cagatttggt aactataatt tatgttagag gcgaagtctt gggtaaaaac 48120
tggcctaaaa ttgctgggga tttcaggaaa gtaaacatca ccttccggct cgatgtctat 48180
tgtagatata tgtagtgtat ctacttgatc gggggatctg ctgcctcgcg cgtttcggtg 48240
atgacggtga aaacctctga cacatgcagc tcccggagac ggtcacagct tgtctgtaag 48300
cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc gggtgttggc gggtgtcggg 48360
gcgcagccat gacccagtca cgtagcgata gcggagtgta tactggctta actatgcggc 48420
atcagagcag attgtactga gagtgcacca tatgcggtgt gaaataccgc acagatgcgt 48480
aaggagaaaa taccgcatca ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 48540
ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 48600
agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 48660
ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 48720
caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 48780
gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 48840
cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 48900
tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 48960
gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 49020
cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 49080
tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 49140
tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 49200
caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 49260
aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 49320
cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 49380
ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 49440
tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 49500
atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 49560
tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 49620
aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 49680
catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 49740
gcgcaacgtt gttgccattg ctgca 49765
<210>14
<211>4222
<212>DNA
<213>玉米
<400>14
ggcaccaacc aagagaaaag tagcaactca cacacagtag gagaggagag gaaagaaccc 60
agcgatagat cagagtcgga gagaaaattt tgcctcctcc catcggaatc tagctaatta 120
ttttccagtc cttctccccc atgtaagtct ccctctccca tccccgtcgc ctgcacctac 180
cggaagcgaa gccttctaca gttcaggtcc atatccgggt cgccgccgtc ggctagctct 240
aggcggacat ggtggccgct tgccggagca cggcctccac ctacgcgttt ctgctgctct 300
tgctgtttgc cgctgcggtc gccgtctctt cttcctccgg gttgcgccgt gtccaggacc 360
aggacagatc ggcgctgctc cagatcaaga acgccttccc tgccgtagac ctgctccagc 420
agtggtcccc ggactctagc ggccgtaacc actgctcctg gctgggggta acatgcgact 480
caagctcccg ggtcgtcgct ctggaggtgc tctccccttc acggcgtttc ggacccggcc 540
aggagcttgc cggcgagctt cctgcggcgg ttgggctcct caccgagttg aaagtggtct 600
cttttccgct ccacggcctc cggggcgaga tccccggtga gatctggcgg ttggaagatc 660
tcgaggtggt caacctcgct gggaactctc tccggggcgc ccttcctgcc gccttcccgc 720
cgcggctgag ggtgctctcg ctcgcttcca acctgcttca cggtgagatc ccgtcttccc 780
tttccacctg caaagacttg gagaggttgg atctttcagg caaccggttc actggatcgg 840
tgccgagagc acttggcagc ctgcctaatc tgaagtggct tgacttgtct gcaaaccttc 900
ttgtcggtgg catcccctct ggtctgggga attgcaggca gctccgctcg ctgcggttgt 960
tctccaattc attacacggt tcaattccag cagggattgg aaagctgagg aaactacagg 1020
tgttggacgt atcaagaaac agattaagtg gcctggtgcc accggagctc ggaaattgct 1080
cagatttgtc agtgctcgta ttgtctagcc agtccaattc agtaaagtca catgagttca 1140
atctgtttaa aggaggaatt ccagagagtg tgacagcttt gccaagactc cgggtgctat 1200
gggtgccaag ggcgggcctg gaaggaactc tcccgagcaa ctggggaagg tgttctaatt 1260
tagaaatggt taatctgggg ggaaatttac tttctggagc aatcccaagg gatctaggac 1320
agtgcagtaa cctcaagttt ctcaacctca gctcaaatag attatctggt ttactggata 1380
aggatctatg tccgcattgt atgactgtat tcgatgtcag cggaaatgaa atttcaggat 1440
caattccagc atgtgtgaat aaagtctgta cttctcatct agtgctggat aaaatgtcat 1500
ccagttattc ttcattcctc atgtccaaaa ctctacaaga actgccatca gttttttgca 1560
actccgggga gtgttctgtt gtgtatcata attttgcaaa gaacaacctt gaaggtcacc 1620
ttacatcctt gccatttagt gctgacaggt ttggaaacaa gacgtcatat gtgtttgtcg 1680
ttgatcacaa taatttcagt ggatcgttgg attcaattct gttggaacag tgtagcaact 1740
tgaaggggtt ggttgtaagc ttccgagaca acaagatatc tggtcagatc acagcagagt 1800
ttagcagaaa atgcagtgct atcagagctt tggatttagc tggcaatcaa atatcaggaa 1860
tgatgcctga taatgttggt ttgttaggag cccttgtcaa gatggacatg agcagaaatt 1920
ttttggaggg tcaaatacct gccagtttca aagatttcaa gagcttgaag tttctctcat 1980
tagctgggaa caacattagt ggcagaatac catcctgttt gggtcagttg agatcactga 2040
gggttttaga tctctcatct aattctcttg ctggtgagat cccaaataac cttgtgacac 2100
taggagacat aactgtgctt ctactcaaca ataataggct ctctggaaac attccaaatt 2160
ttgcctcttc accatcgctg tccatattca atgtttcatt caatgatttg tctgggccac 2220
tgccttcaaa aattcactca ctgacatgca acagtattcg tggaaatcca tcccttcaac 2280
cttgtggact gtcaacgctc tccagtcctt tagtgaatgc tcgagcacta agtgaggcag 2340
acaataatcc accagctgat aatacagccc ctgatgacaa tggtaacggt ggcggattca 2400
gcaaaataga gattgcctcc ataacttcag catcagcaat tgttgcagtt ctcttggctc 2460
tggtcatcct ttatatttac acacgaaaat gtgcatcaag accatcaagg cgttctctaa 2520
gaagggaagt aactgttttt gttgatattg gtgctccctt gacatatgag gctgttttac 2580
gtgccagtgg aagcttcaat gcaagtaatt gcatcggaag tggtggcttc ggagcaacgt 2640
acaaagctga ggtagcacca ggaaaattgg tggcaataaa gagacttgct attggaaggt 2700
tccaaggcat tcagcaattc caagcagagg tcaaaacact tgggaggtgt cgccattcca 2760
atcttgtaac actcatagga taccatctca gcgattcaga gatgtttcta atatataatt 2820
ttctgcctgg tggcaattta gaaagattca tacaagaaag gactaagaga ccaattgatt 2880
ggagaatgct tcataaaatt gctctagatg ttgcgcgtgc acttgcatat cttcatgaca 2940
attgtgtccc ccgcatctta cacagggatg tcaaaccaag taacatcttg cttgataatg 3000
attacactgc ctacctttct gattttggat tagcaaggct gcttggaaat tcagaaacac 3060
atgcaaccac tggtgtggca ggtacttttg gatatgtagc tccagagtat gcgatgacat 3120
gccgcgtttc tgataaggcg gatgtgtata gctatggagt tgtacttctt gaactcattt 3180
cagacaaaaa ggcactggat ccttctttct ctccttacgg gaatggattc aacatagttg 3240
cctgggcttg catgcttcta caaaagggtc gagctcgtga attctttata gaagggctgt 3300
gggatgtggc cccacatgac gaccttgttg agattctaca cctgggcatc aagtgtacgg 3360
ttgattctct ttcttctagg cccacaatga agcaagttgt tcggcgactt aaggaactca 3420
gaccaccgtc ttactaggaa tagcaagcag agactggctt gcttctgcat aatgagacca 3480
agactgctgg ctgtttgggt gactgcaaca gccagtaatg ttccaaataa actggtggca 3540
actggcaagc aattcaaatg tcatttctca agggatcctc aagcaggaca ttcatggtat 3600
ttaatcattt ggtcgtcatg gtcactgtat ggaacgccga tatcgcttca atgccactgc 3660
aagcaagcat caaggatgcc tgaggaagtt cttctatgcc tgccaaatct gatgactgga 3720
ttaggctctg ttcgtttagt ccagattgga ggccggaacc attccaactc atcaagctaa 3780
tacaaatttg cgaactattc caatccggaa ccaatccagt aaagcattca ggaagaaccg 3840
aacgggcact taggcattac cgacccagtt aacaggatgc attatatacc ttgctaagca 3900
attattacgg cgtaacggag atcgaccacc ttgactcatt ggaacgtagc catatatctg 3960
gcagttctag catatatctg atcaaagcaa agccatatat ctggcagttc tggggtgatc 4020
cgtccgtgga aaaggcttgg ccactgacct gcgcgaggac aaagcagtag aaacatatca 4080
gccaatcaaa gcaaagccat atgaattcat gttaaacatg tccatacata tagcagtgaa 4140
aagtatcaaa tcactgattt cctttcttgg cctttgtaac tgtatatact gctttcgtgg 4200
cctttgtaaa aaaaaaaaaa aa 4222
<210>15
<211>1062
<212>PRT
<213>玉米
<400>15
Met Val Ala Ala Cys Arg Ser Thr Ala Ser Thr Tyr Ala Phe Leu Leu
1 5 10 15
Leu Leu Leu Phe Ala Ala Ala Val Ala Val Ser Ser Ser Ser Gly Leu
20 25 30
Arg Arg Val Gln Asp Gln Asp Arg Ser Ala Leu Leu Gln Ile Lys Asn
35 40 45
Ala Phe Pro Ala Val Asp Leu Leu Gln Gln Trp Ser Pro Asp Ser Ser
50 55 60
Gly Arg Asn His Cys Ser Trp Leu Gly Val Thr Cys Asp Ser Ser Ser
65 70 75 80
Arg Val Val Ala Leu Glu Val Leu Ser Pro Ser Arg Arg Phe Gly Pro
85 90 95
Gly Gln Glu Leu Ala Gly Glu Leu Pro Ala Ala Val Gly Leu Leu Thr
100 105 110
Glu Leu Lys Val Val Ser Phe Pro Leu His Gly Leu Arg Gly Glu Ile
115 120 125
Pro Gly Glu Ile Trp Arg Leu Glu Asp Leu Glu Val Val Asn Leu Ala
130 135 140
Gly Asn Ser Leu Arg Gly Ala Leu Pro Ala Ala Phe Pro Pro Arg Leu
145 150 155 160
Arg Val Leu Ser Leu Ala Ser Asn Leu Leu His Gly Glu Ile Pro Ser
165 170 175
Ser Leu Ser Thr Cys Lys Asp Leu Glu Arg Leu Asp Leu Ser Gly Asn
180 185 190
Arg Phe Thr Gly Ser Val Pro Arg Ala Leu Gly Ser Leu Pro Asn Leu
195 200 205
Lys Trp Leu Asp Leu Ser Ala Asn Leu Leu Val Gly Gly Ile Pro Ser
210 215 220
Gly Leu Gly Asn Cys Arg Gln Leu Arg Ser Leu Arg Leu Phe Ser Asn
225 230 235 240
Ser Leu His Gly Ser Ile Pro Ala Gly Ile Gly Lys Leu Arg Lys Leu
245 250 255
Gln Val Leu Asp Val Ser Arg Asn Arg Leu Ser Gly Leu Val Pro Pro
260 265 270
Glu Leu Gly Asn Cys Ser Asp Leu Ser Val Leu Val Leu Ser Ser Gln
275 280 285
Ser Asn Ser Val Lys Ser His Glu Phe Asn Leu Phe Lys Gly Gly Ile
290 295 300
Pro Glu Ser Val Thr Ala Leu Pro Arg Leu Arg Val Leu Trp Val Pro
305 310 315 320
Arg Ala Gly Leu Glu Gly Thr Leu Pro Ser Asn Trp Gly Arg Cys Ser
325 330 335
Asn Leu Glu Met Val Asn Leu Gly Gly Asn Leu Leu Ser Gly Ala Ile
340 345 350
Pro Arg Asp Leu Gly Gln Cys Ser Asn Leu Lys Phe Leu Asn Leu Ser
355 360 365
Ser Asn Arg Leu Ser Gly Leu Leu Asp Lys Asp Leu Cys Pro His Cys
370 375 380
Met Thr Val Phe Asp Val Ser Gly Asn Glu Ile Ser Gly Ser Ile Pro
385 390 395 400
Ala Cys Val Asn Lys Val Cys Thr Ser His Leu Val Leu Asp Lys Met
405 410 415
Ser Ser Ser Tyr Ser Ser Phe Leu Met Ser Lys Thr Leu Gln Glu Leu
420 425 430
Pro Ser Val Phe Cys Asn Ser Gly Glu Cys Ser Val Val Tyr His Asn
435 440 445
Phe Ala Lys Asn Asn Leu Glu Gly His Leu Thr Ser Leu Pro Phe Ser
450 455 460
Ala Asp Arg Phe Gly Asn Lys Thr Ser Tyr Val Phe Val Val Asp His
465 470 475 480
Asn Asn Phe Ser Gly Ser Leu Asp Ser Ile Leu Leu Glu Gln Cys Ser
485 490 495
Asn Leu Lys Gly Leu Val Val Ser Phe Arg Asp Asn Lys Ile Ser Gly
500 505 510
Gln Ile Thr Ala Glu Phe Ser Arg Lys Cys Ser Ala Ile Arg Ala Leu
515 520 525
Asp Leu Ala Gly Asn Gln Ile Ser Gly Met Met Pro Asp Asn Val Gly
530 535 540
Leu Leu Gly Ala Leu Val Lys Met Asp Met Ser Arg Asn Phe Leu Glu
545 550 555 560
Gly Gln Ile Pro Ala Ser Phe Lys Asp Phe Lys Ser Leu Lys Phe Leu
565 570 575
Ser Leu Ala Gly Asn Asn Ile Ser Gly Arg Ile Pro Ser Cys Leu Gly
580 585 590
Gln Leu Arg Ser Leu Arg Val Leu Asp Leu Ser Ser Asn Ser Leu Ala
595 600 605
Gly Glu Ile Pro Asn Asn Leu Val Thr Leu Gly Asp Ile Thr Val Leu
610 615 620
Leu Leu Asn Asn Asn Arg Leu Ser Gly Asn Ile Pro Asn Phe Ala Ser
625 630 635 640
Ser Pro Ser Leu Ser Ile Phe Asn Val Ser Phe Asn Asp Leu Ser Gly
645 650 655
Pro Leu Pro Ser Lys Ile His Ser Leu Thr Cys Asn Ser Ile Arg Gly
660 665 670
Asn Pro Ser Leu Gln Pro Cys Gly Leu Ser Thr Leu Ser Ser Pro Leu
675 680 685
Val Asn Ala Arg Ala Leu Ser Glu Ala Asp Asn Asn Pro Pro Ala Asp
690 695 700
Asn Thr Ala Pro Asp Asp Asn Gly Asn Gly Gly Gly Phe Ser Lys Ile
705 710 715 720
Glu Ile Ala Ser Ile Thr Ser Ala Ser Ala Ile Val Ala Val Leu Leu
725 730 735
Ala Leu Val Ile Leu Tyr Ile Tyr Thr Arg Lys Cys Ala Ser Arg Pro
740 745 750
Ser Arg Arg Ser Leu Arg Arg Glu Val Thr Val Phe Val Asp Ile Gly
755 760 765
Ala Pro Leu Thr Tyr Glu Ala Val Leu Arg Ala Ser Gly Ser Phe Asn
770 775 780
Ala Ser Asn Cys Ile Gly Ser Gly Gly Phe Gly Ala Thr Tyr Lys Ala
785 790 795 800
Glu Val Ala Pro Gly Lys Leu Val Ala Ile Lys Arg Leu Ala Ile Gly
805 810 815
Arg Phe Gln Gly Ile Gln Gln Phe Gln Ala Glu Val Lys Thr Leu Gly
820 825 830
Arg Cys Arg His Ser Asn Leu Val Thr Leu Ile Gly Tyr His Leu Ser
835 840 845
Asp Ser Glu Met Phe Leu Ile Tyr Asn Phe Leu Pro Gly Gly Asn Leu
850 855 860
Glu Arg Phe Ile Gln Glu Arg Thr Lys Arg Pro Ile Asp Trp Arg Met
865 870 875 880
Leu His Lys Ile Ala Leu Asp Val Ala Arg Ala Leu Ala Tyr Leu His
885 890 895
Asp Asn Cys Val Pro Arg Ile Leu His Arg Asp Val Lys Pro Ser Asn
900 905 910
Ile Leu Leu Asp Asn Asp Tyr Thr Ala Tyr Leu Ser Asp Phe Gly Leu
915 920 925
Ala Arg Leu Leu Gly Asn Ser Glu Thr His Ala Thr Thr Gly Val Ala
930 935 940
Gly Thr Phe Gly Tyr Val Ala Pro Glu Tyr Ala Met Thr Cys Arg Val
945 950 955 960
Ser Asp Lys Ala Asp Val Tyr Ser Tyr Gly Val Val Leu Leu Glu Leu
965 970 975
Ile Ser Asp Lys Lys Ala Leu Asp Pro Ser Phe Ser Pro Tyr Gly Asn
980 985 990
Gly Phe Asn Ile Val Ala Trp Ala Cys Met Leu Leu Gln Lys Gly Arg
995 1000 1005
Ala Arg Glu Phe Phc Ile Glu Gly Leu Trp Asp Val Ala Pro His
1010 1015 1020
Asp Asp Leu Val Glu Ile Leu His Leu Gly Ile Lys Cys Thr Val
1025 1030 1035
Asp Ser Leu Ser Ser Arg Pro Thr Met Lys Gln Val Val Arg Arg
1040 1045 1050
Leu Lys Glu Leu Arg Pro Pro Ser Tyr
1055 1060
<210>16
<211>3534
<212>DNA
<213>玉米
<400>16
ccctttcctc tctacaccga cgagacccta acaaggcccc cgccgccgcc gccgcctccg 60
cacatggccg ctcgaacctt gctgctcgtg ctgctcttgc ttctgctcgg tgacgccacc 120
tcggcttccg tcagcggcca gcgcgaagcg ttgatgaagt tcaaagcggc cgtgacggcc 180
gacccgggcg gcctgctgcg cggctggtcc ccggcgtccg gggaccactg ccgctggccg 240
ggcgtgtcgt gcggcgcttc cggggaggtg gtcgcgctca acgtcacctc ctcgcccggg 300
cgcgcgctcg cgggcgcgct gtctccggcc gtcgcggcgc tgcgggagct ccgcgtgctc 360
gctctcccgt cccacgcgct ctcgggcccg ctcccgcccg cgatctggac gctgcgccgc 420
ctccgcgtcc tcgacctctc cggcaatcgc ctccagggcg ggatccccgc agtgctcgtt 480
tgcgtctccc tccagacgct ggaccttgcc tacaaccagc tcaacggctc cgtgcccgcg 540
gccctcggtg cgctccctgt gctacggcgg ctgtcacttg cctgcaaccg cttcggtggc 600
gccatccctg acgagctcgg tggcgccggt tgccgcaacc tgcagttcct cgatgtctct 660
gggaacatgc tcgtcggtgg cataccccgg agcctgggga attgtactga gctgcaggcg 720
ctgttattgt cctcgaacaa tctggatgac atcatcccgc cggagattgg tcggctcaag 780
aacctgcgtg cattagatgt gtccaggaac agtttgagtg ggcctgtgcc agcggagctt 840
ggtggttgca ttcagctgtc agtgcttgtt ctgtccaatc cttatgcacc cactgctggt 900
tctgattctt ctgattatgg tgagctcgat gacttcaact acttccaagg agggattccg 960
gatactatcg ctacactgcc aaagctgagg atgttgtggg cgccaagagc cacattggag 1020
ggagagttgc cgggtaattg gagttcttgc cagagcttgg agatgataaa tttgggggag 1080
aatctgttct ctggtgggat tcctaagggg ctggtggaat gtgagaattt gaagttcctg 1140
aatttgagca tgaacaagtt tacaggttca gttgattctt cgctgccagt accttgcatg 1200
gacgtgtttg atgtcagtgg aaatcagctg tctggctcgt tgccagtatt tatgtcaaaa 1260
aagaattgtc tttcatccca ggccccacgg gatgacttgg tgtcggagta ttcttccttc 1320
ttcacatatc aagcgcttgc tggctttatg tcatccccat cacccttgga tgcacatttg 1380
acaagctatc atagtttttc caggaacaat tttactggtc cagttacatc cttacctctt 1440
gctactgaaa agttggggat gcaggggtcc tatgcattct tggctgatgg gaatcatctt 1500
ggtggtcagc ttcaacctag tctatttgac aagtgcaaca gctcaagggg cctcgtcgtg 1560
gaaatcagta acaacttgat aagtggagct attcccacag acattggctc gctttgcagt 1620
tctcttcttg ttcttggtgt tgctggtaat cagctttcag gtatgatacc atcaagtatc 1680
ggggagttaa gttaccttat cagcttggat ttgagtagga atcgccttgg tggtgtaatt 1740
cctacttctg tgaagaactt gctgcattta cagcgcctct ctttggctca gaaccttctg 1800
aatggcacaa ttccacctga tattaatcag ttgcatgctc tcaaggtttt ggacctgtca 1860
tcaaacctcc tcatggggat gatccctgat gcccttgctg acttgagaaa tctcactgct 1920
ctcctccttg ataacaataa acttactggg aagattcctt caggatttgc taactcggca 1980
tcccttacca cgtttaatgt gtcattcaac aatttgtctg gtccagtgcc aacgaatggt 2040
aatacggtta gatgtgacag tgttattggg aatcctttgc tgcaatcttg tcatgtgtac 2100
actctggctg tgccatcagc tgctcagcag ggtcgaggtt tgaattcgaa tgacagcaat 2160
gatacaacac cttcaaactc acaaaacgaa ggagcgaaca attcatttaa tgcaattgaa 2220
attgcttcaa taacttctgc aacagccatc gtttccatcc tccttgcact gattgcactc 2280
ttcatataca caaggaagtg cgcgccccgg atgtcagctc ggtcttctgg aaggagagaa 2340
gttacactct tccaagatat tggtgtccca atcacttatg agactgttgt tcgagccact 2400
ggaagtttca atgcaagcaa ttgcatcgga agtggaggct ttggagccac ttacaaggct 2460
gaaattgcac ctggagtgtt ggtagctatt aagagactct ctgtcgggag atttcaggga 2520
gcccaacagt ttgacgctga gataaaaact ctagggagat taagacatcc aaatcttgtt 2580
accttggtag gttaccatct aggcgagtct gaaatgtttc tcatatataa ctacttgtct 2640
ggaggaaacc ttgagaggtt tatacaggag agatcgaaga gaccggtaga ctggaaaatg 2700
ctgcacaaga ttgcattgga cgttgccaaa gcacttgctt atctgcatga tacctgtgtt 2760
cctcgcatcc ttcaccggga cgtgaagccg agcaatattt tgttggacac caactatact 2820
gcttacctct cggactttgg actggcgaga ctcttgggaa attcagaaac acatgcaacc 2880
actggtgtag ctggaacatt tggatatgtt gctccagaat atgctatgac ttgtcgtgtt 2940
tcagataaag ctgatgtgta tagctatggt gttgtcctga tggagctaat ctcagacaag 3000
aaggctttgg acccgtcctt ttccccatat ggtaatgggt tcaacatagt tgcttgggct 3060
tgcatgctgc ttcgtcaagg ccgtgctcgg gaattcttta ttgatggtct gtgggatgtt 3120
ggcccgcatg atgacttggt agaaacattg catttagcag tgatatgcac tgctgattca 3180
ctctctatac ggccaactat gaagcaggtc gttcagagat taaagcagct acagcctcca 3240
attcgtgaac atcgatagag atgaataggt ttacaatttc cacatgtaga aggtaaagat 3300
agctgtgacc tgtgctcttg atgatttttt tttcctaatc atctgtttga ctaggtttta 3360
tcttgtagat ggcagctttg tgtaatatag ttttctcctg aatctctcac tagtcactaa 3420
tgtatctgct gataagaagt gcaggtgtaa attaacagtt taatgccaaa gatttcagtt 3480
aagat taaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaag 3534
<210>17
<211>1064
<212>PRT
<213>玉米
<400>17
Met Ala Ala Arg Thr Leu Leu Leu Val Leu Leu Leu Leu Leu Leu Gly
1 5 10 15
Asp Ala Thr Ser Ala Ser Val Ser Gly Gln Arg Glu Ala Leu Met Lys
20 25 30
Phe Lys Ala Ala Val Thr Ala Asp Pro Gly Gly Leu Leu Arg Gly Trp
35 40 45
Ser Pro Ala Ser Gly Asp His Cys Arg Trp Pro Gly Val Ser Cys Gly
50 55 60
Ala Ser Gly Glu Val Val Ala Leu Asn Val Thr Ser Ser Pro Gly Arg
65 70 75 80
Ala Leu Ala Gly Ala Leu Ser Pro Ala Val Ala Ala Leu Arg Glu Leu
85 90 95
Arg Val Leu Ala Leu Pro Ser His Ala Leu Ser Gly Pro Leu Pro Pro
100 105 110
Ala Ile Trp Thr Leu Arg Arg Leu Arg Val Leu Asp Leu Ser Gly Asn
115 120 125
Arg Leu Gln Gly Gly Ile Pro Ala Val Leu Val Cys Val Ser Leu Gln
130 135 140
Thr Leu Asp Leu Ala Tyr Asn Gln Leu Asn Gly Ser Val Pro Ala Ala
145 150 155 160
Leu Gly Ala Leu Pro Val Leu Arg Arg Leu Ser Leu Ala Cys Asn Arg
165 170 175
Phe Gly Gly Ala Ile Pro Asp Glu Leu Gly Gly Ala Gly Cys Arg Asn
180 185 190
Leu Gln Phe Leu Asp Val Ser Gly Asn Met Leu Val Gly Gly Ile Pro
195 200 205
Arg Ser Leu Gly Asn Cys Thr Glu Leu Gln Ala Leu Leu Leu Ser Ser
210 215 220
Asn Asn Leu Asp Asp Ile Ile Pro Pro Glu Ile Gly Arg Leu Lys Asn
225 230 235 240
Leu Arg Ala Leu Asp Val Ser Arg Asn Ser Leu Ser Gly Pro Val Pro
245 250 255
Ala Glu Leu Gly Gly Cys Ile Gln Leu Ser Val Leu Val Leu Ser Asn
260 265 270
Pro Tyr Ala Pro Thr Ala Gly Ser Asp Ser Ser Asp Tyr Gly Glu Leu
275 280 285
Asp Asp Phe Asn Tyr Phe Gln Gly Gly Ile Pro Asp Thr Ile Ala Thr
290 295 300
Leu Pro Lys Leu Arg Met Leu Trp Ala Pro Arg Ala Thr Leu Glu Gly
305 310 315 320
Glu Leu Pro Gly Asn Trp Ser Ser Cys Gln Ser Leu Glu Met Ile Asn
325 330 335
Leu Gly Glu Asn Leu Phe Ser Gly Gly Ile Pro Lys Gly Leu Val Glu
340 345 350
Cys Glu Asn Leu Lys Phe Leu Asn Leu Ser Met Asn Lys Phe Thr Gly
355 360 365
Ser Val Asp Ser Ser Leu Pro Val Pro Cys Met Asp Val Phe Asp Val
370 375 380
Ser Gly Asn Gln Leu Ser Gly Ser Leu Pro Val Phe Met Ser Lys Lys
385 390 395 400
Asn Cys Leu Ser Ser Gln Ala Pro Arg Asp Asp Leu Val Ser Glu Tyr
405 410 415
Ser Ser Phe Phe Thr Tyr Gln Ala Leu Ala Gly Phe Met Ser Ser Pro
420 425 430
Ser Pro Leu Asp Ala His Leu Thr Ser Tyr His Ser Phe Ser Arg Asn
435 440 445
Asn Phe Thr Gly Pro Val Thr Ser Leu Pro Leu Ala Thr Glu Lys Leu
450 455 460
Gly Met Gln Gly Ser Tyr Ala Phe Leu Ala Asp Gly Asn His Leu Gly
465 470 475 480
Gly Gln Leu Gln Pro Ser Leu Phe Asp Lys Cys Asn Ser Ser Arg Gly
485 490 495
Leu Val Val Glu Ile Ser Asn Asn Leu Ile Ser Gly Ala Ile Pro Thr
500 505 510
Asp Ile Gly Ser Leu Cys Ser Ser Leu Leu Val Leu Gly Val Ala Gly
515 520 525
Asn Gln Leu Ser Gly Met Ile Pro Ser Ser Ile Gly Glu Leu Ser Tyr
530 535 540
Leu Ile Ser Leu Asp Leu Ser Arg Asn Arg Leu Gly Gly Val Ile Pro
545 550 555 560
Thr Ser Val Lys Asn Leu Leu His Leu Gln Arg Leu Ser Leu Ala Gln
565 570 575
Asn Leu Leu Asn Gly Thr Ile Pro Pro Asp Ile Asn Gln Leu His Ala
580 585 590
Leu Lys Val Leu Asp Leu Ser Ser Asn Leu Leu Met Gly Met Ile Pro
595 600 605
Asp Ala Leu Ala Asp Leu Arg Asn Leu Thr Ala Leu Leu Leu Asp Asn
610 615 620
Asn Lys Leu Thr Gly Lys Ile Pro Ser Gly Phe Ala Asn Ser Ala Ser
625 630 635 640
Leu Thr Thr Phe Asn Val Ser Phe Asn Asn Leu Ser Gly Pro Val Pro
645 650 655
Thr Asn Gly Asn Thr Val Arg Cys Asp Ser Val Ile Gly Asn Pro Leu
660 665 670
Leu Gln Ser Cys His Val Tyr Thr Leu Ala Val Pro Ser Ala Ala Gln
675 680 685
Gln Gly Arg Gly Leu Asn Ser Asn Asp Ser Asn Asp Thr Thr Pro Ser
690 695 700
Asn Ser Gln Asn Glu Gly Ala Asn Asn Ser Phe Asn Ala Ile Glu Ile
705 710 715 720
Ala Ser Ile Thr Ser Ala Thr Ala Ile Val Ser Ile Leu Leu Ala Leu
725 730 735
Ile Ala Leu Phe Ile Tyr Thr Arg Lys Cys Ala Pro Arg Met Ser Ala
740 745 750
Arg Ser Ser Gly Arg Arg Glu Val Thr Leu Phe Gln Asp Ile Gly Val
755 760 765
Pro Ile Thr Tyr Glu Thr Val Val Arg Ala Thr Gly Ser Phe Asn Ala
770 775 780
Ser Asn Cys Ile Gly Ser Gly Gly Phe Gly Ala Thr Tyr Lys Ala Glu
785 790 795 800
Ile Ala Pro Gly Val Leu Val Ala Ile Lys Arg Leu Ser Val Gly Arg
805 810 815
Phe Gln Gly Ala Gln Gln Phe Asp Ala Glu Ile Lys Thr Leu Gly Arg
820 825 830
Leu Arg His Pro Asn Leu Val Thr Leu Val Gly Tyr His Leu Gly Glu
835 840 845
Ser Glu Met Phe Leu Ile Tyr Asn Tyr Leu Ser Gly Gly Asn Leu Glu
850 855 860
Arg Phe Ile Gln Glu Arg Ser Lys Arg Pro Val Asp Trp Lys Met Leu
865 870 875 880
His Lys Ile Ala Leu Asp Val Ala Lys Ala Leu Ala Tyr Leu His Asp
885 890 895
Thr Cys Val Pro Arg Ile Leu His Arg Asp Val Lys Pro Ser Asn Ile
900 905 910
Leu Leu Asp Thr Asn Tyr Thr Ala Tyr Leu Ser Asp Phe Gly Leu Ala
915 920 925
Arg Leu Leu Gly Asn Ser Glu Thr His Ala Thr Thr Gly Val Ala Gly
930 935 940
Thr Phe Gly Tyr Val Ala Pro Glu Tyr Ala Met Thr Cys Arg Val Ser
945 950 955 960
Asp Lys Ala Asp Val Tyr Ser Tyr Gly Val Val Leu Met Glu Leu Ile
965 970 975
Ser Asp Lys Lys Ala Leu Asp Pro Ser Phe Ser Pro Tyr Gly Asn Gly
980 985 990
Phe Asn Ile Val Ala Trp Ala Cys Met Leu Leu Arg Gln Gly Arg Ala
995 1000 1005
Arg Glu Phe Phe Ile Asp Gly Leu Trp Asp Val Gly Pro His Asp
1010 1015 1020
Asp Leu Val Glu Thr Leu His Leu Ala Val Ile Cys Thr Ala Asp
1025 1030 1035
Ser Leu Ser Ile Arg Pro Thr Met Lys Gln Val Val Gln Arg Leu
1040 1045 1050
Lys Gln Leu Gln Pro Pro Ile Arg Glu His Arg
1055 1060
<210>18
<211>1797
<212>DNA
<213>玉米
<400>18
atgggcttcc tccgccgctc cgccaccacc gccgtcctct tcgctctcca cctcctcctc 60
ctcgccaccg cctccgtcgc gtcctcgccg tcatcctccc tgccggagca gcaggacgac 120
gacgtgcagg cgctgctcca cctcaagcgc ggcctcgcct ccggcgccga cgctcctctc 180
cggcagtggt cgctggagtc cggcgcccac cactgctcct ggccgggggt cacctgcgac 240
gcgcggtccg gccgcgtcgt ggccctcgcc ctcgcgctcg gcggtcgcct cggcggggag 300
ctgtccccgg cggtcgctcg cctcaccgag ctcagagcgc tgtccttccc ctcggccggg 360
ctcggcggcg agattccccc gcagctctgg cggctggggc gcctgcaggc gctcaacctc 420
gccggcaact ctctccgcgg ccgcctgccg gccaccttcc cggaggggct caaaagcttg 480
gacctttccg ggaaccggct gagcggggga atcccgccgg gtctggggag ctgcgcgacg 540
ctccggcgcc tgcgcctgtc ctccaactgg ttggctggaa ccatcccgcc gcggatcggg 600
gagctcgcgc ggctccgcgt tctggacctg tccgggaacc ggctcaccgg cggcgtgccg 660
ccggagctcc tccattgcag gggcctcgtc aggatggatc tcagcaggaa cttgctccac 720
ggccggctgc cctcgggcct cgcgcagctc aagaacctga ggtttctctc cctctccggc 780
aacaatttca gcggcgagat accttccgtt gcactcaccc tgtgcatctg cacaaggaaa 840
tggcgtctga aaccatcaga gcgctccttt gcgagcaaag aagtcaaggt tttcgctgat 900
gttgatattg gagcccccct gacatatgag accgtcgttc gcgccactgg gaatttcaac 960
gctagcaact gcattggtaa cggcggcttt ggcgcgacat acagagccga ggtggcacct 1020
ggagttcttg tggcaataaa gaggcttgct attgggaagc agcacggcga caaggagttc 1080
caagcagaag tgaggatcct cgggcaatgc cgtcatcctc atcttgtcac tctgttgggg 1140
taccacatca acgagtccga gatgttcctg atatacaact acctgccagg tggcaatctg 1200
gaaaggttca tacaagagag gggcaggagg ccgatcagct ggagaaggct tcacaagatc 1260
gccctggatg tcgcccgcgc tctcgcttac atgcacgacg aatgcgtgcc ccgcgtcctg 1320
caccgggatg tcaagccaaa caacatactg ctcgacaacg agtgcaacgc ctacctttcc 1380
gatttcggat tggcgaggct cctccggaac tcggaaacgc atgcgacgac ggacgttgcc 1440
ggtactttcg gttatgtggc tcctgagtac gcaatgacgt gccgcgtgtc ggacaaggca 1500
gacgtgtaca gctatggagt ggtgctgctg gagctgattt cggacaagaa ggcgctggac 1560
ccgtcgtttt ctccatacgg aaacggcttc aacatcgtca gctgggctgt gaggcttatc 1620
cagagaggca gggtccgcga gttcttcgtc gagggcttgt gggagaaggc cccgcatgat 1680
gatctggtcg agttcctgaa cctggcggtg cggtgcacgc aggagtcgct cgcttctagg 1740
cccacaatga agcatgtcct tcgatgcttg agggaactcc gtccaccttc gtactag 1797
<210>19
<211>598
<212>PRT
<213>玉米
<400>19
Met Gly Phe Leu Arg Arg Ser Ala Thr Thr Ala Val Leu Phe Ala Leu
1 5 10 15
His Leu Leu Leu Leu Ala Thr Ala Ser Val Ala Ser Ser Pro Ser Ser
20 25 30
Ser Leu Pro Glu Gln Gln Asp Asp Asp Val Gln Ala Leu Leu His Leu
35 40 45
Lys Arg Gly Leu Ala Ser Gly Ala Asp Ala Pro Leu Arg Gln Trp Ser
50 55 60
Leu Glu Ser Gly Ala His His Cys Ser Trp Pro Gly Val Thr Cys Asp
65 70 75 80
Ala Arg Ser Gly Arg Val Val Ala Leu Ala Leu Ala Leu Gly Gly Arg
85 90 95
Leu Gly Gly Glu Leu Ser Pro Ala Val Ala Arg Leu Thr Glu Leu Arg
100 105 110
Ala Leu Ser Phe Pro Ser Ala Gly Leu Gly Gly Glu Ile Pro Pro Gln
115 120 125
Leu Trp Arg Leu Gly Arg Leu Gln Ala Leu Asn Leu Ala Gly Asn Ser
130 135 140
Leu Arg Gly Arg Leu Pro Ala Thr Phe Pro Glu Gly Leu Lys Ser Leu
145 150 155 160
Asp Leu Ser Gly Asn Arg Leu Ser Gly Gly Ile Pro Pro Gly Leu Gly
165 170 175
Ser Cys Ala Thr Leu Arg Arg Leu Arg Leu Ser Ser Asn Trp Leu Ala
180 185 190
Gly Thr Ile Pro Pro Arg Ile Gly Glu Leu Ala Arg Leu Arg Val Leu
195 200 205
Asp Leu Ser Gly Asn Arg Leu Thr Gly Gly Val Pro Pro Glu Leu Leu
210 215 220
His Cys Arg Gly Leu Val Arg Met Asp Leu Ser Arg Asn Leu Leu His
225 230 235 240
Gly Arg Leu Pro Ser Gly Leu Ala Gln Leu Lys Asn Leu Arg Phe Leu
245 250 255
Ser Leu Ser Gly Asn Asn Phe Ser Gly Glu Ile Pro Ser Val Ala Leu
260 265 270
Thr Leu Cys Ile Cys Thr Arg Lys Trp Arg Leu Lys Pro Ser Glu Arg
275 280 285
Ser Phe Ala Ser Lys Glu Val Lys Val Phe Ala Asp Val Asp Ile Gly
290 295 300
Ala Pro Leu Thr Tyr Glu Thr Val Val Arg Ala Thr Gly Asn Phe Asn
305 310 315 320
Ala Ser Asn Cys Ile Gly Asn Gly Gly Phe Gly Ala Thr Tyr Arg Ala
325 330 335
Glu Val Ala Pro Gly Val Leu Val Ala Ile Lys Arg Leu Ala Ile Gly
340 345 350
Lys Gln His Gly Asp Lys Glu Phe Gln Ala Glu Val Arg Ile Leu Gly
355 360 365
Gln Cys Arg His Pro His Leu Val Thr Leu Leu Gly Tyr His Ile Asn
370 375 380
Glu Ser Glu Met Phe Leu Ile Tyr Asn Tyr Leu Pro Gly Gly Asn Leu
385 390 395 400
Glu Arg Phe Ile Gln Glu Arg Gly Arg Arg Pro Ile Ser Trp Arg Arg
405 410 415
Leu His Lys Ile Ala Leu Asp Val Ala Arg Ala Leu Ala Tyr Met His
420 425 430
Asp Glu Cys Val Pro Arg Val Leu His Arg Asp Val Lys Pro Asn Asn
435 440 445
Ile Leu Leu Asp Asn Glu Cys Asn Ala Tyr Leu Ser Asp Phe Gly Leu
450 455 460
Ala Arg Leu Leu Arg Asn Ser Glu Thr His Ala Thr Thr Asp Val Ala
465 470 475 480
Gly Thr Phe Gly Tyr Val Ala Pro Glu Tyr Ala Met Thr Cys Arg Val
485 490 495
Ser Asp Lys Ala Asp Val Tyr Ser Tyr Gly Val Val Leu Leu Glu Leu
500 505 510
Ile Ser Asp Lys Lys Ala Leu Asp Pro Ser Phe Ser Pro Tyr Gly Asn
515 520 525
Gly Phe Asn Ile Val Ser Trp Ala Val Arg Leu Ile Gln Arg Gly Arg
530 535 540
Val Arg Glu Phe Phe Val Glu Gly Leu Trp Glu Lys Ala Pro His Asp
545 550 555 560
Asp Leu Val Glu Phe Leu Asn Leu Ala Val Arg Cys Thr Gln Glu Ser
565 570 575
Leu Ala Ser Arg Pro Thr Met Lys His Val Leu Arg Cys Leu Arg Glu
580 585 590
Leu Arg Pro Pro Ser Tyr
595
<210>20
<211>2380
<212>DNA
<213>拟南芥属
<400>20
aatgtcattt tctttttgag taaagaaact ttcttctttt gcaacttgaa aattttcgat 60
ttcgagagcg aaggaagaga gagagagagg gagagatcaa aaaaatttgt gaaaaaagag 120
atttatttta gggttttgtt gattttcttt gttcattttg ccgtcgtctc tttctctttc 180
tcttgtgaag aaaaagaaga aaaatgaaac ttctgggttt ggtcttcttg ctgtttaatc 240
tgtttatgtt ttcgttttct cggaaacttt taaccgagag tggtggtggt ctccacgacg 300
aagctgctct gttgaagttg aagagtagtt tcttggatcc aaatggagtt ctgtctagct 360
gggtttccga cagctcatca aaccattgtt cttggtacgg tgtctcttgc aactctgatt 420
cgagagtcgt ttctttaatt ctcagaggct gtgatgagct tgaaggtagt ggtgttcttc 480
atctcccaga tttgtcttct tgttcttcaa gtaaaagaag acttggtggt gtaatctctc 540
ctgttgttgg agatttgtct gaaattaggg ttttatctct gtcttttaat gatcttagag 600
gtgagattcc aaaggagatt tgggggttgg agaaattaga gattcttgat cttaaaggta 660
acaactttat cggtggaatt agggttgttg ataatgttgt tcttaggaaa cttatgagtt 720
ttgaggatga agatgaaatt ggtccaagct cagctgatga tgattctcct ggtaaatctg 780
gattgtatcc gattgagata gcttctattg tgtcagcttc agtgattgtc tttgtgcttc 840
ttgttcttgt gatcttgttc atctacacga ggaaatggaa acgaaactct caggttcagg 900
tagatgagat caaggagatt aaagtgtttg ttgacattgg tattcctctg acgtatgaga 960
tcattgttag agctactggt tactttagta actctaactg tatcggtcac ggcggtttcg 1020
gttcgactta taaagcagag gtgtctccaa cgaatgtgtt tgctgttaaa agactctctg 1080
ttgggaggtt tcaaggtgat caacagtttc atgctgagat ctctgctctt gagatggtta 1140
ggcatccgaa tcttgttatg ttaatcggtt accatgcgag cgaaaccgag atgtttctta 1200
tttacaacta cttatctgga ggaaaccttc aagatttcat taaagagaga tcgaaagctg 1260
ctattgagtg gaaggtactt cacaagattg ctcttgatgt agcgcgtgct ctctcctatc 1320
ttcacgaaca gtgttctcct aaagtcttgc atagagatat caaaccgagt aacatactct 1380
tggacaacaa ctacaacgct tatctatctg atttcggact ctcgaaactc ttgggaactt 1440
cgcagtctca tgtcacaact ggtgtggctg gaacattcgg atatgtagct ccagagtacg 1500
caatgacatg ccgagtctcg gagaaagcag atgtttacag ctatgggata gtcctcttgg 1560
agctgatatc ggacaaacga gctctggatc cttctttctc atcgcatgag aacgggttca 1620
acattgtctc gtgggctcat atgatgttga gtcaaggcaa agcgaaagaa gtcttcacca 1680
caggactatg ggagactggt cctccagacg atttggtcga ggttctgcat ttggcactca 1740
aatgcaccgt cgatagcctc tcgataaggc cgacaatgaa acaagctgta agactgctga 1800
aacgaatcca gccttctaga ttgtgataca aagctagaag aaaggtgttt ttagaggatt 1860
tgtacagcag caaacaagtg cctagccatt agaatctttc tcaagtctgt tttttatatt 1920
ttcagtgcta gggaggtttt tcatatgcat catcaaaatg ttattctcag tctactctct 1980
tgaaaactga tataaatgat gtttctttct ccttaattct ctctgtataa agcaggtaaa 2040
aaatacatat atgacttaaa aattgagaaa tgaccagaga ctcttaaaaa catcttctct 2100
aaacaagggt tggttagagg gtttttggtc tattataaga gattagaagg agaagaagtg 2160
ttgacaacaa tgtgtctaag aggatgatct acatgaggac caccggcatg tttaacgaaa 2220
tccgccggag acaagaaatc gccatggcaa acacacatga tcctaacttc ttctccactg 2280
ccgtatctat aaagaatccc atcaactctc ttcccatttg gtccatctcc tttggtaaac 2340
acacatggca tctcactttt tcctttccct tgtggctcca 2380
<210>21
<211>1623
<212>DNA
<213>拟南芥属
<400>21
atgaaacttc tgggtttggt cttcttgctg tttaatctgt ttatgttttc gttttctcgg 60
aaacttttaa ccgagagtgg tggtggtctc cacgacgaag ctgctctgtt gaagttgaag 120
agtagtttct tggatccaaa tggagttctg tctagctggg tttccgacag ctcatcaaac 180
cattgttctt ggtacggtgt ctcttgcaac tctgattcga gagtcgtttc tttaattctc 240
agaggctgtg atgagcttga aggtagtggt gttcttcatc tcccagattt gtcttcttgt 300
tcttcaagta aaagaagact tggtggtgta atctctcctg ttgttggaga tttgtctgaa 360
attagggttt tatctctgtc ttttaatgat cttagaggtg agattccaaa ggagatttgg 420
gggttggaga aattagagat tcttgatctt aaaggtaaca actttatcgg tggaattagg 480
gttgttgata atgttgttct taggaaactt atgagttttg aggatgaaga tgaaattggt 540
ccaagctcag ctgatgatga ttctcctggt aaatctggat tgtatccgat tgagatagct 600
tctattgtgt cagcttcagt gattgtcttt gtgcttcttg ttcttgtgat cttgttcatc 660
tacacgagga aatggaaacg aaactctcag gttcaggtag atgagatcaa ggagattaaa 720
gtgtttgttg acattggtat tcctctgacg tatgagatca ttgttagagc tactggttac 780
tttagtaact ctaactgtat cggtcacggc ggtttcggtt cgacttataa agcagaggtg 840
tctccaacga atgtgtttgc tgttaaaaga ctctctgttg ggaggtttca aggtgatcaa 900
cagtttcatg ctgagatctc tgctcttgag atggttaggc atccgaatct tgttatgtta 960
atcggttacc atgcgagcga aaccgagatg tttcttattt acaactactt atctggagga 1020
aaccttcaag atttcattaa agagagatcg aaagctgcta ttgagtggaa ggtacttcac 1080
aagattgctc ttgatgtagc gcgtgctctc tcctatcttc acgaacagtg ttctcctaaa 1140
gtcttgcata gagatatcaa accgagtaac atactcttgg acaacaacta caacgcttat 1200
ctatctgatt tcggactctc gaaactcttg ggaacttcgc agtctcatgt cacaactggt 1260
gtggctggaa cattcggata tgtagctcca gagtacgcaa tgacatgccg agtctcggag 1320
aaagcagatg tttacagcta tgggatagtc ctcttggagc tgatatcgga caaacgagct 1380
ctggatcctt ctttctcatc gcatgagaac gggttcaaca ttgtctcgtg ggctcatatg 1440
atgttgagtc aaggcaaagc gaaagaagtc ttcaccacag gactatggga gactggtcct 1500
ccagacgatt tggtcgaggt tctgcatttg gcactcaaat gcaccgtcga tagcctctcg 1560
ataaggccga caatgaaaca agctgtaaga ctgctgaaac gaatccagcc ttctagattg 1620
tga 1623
<210>22
<211>540
<212>PRT
<213>拟南芥属
<400>22
Met Lys Leu Leu Gly Leu Val Phe Leu Leu Phe Asn Leu Phe Met Phe
1 5 10 15
Ser Phe Ser Arg Lys Leu Leu Thr Glu Ser Gly Gly Gly Leu His Asp
20 25 30
Glu Ala Ala Leu Leu Lys Leu Lys Ser Ser Phe Leu Asp Pro Asn Gly
35 40 45
Val Leu Ser Ser Trp Val Ser Asp Ser Ser Ser Asn His Cys Ser Trp
50 55 60
Tyr Gly Val Ser Cys Asn Ser Asp Ser Arg Val Val Ser Leu Ile Leu
65 70 75 80
Arg Gly Cys Asp Glu Leu Glu Gly Ser Gly Val Leu His Leu Pro Asp
85 90 95
Leu Ser Ser Cys Ser Ser Ser Lys Arg Arg Leu Gly Gly Val Ile Ser
100 105 110
Pro Val Val Gly Asp Leu Ser Glu Ile Arg Val Leu Ser Leu Ser Phe
115 120 125
Asn Asp Leu Arg Gly Glu Ile Pro Lys Glu Ile Trp Gly Leu Glu Lys
130 135 140
Leu Glu Ile Leu Asp Leu Lys Gly Asn Asn Phe Ile Gly Gly Ile Arg
145 150 155 160
Val Val Asp Asn Val Val Leu Arg Lys Leu Met Ser Phe Glu Asp Glu
165 170 175
Asp Glu Ile Gly Pro Ser Ser Ala Asp Asp Asp Ser Pro Gly Lys Ser
180 185 190
Gly Leu Tyr Pro Ile Glu Ile Ala Ser Ile Val Ser Ala Ser Val Ile
195 200 205
Val Phe Val Leu Leu Val Leu Val Ile Leu Phe Ile Tyr Thr Arg Lys
210 215 220
Trp Lys Arg Asn Ser Gln Val Gln Val Asp Glu Ile Lys Glu Ile Lys
225 230 235 240
Val Phe Val Asp Ile Gly Ile Pro Leu Thr Tyr Glu Ile Ile Val Arg
245 250 255
Ala Thr Gly Tyr Phe Ser Asn Ser Asn Cys Ile Gly His Gly Gly Phe
260 265 270
Gly Ser Thr Tyr Lys Ala Glu Val Ser Pro Thr Asn Val Phe Ala Val
275 280 285
Lys Arg Leu Ser Val Gly Arg Phe Gln Gly Asp Gln Gln Phe His Ala
290 295 300
Glu Ile Ser Ala Leu Glu Met Val Arg His Pro Asn Leu Val Met Leu
305 310 315 320
Ile Gly Tyr His Ala Ser Glu Thr Glu Met Phe Leu Ile Tyr Asn Tyr
325 330 335
Leu Ser Gly Gly Asn Leu Gln Asp Phe Ile Lys Glu Arg Ser Lys Ala
340 345 350
Ala Ile Glu Trp Lys Val Leu His Lys Ile Ala Leu Asp Val Ala Arg
355 360 365
Ala Leu Ser Tyr Leu His Glu Gln Cys Ser Pro Lys Val Leu His Arg
370 375 380
Asp Ile Lys Pro Ser Asn Ile Leu Leu Asp Asn Asn Tyr Asn Ala Tyr
385 390 395 400
Leu Ser Asp Phe Gly Leu Ser Lys Leu Leu Gly Thr Ser Gln Ser His
405 410 415
Val Thr Thr Gly Val Ala Gly Thr Phe Gly Tyr Val Ala Pro Glu Tyr
420 425 430
Ala Met Thr Cys Arg Val Ser Glu Lys Ala Asp Val Tyr Ser Tyr Gly
435 440 445
Ile Val Leu Leu Glu Leu Ile Ser Asp Lys Arg Ala Leu Asp Pro Ser
450 455 460
Phe Ser Ser His Glu Asn Gly Phe Asn Ile Val Ser Trp Ala His Met
465 470 475 480
Met Leu Ser Gln Gly Lys Ala Lys Glu Val Phe Thr Thr Gly Leu Trp
485 490 495
Glu Thr Gly Pro Pro Asp Asp Leu Val Glu Val Leu His Leu Ala Leu
500 505 510
Lys Cys Thr Val Asp Ser Leu Ser Ile Arg Pro Thr Met Lys Gln Ala
515 520 525
Val Arg Leu Leu Lys Arg Ile Gln Pro Ser Arg Leu
530 535 540
<210>23
<211>1049
<212>PRT
<213>水稻
<400>23
Met Val Ala Ala Arg Arg Ser Ala Ala Ser Ser Ser Phe Phe Leu Leu
1 5 10 15
Leu Leu Val Leu Ala Val Arg Val Arg Ala Ser Ser Asp Arg Val Gln
20 25 30
Glu Arg Asp Arg Ser Ala Leu Leu Glu Leu Arg Gly Ala Ala Gly Leu
35 40 45
Leu Gly Arg Trp Pro Thr Gly Ser Ala Val Ala Asp His Cys Ser Trp
50 55 60
Pro Gly Val Thr Cys Asp Ala Ser Arg Arg Val Val Ala Val Ala Val
65 70 75 80
Ala Ala Pro Pro Ala Ser Gly Ser Ser Glu Leu Ala Gly Glu Leu Ser
85 90 95
Pro Ala Val Gly Leu Leu Thr Glu Leu Arg Glu Leu Ser Leu Pro Ser
100 105 110
Arg Gly Leu Arg Gly Glu Ile Pro Ala Glu Ile Trp Arg Leu Glu Lys
115 120 125
Leu Glu Val Val Asn Leu Ala Gly Asn Ser Leu His Gly Ala Leu Pro
130 135 140
Leu Ala Phe Pro Pro Arg Met Arg Val Leu Asp Leu Ala Ser Asn Arg
145 150 155 160
Leu His Gly Glu Ile Gln Gly Thr Leu Ser Asp Cys Lys Ser Leu Met
165 170 175
Arg Leu Asn Leu Ser Gly Asn Arg Leu Thr Gly Ser Val Pro Gly Val
180 185 190
Leu Gly Ser Leu Pro Lys Leu Lys Leu Leu Asp Leu Ser Arg Asn Leu
195 200 205
Leu Thr Gly Arg Ile Pro Ser Glu Leu Gly Asp Cys Arg Glu Leu Arg
210 215 220
Ser Leu Gln Leu Phe Ser Asn Leu Leu Glu Gly Ser Ile Pro Pro Glu
225 230 235 240
Ile Gly Arg Leu Arg Arg Leu Gln Val Leu Asp Ile Ser Ser Asn Arg
245 250 255
Leu Asn Gly Pro Val Pro Met Glu Leu Gly Asn Cys Met Asp Leu Ser
260 265 270
Val Leu Val Leu Thr Ser Gln Phe Asp Ala Val Asn Leu Ser Glu Phe
275 280 285
Asn Met Phe Ile Gly Gly Ile Pro Glu Ser Val Thr Ala Leu Pro Lys
290 295 300
Leu Arg Met Leu Trp Ala Pro Arg Ala Gly Phe Glu Gly Asn Ile Pro
305 310 315 320
Ser Asn Trp Gly Arg Cys His Ser Leu Glu Met Val Asn Leu Ala Glu
325 330 335
Asn Leu Leu Ser Gly Val Ile Pro Arg Glu Leu Gly Gln Cys Ser Asn
340 345 350
Leu Lys Phe Leu Asn Leu Ser Ser Asn Lys Leu Ser Gly Ser Ile Asp
355 360 365
Asn Gly Leu Cys Pro His Cys Ile Ala Val Phe Asp Val Ser Arg Asn
370 375 380
Glu Leu Ser Gly Thr Ile Pro Ala Cys Ala Asn Lys Gly Cys Thr Pro
385 390 395 400
Gln Leu Leu Asp Asp Met Pro Ser Arg Tyr Pro Ser Phe Phe Met Ser
405 410 415
Lys Ala Leu Ala Gln Pro Ser Ser Gly Tyr Cys Lys Ser Gly Asn Cys
420 425 430
Ser Val Val Tyr His Asn Phe Ala Asn Asn Asn Leu Gly Gly His Leu
435 440 445
Thr Ser Leu Pro Phe Ser Ala Asp Arg Phe Gly Asn Lys Ile Leu Tyr
450 455 460
Ala Phe His Val Asp Tyr Asn Asn Phe Thr Gly Ser Leu His Glu Ile
465 470 475 480
Leu Leu Ala Gln Cys Asn Asn Val Glu Gly Leu Ile Val Ser Phe Arg
485 490 495
Asp Asn Lys Ile Ser Gly Gly Leu Thr Glu Glu Met Ser Thr Lys Cys
500 505 510
Ser Ala Ile Arg Ala Leu Asp Leu Ala Gly Asn Arg Ile Thr Gly Val
515 520 525
Met Pro Gly Asn Ile Gly Leu Leu Ser Ala Leu Val Lys Met Asp Ile
530 535 540
Ser Arg Asn Leu Leu Glu Gly Gln Ile Pro Ser Ser Phe Lys Glu Leu
545 550 555 560
Lys Ser Leu Lys Phe Leu Ser Leu Ala Glu Asn Asn Leu Ser Gly Thr
565 570 575
Ile Pro Ser Cys Leu Gly Lys Leu Arg Ser Leu Glu Val Leu Asp Leu
580 585 590
Ser Ser Asn Ser Leu Ser Gly Lys Ile Pro Arg Asn Leu Val Thr Leu
595 600 605
Thr Tyr Leu Thr Ser Leu Leu Leu Asn Asn Asn Lys Leu Ser Gly Asn
610 615 620
Ile Pro Asp Ile Ala Pro Ser Ala Ser Leu Ser Ile Phe Asn Ile Ser
625 630 635 640
Phe Asn Asn Leu Ser Gly Pro Leu Pro Leu Asn Met His Ser Leu Ala
645 650 655
Cys Asn Ser Ile Gln Gly Asn Pro Ser Leu Gln Pro Cys Gly Leu Ser
660 665 670
Thr Leu Ala Asn Thr Val Met Lys Ala Arg Ser Leu Ala Glu Gly Asp
675 680 685
Val Pro Pro Ser Asp Ser Ala Thr Val Asp Ser Gly Gly Gly Phe Ser
690 695 700
Lys Ile Glu Ile Ala Ser Ile Thr Ser Ala Ser Ala Ile Val Ala Val
705 710 715 720
Leu Leu Ala Leu Ile Ile Leu Tyr Ile Tyr Thr Arg Lys Cys Ala Ser
725 730 735
Arg Gln Ser Arg Arg Ser Ile Arg Arg Arg Glu Val Thr Val Phe Val
740 745 750
Asp Ile Gly Ala Pro Leu Thr Tyr Glu Thr Val Val Arg Ala Thr Gly
755 760 765
Ser Phe Asn Ala Ser Asn Cys Ile Gly Ser Gly Gly Phe Gly Ala Thr
770 775 780
Tyr Lys Ala Glu Ile Ala Pro Gly Val Leu Val Ala Ile Lys Arg Leu
785 790 795 800
Ala Ile Gly Arg Phe Gln Gly Ile Gln Gln Phe Gln Ala Glu Val Lys
805 810 815
Thr Leu Gly Arg Cys Arg His Pro Asn Leu Val Thr Leu Ile Gly Tyr
820 825 830
His Leu Ser Asp Ser Glu Met Phe Leu Ile Tyr Asn Phe Leu Pro Gly
835 840 845
Gly Asn Leu Glu Arg Phe Ile Gln Glu Arg Ala Lys Arg Pro Ile Asp
850 855 860
Trp Arg Met Leu His Lys Ile Ala Leu Asp Ile Ala Arg Ala Leu Gly
865 870 875 880
Phe Leu His Asp Ser Cys Val Pro Arg Ile Leu His Arg Asp Val Lys
885 890 895
Pro Ser Asn Ile Leu Leu Asp Asn Glu Tyr Asn Ala Tyr Leu Ser Asp
900 905 910
Phe Gly Leu Ala Arg Leu Leu Gly Asn Ser Glu Thr His Ala Thr Thr
915 920 925
Gly Val Ala Gly Thr Phe Gly Tyr Val Ala Pro Glu Tyr Ala Met Thr
930 935 940
Cys Arg Val Ser Asp Lys Ala Asp Val Tyr Ser Tyr Gly Val Val Leu
945 950 955 960
Leu Glu Leu Ile Ser Asp Lys Lys Ala Leu Asp Pro Ser Phe Ser Pro
965 970 975
Tyr Gly Asn Gly Phe Asn Ile Val Ala Trp Ala Cys Met Leu Leu Gln
980 985 990
Lys Gly Arg Ala Arg Glu Phe Phe Ile Glu Gly Leu Trp Asp Val Ala
995 1000 1005
Pro His Asp Asp Leu Val Glu Ile Leu His Leu Gly Ile Lys Cys
1010 1015 1020
Thr Val Asp Ser Leu Ser Ser Arg Pro Thr Met Lys Gln Val Val
1025 1030 1035
Arg Arg Leu Lys Glu Leu Arg Pro Pro Ser Tyr
1040 1045
<210>24
<211>1070
<212>PRT
<213>水稻
<400>24
Met Ala Thr Leu Asn Thr Gln Thr Leu Leu Thr Leu Phe Leu Leu Leu
1 5 10 15
Leu Leu Ala Ala Ala Ala Ala Ala Ala Asp Ala Gly Gly Gly Gly Glu
20 25 30
Arg Glu Ala Leu Leu Arg Phe Lys Ala Gly Val Ala Ser Asp Pro Gly
35 40 45
Gly Leu Leu Arg Gly Trp Thr Thr Ala Ala Ser Pro Asp His Cys Ala
50 55 60
Trp Pro Gly Val Ser Cys Gly Gly Asn Gly Glu Val Val Ala Leu Asn
65 70 75 80
Val Ser Ser Ser Pro Gly Arg Arg Leu Ala Gly Ala Leu Ser Pro Ala
85 90 95
Val Ala Ala Leu Arg Gly Leu Arg Val Leu Ala Leu Pro Ser His Ala
100 105 110
Leu Ser Gly Gln Leu Pro Ala Ala Ile Trp Ser Leu Arg Arg Leu Leu
115 120 125
Val Leu Asp Leu Ser Gly Asn Arg Leu Gln Gly Glu Ile Pro Pro Ala
130 135 140
Leu Ala Cys Ala Gly Leu Gln Thr Leu Asp Leu Ser Tyr Asn Gln Leu
145 150 155 160
Asn Gly Ser Val Pro Ala Ser Leu Gly Ala Leu Pro Gly Leu Arg Arg
165 170 175
Leu Ser Leu Ala Ser Asn Arg Leu Gly Gly Ala Ile Pro Asp Glu Leu
180 185 190
Gly Gly Ala Gly Cys Arg Ser Leu Gln Tyr Leu Asp Leu Ser Gly Asn
195 200 205
Leu Leu Val Gly Gly Ile Pro Arg Ser Leu Gly Asn Cys Ser Lys Leu
210 215 220
Glu Ala Leu Leu Leu Ser Ser Asn Leu Leu Asp Asp Val Ile Pro Pro
225 230 235 240
Glu Ile Gly Arg Leu Arg Asn Leu Arg Ala Leu Asp Val Ser Arg Asn
245 250 255
Ser Leu Ser Gly Ser Val Pro Ala Glu Leu Gly Gly Cys Val Glu Leu
260 265 270
Ser Val Leu Val Leu Ser Asn Pro Tyr Thr Pro Ile Gly Gly Ser Asn
275 280 285
Ser Ser Asp Tyr Gly Asp Val Asp Asp Phe Asn Tyr Phe Gln Gly Gly
290 295 300
Ile Pro Asp Ala Val Val Ala Leu Pro Lys Leu Arg Val Leu Trp Ala
305 310 315 320
Pro Arg Ala Thr Leu Glu Gly Glu Leu Pro Arg Asn Trp Ser Ala Cys
325 330 335
Gln Ser Leu Glu Met Ile Asn Leu Gly Glu Asn Leu Phe Ser Gly Gly
340 345 350
Ile Pro Asn Gly Leu Val Glu Cys Ser His Leu Lys Phe Leu Asn Leu
355 360 365
Ser Ser Asn Lys Leu Thr Gly Ala Ile Asp Pro Ser Leu Thr Val Pro
370 375 380
Cys Met Asp Val Phe Asp Val Ser Gly Asn Arg Phe Ser Gly Ala Met
385 390 395 400
Pro Val Phe Glu Gln Lys Gly Cys Pro Ser Ser Gln Leu Pro Phe Asp
405 410 415
Asp Leu Val Ser Glu Tyr Ser Ser Phe Phe Ser Tyr Gln Ala Leu Ala
420 425 430
Gly Phe Arg Ser Ser Ser Phe Val Leu Gly Thr Asp Leu Thr Ser Tyr
435 440 445
His Ser Phe Ala Gln Asn Asn Phe Thr Gly Pro Val Lys Ser Leu Pro
450 455 460
Leu Ala Ala Asp Lys Leu Gly Met Gln Gly Ser Tyr Ala Phe Leu Ala
465 470 475 480
Asp Gly Asn Asn Ile Ala Gly Gln Leu Gln Pro Asp Leu Phe Ser Lys
485 490 495
Cys Asn Ser Ser Arg Gly Phe Ile Val Asp Val Ser Asn Asn Leu Ile
500 505 510
Thr Gly Gly Ile Pro Val Glu Ile Gly Ser Leu Cys Ser Ser Leu Val
515 520 525
Val Leu Gly Val Ala Gly Asn Gln Leu Ser Gly Leu Ile Pro Thr Ser
530 535 540
Ile Gly Gln Leu Asn Tyr Leu Ile Ser Leu Asp Leu Ser Arg Asn His
545 550 555 560
Leu Gly Gly Glu Ile Pro Thr Ser Val Lys Asn Leu Pro Asn Leu Glu
565 570 575
Arg Leu Ser Leu Gly His Asn Phe Leu Asn Gly Thr Ile Pro Thr Glu
580 585 590
Ile Asn Gln Leu Tyr Ser Leu Lys Val Leu Asp Leu Ser Ser Asn Leu
595 600 605
Leu Thr Gly Glu Ile Pro Gly Ala Leu Ala Asp Leu Arg Asn Leu Thr
610 615 620
Ala Leu Leu Leu Asp Asn Asn Lys Leu Thr Gly Lys Ile Pro Ser Ala
625 630 635 640
Phe Ala Lys Ser Met Ser Leu Thr Met Phe Asn Leu Ser Phe Asn Asn
645 650 655
Leu Ser Gly Pro Val Pro Ala Asn Ser Asn Thr Val Arg Cys Asp Ser
660 665 670
Val Ile Gly Asn Pro Leu Leu Gln Ser Cys His Met Tyr Thr Leu Ala
675 680 685
Val Pro Ser Ala Ala Gln Gln Gly Arg Gly Leu Asn Ser Asn Asp Tyr
690 695 700
Asn Asp Thr Ser Ser Ala Asp Ser Gln Asn Gln Gly Gly Ser Asn Ser
705 710 715 720
Phe Asn Ala Ile Glu Ile Ala Ser Ile Thr Ser Ala Thr Ala Ile Val
725 730 735
Ser Val Leu Leu Ala Leu Ile Val Leu Phe Ile Tyr Thr Arg Lys Cys
740 745 750
Ala Pro Arg Met Ser Ser Arg Ser Ser Arg Arg Arg Glu Val Ile Thr
755 760 765
Phe Gln Asp Ile Gly Val Pro Ile Thr Tyr Glu Thr Val Val Arg Ala
770 775 780
Thr Gly Ser Phe Asn Ala Ser Asn Cys Ile Gly Ser Gly Gly Phe Gly
785 790 795 800
Ala Thr Tyr Lys Ala Glu Ile Ser Pro Gly Val Leu Val Ala Ile Lys
805 810 815
Arg Leu Ser Val Gly Arg Phe Gln Gly Val Gln Gln Phe His Ala Glu
820 825 830
Ile Lys Thr Leu Gly Arg Leu Arg His Pro Asn Leu Val Thr Leu Val
835 840 845
Gly Tyr His Leu Gly Glu Ser Glu Met Phe Leu Ile Tyr Asn Tyr Leu
850 855 860
Pro Gly Gly Asn Leu Glu Arg Phe Ile Gln Glu Arg Ser Lys Arg Pro
865 870 875 880
Val Asp Trp Lys Met Leu His Lys Ile Ala Leu Asp Ile Ala Lys Ala
885 890 895
Leu Ala Tyr Leu His Asp Thr Cys Val Pro Arg Ile Leu His Arg Asp
900 905 910
Val Lys Pro Ser Asn Ile Leu Leu Asp Thr Glu Tyr Asn Ala Tyr Leu
915 920 925
Ser Asp Phe Gly Leu Ala Arg Leu Leu Gly Asn Ser Glu Thr His Ala
930 935 940
Thr Thr Gly Val Ala Gly Thr Phe Gly Tyr Val Ala Pro Glu Tyr Ala
945 950 955 960
Met Thr Cys Arg Val Ser Asp Lys Ala Asp Val Tyr Ser Tyr Gly Val
965 970 975
Val Leu Met Glu Leu Ile Ser Asp Lys Lys Ala Leu Asp Pro Ser Phe
980 985 990
Ser Pro Tyr Gly Asn Gly Phe Asn Ile Val Ala Trp Ala Cys Met Leu
995 1000 1005
Leu Arg Gln Gly Arg Ala Arg Glu Phe Phe Ile Asp Gly Leu Trp
1010 1015 1020
Asp Val Gly Pro His Asp Asp Leu Val Glu Thr Leu His Leu Ala
1025 1030 1035
Val Met Cys Thr Val Asp Ser Leu Ser Val Arg Pro Thr Met Lys
1040 1045 1050
Gln Val Val Gln Arg Leu Lys Gln Leu Gln Pro Pro Ile Arg Glu
1055 1060 1065
His Arg
1070
<210>25
<211>29
<212>DNA
<213>人工序列s
<220>
<223>引物
<400>25
ggggacaagt ttgtacaaaa aagcaggct 29
<210>26
<211>29
<212>DNA
<213>人工序列
<220>
<223>引物
<400>26
ggggaccact ttgtacaaga aagctgggt 29
<210>27
<211>54
<212>DNA
<213>人工序列
<220>
<223>引物
<400>27
ttaaacaagt ttgtacaaaa aagcaggctg caattaaccc tcactaaagg gaac 54
<210>28
<211>53
<212>DNA
<213>人工序列
<220>
<223>引物
<400>28
ttaaaccact ttgtacaaga aagctgggtg cgtaatacga ctcactatag ggc 53
<210>29
<211>12856
<212>DNA
<213>人工序列
<220>
<223>载体
<400>29
cgccttggcg cgccgatcat ccacaagttt gtacaaaaaa gctgaacgag aaacgtaaaa 60
tgatataaat atcaatatat taaattagat tttgcataaa aaacagacta cataatactg 120
taaaacacaa catatccagt cactatggcg gccgcattag gcaccccagg ctttacactt 180
tatgcttccg gctcgtataa tgtgtggatt ttgagttagg atttaaatac gcgttgatcc 240
ggcttactaa aagccagata acagtatgcg tatttgcgcg ctgatttttg cggtataaga 300
atatatactg atatgtatac ccgaagtatg tcaaaaagag gtatgctatg aagcagcgta 360
ttacagtgac agttgacagc gacagctatc agttgctcaa ggcatatatg atgtcaatat 420
ctccggtctg gtaagcacaa ccatgcagaa tgaagcccgt cgtctgcgtg ccgaacgctg 480
gaaagcggaa aatcaggaag ggatggctga ggtcgcccgg tttattgaaa tgaacggctc 540
ttttgctgac gagaacaggg gctggtgaaa tgcagtttaa ggtttacacc tataaaagag 600
agagccgtta tcgtctgttt gtggatgtac agagtgatat cattgacacg cccggtcgac 660
ggatggtgat ccccctggcc agtgcacgtc tgctgtcaga taaagtctcc cgtgaacttt 720
acccggtggt gcatatcggg gatgaaagct ggcgcatgat gaccaccgat atggccagtg 780
tgccggtctc cgttatcggg gaagaagtgg ctgatctcag ccaccgcgaa aatgacatca 840
aaaacgccat taacctgatg ttctggggaa tataaatgtc aggctccctt atacacagcc 900
agtctgcagg tcgaccatag tgactggata tgttgtgttt tacagtatta tgtagtctgt 960
tttttatgca aaatctaatt taatatattg atatttatat cattttacgt ttctcgttca 1020
gctttcttgt acaaagtggt gttaacctag acttgtccat cttctggatt ggccaactta 1080
attaatgtat gaaataaaag gatgcacaca tagtgacatg ctaatcacta taatgtgggc 1140
atcaaagttg tgtgttatgt gtaattacta gttatctgaa taaaagagaa agagatcatc 1200
catatttctt atcctaaatg aatgtcacgt gtctttataa ttctttgatg aaccagatgc 1260
atttcattaa ccaaatccat atacatataa atattaatca tatataatta atatcaattg 1320
ggttagcaaa acaaatctag tctaggtgtg ttttgcgaat tgcggccgcc accgcggtgg 1380
agctcgaatt ccggtccggg tcacctttgt ccaccaagat ggaactgcgg ccgctcatta 1440
attaagtcag gcgcgcctct agttgaagac acgttcatgt cttcatcgta agaagacact 1500
cagtagtctt cggccagaat ggccatctgg attcagcagg cctagaaggc catttaaatc 1560
ctgaggatct ggtcttccta aggacccggg atatcggacc gattaaactt taattcggtc 1620
cgaagcttga agttcctatt ccgaagttcc tattctccag aaagtatagg aacttcgcat 1680
gcctgcagtg cagcgtgacc cggtcgtgcc cctctctaga gataatgagc attgcatgtc 1740
taagttataa aaaattacca catatttttt ttgtcacact tgtttgaagt gcagtttatc 1800
tatctttata catatattta aactttactc tacgaataat ataatctata gtactacaat 1860
aatatcagtg ttttagagaa tcatataaat gaacagttag acatggtcta aaggacaatt 1920
gagtattttg acaacaggac tctacagttt tatcttttta gtgtgcatgt gttctccttt 1980
ttttttgcaa atagcttcac ctatataata cttcatccat tttattagta catccattta 2040
gggtttaggg ttaatggttt ttatagacta atttttttag tacatctatt ttattctatt 2100
ttagcctcta aattaagaaa actaaaactc tattttagtt tttttattta ataatttaga 2160
tataaaatag aataaaataa agtgactaaa aattaaacaa atacccttta agaaattaaa 2220
aaaactaagg aaacattttt cttgtttcga gtagataatg ccagcctgtt aaacgccgtc 2280
gacgagtcta acggacacca accagcgaac cagcagcgtc gcgtcgggcc aagcgaagca 2340
gacggcacgg catctctgtc gctgcctctg gacccctctc gagagttccg ctccaccgtt 2400
ggacttgctc cgctgtcggc atccagaaat tgcgtggcgg agcggcagac gtgagccggc 2460
acggcaggcg gcctcctcct cctctcacgg caccggcagc tacgggggat tcctttccca 2520
ccgctccttc gctttccctt cctcgcccgc cgtaataaat agacaccccc tccacaccct 2580
ctttccccaa cctcgtgttg ttcggagcgc acacacacac aaccagatct cccccaaatc 2640
cacccgtcgg cacctccgct tcaaggtacg ccgctcgtcc tccccccccc ccctctctac 2700
cttctctaga tcggcgttcc ggtccatgca tggttagggc ccggtagttc tacttctgtt 2760
catgtttgtg ttagatccgt gtttgtgtta gatccgtgct gctagcgttc gtacacggat 2820
gcgacctgta cgtcagacac gttctgattg ctaacttgcc agtgtttctc tttggggaat 2880
cctgggatgg ctctagccgt tccgcagacg ggatcgattt catgattttt tttgtttcgt 2940
tgcatagggt ttggtttgcc cttttccttt atttcaatat atgccgtgca cttgtttgtc 3000
gggtcatctt ttcatgcttt tttttgtctt ggttgtgatg atgtggtctg gttgggcggt 3060
cgttctagat cggagtagaa ttctgtttca aactacctgg tggatttatt aattttggat 3120
ctgtatgtgt gtgccataca tattcatagt tacgaattga agatgatgga tggaaatatc 3180
gatctaggat aggtatacat gttgatgcgg gttttactga tgcatataca gagatgcttt 3240
ttgttcgctt ggttgtgatg atgtggtgtg gttgggcggt cgttcattcg ttctagatcg 3300
gagtagaata ctgtttcaaa ctacctggtg tatttattaa ttttggaact gtatgtgtgt 3360
gtcatacatc ttcatagtta cgagtttaag atggatggaa atatcgatct aggataggta 3420
tacatgttga tgtgggtttt actgatgcat atacatgatg gcatatgcag catctattca 3480
tatgctctaa ccttgagtac ctatctatta taataaacaa gtatgtttta taattatttt 3540
gatcttgata tacttggatg atggcatatg cagcagctat atgtggattt ttttagccct 3600
gccttcatac gctatttatt tgcttggtac tgtttctttt gtcgatgctc accctgttgt 3660
ttggtgttac ttctgcaggt cgactttaac ttagcctagg atccacacga caccatgata 3720
gaggtgaaac cgattaacgc agaggatacc tatgaactaa ggcatagaat actcagacca 3780
aaccagccga tagaagcgtg tatgtttgaa agcgatttac ttcgtggtgc atttcactta 3840
ggcggctatt acgggggcaa actgatttcc atagcttcat tccaccaggc cgagcactca 3900
gaactccaag gccagaaaca gtaccagctc cgaggtatgg ctaccttgga aggttatcgt 3960
gagcagaagg cgggatcgag tctaattaaa cacgctgaag aaattcttcg taagaggggg 4020
gcggacttgc tttggtgtaa tgcgcggaca tccgcctcag gctactacaa aaagttaggc 4080
ttcagcgagc agggagaggt attcgacacg ccgccagtag gacctcacat cctgatgtat 4140
aaaaggatca cataactagc tagtcagtta acctagactt gtccatcttc tggattggcc 4200
aacttaatta atgtatgaaa taaaaggatg cacacatagt gacatgctaa tcactataat 4260
gtgggcatca aagttgtgtg ttatgtgtaa ttactagtta tctgaataaa agagaaagag 4320
atcatccata tttcttatcc taaatgaatg tcacgtgtct ttataattct ttgatgaacc 4380
agatgcattt cattaaccaa atccatatac atataaatat taatcatata taattaatat 4440
caattgggtt agcaaaacaa atctagtcta ggtgtgtttt gcgaattcag agctcgaatt 4500
cattccgatt aatcgtggcc tcttgctctt caggatgaag agctatgttt aaacgtgcaa 4560
gcgctactag acaattcagt acattaaaaa cgtccgcaat gtgttattaa gttgtctaag 4620
cgtcaatttg tttacaccac aatatatcct gccaccagcc agccaacagc tccccgaccg 4680
gcagctcggc acaaaatcac cactcgatac aggcagccca tcagtccggg acggcgtcag 4740
cgggagagcc gttgtaaggc ggcagacttt gctcatgtta ccgatgctat tcggaagaac 4800
ggcaactaag ctgccgggtt tgaaacacgg atgatctcgc ggagggtagc atgttgattg 4860
taacgatgac agagcgttgc tgcctgtgat caaatatcat ctccctcgca gagatccgaa 4920
ttatcagcct tcttattcat ttctcgctta accgtgacag gctgtcgatc ttgagaacta 4980
tgccgacata ataggaaatc gctggataaa gccgctgagg aagctgagtg gcgctatttc 5040
tttagaagtg aacgttgacg atcgtcgacc gtaccccgat gaattaattc ggacgtacgt 5100
tctgaacaca gctggatact tacttgggcg attgtcatac atgacatcaa caatgtaccc 5160
gtttgtgtaa ccgtctcttg gaggttcgta tgacactagt ggttcccctc agcttgcgac 5220
tagatgttga ggcctaacat tttattagag agcaggctag ttgcttagat acatgatctt 5280
caggccgtta tctgtcaggg caagcgaaaa ttggccattt atgacgacca atgccccgca 5340
gaagctccca tctttgccgc catagacgcc gcgcccccct tttggggtgt agaacatcct 5400
tttgccagat gtggaaaaga agttcgttgt cccattgttg gcaatgacgt agtagccggc 5460
gaaagtgcga gacccatttg cgctatatat aagcctacga tttccgttgc gactattgtc 5520
gtaattggat gaactattat cgtagttgct ctcagagttg tcgtaatttg atggactatt 5580
gtcgtaattg cttatggagt tgtcgtagtt gcttggagaa atgtcgtagt tggatgggga 5640
gtagtcatag ggaagacgag cttcatccac taaaacaatt ggcaggtcag caagtgcctg 5700
ccccgatgcc atcgcaagta cgaggcttag aaccaccttc aacagatcgc gcatagtctt 5760
ccccagctct ctaacgcttg agttaagccg cgccgcgaag cggcgtcggc ttgaacgaat 5820
tgttagacat tatttgccga ctaccttggt gatctcgcct ttcacgtagt gaacaaattc 5880
ttccaactga tctgcgcgcg aggccaagcg atcttcttgt ccaagataag cctgcctagc 5940
ttcaagtatg acgggctgat actgggccgg caggcgctcc attgcccagt cggcagcgac 6000
atccttcggc gcgattttgc cggttactgc gctgtaccaa atgcgggaca acgtaagcac 6060
tacatttcgc tcatcgccag cccagtcggg cggcgagttc catagcgtta aggtttcatt 6120
tagcgcctca aatagatcct gttcaggaac cggatcaaag agttcctccg ccgctggacc 6180
taccaaggca acgctatgtt ctcttgcttt tgtcagcaag atagccagat caatgtcgat 6240
cgtggctggc tcgaagatac ctgcaagaat gtcattgcgc tgccattctc caaattgcag 6300
ttcgcgctta gctggataac gccacggaat gatgtcgtcg tgcacaacaa tggtgacttc 6360
tacagcgcgg agaatctcgc tctctccagg ggaagccgaa gtttccaaaa ggtcgttgat 6420
caaagctcgc cgcgttgttt catcaagcct tacagtcacc gtaaccagca aatcaatatc 6480
actgtgtggc ttcaggccgc catccactgc ggagccgtac aaatgtacgg ccagcaacgt 6540
cggttcgaga tggcgctcga tgacgccaac tacctctgat agttgagtcg atacttcggc 6600
gatcaccgct tccctcatga tgtttaactc ctgaattaag ccgcgccgcg aagcggtgtc 6660
ggcttgaatg aattgttagg cgtcatcctg tgctcccgag aaccagtacc agtacatcgc 6720
tgtttcgttc gagacttgag gtctagtttt atacgtgaac aggtcaatgc cgccgagagt 6780
aaagccacat tttgcgtaca aattgcaggc aggtacattg ttcgtttgtg tctctaatcg 6840
tatgccaagg agctgtctgc ttagtgccca ctttttcgca aattcgatga gactgtgcgc 6900
gactcctttg cctcggtgcg tgtgcgacac aacaatgtgt tcgatagagg ctagatcgtt 6960
ccatgttgag ttgagttcaa tcttcccgac aagctcttgg tcgatgaatg cgccatagca 7020
agcagagtct tcatcagagt catcatccga gatgtaatcc ttccggtagg ggctcacact 7080
tctggtagat agttcaaagc cttggtcgga taggtgcaca tcgaacactt cacgaacaat 7140
gaaatggttc tcagcatcca atgtttccgc cacctgctca gggatcaccg aaatcttcat 7200
atgacgccta acgcctggca cagcggatcg caaacctggc gcggcttttg gcacaaaagg 7260
cgtgacaggt ttgcgaatcc gttgctgcca cttgttaacc cttttgccag atttggtaac 7320
tataatttat gttagaggcg aagtcttggg taaaaactgg cctaaaattg ctggggattt 7380
caggaaagta aacatcacct tccggctcga tgtctattgt agatatatgt agtgtatcta 7440
cttgatcggg ggatctgctg cctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac 7500
atgcagctcc cggagacggt cacagcttgt ctgtaagcgg atgccgggag cagacaagcc 7560
cgtcagggcg cgtcagcggg tgttggcggg tgtcggggcg cagccatgac ccagtcacgt 7620
agcgatagcg gagtgtatac tggcttaact atgcggcatc agagcagatt gtactgagag 7680
tgcaccatat gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc 7740
gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 7800
tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 7860
agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 7920
cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 7980
ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 8040
tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 8100
gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 8160
gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 8220
gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 8280
ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 8340
ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag 8400
ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 8460
gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 8520
ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 8580
tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt 8640
ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca 8700
gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg 8760
tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac 8820
cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg 8880
ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc 8940
gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgctg 9000
cagggggggg gggggggggg gacttccatt gttcattcca cggacaaaaa cagagaaagg 9060
aaacgacaga ggccaaaaag cctcgctttc agcacctgtc gtttcctttc ttttcagagg 9120
gtattttaaa taaaaacatt aagttatgac gaagaagaac ggaaacgcct taaaccggaa 9180
aattttcata aatagcgaaa acccgcgagg tcgccgcccc gtaacctgtc ggatcaccgg 9240
aaaggacccg taaagtgata atgattatca tctacatatc acaacgtgcg tggaggccat 9300
caaaccacgt caaataatca attatgacgc aggtatcgta ttaattgatc tgcatcaact 9360
taacgtaaaa acaacttcag acaatacaaa tcagcgacac tgaatacggg gcaacctcat 9420
gtcccccccc cccccccccc tgcaggcatc gtggtgtcac gctcgtcgtt tggtatggct 9480
tcattcagct ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa 9540
aaagcggtta gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta 9600
tcactcatgg ttatggcagc actgcataat tctcttactg tcatgccatc cgtaagatgc 9660
ttttctgtga ctggtgagta ctcaaccaag tcattctgag aatagtgtat gcggcgaccg 9720
agttgctctt gcccggcgtc aacacgggat aataccgcgc cacatagcag aactttaaaa 9780
gtgctcatca ttggaaaacg ttcttcgggg cgaaaactct caaggatctt accgctgttg 9840
agatccagtt cgatgtaacc cactcgtgca cccaactgat cttcagcatc ttttactttc 9900
accagcgttt ctgggtgagc aaaaacagga aggcaaaatg ccgcaaaaaa gggaataagg 9960
gcgacacgga aatgttgaat actcatactc ttcctttttc aatattattg aagcatttat 10020
cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 10080
ggggttccgc gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cattattatc 10140
atgacattaa cctataaaaa taggcgtatc acgaggccct ttcgtcttca agaattggtc 10200
gacgatcttg ctgcgttcgg atattttcgt ggagttcccg ccacagaccc ggattgaagg 10260
cgagatccag caactcgcgc cagatcatcc tgtgacggaa ctttggcgcg tgatgactgg 10320
ccaggacgtc ggccgaaaga gcgacaagca gatcacgctt ttcgacagcg tcggatttgc 10380
gatcgaggat ttttcggcgc tgcgctacgt ccgcgaccgc gttgagggat caagccacag 10440
cagcccactc gaccttctag ccgacccaga cgagccaagg gatctttttg gaatgctgct 10500
ccgtcgtcag gctttccgac gtttgggtgg ttgaacagaa gtcattatcg tacggaatgc 10560
caagcactcc cgaggggaac cctgtggttg gcatgcacat acaaatggac gaacggataa 10620
accttttcac gcccttttaa atatccgtta ttctaataaa cgctcttttc tcttaggttt 10680
acccgccaat atatcctgtc aaacactgat agtttaaact gaaggcggga aacgacaatc 10740
tgatcatgag cggagaatta agggagtcac gttatgaccc ccgccgatga cgcgggacaa 10800
gccgttttac gtttggaact gacagaaccg caacgttgaa ggagccactc agcaagctgg 10860
tacgattgta atacgactca ctatagggcg aattgagcgc tgtttaaacg ctcttcaact 10920
ggaagagcgg ttacccggac cgaagcttga agttcctatt ccgaagttcc tattctctag 10980
aaagtatagg aacttcagat ctcgatgctc accctgttgt ttggtgttac ttctgcaggt 11040
cgactctaga ggatccacca tgagcccaga acgacgcccg gccgacatcc gccgtgccac 11100
cgaggcggac atgccggcgg tctgcaccat cgtcaaccac tacatcgaga caagcacggt 11160
caacttccgt accgagccgc aggaaccgca ggactggacg gacgacctcg tccgtctgcg 11220
ggagcgctat ccctggctcg tcgccgaggt ggacggcgag gtcgccggca tcgcctacgc 11280
gggcccctgg aaggcacgca acgcctacga ctggacggcc gagtcgaccg tgtacgtctc 11340
cccccgccac cagcggacgg gactgggctc cacgctctac acccacctgc tgaagtccct 11400
ggaggcacag ggcttcaaga gcgtggtcgc tgtcatcggg ctgcccaacg acccgagcgt 11460
gcgcatgcac gaggcgctcg gatatgcccc ccgcggcatg ctgcgggcgg ccggcttcaa 11520
gcacgggaac tggcatgacg tgggtttctg gcagctggac ttcagcctgc cggtaccgcc 11580
ccgtccggtc ctgcccgtca ccgagatctg atccgtcgac caacctagac ttgtccatct 11640
tctggattgg ccaacttaat taatgtatga aataaaagga tgcacacata gtgacatgct 11700
aatcactata atgtgggcat caaagttgtg tgttatgtgt aattactagt tatctgaata 11760
aaagagaaag agatcatcca tatttcttat cctaaatgaa tgtcacgtgt ctttataatt 11820
ctttgatgaa ccagatgcat ttcattaacc aaatccatat acatataaat attaatcata 11880
tataattaat atcaattggg ttagcaaaac aaatctagtc taggtgtgtt ttgcgaattg 11940
cggccgcgat ctggggaatt cccatggaca ccggtaattc ccatgatctt ctctccttca 12000
tcaatggatg ccatgtttca taacaataac accaaatgtt tgatgagcta ccaacaattg 12060
cgcaaagact atggctaagc tcgagctcgc tcgctacaag ttgttgactt tcaaatacaa 12120
gtttgttttt ggaacaccaa atattctaca tgatctttca ctaagttgcg caccactatc 12180
aaaagattat ctaggccatt attcaagtaa agagtgaaca cgtctaagac ccacaaccac 12240
accaaataga atacgcatac atgcaacata ttgtgcaaga agtatccaac tggactccca 12300
tgtattctaa aactattttc gtagagttaa agttatgaca aacttatcaa ataaaaattt 12360
gaacgctgga ccaaaacttt catctttcaa atccaccatc gtctatcctc ataaattgtt 12420
ttgattataa cacatctacg taaatcattt gttttgaaca atactaattt aattttatta 12480
agtcaaataa cctgcttaga aaataatccc tccacctcat ttaacaattt cttgtcaaac 12540
acacaccaag aaaaaaatta atgaaagaga aaagaaatga aaaggacatg gagttgaata 12600
ctagcaaaat tgattgaagg aagattcaca attgaaattg aaaccattta atttattttc 12660
gggtccataa taataaattg gtaagaataa aaacccgatc aagtccggta cagtacaatt 12720
ccactccacc aactccttac ttaaacccct atttataccc actctcatcc tcactcttcc 12780
ttcacctctc acactctctt ctctctctca aaaccctcac acaaacgctg cgtttagtgt 12840
aagaaattca atccgg 12856
<210>30
<211>825
<212>DNA
<213>玉米
<400>30
aaatccttac agaattgctg tagtttcata gtgctagatg tggacagcaa agcgccgctg 60
tatgcttctg cttttctttt ttggtgtgtg tagccacatc ctttgttcct gcccggcgcc 120
atcccacttg gttgtttttt tttatgattg aaagccttca tgcttcctcg gtcaatcacc 180
ggtgcgcact gggagcatcg ccggaaaaaa aattcttcgg ctaagagtaa cttctttctc 240
cttttcttct ctgatctcgc gagcagtgct gataacgtgt tgtaatctac ttagcggtaa 300
cgagattgag agagacaaaa tgacagaact attgtcttta ttgcagagtg tcatgtattt 360
atacagggga tacaaagtct cccaaggggt gtgtcccttg ggagtaactg ccagttgatc 420
acaggacaat attttgtaac aaaacgtaca catcgtcaaa atagcgaggc atgaaactgg 480
ccttggccat ggacgcgtga agcgcgccat gcgttggata tgtggtcaat aagtatatac 540
aatacaatgt ttaacagagc tgatagtact gctttggcac atttttgtcc acgcttcatg 600
agagataaaa cacctgcacg taaattcaca tgctgcactg aaggcccgat cactgaggag 660
cgaactgccg taactccctt ctatatatac ccccagtccc tgtttcagtt ttcgtcaagc 720
tagcagcacc aagttgtcga tcacttgcct gctcttgagc tcgattaagc tatcatcagc 780
tacagcatcc gatcccaaac tgcaactgta gcagcgacaa ctgcc 825
<210>31
<211>860
<212>DNA
<213>玉米
<400>31
ctggtaatta ttggctgtag gattctaaac agagcctaaa tagctggaat agctctagcc 60
ctcaatccaa actaatgata tctatactta tgcaactcta aatttttatt ctaaaagtaa 120
tatttcattt ttgtcaacga gattctctac tctattccac aatcttttga agcaatattt 180
accttaaatc tgtactctat accaataatc atatattcta ttatttattt ttatctctct 240
cctaaggagc atccccctat gtctgcatgg cccccgcctc gggtcccaat ctcttgctct 300
gctagtagca cagaagaaaa cactagaaat gacttgcttg acttagagta tcagataaac 360
atcatgttta cttaacttta atttgtatcg gtttctacta tttttataat atttttgtct 420
ctatagatac tacgtgcaac agtataatca acctagttta atccagagcg aaggattttt 480
tactaagtac gtgactccat atgcacagcg ttccttttat ggttcctcac tgggcacagc 540
ataaacgaac cctgtccaat gttttcagcg cgaacaaaca gaaattccat cagcgaacaa 600
acaacataca tgcgagatga aaataaataa taaaaaaagc tccgtctcga taggccggca 660
cgaatcgaga gcctccatag ccagtttttt ccatcggaac ggcggttcgc gcacctaatt 720
atatgcacca cacgcctata aagccaacca acccgtcgga ggggcgcaag ccagacagaa 780
gacagcccgt cagcccctct cgtttttcat ccgccttcgc ctccaaccgc gtgcgctcca 840
cgcctcctcc aggaaagcga 860
<210>32
<211>899
<212>DNA
<213>玉米
<400>32
gtgcagcgtg acccggtcgt gcccctctct agagataatg agcattgcat gtctaagtta 60
taaaaaatta ccacatattt tttttgtcac acttgtttga agtgcagttt atctatcttt 120
atacatatat ttaaacttta ctctacgaat aatataatct atagtactac aataatatca 180
gtgttttaga gaatcatata aatgaacagt tagacatggt ctaaaggaca attgagtatt 240
ttgacaacag gactctacag ttttatcttt ttagtgtgca tgtgttctcc tttttttttg 300
caaatagctt cacctatata atacttcatc cattttatta gtacatccat ttagggttta 360
gggttaatgg tttttataga ctaatttttt tagtacatct attttattct attttagcct 420
ctaaattaag aaaactaaaa ctctatttta gtttttttat ttaataattt agatataaaa 480
tagaataaaa taaagtgact aaaaattaaa caaataccct ttaagaaatt aaaaaaacta 540
aggaaacatt tttcttgttt cgagtagata atgccagcct gttaaacgcc gtcgacgagt 600
ctaacggaca ccaaccagcg aaccagcagc gtcgcgtcgg gccaagcgaa gcagacggca 660
cggcatctct gtcgctgcct ctggacccct ctcgagagtt ccgctccacc gttggacttg 720
ctccgctgtc ggcatccaga aattgcgtgg cggagcggca gacgtgagcc ggcacggcag 780
gcggcctcct cctcctctca cggcacggca gctacggggg attcctttcc caccgctcct 840
tcgctttccc ttcctcgccc gccgtaataa atagacaccc cctccacacc ctctttccc 899
<210>33
<211>879
<212>DNA
<213>紫花苜蓿
<400>33
aattcccatg atcttctctc cttcatcaat ggatgccatg tttcataaca ataacaccaa 60
atgtttgatg agctaccaac aattgcgcaa agactatggc taagctcgag ctcgctcgct 120
acaagttgtt gactttcaaa tacaagtttg tttttggaac accaaatatt ctacatgatc 180
tttcactaag ttgcgcacca ctatcaaaag attatctagg ccattattca agtaaagagt 240
gaacacgtct aagacccaca accacaccaa atagaatacg catacatgca acatattgtg 300
caagaagtat ccaactggac tcccatgtat tctaaaacta ttttcgtaga gttaaagtta 360
tgacaaactt atcaaataaa aatttgaacg ctggaccaaa actttcatct ttcaaatcca 420
ccatcgtcta tcctcataaa ttgttttgat tataacacat ctacgtaaat catttgtttt 480
gaacaatact aatttaattt tattaagtca aataacctgc ttagaaaata atccctccac 540
ctcatttaac aatttcttgt caaacacaca ccaagaaaaa aattaatgaa agagaaaaga 600
aatgaaaagg acatggagtt gaatactagc aaaattgatt gaaggaagat tcacaattga 660
aattgaaacc atttaattta ttttcgggtc cataataata aattggtaag aataaaaacc 720
cgatcaagtc cggtacagta caattccact ccaccaactc cttacttaaa cccctattta 780
tacccactct catcctcact cttccttcac ctctcacact ctcttctctc tctcaaaacc 840
ctcacacaaa cgctgcgttt agtgtaagaa attcaatcc 879
<210>34
<211>318
<212>DNA
<213>马铃薯
<400>34
agacttgtcc atcttctgga ttggccaact taattaatgt atgaaataaa aggatgcaca 60
catagtgaca tgctaatcac tataatgtgg gcatcaaagt tgtgtgttat gtgtaattac 120
tagttatctg aataaaagag aaagagatca tccatatttc ttatcctaaa tgaatgtcac 180
gtgtctttat aattctttga tgaaccagat gcatttcatt aaccaaatcc atatacatat 240
aaatattaat catatataat taatatcaat tgggttagca aaacaaatct agtctaggtg 300
tgttttgcga attgcggc 318
<210>35
<211>2228
<212>DNA
<213>玉米
<400>35
gtttctcttt tgtttcgcac ccctccatcc ttcactcctc gctggtccag ccattaatgc 60
caccacctgc tatacatcac taccttcctc ttccctcccg cagtctccga cctctctctc 120
tcctctctct ttctttttct ctctctccct cgctttttct gctgtctatc cccgcgccgc 180
gcgaggttgc tgccttggcc ttgctggttc tttgcttgtg ggatgagtgg aaggaaggcc 240
ggtcctcgct gcagaaagag ggagaagact tcatctgcac gtgctgctga tgtgcattag 300
aggatccgtg ctgataatat acagctgaca cccacaagat cgactggata agttcgagcc 360
tgacggcatc atgggttccg gcgatttgag ctcggagatg gagaggacgg ttcttggcct 420
gacgctgtgg gtctggattg ccatcggcgt ggtcgcgctt ctcgtggcaa tcctgctgat 480
gatctgcatt tgggtggcgt cccgcctgag gaccaagaga accatggaca gcctgaggca 540
gacgcagatc cccatctact ccaaggagat ccccatcgac cgggtcggcg ggcgcagcct 600
ggcacagaca atgcatgagc gcgagcagcc gagcttgccg ccgccggaca agtacgcgaa 660
ccgggagccg gccggggcgg ctgtggggca cctggcactc agcaagtcct cggatcatga 720
caacatgagc caggggagct cggtttgcaa tgctgaccgt gctggcactg gcagcatgca 780
ctccggtgag gatgggagct caggacctcg gaggaaaccc aactctcctg cggcgttcgt 840
gtcagcttcg cccttggttg gccttccgga gttctcgcat cttggttggg gtcactggtt 900
tactcagcgt gaccttgaac ttgctaccaa ccgcttctct aaggagaatg tgctcgggga 960
gggtggttat ggagttgtgt accgtggacg tctggtgaat ggaactgaag tcgcaataaa 1020
aaagatcttt aacaacatgg ggcaggcaga aaaagaattc agagtggaag ttgaggctat 1080
tggccatgtc cgacataaga atttggttcg gctgttggga tattgtgttg agggcgtaaa 1140
caggatgcta gtatatgagt tcgtgaacaa tggtaattta gagcagtggc ttcatggagc 1200
tatgcatcag cgtggtgttt ttagctggga aaatcgcatg aaggttgtaa caggcactgc 1260
aaaagcgctt gcttaccttc atgaagctat tgagccaaaa gttgtacatc gagatataaa 1320
atcaagcaac atattgattg atgatgaatt taatggcaaa gtatctgact ttggattggc 1380
taaacttttg ggatcagaca aaagccacat tactactaga gtgatgggaa catttggata 1440
tgttgcacct gaatatgcta atactggaat gttaaatgaa aagagtgacg tttacagttt 1500
tggcgttctc ttgttagaaa ctgtgacagg aaggaatcct gttgactaca gccgatcttc 1560
caatgaggtc aatcttgttg aatggcttaa aacaatggtg gccaatcgga gggcagagga 1620
agtggctgat ccaagcttag aggctagacc cagtatccgg gctctcaagc gggctctttt 1680
ggtggcactg aggtgtgttg atcctgactc tgaaaagaga cctaagatgg gccaggttgt 1740
taggatgctt gagtcagaag aagtaccata ccgggaggat cggagaaatc gtagaagtcg 1800
cactggaagc atggatattg agtccattgc agagggttcc aactctgcgg agtttggaaa 1860
gaaggtagaa aggactggaa gctctatatc tgacagatct caaccatgag gcctggaaat 1920
ctaagctata tcaggaactg tctcagaaac gatgttaatc accaaggtga ttgctgtcct 1980
gataggggat gctggtaggt gacagaggca aggcaggccc aggtattact gcctaagtat 2040
taggctttag ttttcaggtg cagtagaagc attcttggat gaggaacctc ctggtctcct 2100
cttatttgta taggtatatg ttggtgcctg aggaattcct cttggtttgg gtgtttaaca 2160
tttgttcggt tcttttattt cttgcattgt tgccttctta aaaaaaaaaa aaaaaaaaaa 2220
aaaaaaag 2228
<210>36
<211>512
<212>PRT
<213>玉米
<400>36
Met Gly Ser Gly Asp Leu Ser Ser Glu Met Glu Arg Thr Val Leu Gly
1 5 10 15
Leu Thr Leu Trp Val Trp Ile Ala Ile Gly Val Val Ala Leu Leu Val
20 25 30
Ala Ile Leu Leu Met Ile Cys Ile Trp Val Ala Ser Arg Leu Arg Thr
35 40 45
Lys Arg Thr Met Asp Ser Leu Arg Gln Thr Gln Ile Pro Ile Tyr Ser
50 55 60
Lys Glu Ile Pro Ile Asp Arg Val Gly Gly Arg Ser Leu Ala Gln Thr
65 70 75 80
Met His Glu Arg Glu Gln Pro Ser Leu Pro Pro Pro Asp Lys Tyr Ala
85 90 95
Asn Arg Glu Pro Ala Gly Ala Ala Val Gly His Leu Ala Leu Ser Lys
100 105 110
Ser Ser Asp His Asp Asn Met Ser Gln Gly Ser Ser Val Cys Asn Ala
115 120 125
Asp Arg Ala Gly Thr Gly Ser Met His Ser Gly Glu Asp Gly Ser Ser
130 135 140
Gly Pro Arg Arg Lys Pro Asn Ser Pro Ala Ala Phe Val Ser Ala Ser
145 150 155 160
Pro Leu Val Gly Leu Pro Glu Phe Ser His Leu Gly Trp Gly His Trp
165 170 175
Phe Thr Gln Arg Asp Leu Glu Leu Ala Thr Asn Arg Phe Ser Lys Glu
180 185 190
Asn Val Leu Gly Glu Gly Gly Tyr Gly Val Val Tyr Arg Gly Arg Leu
195 200 205
Val Asn Gly Thr Glu Val Ala Ile Lys Lys Ile Phe Asn Asn Met Gly
210 215 220
Gln Ala Glu Lys Glu Phe Arg Val Glu Val Glu Ala Ile Gly His Val
225 230 235 240
Arg His Lys Asn Leu Val Arg Leu Leu Gly Tyr Cys Val Glu Gly Val
245 250 255
Asn Arg Met Leu Val Tyr Glu Phe Val Asn Asn Gly Asn Leu Glu Gln
260 265 270
Trp Leu His Gly Ala Met His Gln Arg Gly Val Phe Ser Trp Glu Asn
275 280 285
Arg Met Lys Val Val Thr Gly Thr Ala Lys Ala Leu Ala Tyr Leu His
290 295 300
Glu Ala Ile Glu Pro Lys Val Val His Arg Asp Ile Lys Ser Ser Asn
305 310 315 320
Ile Leu Ile Asp Asp Glu Phe Asn Gly Lys Val Ser Asp Phe Gly Leu
325 330 335
Ala Lys Leu Leu Gly Ser Asp Lys Ser His Ile Thr Thr Arg Val Met
340 345 350
Gly Thr Phe Gly Tyr Val Ala Pro Glu Tyr Ala Asn Thr Gly Met Leu
355 360 365
Asn Glu Lys Ser Asp Val Tyr Ser Phe Gly Val Leu Leu Leu Glu Thr
370 375 380
Val Thr Gly Arg Asn Pro Val Asp Tyr Ser Arg Ser Ser Asn Glu Val
385 390 395 400
Asn Leu Val Glu Trp Leu Lys Thr Met Val Ala Asn Arg Arg Ala Glu
405 410 415
Glu Val Ala Asp Pro Ser Leu Glu Ala Arg Pro Ser Ile Arg Ala Leu
420 425 430
Lys Arg Ala Leu Leu Val Ala Leu Arg Cys Val Asp Pro Asp Ser Glu
435 440 445
Lys Arg Pro Lys Met Gly Gln Val Val Arg Met Leu Glu Ser Glu Glu
450 455 460
Val Pro Tyr Arg Glu Asp Arg Arg Asn Arg Arg Ser Arg Thr Gly Ser
465 470 475 480
Met Asp Ile Glu Ser Ile Ala Glu Gly Ser Asn Ser Ala Glu Phe Gly
485 490 495
Lys Lys Val Glu Arg Thr Gly Ser Ser Ile Ser Asp Arg Ser Gln Pro
500 505 510
<210>37
<211>509
<212>PRT
<213>水稻
<400>37
Met Gly Ser Ser Asp Leu Ser Ser Glu Met Arg Arg Thr Val Leu Gly
1 5 10 15
Leu Thr Leu Trp Val Trp Ile Ala Ile Gly Val Val Ala Leu Leu Val
20 25 30
Ala Ile Leu Leu Met Ile Cys Ile Trp Met Ala Ser Arg Arg Lys Thr
35 40 45
Lys Arg Thr Met Asp Asn Leu Arg Gln Thr Gln Ile Pro Ile Phe Ser
50 55 60
Lys Glu Ile Pro Val Asp Arg Val Gly Gly Arg Ser Leu Ala Gln Thr
65 70 75 80
Met His Glu Arg Glu Gln Pro Ser Phe Pro Pro Gln Asp Lys His Thr
85 90 95
Asn Arg Glu Pro Gly Lys Thr Leu Gly His Met Ala Leu Ser Lys Ser
100 105 110
Ser Glu Pro Asp Asn Met Ser Gln Gly Ser Ser Val Cys Asn Val Asp
115 120 125
Arg Ala Gly Ser Val His Ser Gly Glu Asp Gly Ser Thr Gly His Gly
130 135 140
Arg Lys Pro Tyr Ser Pro Ala Ala Phe Val Ser Ala Ser Pro Leu Val
145 150 155 160
Gly Leu Pro Glu Phe Ser His Leu Gly Trp Gly His Trp Phe Thr Leu
165 170 175
Arg Asp Leu Glu Leu Ala Thr Asn Arg Phe Ser Arg Glu Asn Val Leu
180 185 190
Gly Glu Gly Gly Tyr Gly Val Val Tyr Arg Gly Arg Leu Val Asn Gly
195 200 205
Thr Glu Val Ala Ile Lys Lys Ile Phe Asn Asn Met Gly Gln Ala Glu
210 215 220
Lys Glu Phe Arg Val Glu Val Glu Ala Ile Gly His Val Arg His Lys
225 230 235 240
Asn Leu Val Arg Leu Leu Gly Tyr Cys Val Glu Gly Val Asn Arg Met
245 250 255
Leu Val Tyr Glu Phe Val Asn Asn Gly Asn Leu Glu Gln Trp Leu His
260 265 270
Gly Ala Met Arg Gln His Gly Val Phe Ser Trp Glu Asn Arg Met Lys
275 280 285
Val Val Ile Gly Thr Ala Lys Ala Leu Ala Tyr Leu His Glu Ala Ile
290 295 300
Glu Pro Lys Val Val His Arg Asp Ile Lys Ser Ser Asn Ile Leu Ile
305 310 315 320
Asp Glu Glu Phe Asn Gly Lys Val Ser Asp Phe Gly Leu Ala Lys Leu
325 330 335
Leu Gly Ser Asp Lys Ser His Ile Thr Thr Arg Val Met Gly Thr Phe
340 345 350
Gly Tyr Val Ala Pro Glu Tyr Ala Asn Thr Gly Met Leu Asn Glu Lys
355 360 365
Ser Asp Val Tyr Ser Phe Gly Val Leu Leu Leu Glu Thr Val Thr Gly
370 375 380
Arg Glu Pro Val Asp Tyr Ser Arg Ser Gly Asn Glu Val Asn Leu Val
385 390 395 400
Glu Trp Leu Lys Ile Met Val Ala Asn Arg Arg Ala Glu Glu Val Val
405 410 415
Asp Pro Ile Leu Glu Val Arg Pro Thr Val Arg Ala Ile Lys Arg Ala
420 425 430
Leu Leu Val Ala Leu Arg Cys Val Asp Pro Asp Ser Glu Lys Arg Pro
435 440 445
Lys Met Gly Gln Val Val Arg Met Leu Glu Ser Glu Glu Val Pro Tyr
450 455 460
Arg Glu Asp Arg Arg Asn Arg Arg Ser Arg Thr Gly Ser Met Asp Ile
465 470 475 480
Glu Ser Ile Ala Glu Gly Ser Asn Ser Thr Glu Phe Ala Asn Lys Val
485 490 495
Glu Arg Thr Gly Ser Ser Thr Ser Asp Arg Ser Gln Ser
500 505
Claims (20)
1.改变植物根构造的方法,所述方法包括:
(a)将重组DNA构建体引入到可再生的植物细胞中,所述重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中所述多核苷酸编码多肽,所述多肽如SEQ ID NO:15的氨基酸序列所示;以及
(b)在步骤(a)之后,从所述可再生的植物细胞再生出转基因植物,其中所述转基因植物在其基因组中包含所述重组DNA构建体,并且在与未包含所述重组DNA构建体的对照植物比较时表现出改变的根构造。
2.权利要求1的方法,所述方法还包括:
(c)获得源自所述转基因植物的子代植物,其中所述子代植物在其基因组中包含所述重组DNA构建体,并且在与未包含所述重组DNA构建体的对照植物比较时表现出改变的根构造。
3.评价植物根构造的方法,所述方法包括:
(a)将重组DNA构建体引入到可再生的植物细胞中,所述重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中所述多核苷酸编码多肽,所述多肽如SEQ ID NO:15的氨基酸序列所示;
(b)在步骤(a)之后,从所述可再生的植物细胞再生出转基因植物,其中所述转基因植物在其基因组中包含所述重组DNA构建体;以及
(c)评价与未包含所述重组DNA构建体的对照植物比较时所述转基因植物的根构造。
4.权利要求3的方法,所述方法还包括:
(d)获得源自所述转基因植物的子代植物,其中所述子代植物在其基因组中包含所述重组DNA构建体;以及
(e)评价与未包含所述重组DNA构建体的对照植物比较时所述子代植物的根构造。
5.评价植物根构造的方法,所述方法包括:
(a)将重组DNA构建体引入到可再生的植物细胞中,所述重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中所述多核苷酸编码多肽,所述多肽如SEQ ID NO:15的氨基酸序列所示;
(b)在步骤(a)之后,从所述可再生的植物细胞再生出转基因植物,其中所述转基因植物在其基因组中包含所述重组DNA构建体;
(c)获得源自所述转基因植物的子代植物,其中所述子代植物在其基因组中包含所述重组DNA构建体;以及
(d)评价与未包含所述重组DNA构建体的对照植物比较时所述子代植物的根构造。
6.测定植物农学特性改变的方法,所述方法包括:
(a)将重组DNA构建体引入到可再生的植物细胞中,所述重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中所述多核苷酸编码多肽,所述多肽如SEQ ID NO:15的氨基酸序列所示;
(b)在步骤(a)之后,从所述可再生的植物细胞再生出转基因植物,其中所述转基因植物在其基因组中包含所述重组DNA构建体;以及
(c)测定所述转基因植物在与未包含所述重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
7.权利要求6的方法,所述方法还包括:
(d)获得源自所述转基因植物的子代植物,其中所述子代植物在其基因组中包含所述重组DNA构建体;以及
(e)测定所述子代植物在与未包含所述重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
8.权利要求7的方法,其中所述测定步骤包括:测定所述转基因植物在不同的环境条件下与未包含所述重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
9.权利要求8的方法,其中所述测定步骤(e)包括:测定所述子代植物在不同的环境条件下与未包含所述重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
10.测定植物农学特性改变的方法,所述方法包括:
(a)将重组DNA构建体引入到可再生的植物细胞中,所述重组DNA构建体包含可操作地连接至少一种调控序列的多核苷酸,其中所述多核苷酸编码多肽,所述多肽如SEQ ID NO:15的氨基酸序列所示;
(b)在步骤(a)之后,从所述可再生的植物细胞再生出转基因植物,其中所述转基因植物在其基因组中包含所述重组DNA构建体;
(c)获得源自所述转基因植物的子代植物,其中所述子代植物在其基因组中包含所述重组DNA构建体;以及
(d)测定所述子代植物在与未包含所述重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
11.权利要求10的方法,其中所述测定步骤包括:测定所述转基因植物在不同的环境条件下与未包含所述重组DNA构建体的对照植物比较时是否表现出至少一种农学特性的改变。
12.分离的多核苷酸,所述多核苷酸包含编码LLRK或LLRK样多肽的核酸序列或所述核酸序列的全长互补序列,所述多肽如SEQ IDNO:15的氨基酸序列所示。
13.权利要求12的多核苷酸,其中所述核酸序列是SEQ ID NO:14。
14.包含权利要求12的多核苷酸的载体。
15.重组DNA构建体,所述重组DNA构建体包含与至少一种调控序列可操作地连接的权利要求12的多核苷酸。
16.用于转化细胞的方法,所述方法包括用权利要求12的多核苷酸来转化细胞。
17.细胞,所述细胞包含权利要求16的重组DNA构建体,其中所述细胞不是种子细胞。
18.用于生产植物的方法,所述方法包括使用权利要求12的多核苷酸来转化植物细胞以及从转化过的植物细胞再生植物。
19.对基因变异进行作图的方法,所述基因变异表现出植物根构造的改变和/或植物至少一种农学特性的改变,所述方法包括:
(a)使两种植物品种杂交;以及
(b)在得自步骤(a)的杂交的子代植物中针对以下序列来评价基因变异:
(i)SEQ ID NO:14的核酸序列;或
(ii)编码SEQ ID NO:15的多肽的核酸序列;
所述评价是使用选自下组的方法进行的:RFLP分析、SNP分析、和基于PCR的分析。
20.用于改变植物根构造和/或改变植物至少一种农学特性的分子育种方法,所述方法包括:
(a)使两种植物品种杂交;以及
(b)在得自步骤(a)的杂交的子代植物中针对以下序列来评价基因变异:
(i)SEQ ID NO:14的核酸序列;或
(ii)编码SEQ ID NO:15的多肽的核酸序列;
所述评价是使用选自下组的方法进行的:RFLP分析、SNP分析、和基于PCR的分析。
Applications Claiming Priority (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US98916107P | 2007-11-20 | 2007-11-20 | |
US60/989161 | 2007-11-20 | ||
US60/989,161 | 2007-11-20 | ||
US4128108P | 2008-04-01 | 2008-04-01 | |
US61/041,281 | 2008-04-01 | ||
US61/041281 | 2008-04-01 | ||
PCT/US2008/084103 WO2009067569A2 (en) | 2007-11-20 | 2008-11-20 | Plants with altered root architecture, related constructs and methods involving genes encoding leucine rich repeat kinase (llrk) polypeptides and homologs thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101868545A CN101868545A (zh) | 2010-10-20 |
CN101868545B true CN101868545B (zh) | 2013-08-21 |
Family
ID=40326361
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2008801169274A Expired - Fee Related CN101868545B (zh) | 2007-11-20 | 2008-11-20 | 具有改变的根构造的植物、涉及编码富含亮氨酸重复序列激酶(llrk)多肽及其同源物的基因的相关构建体和方法 |
Country Status (8)
Country | Link |
---|---|
US (3) | US20090130685A1 (zh) |
EP (2) | EP2225379A2 (zh) |
CN (1) | CN101868545B (zh) |
BR (1) | BRPI0818971A2 (zh) |
CA (1) | CA2703903A1 (zh) |
MX (1) | MX2010005578A (zh) |
WO (1) | WO2009067569A2 (zh) |
ZA (1) | ZA201002765B (zh) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2011119403A1 (en) * | 2010-03-26 | 2011-09-29 | Monsanto Technology Llc | Automated system for germination testing using optical imaging |
BR112016015339A2 (pt) | 2013-12-30 | 2017-10-31 | E I Du Pont De Nemours And Company Us | método para aumentar pelo menos um fenótipo, planta, semente da planta, método para aumentar a tolerância ao estresse, método para selecionar tolerância ao estresse, método para selecionar uma alteração, polinucleotídeo isolado, método para produção de uma planta, método para produção de uma semente, método para produção de óleo |
US20180066026A1 (en) | 2014-12-17 | 2018-03-08 | Ei Du Pont De Nemours And Company | Modulation of yep6 gene expression to increase yield and other related traits in plants |
WO2017062618A1 (en) * | 2015-10-06 | 2017-04-13 | Iowa State University Research Foundation, Inc. | Plants with improved agronomic characteristics |
CN112080508B (zh) * | 2020-09-16 | 2021-09-14 | 山东省农业科学院作物研究所 | 甘薯根系发育基因IbLRR1及其应用 |
CN114807181A (zh) * | 2022-04-30 | 2022-07-29 | 浙江师范大学 | 水稻OsCKX3基因在调控水稻叶夹角中的应用 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999009151A2 (en) * | 1997-08-13 | 1999-02-25 | The Regents Of The University Of California | Procedures and materials for conferring disease resistance in plants |
CN1318974A (zh) * | 1998-07-21 | 2001-10-24 | 索尔克生物学研究院 | 受体样蛋白激酶rkn以及用其增加植物生长和产量的方法 |
WO2003008540A3 (en) * | 2001-06-22 | 2003-12-04 | Syngenta Participations Ag | Abiotic stress responsive polynucleotides and polypeptides |
US20060141495A1 (en) * | 2004-09-01 | 2006-06-29 | Kunsheng Wu | Polymorphic markers and methods of genotyping corn |
Family Cites Families (55)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IL71064A (en) | 1983-02-28 | 1989-10-31 | Lifecodes Corp | Paternity and forensic test involving analysis of dna polymorphic genetic regions |
US4945050A (en) | 1984-11-13 | 1990-07-31 | Cornell Research Foundation, Inc. | Method for transporting substances into living cells and tissues and apparatus therefor |
US5569597A (en) | 1985-05-13 | 1996-10-29 | Ciba Geigy Corp. | Methods of inserting viral DNA into plant material |
ES2018274T5 (es) | 1986-03-11 | 1996-12-16 | Plant Genetic Systems Nv | Celulas vegetales resistentes a los inhibidores de glutamina sintetasa, preparadas por ingenieria genetica. |
US5107065A (en) | 1986-03-28 | 1992-04-21 | Calgene, Inc. | Anti-sense regulation of gene expression in plant cells |
US5188958A (en) | 1986-05-29 | 1993-02-23 | Calgene, Inc. | Transformation and foreign gene expression in brassica species |
US5013659A (en) | 1987-07-27 | 1991-05-07 | E. I. Du Pont De Nemours And Company | Nucleic acid fragment encoding herbicide resistant plant acetolactate synthase |
US5268463A (en) | 1986-11-11 | 1993-12-07 | Jefferson Richard A | Plant promoter α-glucuronidase gene construct |
US5608142A (en) | 1986-12-03 | 1997-03-04 | Agracetus, Inc. | Insecticidal cotton plants |
US5004863B2 (en) | 1986-12-03 | 2000-10-17 | Agracetus | Genetic engineering of cotton plants and lines |
US5416011A (en) | 1988-07-22 | 1995-05-16 | Monsanto Company | Method for soybean transformation and regeneration |
KR920700360A (ko) | 1989-03-22 | 1992-02-19 | 하리크 프리드리히 | 미끄럼 베어링 |
WO1990013668A1 (en) | 1989-05-05 | 1990-11-15 | Lifecodes Corporation | Method for genetic analysis of a nucleic acid sample |
ATE225853T1 (de) | 1990-04-12 | 2002-10-15 | Syngenta Participations Ag | Gewebe-spezifische promotoren |
US5498830A (en) | 1990-06-18 | 1996-03-12 | Monsanto Company | Decreased oil content in plant seeds |
US5399680A (en) | 1991-05-22 | 1995-03-21 | The Salk Institute For Biological Studies | Rice chitinase promoter |
EP0600993B1 (en) | 1991-08-27 | 1999-11-10 | Novartis AG | Proteins with insecticidal properties against homopteran insects and their use in plant protection |
US5518908A (en) | 1991-09-23 | 1996-05-21 | Monsanto Company | Method of controlling insects |
AU694187B2 (en) | 1994-02-07 | 1998-07-16 | Beckman Coulter, Inc. | Ligase/polymerase-mediated genetic bit analysis TM of single nucleotide polymorphisms and its use in genetic analysis |
US5608144A (en) | 1994-08-12 | 1997-03-04 | Dna Plant Technology Corp. | Plant group 2 promoters and uses thereof |
US5631152A (en) | 1994-10-26 | 1997-05-20 | Monsanto Company | Rapid and efficient regeneration of transgenic plants |
US5659026A (en) | 1995-03-24 | 1997-08-19 | Pioneer Hi-Bred International | ALS3 promoter |
US6072050A (en) | 1996-06-11 | 2000-06-06 | Pioneer Hi-Bred International, Inc. | Synthetic promoters |
US5981840A (en) | 1997-01-24 | 1999-11-09 | Pioneer Hi-Bred International, Inc. | Methods for agrobacterium-mediated transformation |
GB9703146D0 (en) | 1997-02-14 | 1997-04-02 | Innes John Centre Innov Ltd | Methods and means for gene silencing in transgenic plants |
SG60056A1 (en) | 1997-04-17 | 1999-02-22 | Inst Of Molecular Agrobilogy | Alteration of plant morphology by control of profilin expression |
CA2315546C (en) | 1998-02-26 | 2008-04-29 | Pioneer Hi-Bred International, Inc. | Constitutive maize promoters |
EP1068311B2 (en) | 1998-04-08 | 2020-12-09 | Commonwealth Scientific and Industrial Research Organisation | Methods and means for obtaining modified phenotypes |
EP0959133A1 (en) | 1998-05-22 | 1999-11-24 | Centrum Voor Plantenveredelings- En Reproduktieonderzoek (Cpro-Dlo) | A process for inhibiting expression of genes |
US6504083B1 (en) | 1998-10-06 | 2003-01-07 | Pioneer Hi-Bred International, Inc. | Maize Gos-2 promoters |
US7217858B2 (en) | 1998-12-21 | 2007-05-15 | E. I. Du Pont De Nemours And Company | S-adenosyl-L-methionine synthetase promoter and its use in expression of transgenic genes in plants |
EP1033405A3 (en) * | 1999-02-25 | 2001-08-01 | Ceres Incorporated | Sequence-determined DNA fragments and corresponding polypeptides encoded thereby |
US20070011783A1 (en) * | 1999-05-06 | 2007-01-11 | Jingdong Liu | Nucleic acid molecules and other molecules associated with plants and uses thereof for plant improvement |
US20100293669A2 (en) * | 1999-05-06 | 2010-11-18 | Jingdong Liu | Nucleic Acid Molecules and Other Molecules Associated with Plants and Uses Thereof for Plant Improvement |
ATE414779T1 (de) | 1999-09-30 | 2008-12-15 | Japan Tobacco Inc | Vektoren zum transformieren von pflanzen |
EP1094113A1 (en) * | 1999-10-22 | 2001-04-25 | Genetwister Technologies B.V. | Regeneration |
DE60144517D1 (de) | 2000-06-23 | 2011-06-09 | Pioneer Hi Bred Int | Rekombinante konstrukte und ihre verwendung bei der reduzierung der genexpression |
WO2002000894A2 (en) | 2000-06-30 | 2002-01-03 | Cropdesign N.V. | Gene silencing vector |
US7619146B2 (en) | 2001-06-18 | 2009-11-17 | Frankard Valerie | Method for modifying plant morphology, biochemistry and physiology |
WO2002010210A2 (en) * | 2001-08-28 | 2002-02-07 | Bayer Cropscience Ag | Polypeptides for identifying herbicidally active compounds |
CA2459756C (en) | 2001-09-14 | 2013-07-02 | Cropdesign N.V. | A method to modify cell number, architecture and yield of plants by overexpressing the e2f transcription factor |
US7928287B2 (en) | 2002-02-19 | 2011-04-19 | Pioneer Hi-Bred International, Inc. | Methods for large scale functional evaluation of nucleotide sequences in plants |
JP2005185101A (ja) * | 2002-05-30 | 2005-07-14 | National Institute Of Agrobiological Sciences | 植物の全長cDNAおよびその利用 |
JP4443864B2 (ja) * | 2002-07-12 | 2010-03-31 | 株式会社ルネサステクノロジ | レジストまたはエッチング残さ物除去用洗浄液および半導体装置の製造方法 |
US7403855B2 (en) | 2002-12-19 | 2008-07-22 | Pioneer Hi-Bred International, Inc. | Method and apparatus for tracking individual plants while growing and/or after harvest |
WO2004106531A1 (en) | 2003-05-22 | 2004-12-09 | E.I. Dupont De Nemours And Company | Method for manipulating growth, yield, and architecture in plants |
TWI285870B (en) * | 2003-08-27 | 2007-08-21 | Chi Mei Optoelectronics Corp | Liquid crystal display and driving method |
US20060150283A1 (en) * | 2004-02-13 | 2006-07-06 | Nickolai Alexandrov | Sequence-determined DNA fragments and corresponding polypeptides encoded thereby |
US7411112B2 (en) | 2003-10-09 | 2008-08-12 | Pioneer Hi-Bred International, Inc. | Maize promoter named CRWAQ81 |
DE602004023229D1 (de) | 2003-12-22 | 2009-10-29 | Du Pont | Mais metallothionen promoter 2 und methoden zur verwendung |
WO2006055487A2 (en) | 2004-11-16 | 2006-05-26 | Pioneer Hi-Bred International, Inc. | Maize cr1bio gene promoter and its use to direct root-preferred transgene expression in plants |
CA2594728C (en) | 2005-01-13 | 2013-03-19 | Pioneer Hi-Bred International, Inc. | Maize cyclo1 gene and promoter |
CN101189342B (zh) * | 2005-06-08 | 2013-04-03 | 克罗普迪塞恩股份有限公司 | 具有改良生长特性的植物及其制备方法 |
AR060523A1 (es) * | 2006-04-19 | 2008-06-25 | Pioneer Hi Bred Int | Moleculas de polinucleotidos aislados que corresponden a alelos mutantes y tipo salvaje del gen de maiz d9 y metodos para usarlas |
US20130305398A1 (en) * | 2012-02-16 | 2013-11-14 | Marie Coffin | Genes and uses for plant enhacement |
-
2008
- 2008-11-20 EP EP08852070A patent/EP2225379A2/en not_active Withdrawn
- 2008-11-20 BR BRPI0818971A patent/BRPI0818971A2/pt not_active IP Right Cessation
- 2008-11-20 CA CA2703903A patent/CA2703903A1/en not_active Abandoned
- 2008-11-20 MX MX2010005578A patent/MX2010005578A/es unknown
- 2008-11-20 EP EP13157704.1A patent/EP2617831A3/en not_active Withdrawn
- 2008-11-20 CN CN2008801169274A patent/CN101868545B/zh not_active Expired - Fee Related
- 2008-11-20 WO PCT/US2008/084103 patent/WO2009067569A2/en active Application Filing
- 2008-11-20 US US12/274,412 patent/US20090130685A1/en not_active Abandoned
-
2010
- 2010-04-20 ZA ZA2010/02765A patent/ZA201002765B/en unknown
-
2012
- 2012-01-11 US US13/347,850 patent/US9090901B2/en not_active Expired - Fee Related
-
2015
- 2015-07-01 US US14/788,821 patent/US20150299723A1/en not_active Abandoned
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999009151A2 (en) * | 1997-08-13 | 1999-02-25 | The Regents Of The University Of California | Procedures and materials for conferring disease resistance in plants |
CN1318974A (zh) * | 1998-07-21 | 2001-10-24 | 索尔克生物学研究院 | 受体样蛋白激酶rkn以及用其增加植物生长和产量的方法 |
WO2003008540A3 (en) * | 2001-06-22 | 2003-12-04 | Syngenta Participations Ag | Abiotic stress responsive polynucleotides and polypeptides |
US20060141495A1 (en) * | 2004-09-01 | 2006-06-29 | Kunsheng Wu | Polymorphic markers and methods of genotyping corn |
Also Published As
Publication number | Publication date |
---|---|
US20120110700A1 (en) | 2012-05-03 |
CA2703903A1 (en) | 2009-05-28 |
MX2010005578A (es) | 2010-06-02 |
BRPI0818971A2 (pt) | 2017-05-16 |
US20150299723A1 (en) | 2015-10-22 |
US20090130685A1 (en) | 2009-05-21 |
EP2617831A2 (en) | 2013-07-24 |
US9090901B2 (en) | 2015-07-28 |
EP2617831A3 (en) | 2013-08-07 |
WO2009067569A2 (en) | 2009-05-28 |
CN101868545A (zh) | 2010-10-20 |
ZA201002765B (en) | 2011-10-26 |
WO2009067569A3 (en) | 2009-07-30 |
EP2225379A2 (en) | 2010-09-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101815432A (zh) | 涉及编码核苷二磷酸激酶(ndk)多肽及其同源物的基因的用于修改植物根构造的方法 | |
KR102147005B1 (ko) | Fad2 성능 유전자좌 및 표적화 파단을 유도할 수 있는 상응하는 표적 부위 특이적 결합 단백질 | |
CN101827938A (zh) | 涉及rt1基因、相关的构建体和方法的具有改变的根构造的植物 | |
CN101939434B (zh) | 用于在大豆中提高种子贮藏油脂的生成和改变脂肪酸谱的来自解脂耶氏酵母的dgat基因 | |
CN101365788B (zh) | Δ-9延伸酶及其在制备多不饱和脂肪酸中的用途 | |
CA2683497C (en) | .delta.8 desaturases and their use in making polyunsaturated fatty acids | |
CN108368517B (zh) | 用于快速植物转化的方法和组合物 | |
DK2087105T3 (da) | Delta 17-desaturase og anvendelse heraf ved fremstilling af flerumættede fedtsyrer | |
KR101447300B1 (ko) | 안트라닐레이트 신타제의 엽록체를 표적으로 하는 발현에 의한 고-트립토판 옥수수의 생산 | |
DK2623594T3 (da) | Antistof mod human prostaglandin-E2-receptor EP4 | |
CN110551713B (zh) | 用于修饰梭状芽孢杆菌属细菌的优化的遗传工具 | |
DK2324120T3 (en) | Manipulating SNF1 protein kinase OF REVISION OF OIL CONTENT IN OLEAGINOUS ORGANISMS | |
CN101646766B (zh) | △17去饱和酶及其用于制备多不饱和脂肪酸的用途 | |
BRPI0806354A2 (pt) | plantas oleaginosas transgências, sementes, óleos, produtos alimentìcios ou análogos a alimento, produtos alimentìcios medicinais ou análogos alimentìcios medicinais, produtos farmacêuticos, bebidas fórmulas para bebês, suplementos nutricionais, rações para animais domésticos, alimentos para aquacultura, rações animais, produtos de sementes inteiras, produtos de óleos misturados, produtos, subprodutos e subprodutos parcialmente processados | |
KR20130132405A (ko) | 형질전환 빈도를 증가시키기 위해 변형된 아그로박테리움 균주 | |
CN112204147A (zh) | 基于Cpf1的植物转录调控系统 | |
KR20070085669A (ko) | 고농도의 아라키돈산을 생성하는 야로위아 리폴리티카 균주 | |
CN101918560B (zh) | 在氮限制条件下具有改变的农学特性的植物以及涉及编码lnt2多肽及其同源物的基因的相关构建体和方法 | |
CN113621642A (zh) | 一种用于农作物杂交育种制种的遗传智能化育制种系统及其应用 | |
CN101868545B (zh) | 具有改变的根构造的植物、涉及编码富含亮氨酸重复序列激酶(llrk)多肽及其同源物的基因的相关构建体和方法 | |
PT1984512T (pt) | Sistema de expressão génica utilizando excisão-união em insetos | |
KR20150093721A (ko) | 원핵생물 경로를 모방함에 의한 식물 자기 질소 고정 | |
CN101001951B (zh) | 分离转录终止序列的方法 | |
CN101883843A (zh) | 破坏过氧化物酶体生物合成因子蛋白(pex)以改变含油真核生物中多不饱和脂肪酸和总脂质含量 | |
KR20180137558A (ko) | 유전자내 식물 형질전환을 위한 구조체 및 벡터 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20130821 Termination date: 20131120 |