CN113088523B - A kind of transposon and its application - Google Patents
A kind of transposon and its application Download PDFInfo
- Publication number
- CN113088523B CN113088523B CN202110355321.3A CN202110355321A CN113088523B CN 113088523 B CN113088523 B CN 113088523B CN 202110355321 A CN202110355321 A CN 202110355321A CN 113088523 B CN113088523 B CN 113088523B
- Authority
- CN
- China
- Prior art keywords
- zmcct
- allele
- sequence
- tested
- transposon
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 240000008042 Zea mays Species 0.000 claims abstract description 123
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims abstract description 120
- 108700028369 Alleles Proteins 0.000 claims abstract description 103
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 claims abstract description 88
- 235000009973 maize Nutrition 0.000 claims abstract description 88
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 84
- 241000196324 Embryophyta Species 0.000 claims abstract description 64
- 108020004414 DNA Proteins 0.000 claims abstract description 42
- 230000017260 vegetative to reproductive phase transition of meristem Effects 0.000 claims abstract description 34
- 239000002773 nucleotide Substances 0.000 claims abstract description 22
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 22
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 32
- 235000005822 corn Nutrition 0.000 claims description 32
- 102000053602 DNA Human genes 0.000 claims description 17
- 238000000034 method Methods 0.000 claims description 17
- 108020004682 Single-Stranded DNA Proteins 0.000 claims description 16
- 239000000463 material Substances 0.000 claims description 11
- 239000000126 substance Substances 0.000 claims description 5
- 108091028043 Nucleic acid sequence Proteins 0.000 abstract description 6
- 125000000539 amino acid group Chemical group 0.000 abstract description 4
- 238000003780 insertion Methods 0.000 description 59
- 230000037431 insertion Effects 0.000 description 58
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 15
- 230000014509 gene expression Effects 0.000 description 13
- 210000000349 chromosome Anatomy 0.000 description 11
- 238000012408 PCR amplification Methods 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 238000001514 detection method Methods 0.000 description 6
- 230000003321 amplification Effects 0.000 description 5
- 238000013507 mapping Methods 0.000 description 5
- 238000003199 nucleic acid amplification method Methods 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 108010028295 histidylhistidine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 150000001413 amino acids Chemical group 0.000 description 3
- 230000002035 prolonged effect Effects 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 210000001519 tissue Anatomy 0.000 description 3
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 2
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 2
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 2
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 2
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 2
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 2
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- NMTANZXPDAHUKU-ULQDDVLXSA-N Arg-Tyr-Lys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 NMTANZXPDAHUKU-ULQDDVLXSA-N 0.000 description 2
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 2
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 2
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 2
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 2
- UBHPUQAWSSNQLQ-DCAQKATOSA-N Cys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O UBHPUQAWSSNQLQ-DCAQKATOSA-N 0.000 description 2
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 2
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 2
- LVRKAFPPFJRIOF-GARJFASQSA-N Gln-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N LVRKAFPPFJRIOF-GARJFASQSA-N 0.000 description 2
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 2
- BJVBMSTUUWGZKX-JYJNAYRXSA-N Gln-Tyr-His Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BJVBMSTUUWGZKX-JYJNAYRXSA-N 0.000 description 2
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 2
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 2
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 2
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 2
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 2
- CSTNMMIHMYJGFR-IHRRRGAJSA-N His-His-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 CSTNMMIHMYJGFR-IHRRRGAJSA-N 0.000 description 2
- IDQNVIWPPWAFSY-AVGNSLFASA-N His-His-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O IDQNVIWPPWAFSY-AVGNSLFASA-N 0.000 description 2
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 2
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 2
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 2
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 2
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 2
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 2
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 2
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 2
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 2
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 2
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 230000033764 rhythmic process Effects 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 244000025254 Cannabis sativa Species 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 102100025568 Voltage-dependent L-type calcium channel subunit beta-1 Human genes 0.000 description 1
- 101710176690 Voltage-dependent L-type calcium channel subunit beta-1 Proteins 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- OMOVVBIIQSXZSZ-UHFFFAOYSA-N [6-(4-acetyloxy-5,9a-dimethyl-2,7-dioxo-4,5a,6,9-tetrahydro-3h-pyrano[3,4-b]oxepin-5-yl)-5-formyloxy-3-(furan-3-yl)-3a-methyl-7-methylidene-1a,2,3,4,5,6-hexahydroindeno[1,7a-b]oxiren-4-yl] 2-hydroxy-3-methylpentanoate Chemical compound CC12C(OC(=O)C(O)C(C)CC)C(OC=O)C(C3(C)C(CC(=O)OC4(C)COC(=O)CC43)OC(C)=O)C(=C)C32OC3CC1C=1C=COC=1 OMOVVBIIQSXZSZ-UHFFFAOYSA-N 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000005265 energy consumption Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 230000010152 pollination Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 230000001020 rhythmical effect Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009105 vegetative growth Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/6895—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for plants, fungi or algae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/13—Plant traits
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Analytical Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Botany (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Genetics & Genomics (AREA)
- Mycology (AREA)
- Physics & Mathematics (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
本发明公开了一种转座子及其应用。本发明提供了的LINE/L1转座子,其为如下任一种:1)核苷酸序列为序列表中序列5;2)在严格条件下与1)限定的DNA序列杂交且编码具有相同功能氨基酸残基的DNA分子;3)与1)限定的DNA序列至少具有70%、至少具有75%、至少具有80%、至少具有85%、至少具有90%、至少具有95%、至少具有96%、至少具有97%、至少具有98%或至少具有99%同源性且编码具有相同功能氨基酸残基的DNA分子。本发明首次在ZmCCT基因中发现了LINE/L1转座子,且发现含有该转座子的等位基因ZmCCT‑FO,可用于鉴定待测玉米的开花时间早晚,也可以用于鉴定待测玉米的株高和穗位高。The invention discloses a transposon and its application. The LINE/L1 transposon provided by the present invention is any of the following: 1) the nucleotide sequence is sequence 5 in the sequence table; 2) hybridizes with the DNA sequence defined in 1) under stringent conditions and has the same encoding DNA molecules with functional amino acid residues; 3) and 1) defined DNA sequences with at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96% %, at least 97%, at least 98%, or at least 99% homologous and encoding DNA molecules with the same functional amino acid residues. The present invention finds the LINE/L1 transposon in the ZmCCT gene for the first time, and finds that the allele ZmCCT-FO containing the transposon can be used to identify the flowering time of the maize to be tested, and can also be used to identify the maize to be tested. plant height and ear height.
Description
技术领域technical field
本发明属于生物技术领域,尤其涉及一种转座子及其应用。The invention belongs to the field of biotechnology, and in particular relates to a transposon and its application.
背景技术Background technique
玉米是我国最重要的粮食作物之一,2019年,玉米产量接近我国粮食总产量的40%。玉米成熟期是影响玉米产量的关键性状之一,玉米成熟期延长,即玉米开花期变晚,会导致更高比例的能量消耗在营养生长阶段,增加田间遭遇病虫害的可能性。在华北地区,玉米成熟期延长,还会影响下一茬作物的轮作,在东北地区,玉米成熟期延长,还会增加冻害风险,造成减产。此外,玉米由热带植物大刍草驯化而来,大刍草对光周期敏感,在高纬度长日照条件下不能正常开花,许多热带玉米自交系仍然光周期敏感,种植在高纬度温带地区开花期特晚,甚至不开花,直接限制了热带玉米种质资源中优良等位基因不能被挖掘利用。Corn is one of the most important food crops in my country. In 2019, corn production was close to 40% of my country's total grain output. Maize maturity is one of the key traits affecting maize yield. Prolonged maize maturity, that is, later flowering of maize, will lead to a higher proportion of energy consumption in the vegetative growth stage and increase the possibility of field encounters with pests and diseases. In North China, prolonged corn maturity will also affect the rotation of the next crop. In Northeast China, prolonged corn maturity will increase the risk of frost damage and reduce yields. In addition, maize is domesticated from the tropical plant ruminant grass, which is sensitive to photoperiod and cannot bloom normally under high-latitude long-day conditions. Many tropical maize inbred lines are still sensitive to photoperiod and bloom in high-latitude temperate regions. The period is very late, or even does not bloom, which directly restricts the excellent alleles in tropical maize germplasm resources from being excavated and utilized.
发明内容SUMMARY OF THE INVENTION
本发明的一个目的是提供一种LINE/L1转座子。An object of the present invention is to provide a LINE/L1 transposon.
本发明提供的LINE/L1转座子,其为如下任一种:The LINE/L1 transposon provided by the invention is any of the following:
1)核苷酸序列为序列表中序列5;1) The nucleotide sequence is sequence 5 in the sequence listing;
2)在严格条件下与1)限定的DNA序列杂交的DNA分子;2) DNA molecules that hybridize under stringent conditions to the DNA sequences defined in 1);
3)与1)限定的DNA序列至少具有70%、至少具有75%、至少具有80%、至少具有85%、至少具有90%、至少具有95%、至少具有96%、至少具有97%、至少具有98%或至少具有99%同源性的DNA分子。3) The DNA sequences defined in 1) have at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least DNA molecules with 98% or at least 99% homology.
含有上述的LINE/L1转座子的ZmCCT等位基因也是本发明保护的范围。The ZmCCT allele containing the above-mentioned LINE/L1 transposon is also within the scope of the present invention.
上述ZmCCT等位基因为ZmCCT-FO,其为如下任一种:The above-mentioned ZmCCT allele is ZmCCT-FO, which is any of the following:
1)核苷酸序列为序列表中序列14;1) The nucleotide sequence is sequence 14 in the sequence listing;
2)在严格条件下与1)限定的DNA序列杂交且编码具有相同功能氨基酸残基的DNA分子;2) Hybridize with the DNA sequence defined in 1) under stringent conditions and encode a DNA molecule with the same functional amino acid residue;
3)与1)限定的DNA序列至少具有70%、至少具有75%、至少具有80%、至少具有85%、至少具有90%、至少具有95%、至少具有96%、至少具有97%、至少具有98%或至少具有99%同源性且编码具有相同功能氨基酸残基的DNA分子。3) The DNA sequences defined in 1) have at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least DNA molecules having 98% or at least 99% homology and encoding amino acid residues with the same function.
如下A或B的物质在鉴定如下1)-3)中至少一种中的应用也是本发明保护的范围:The application of the following substances of A or B in identifying at least one of the following 1)-3) is also the scope of protection of the present invention:
A、所示的物质为检测待测玉米基因组中的ZmCCT基因是否含有所述LINE/L1转座子的物质;A. The material shown is a material for detecting whether the ZmCCT gene in the maize genome to be tested contains the LINE/L1 transposon;
B、所示的物质为检测待测玉米基因组中的ZmCCT基因为等位基因ZmCCT-FO(序列15)、等位基因ZmCCT-F7还是等位基因ZmCCT-OGD的物质;B, the material shown is to detect that the ZmCCT gene in the corn genome to be tested is the material of allele ZmCCT-FO (sequence 15), allele ZmCCT-F7 or allele ZmCCT-OGD;
1)鉴定待测玉米的开花时间早晚;1) Identify the flowering time of the corn to be tested sooner or later;
2)鉴定待测玉米的株高;2) Identify the plant height of the corn to be tested;
3)鉴定待测玉米的穗位高。3) Identify the ear height of the corn to be tested.
上述应用中,所述检测待测玉米基因组中的ZmCCT基因是否含有LINE/L1转座子的物质为扩增LINE/L1转座子的引物;所述引物具体由IIA引物和IIB引物组成;In the above application, the material for detecting whether the ZmCCT gene in the maize genome to be tested contains LINE/L1 transposon is a primer for amplifying the LINE/L1 transposon; the primer is specifically composed of an IIA primer and an IIB primer;
所述IIA引物由序列表中序列6所示的单链DNA分子和序列表中序列7所述的单链DNA分子组成;The IIA primer is composed of the single-stranded DNA molecule shown in
或,所述IIA引物由序列表中序列10所示的单链DNA分子和序列表中序列11所述的单链DNA分子组成;Or, the IIA primer is composed of the single-stranded DNA molecule shown in
所述IIB引物由序列表中序列8所示的单链DNA分子和序列表中序列9所述的单链DNA分子组成;The IIB primer is composed of the single-stranded DNA molecule shown in
或,所述IIB引物由序列表中序列12所示的单链DNA分子和序列表中序列13所述的单链DNA分子组成。Or, the IIB primer consists of the single-stranded DNA molecule shown in SEQ ID NO: 12 in the Sequence Listing and the single-stranded DNA molecule described in SEQ ID NO: 13 in the Sequence Listing.
上述应用中的所述A物质或者B物质也是本发明保护的范围。The substance A or substance B in the above application is also within the protection scope of the present invention.
所述检测待测玉米基因组中的ZmCCT基因为等位基因ZmCCT-FO(序列14)、等位基因ZmCCT-F7(核苷酸序列为序列1)还是等位基因ZmCCT-OGD(核苷酸序列为序列2)的物质为如下:Described to detect whether the ZmCCT gene in the maize genome to be tested is allele ZmCCT-FO (sequence 14), allele ZmCCT-F7 (nucleotide sequence is sequence 1) or allele ZmCCT-OGD (nucleotide sequence 1) The substances of sequence 2) are as follows:
鉴定5.1-kb的转座子纯合插入采用引物为表1中的M7-1和M7-2,引物M7-1扩增无条带,且M7-2扩增有条带的单株为5.1-kb的转座子纯合插入单株;鉴定4.2-kb的LINE/L1转座子纯合插入采用表2中的IIA和IIB,引物IIA扩增有条带,且IIB扩增无条带的单株为LINE/L1转座子纯合插入单株。The homozygous insertion of the 5.1-kb transposon was identified using primers M7-1 and M7-2 in Table 1. The primer M7-1 amplified no band, and the individual plant with a band amplified by M7-2 was 5.1 -Kb transposon homozygous insertion into a single plant; identify 4.2-kb LINE/L1 transposon homozygous insertion using IIA and IIB in Table 2, primer IIA amplified with bands, and IIB amplified without bands The individual plant is a LINE/L1 transposon homozygous insertion plant.
ZmCCT基因启动子区域有5.1-kb的转座子纯合插入且内含子区域有4.2-kb的LINE/L1转座子纯合插入的单株,其具有的ZmCCT等位基因ZmCCT-FO。A single plant with a 5.1-kb transposon homozygous insertion in the promoter region of the ZmCCT gene and a 4.2-kb LINE/L1 transposon homozygous insertion in the intron region has the ZmCCT allele ZmCCT-FO.
ZmCCT基因启动子区域有5.1-kb的转座子纯合插入且内含子区域没有4.2-kb的LINE/L1转座子插入的单株,其具有ZmCCT等位基因ZmCCT-OGD。A single plant with a 5.1-kb transposon homozygous insertion in the promoter region of the ZmCCT gene and no 4.2-kb LINE/L1 transposon insertion in the intron region has the ZmCCT allele ZmCCT-OGD.
ZmCCT基因启动子区域没有5.1-kb的转座子插入且内含子区域有4.2-kb的LINE/L1转座子纯合插入的单株,其具有ZmCCT等位基因ZmCCT-F7。A single plant without a 5.1-kb transposon insertion in the promoter region of the ZmCCT gene and a homozygous insertion of a 4.2-kb LINE/L1 transposon in the intron region has the ZmCCT allele ZmCCT-F7.
本发明还有一个目的是提供如下方法。Still another object of the present invention is to provide the following method.
本发明提供的一种鉴定待测玉米的开花时间早晚的方法,为检测待测玉米基因组中的ZmCCT基因是否含有上述的LINE/L1转座子,基因组中的ZmCCT基因含有LINE/L1转座子的待测玉米开花时间早于或候选早于基因组中的ZmCCT基因不含有LINE/L1转座子的待测玉米;The present invention provides a method for identifying the flowering time of corn to be tested. In order to detect whether the ZmCCT gene in the genome of the corn to be tested contains the above-mentioned LINE/L1 transposon, the ZmCCT gene in the genome contains the LINE/L1 transposon. The flowering time of the tested maize is earlier than or the candidate is earlier than the tested maize that does not contain the LINE/L1 transposon in the ZmCCT gene in the genome;
或,本发明提供的一种鉴定待测玉米的开花时间早晚的方法,为检测待测玉米基因组中的ZmCCT基因为等位基因ZmCCT-FO、等位基因ZmCCT-F7还是等位基因ZmCCT-OGD,具有等位基因ZmCCT-FO的待测玉米开花时间早于或候选早于具有等位基因ZmCCT-F7或具有等位基因ZmCCT-OGD的待测玉米。Or, a method for identifying the flowering time of corn to be tested sooner or later provided by the present invention is to detect whether the ZmCCT gene in the genome of corn to be tested is allele ZmCCT-FO, allele ZmCCT-F7 or allele ZmCCT-OGD , the flowering time of the tested maize with allele ZmCCT-FO is earlier or candidate earlier than the tested maize with allele ZmCCT-F7 or with allele ZmCCT-OGD.
或本发明提供的一种鉴定待测玉米的株高的方法,为检测待测玉米基因组中的ZmCCT基因是否含有上述的LINE/L1转座子,基因组中的ZmCCT基因含有LINE/L1转座子的待测玉米株高低于或候选低于基因组中的ZmCCT基因不含有LINE/L1转座子的待测玉米;Or a method for identifying the plant height of maize to be tested provided by the present invention is to detect whether the ZmCCT gene in the maize genome to be tested contains the above-mentioned LINE/L1 transposon, and the ZmCCT gene in the genome contains the LINE/L1 transposon The tested maize plant height is lower than or the candidate is lower than the ZmCCT gene in the genome to be tested without the LINE/L1 transposon;
或,本发明提供的一种鉴定待测玉米的株高的方法,为检测待测玉米基因组中的ZmCCT基因为等位基因ZmCCT-FO、等位基因ZmCCT-F7还是等位基因ZmCCT-OGD,具有等位基因ZmCCT-FO的待测玉米株高低于或候选低于具有等位基因ZmCCT-F7或具有等位基因ZmCCT-OGD的待测玉米。Or, a method for identifying the plant height of maize to be tested provided by the present invention is to detect whether the ZmCCT gene in the maize genome to be tested is allele ZmCCT-FO, allele ZmCCT-F7 or allele ZmCCT-OGD, The plant height of the tested maize with the allele ZmCCT-FO is lower or candidate lower than that of the tested maize with the allele ZmCCT-F7 or with the allele ZmCCT-OGD.
或,本发明提供的一种鉴定待测玉米的穗位高的方法,为检测待测玉米基因组中的ZmCCT基因是否含有上述的LINE/L1转座子,基因组中的ZmCCT基因含有LINE/L1转座子的待测玉米穗位高低于或候选低于基因组中的ZmCCT基因不含有LINE/L1转座子的待测玉米;Or, a method for identifying the ear height of maize to be tested provided by the present invention is to detect whether the ZmCCT gene in the maize genome to be tested contains the above-mentioned LINE/L1 transposon, and the ZmCCT gene in the genome contains the LINE/L1 transposon. The corn ear height of the transposon to be tested is lower than or the candidate is lower than the ZmCCT gene in the genome of the corn to be tested that does not contain the LINE/L1 transposon;
或,本发明提供的一种鉴定待测玉米的穗位高的方法,为检测待测玉米基因组中的ZmCCT基因为等位基因ZmCCT-FO、等位基因ZmCCT-F7还是等位基因ZmCCT-OGD,具有等位基因ZmCCT-FO的待测玉米穗位高低于或候选低于具有等位基因ZmCCT-F7或具有等位基因ZmCCT-OGD的待测玉米。Or, a method for identifying the ear height of maize to be tested provided by the present invention is to detect whether the ZmCCT gene in the maize genome to be tested is allele ZmCCT-FO, allele ZmCCT-F7 or allele ZmCCT-OGD , the ear height of the tested maize with the allele ZmCCT-FO is lower or the candidate is lower than the tested maize with the allele ZmCCT-F7 or with the allele ZmCCT-OGD.
上述检测待测玉米基因组中的ZmCCT基因是否含有上述的LINE/L1转座子的方法如下:Whether the above-mentioned ZmCCT gene in the maize genome to be tested contains the above-mentioned LINE/L1 transposon is as follows:
用引物IIA和IIB分别对待测玉米基因组进行扩增,按照如下方法判断:Use primers IIA and IIB to amplify the maize genome to be tested, respectively, and judge according to the following methods:
如果引物IIA扩增有条带,且IIB扩增无条带,则说明待测玉米基因组中2条染色体的ZmCCT基因均含有LINE/L1转座子,为LINE/L1转座子纯合型;If primer IIA has a band in amplification, and IIB has no band in amplification, it means that the ZmCCT genes of the two chromosomes in the maize genome to be tested both contain LINE/L1 transposon and are homozygous for LINE/L1 transposon;
如果引物IIB扩增有条带,且IIA扩增无条带,则说明待测玉米基因组中2条染色体的ZmCCT基因均不含有LINE/L1转座子,为缺失LINE/L1转座子纯合型;If the primer IIB has a band, and the IIA has no band, it means that the ZmCCT gene of the two chromosomes in the maize genome to be tested does not contain the LINE/L1 transposon, and it is homozygous for the deletion of the LINE/L1 transposon. type;
如果引物IIA和引物IIB二者都有条带,说明待测玉米基因组中一条染色体的ZmCCT基因有LINE/L1转座子插入,而另一条染色体的没有LINE/L1转座子插入,为LINE/L1转座子杂合型。If both primer IIA and primer IIB have bands, it means that the ZmCCT gene of one chromosome in the tested maize genome has LINE/L1 transposon insertion, while the other chromosome does not have LINE/L1 transposon insertion, which is LINE/ L1 transposon heterozygous.
上述引物IIA为引物IIA1或引物IIA2,上述引物IIB为引物IIB1或引物IIB2。The aforementioned primer IIA is primer IIA1 or primer IIA2, and the aforementioned primer IIB is primer IIB1 or primer IIB2.
IIA1引物的目的扩增片段大小为493bp,IIB1引物的目的扩增片段大小为1007bp。The size of the target amplified fragment of the IIA1 primer was 493 bp, and the size of the target amplified fragment of the IIB1 primer was 1007 bp.
IIA2引物的目的扩增片段大小为591bp,IIB2引物的目的扩增片段大小为1588bp。The size of the target amplified fragment of the IIA2 primer is 591 bp, and the size of the target amplified fragment of the IIB2 primer is 1588 bp.
上述检测待测玉米基因组中的ZmCCT基因为等位基因ZmCCT-FO、等位基因ZmCCT-F7还是等位基因ZmCCT-OGD的方法如下:The above-mentioned method for detecting whether the ZmCCT gene in the maize genome to be tested is the allele ZmCCT-FO, the allele ZmCCT-F7 or the allele ZmCCT-OGD is as follows:
可以通过测序检测,也可以通过如下方法检测:It can be detected by sequencing or by the following methods:
ZmCCT基因启动子区域有5.1-kb的转座子纯合插入且内含子区域有4.2-kb的LINE/L1转座子纯合插入的单株,其具有的ZmCCT等位基因ZmCCT-FO。A single plant with a 5.1-kb transposon homozygous insertion in the promoter region of the ZmCCT gene and a 4.2-kb LINE/L1 transposon homozygous insertion in the intron region has the ZmCCT allele ZmCCT-FO.
ZmCCT基因启动子区域有5.1-kb的转座子纯合插入且内含子区域没有4.2-kb的LINE/L1转座子插入的单株,其具有ZmCCT等位基因ZmCCT-OGD。A single plant with a 5.1-kb transposon homozygous insertion in the promoter region of the ZmCCT gene and no 4.2-kb LINE/L1 transposon insertion in the intron region has the ZmCCT allele ZmCCT-OGD.
ZmCCT基因启动子区域没有5.1-kb的转座子插入且内含子区域有4.2-kb的LINE/L1转座子纯合插入的单株,其具有ZmCCT等位基因ZmCCT-F7。A single plant without a 5.1-kb transposon insertion in the promoter region of the ZmCCT gene and a homozygous insertion of a 4.2-kb LINE/L1 transposon in the intron region has the ZmCCT allele ZmCCT-F7.
上述鉴定5.1-kb的转座子插入采用引物为表1中的M7-1和M7-2,引物M7-1扩增无条带,且M7-2扩增有条带的单株为5.1-kb的转座子纯合插入单株。The above-identified 5.1-kb transposon insertion used primers M7-1 and M7-2 in Table 1, primer M7-1 amplified no band, and M7-2 amplified a single plant with a band of 5.1- The kb transposon was homozygous for insertion into the individual plant.
上述鉴定4.2-kb的LINE/L1转座子插入采用表2中的IIA和IIB,引物IIA扩增有条带,且IIB扩增无条带的单株为LINE/L1转座子纯合插入单株。The above-identified LINE/L1 transposon insertion of 4.2-kb adopts IIA and IIB in Table 2, the primer IIA amplified with a band, and the individual plant with no band amplified by IIB was a LINE/L1 transposon homozygous insertion single plant.
本发明首次在ZmCCT基因中发现了LINE/L1转座子,且发现含有该转座子的等位基因ZmCCT-FO,该转座子和等位基因可用于鉴定待测玉米的开花时间早晚,也可以用于鉴定待测玉米的株高和穗位高。The present invention finds the LINE/L1 transposon in the ZmCCT gene for the first time, and finds the allele ZmCCT-FO containing the transposon, and the transposon and the allele can be used to identify the flowering time of the maize to be tested sooner or later, It can also be used to identify the plant height and ear height of the corn to be tested.
附图说明Description of drawings
图1为玉米10号染色体的开花期QTL定位结果。Figure 1 shows the results of QTL mapping at the flowering stage of
图2为10号开花期QTL的精细定位结果。Figure 2 shows the results of fine mapping of QTLs at the flowering stage of No. 10.
图3为520-kb区间内全部7个基因的相对表达量。Figure 3 shows the relative expression levels of all 7 genes in the 520-kb interval.
图4为双亲的ZmCCT基因内含子的PCR扩增。Figure 4 is a PCR amplification of the ZmCCT gene introns of the parents.
图5为ZmCCT基因在双亲间的变异位点分析。Figure 5 is an analysis of the variation sites of ZmCCT gene between parents.
图6为NILs-F7和NILs-OGD的散粉期、株高、穗位高和穗上叶片数。Fig. 6 shows the pollination stage, plant height, ear height and number of leaves on ear of NILs-F7 and NILs-OGD.
图7为NILs-F7和NILs-OGD的ZmCCT基因表达节律和表达积累量。Figure 7 shows the rhythm and accumulation of ZmCCT gene expression in NILs-F7 and NILs-OGD.
图8为双亲的ZmCCT转录本的PCR扩增。Figure 8 is a PCR amplification of parental ZmCCT transcripts.
图9为ZmCCT-FO、ZmCCT-OGD和ZmCCT-F7的ZmCCT表达量和开花期表型。Figure 9 shows the ZmCCT expression levels and flowering phenotypes of ZmCCT-FO, ZmCCT-OGD and ZmCCT-F7.
具体实施方式Detailed ways
下述实施例中所使用的实验方法如无特殊说明,均为常规方法。The experimental methods used in the following examples are conventional methods unless otherwise specified.
下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。The materials, reagents, etc. used in the following examples can be obtained from commercial sources unless otherwise specified.
极早熟玉米自交系F7由Germplasm Resource Information Network(GRIN)提供,家系编号是PI 257507。The very early maize inbred line F7 was provided by the Germplasm Resource Information Network (GRIN) with the family number PI 257507.
玉米地方种OGD是商业品种,全名Oaxacan Green Dent,购自网上(https://www.anniesheirloomseeds.com/)。Corn landrace OGD is a commercial variety, full name Oaxacan Green Dent, purchased online (https://www.anniesheirloomseeds.com/).
F7代的剩余杂合家系(HIF)为F7和OGD杂交获得F1代,F1代自交6代得到在ZmCCT基因保持杂合(采用表1中M2进行鉴定,M2位于ZmCCT基因上,M2扩增获得2条杂合条带就代表ZmCCT基因保持杂合)的F7代单株,F7代单株自交获得的全部后代所组成的家系。The remaining heterozygous family (HIF) of the F 7 generation is the F 1 generation obtained from the cross between F7 and OGD, and the F 1 generation is obtained by self-crossing for 6 generations. The ZmCCT gene remains heterozygous (M2 in Table 1 is used for identification, M2 is located on the ZmCCT gene, M2 amplification to obtain 2 heterozygous bands represents the F 7 generation individual plant with the ZmCCT gene remaining heterozygous), and the family composed of all the progeny obtained from the self-crossing of the F 7 generation individual plant.
实施例1、转座子的发现及在ZmCCT近等基因系中的应用
一、LINE/L1转座子的发现1. Discovery of LINE/L1 transposon
1、通过杂交,组配一个来自欧洲的极早熟玉米自交系F7和一个来自墨西哥的正常熟期的玉米地方种OGD(OAXACAN GREEN DENT)的F2群体,双亲的散粉期差异达到一周。在低纬度的海南对F2群体进行QTL定位,在10号染色体94-Mb附近定位到一个影响玉米开花期的主效QTL,可以解释15.3%的表型变异,加性效应有1.6天(图1)。1. By crossing, a very early-maturing maize inbred line F7 from Europe and an F 2 population of OGD (OAXACAN GREEN DENT), a corn landrace of normal maturity from Mexico, were assembled. QTL mapping of the F 2 population in the low-latitude Hainan, a major QTL affecting the flowering period of maize was located near the 94-Mb chromosome of
2、利用来自19个剩余杂合家系(HIFs)的5800个单株进行精细定位,通过筛选新的交换单株和分子标记,将这个QTL缩小到520-kb的区间内,位于标记M4和M7之间(图2、表1)。2. Using 5800 individuals from 19 remaining heterozygous families (HIFs) for fine mapping, this QTL was narrowed down to a 520-kb interval by screening for new crossover individuals and molecular markers, located at markers M4 and M7 between (Figure 2, Table 1).
表1为引物序列Table 1 is the primer sequence
这一区间内包含7个基因,通过检测包含两个近等基因系(NILs)中这7个基因在V3时期叶片中的表达量,发现ZmCCT是其中唯一具有显著表达量差异(P<0.01)的基因(图3、表1)。This interval contains 7 genes. By detecting the expression levels of these 7 genes in leaves at V3 stage in two near-isogenic lines (NILs), it is found that ZmCCT is the only one with significant difference in expression (P<0.01). genes (Figure 3, Table 1).
ZmCCT已经被报道调控玉米开花期,并且亲本型NILs间的ZmCCT表达量差异符合开花期表型的早晚,所以将ZmCCT确定为候选基因(图6)。ZmCCT has been reported to regulate the flowering stage of maize, and the difference in ZmCCT expression between parental NILs is consistent with the early and late flowering phenotypes, so ZmCCT was identified as a candidate gene (Figure 6).
3、根据B73参考基因组,ZmCCT包含两个外显子和一个内含子,编码一个240-aa长度的蛋白,包含一个CO CO-LIKE TIMING OF CAB1(CCT)结构域,结构域从196-aa到237-aa(XP_008662996,提交日为2020年1月10日)。3. According to the B73 reference genome, ZmCCT contains two exons and one intron, encoding a 240-aa protein, containing a CO CO-LIKE TIMING OF CAB1 (CCT) domain, the domain from 196-aa to 237-aa (XP_008662996, commit date January 10, 2020).
4、对双亲材料进行DNA测序,发现亲本OGD的ZmCCT基因(核苷酸序列为序列2,等位基因命名为ZmCCT-OGD,编码的蛋白的氨基酸序列为序列4)长度是2561-bp,在启动子区域存在已知的5.1-kb转座子插入,亲本F7的ZmCCT基因(核苷酸序列为序列1,等位基因命名为ZmCCT-F7,编码的蛋白的氨基酸序列为序列3)长度是6730-bp,在第2292-6461位,距离第2个外显子仅26-bp的第1个内含子中,存在一个未被报道的LINE/L1转座子插入,转座子长度4170-bp(图4)。4. DNA sequencing was performed on the parental material, and it was found that the ZmCCT gene of the parent OGD (the nucleotide sequence was
此外,在编码区内,ZmCCT-F7和ZmCCT-OGD之间存在5个SNPs,其中3个会导致氨基酸替换(序列3、序列4)。在NAM(Nested Association Mapping)群体中,共有亲本B73的ZmCCT序列与F7的一致,亲本B97、M37W和MS71的ZmCCT序列与OGD的一致,但是B73与这三个亲本分别组配的亚群体在ZmCCT附近并没有检测到超过阈值的开花期QTL(图5),这说明这5个SNPs并不是功能位点。In addition, within the coding region, there are 5 SNPs between ZmCCT-F7 and ZmCCT-OGD, 3 of which lead to amino acid substitutions (SEQ ID NO: 3, SEQ ID NO: 4). In the NAM (Nested Association Mapping) population, the ZmCCT sequence of the common parent B73 is consistent with that of F7, and the ZmCCT sequences of the parents B97, M37W and MS71 are consistent with those of OGD, but the subpopulations in which B73 and these three parents were assembled respectively are in ZmCCT No flowering QTLs above the threshold were detected nearby (Fig. 5), indicating that these five SNPs are not functional loci.
综上,推测LINE/L1转座子插入是唯一的功能位点。In summary, it is speculated that the LINE/L1 transposon insertion is the only functional site.
经过测序,LINE/L1转座子(又名4.2-kb的LINE/L1转座子)的核苷酸序列为序列5。After sequencing, the nucleotide sequence of the LINE/L1 transposon (also known as the 4.2-kb LINE/L1 transposon) was sequence 5.
二、LINE/L1转座子与开花时间的关系2. Relationship between LINE/L1 transposon and flowering time
1、通过在一个F7代的杂合HIF系内自交,获得两个F8代NILs,NILs-F7在低纬度海南地区的散粉期比NILs-OGD的提早6.0天(P<0.01)(图6)。进一步研究这两个NILs中ZmCCT在短日照条件下的表达模式,发现ZmCCT-F7存在明显的节律表达,而ZmCCT-OGD不存在表达节律,并且在各个时间点的表达量都超过ZmCCT-F7的,从而在一天内的ZmCCT表达积累量上也显著高于ZmCCT-F7的(P<0.01)(图7)。1. By selfing in a heterozygous HIF line of F 7 generation, two NILs of F 8 generation were obtained. The dispersing stage of NILs-F7 in low-latitude Hainan was 6.0 days earlier than that of NILs-OGD (P<0.01) ( Image 6). Further study of the expression pattern of ZmCCT in these two NILs under short-day conditions found that ZmCCT-F7 had a clear rhythmic expression, while ZmCCT-OGD had no expression rhythm, and the expression level at each time point exceeded that of ZmCCT-F7. , so that the accumulation of ZmCCT expression in one day was also significantly higher than that of ZmCCT-F7 (P<0.01) (Fig. 7).
另外,对两个NILs的cDNA进行PCR检测和测序,发现这个转座子插入并没有产生新的转录本(图8、表1)。据此,推断亲本F7内含子区域的LINE/L1转座子插入,抑制了ZmCCT的表达量,进而提前了开花期,使包含ZmCCT-F7等位基因的NILs表现出早熟表型。此外,NILs-F7与NILs-OGD在多个株型性状上具有显著性差异,相对于NILs-OGD,NILs-F7的株高和穗位高分别降低22和18厘米,穗上叶片数减少1.1片(P<0.01)(图6)。In addition, PCR detection and sequencing of the cDNAs of the two NILs revealed that this transposon insertion did not generate new transcripts (Fig. 8, Table 1). Based on this, it is inferred that the insertion of the LINE/L1 transposon in the intron region of the parental F7 inhibited the expression of ZmCCT, thereby advancing the flowering period, and making the NILs containing the ZmCCT-F7 allele show a precocious phenotype. In addition, NILs-F7 and NILs-OGD had significant differences in several plant morphological traits. Compared with NILs-OGD, the plant height and ear height of NILs-F7 decreased by 22 and 18 cm, respectively, and the number of leaves on the ear decreased by 1.1 slices (P<0.01) (Fig. 6).
因此,可以通过检测待测玉米基因组中的ZmCCT基因是否有LINE/L1转座子判断待测玉米的开花时间早晚,含有LINE/L1转座子的玉米的开花时间早于不含有LINE/L1转座子的玉米;Therefore, it is possible to judge whether the ZmCCT gene in the genome of the maize to be tested has LINE/L1 transposon to determine whether the flowering time of the maize to be tested is sooner or later, and the flowering time of the maize containing the LINE/L1 transposon is earlier than that without the LINE/L1 transposon. the corn of the seat;
或,通过检测待测玉米基因组中的ZmCCT基因是否有LINE/L1转座子判断待测玉米的株高,含有LINE/L1转座子的玉米的株高低于不含有LINE/L1转座子的玉米;Or, by detecting whether the ZmCCT gene in the genome of the maize to be tested has a LINE/L1 transposon, the plant height of the maize to be tested is judged, and the plant height of the maize containing the LINE/L1 transposon is lower than that of the maize that does not contain the LINE/L1 transposon. corn;
或,通过检测待测玉米基因组中的ZmCCT基因是否有LINE/L1转座子判断待测玉米的穗位高,含有LINE/L1转座子的玉米的穗位高低于不含有LINE/L1转座子的玉米。Or, by detecting whether the ZmCCT gene in the maize genome to be tested has a LINE/L1 transposon, the ear height of the maize to be tested is judged, and the ear height of the maize containing the LINE/L1 transposon is lower than that without the LINE/L1 transposon. of corn.
三、ZmCCT-FO基因的获得及应用3. Acquisition and application of ZmCCT-FO gene
将上述一中的F4到F7代单株进行5.1-kb的转座子插入检测和4.2-kb的LINE/L1转座子插入检测;Perform 5.1-kb transposon insertion detection and 4.2-kb LINE/L1 transposon insertion detection on the F 4 to F 7 generation individual plants in the above-mentioned one;
5.1-kb的转座子插入检测采用引物为表1中的M7-1和M7-2,引物M7-1扩增无条带,且M7-2扩增有条带的单株为5.1-kb的转座子纯合插入单株。5.1-kb transposon insertion detection using primers M7-1 and M7-2 in Table 1, primer M7-1 amplified no band, and M7-2 amplified a single plant with a band of 5.1-kb homozygous insertion of the transposon into a single plant.
4.2-kb的LINE/L1转座子采用表2中的IIA和IIB,引物IIA扩增有条带,且IIB扩增无条带的单株为LINE/L1转座子纯合插入单株。4. The 2-kb LINE/L1 transposon uses IIA and IIB in Table 2, the primer IIA amplified with a band, and the IIB amplified the individual plant without the band is the LINE/L1 transposon homozygous insertion individual plant.
选取ZmCCT基因启动子区域有5.1-kb的转座子纯合插入且内含子区域有4.2-kb的LINE/L1转座子纯合插入的单株,其具有的ZmCCT等位基因,命名为ZmCCT-FO,ZmCCT-FO等位基因的核苷酸序列为序列14(序列14第2292-6461位为LINE/L1转座子)。A single plant with a 5.1-kb transposon homozygous insertion in the promoter region of the ZmCCT gene and a 4.2-kb LINE/L1 transposon homozygous insertion in the intron region was selected. The ZmCCT allele was named as ZmCCT-FO, the nucleotide sequence of the ZmCCT-FO allele is sequence 14 (position 2292-6461 of sequence 14 is the LINE/L1 transposon).
选取ZmCCT基因启动子区域有5.1-kb的转座子纯合插入且内含子区域没有4.2-kb的LINE/L1转座子插入的单株,其具有的ZmCCT等位基因为ZmCCT-OGD(序列2)。A single plant with a 5.1-kb transposon homozygous insertion in the ZmCCT gene promoter region and no 4.2-kb LINE/L1 transposon insertion in the intron region was selected, and the ZmCCT allele it had was ZmCCT-OGD ( sequence 2).
选取ZmCCT基因启动子区域没有5.1-kb的转座子插入且内含子区域有4.2-kb的LINE/L1转座子纯合插入的单株,其具有的ZmCCT等位基因为ZmCCT-F7(序列1,其中,第2292-6461位为LINE/L1转座子)。A single plant with no 5.1-kb transposon insertion in the promoter region of the ZmCCT gene and a 4.2-kb LINE/L1 transposon homozygous insertion in the intron region was selected, and the ZmCCT allele it had was ZmCCT-F7 (
将在高纬度长日照的北京地区的具有ZmCCT-FO等位基因的单株、具有ZmCCT-OGD等位基因的单株和具有ZmCCT-F7等位基因的单株进行开花期检测和实时荧光定量PCR检测(引物为表1中的ZmCCT)。The individual plants with ZmCCT-FO alleles, the individual plants with ZmCCT-OGD alleles, and the individual plants with ZmCCT-F7 alleles in the high-latitude long-day Beijing area were subjected to flowering stage detection and real-time fluorescence quantification PCR detection (primers are ZmCCT in Table 1).
结果如图9所示,可以看出,相对于另外两个等位基因ZmCCT-F7和ZmCCT-OGD,ZmCCT-FO在高纬度长日照的北京地区,ZmCCT的表达量最低,开花期最早,成熟期最短(图9)。The results are shown in Figure 9. It can be seen that compared with the other two alleles, ZmCCT-F7 and ZmCCT-OGD, ZmCCT-FO has the lowest expression level of ZmCCT in the high-latitude and long-day Beijing area, the earliest flowering period, and mature the shortest period (Figure 9).
因此,也可以采用检测基因组中的ZmCCT基因是否为ZmCCT-FO等位基因、ZmCCT-F7等位基因或ZmCCT-OGD等位基因鉴定待测玉米的开花时间:含有等位基因ZmCCT-FO的待测玉米开花时间早于或候选早于含有等位基因ZmCCT-F0和ZmCCT-OGD的待测玉米。Therefore, it is also possible to detect whether the ZmCCT gene in the genome is the ZmCCT-FO allele, the ZmCCT-F7 allele or the ZmCCT-OGD allele to identify the flowering time of the maize to be tested: the waiting time containing the allele ZmCCT-FO The flowering time of the tested maize was earlier or candidate earlier than that of the tested maize containing alleles ZmCCT-F0 and ZmCCT-OGD.
上述符合如下判断标准:基因组中的ZmCCT基因含有LINE/L1转座子的待测玉米开花时间早于或候选早于基因组中的ZmCCT基因不含有LINE/L1转座子的待测玉米。The above meets the following judgment criteria: the flowering time of the tested maize whose ZmCCT gene in the genome contains the LINE/L1 transposon is earlier or the candidate is earlier than the tested maize whose ZmCCT gene in the genome does not contain the LINE/L1 transposon.
四、LINE/L1转座子或ZmCCT-FO等位基因、ZmCCT-F7等位基因或ZmCCT-OGD等位基因在鉴定待测玉米的开花时间早晚、株高或穗位高方法的建立4. Establishment of a method for identifying the flowering time, plant height or ear height of LINE/L1 transposon or ZmCCT-FO allele, ZmCCT-F7 allele or ZmCCT-OGD allele
A、LINE/L1转座子的应用A. Application of LINE/L1 transposon
1、提取待测玉米的基因组DNA1. Extract the genomic DNA of the corn to be tested
提取待测玉米叶片组织的基因组DNA。Extract genomic DNA from the maize leaf tissue to be tested.
2、PCR鉴定转座子2. PCR identification of transposons
根据LINE/L1转座子上下游序列,设计如下表2所示的引物IIA和IIB,其中IIA为IIA1或IIA2,IIB为IIB1或IIB2;According to the upstream and downstream sequences of the LINE/L1 transposon, design primers IIA and IIB as shown in Table 2 below, where IIA is IIA1 or IIA2, and IIB is IIB1 or IIB2;
表2为鉴定转座子所用的引物IIA和IIB的序列Table 2 shows the sequences of primers IIA and IIB used to identify transposons
以上述1得到的基因组DNA为模板,分别用表2所示的引物对IIA1/IIB1进行PCR扩增,得到PCR扩增产物。Using the genomic DNA obtained in the above 1 as a template, PCR amplification was performed on IIA1/IIB1 with the primer pairs shown in Table 2, respectively, to obtain PCR amplification products.
上述PCR扩增的反应体系如下表3所示:The reaction system of above-mentioned PCR amplification is as shown in Table 3 below:
表3为PCR扩增的反应体系Table 3 is the reaction system of PCR amplification
上述PCR扩增反应程序如下表4所示:The above-mentioned PCR amplification reaction program is shown in Table 4 below:
表4为PCR扩增的反应程序Table 4 is the reaction program of PCR amplification
*注:引物IIA使用30s延伸时间,IIB使用1min延伸时间*Note: Primer IIA uses 30s extension time, IIB uses 1min extension time
将上述扩增产物用琼脂糖凝胶电泳检测扩增产物大小。The size of the amplified product was detected by agarose gel electrophoresis.
如果引物IIA扩增有条带,且IIB扩增无条带,则说明待测玉米基因组中2条染色体的ZmCCT基因均含有LINE/L1转座子,为LINE/L1转座子纯合型;If primer IIA has a band in amplification, and IIB has no band in amplification, it means that the ZmCCT genes of the two chromosomes in the maize genome to be tested both contain LINE/L1 transposon and are homozygous for LINE/L1 transposon;
如果引物IIB扩增有条带,且IIA扩增无条带,则说明待测玉米基因组中2条染色体的ZmCCT基因均不含有LINE/L1转座子,为缺失LINE/L1转座子纯合型;If the primer IIB has a band, and the IIA has no band, it means that the ZmCCT gene of the two chromosomes in the maize genome to be tested does not contain the LINE/L1 transposon, and it is homozygous for the deletion of the LINE/L1 transposon. type;
如果引物IIA和引物IIB二者都有条带,说明待测玉米基因组中一条染色体的ZmCCT基因有LINE/L1转座子插入,而另一条染色体的没有LINE/L1转座子插入,为LINE/L1转座子杂合型。If both primer IIA and primer IIB have bands, it means that the ZmCCT gene of one chromosome in the tested maize genome has LINE/L1 transposon insertion, while the other chromosome has no LINE/L1 transposon insertion, which is LINE/ L1 transposon heterozygous.
上述引物IIA为引物IIA1或引物IIA2,上述引物IIB为引物IIB1或引物IIB2。The aforementioned primer IIA is primer IIA1 or primer IIA2, and the aforementioned primer IIB is primer IIB1 or primer IIB2.
IIA1引物的目的扩增片段大小为493bp,IIB1引物的目的扩增片段大小为1007bp。The size of the target amplified fragment of the IIA1 primer was 493 bp, and the size of the target amplified fragment of the IIB1 primer was 1007 bp.
IIA2引物的目的扩增片段大小为591bp,IIB2引物的目的扩增片段大小为1588bp。The size of the target amplified fragment of the IIA2 primer is 591 bp, and the size of the target amplified fragment of the IIB2 primer is 1588 bp.
基因组中的ZmCCT基因含有LINE/L1转座子的待测玉米开花时间早于或候选早于基因组中的ZmCCT基因不含有LINE/L1转座子的待测玉米;The flowering time of the tested maize whose ZmCCT gene in the genome contains LINE/L1 transposon is earlier or the candidate is earlier than the tested maize whose ZmCCT gene in the genome does not contain LINE/L1 transposon;
或,基因组中的ZmCCT基因含有LINE/L1转座子的待测玉米株高低于或候选低于基因组中的ZmCCT基因不含有LINE/L1转座子的待测玉米;Or, the ZmCCT gene in the genome contains LINE/L1 transposon to be tested maize plant height is lower than or candidate is lower than the ZmCCT gene in the genome does not contain LINE/L1 transposon to be tested maize;
或,基因组中的ZmCCT基因含有LINE/L1转座子的待测玉米穗位高低于或候选低于基因组中的ZmCCT基因不含有LINE/L1转座子的待测玉米。Or, the maize to be tested whose ZmCCT gene in the genome contains LINE/L1 transposon is lower than or candidate is lower than the maize to be tested whose ZmCCT gene in the genome does not contain LINE/L1 transposon.
B、检测B. to detect
1、提取待测玉米的基因组DNA1. Extract the genomic DNA of the corn to be tested
提取待测玉米叶片组织的基因组DNA。Extract genomic DNA from the maize leaf tissue to be tested.
2、检测待测玉米中的ZmCCT基因为ZmCCT-FO等位基因、ZmCCT-F7等位基因还是ZmCCT-OGD等位基因2. Detect whether the ZmCCT gene in the corn to be tested is the ZmCCT-FO allele, the ZmCCT-F7 allele or the ZmCCT-OGD allele
可以通过测序检测,也可以通过如下方法检测:It can be detected by sequencing or by the following methods:
ZmCCT基因启动子区域有5.1-kb的转座子纯合插入且内含子区域有4.2-kb的LINE/L1转座子纯合插入的单株,其具有的ZmCCT等位基因ZmCCT-FO。A single plant with a 5.1-kb transposon homozygous insertion in the promoter region of the ZmCCT gene and a 4.2-kb LINE/L1 transposon homozygous insertion in the intron region has the ZmCCT allele ZmCCT-FO.
ZmCCT基因启动子区域有5.1-kb的转座子纯合插入且内含子区域没有4.2-kb的LINE/L1转座子插入的单株,其具有ZmCCT等位基因ZmCCT-OGD。A single plant with a 5.1-kb transposon homozygous insertion in the promoter region of the ZmCCT gene and no 4.2-kb LINE/L1 transposon insertion in the intron region has the ZmCCT allele ZmCCT-OGD.
ZmCCT基因启动子区域没有5.1-kb的转座子插入且内含子区域有4.2-kb的LINE/L1转座子纯合插入的单株,其具有ZmCCT等位基因ZmCCT-F7。A single plant without a 5.1-kb transposon insertion in the promoter region of the ZmCCT gene and a homozygous insertion of a 4.2-kb LINE/L1 transposon in the intron region has the ZmCCT allele ZmCCT-F7.
上述鉴定5.1-kb的转座子插入采用引物为表1中的M7-1和M7-2,引物M7-1扩增无条带,且M7-2扩增有条带的单株为5.1-kb的转座子纯合插入单株。The above-identified 5.1-kb transposon insertion used primers M7-1 and M7-2 in Table 1, primer M7-1 amplified no band, and M7-2 amplified a single plant with a band of 5.1- The kb transposon was homozygous for insertion into the individual plant.
上述鉴定4.2-kb的LINE/L1转座子插入采用表2中的IIA和IIB,引物IIA扩增有条带,且IIB扩增无条带的单株为LINE/L1转座子纯合插入单株。The above-identified LINE/L1 transposon insertion of 4.2-kb adopts IIA and IIB in Table 2, the primer IIA amplified with a band, and the individual plant with no band amplified by IIB was a LINE/L1 transposon homozygous insertion single plant.
含有等位基因ZmCCT-FO的待测玉米开花时间早于或候选早于含有等位基因ZmCCT-F0或ZmCCT-OGD的待测玉米。The flowering time of the tested maize containing the allele ZmCCT-FO was earlier or candidate earlier than that of the tested maize containing the allele ZmCCT-F0 or ZmCCT-OGD.
或,含有等位基因ZmCCT-FO的待测玉米株高低于或候选低于含有等位基因ZmCCT-F0的待测玉米或等位基因ZmCCT-OGD的待测玉米。Or, the plant height of the maize to be tested containing the allele ZmCCT-FO is lower than or candidate is lower than the maize to be tested containing the allele ZmCCT-F0 or the maize to be tested containing the allele ZmCCT-OGD.
或,含有等位基因ZmCCT-FO的待测玉米穗位高高低于或候选低于含有等位基因ZmCCT-F0的待测玉米或含有等位基因ZmCCT-OGD的待测玉米。Or, the height of the corn to be tested containing the allele ZmCCT-FO is lower than or candidate is lower than the corn to be tested containing the allele ZmCCT-F0 or the corn to be tested containing the allele ZmCCT-OGD.
实施例2、LINE/L1转座子在鉴定待测玉米的开花时间早晚、株高和穗位高中的应用
1、提取待测玉米的基因组DNA1. Extract the genomic DNA of the corn to be tested
提取待测玉米叶片组织的基因组DNA。Extract genomic DNA from the maize leaf tissue to be tested.
待测玉米为表5中的各个家系,每个家系分别有22、32、28、36、44、32、72、63、69和147株:The maize to be tested is each family in Table 5, and each family has 22, 32, 28, 36, 44, 32, 72, 63, 69 and 147 strains respectively:
1)ft1-1*、ft1-2*,ft2-1*、ft2-2*,ft3-1*、ft3-2*按照如下方法制备:1) ft1-1*, ft1-2*, ft2-1*, ft2-2*, ft3-1*, ft3-2* are prepared as follows:
亲本是F7和OGD,两个亲本杂交获得F1代,F1自交7代,得到F7;从F7中选取ZmCCT基因保持杂合的F7代单株(采用表1中M2进行鉴定,M2位于ZmCCT基因上,M2扩增获得2条杂合条带就代表ZmCCT基因保持杂合),将选取的这些F7代单株自交获得的全部后代所组成的家系,ft1-1*和ft1-2*,ft2-1*和ft2-2*,ft3-1*和ft3-2*分别来自3个不同的F7代单株。The parents are F7 and OGD, the two parents are crossed to obtain F 1 generation, F 1 is self-crossed for 7 generations, and F 7 is obtained; from F 7 , select the F 7 generation individual plant with the ZmCCT gene remaining heterozygous (using M2 in Table 1 to identify , M2 is located on the ZmCCT gene, and M2 amplifies and obtains 2 heterozygous bands on behalf of the ZmCCT gene to maintain heterozygosity), the family composed of all progeny obtained by selfing of these F 7 generations of individual plants, ft1-1* and ft1-2*, ft2-1* and ft2-2*, ft3-1* and ft3-2* from 3 different F 7 generation individual plants, respectively.
2)ft4-1**和ft 4-2**按照如下方法制备:2) ft4-1** and ft 4-2** were prepared as follows:
亲本是F7和OGD,两个亲本杂交获得F1代,F1自交4代,得到F5代;从F5代中选取ZmCCT基因的启动子区域具有已知的5.1-kb转座子纯合插入(引物为表1中的M7-1和M7-2,引物M7-1扩增无条带,且M7-2扩增有条带的单株为具有已知的5.1-kb转座子纯合插入的单株),且LINE/L1转座子插入保持杂合的单株(引物为表2中的IIA和IIB,引物IIA扩增有条带,且IIB扩增有条带的单株为4.2-kb LINE/L1转座子插入保持杂合的单株),将选取的这些F5代单株自交获得的全部后代所组成的家系,ft4-1**和ft 4-2**来自同1个F5代单株;The parents are F7 and OGD. The two parents were crossed to obtain the F 1 generation, and the F 1 self-crossed for 4 generations to obtain the F 5 generation; from the F 5 generation, the promoter region of the ZmCCT gene was selected to have a known pure 5.1-kb transposon. Co-insertion (primers are M7-1 and M7-2 in Table 1, primer M7-1 amplifies no band, and M7-2 amplifies a single strain with a known 5.1-kb transposon Homozygous inserted individual), and the LINE/L1 transposon insertion remained heterozygous individual (primers were IIA and IIB in Table 2, primer IIA amplified a band, and IIB amplified a band with a single clone). The 4.2-kb LINE/L1 transposon was inserted into the individual plant to maintain heterozygosity), the family composed of all the progeny obtained by selfing of these F 5 generation individual plants, ft4-1** and ft4-2 **From the same F 5th generation single plant;
3)ft5-1***和ft5-2***按照如下方法制备:3) ft5-1*** and ft5-2*** are prepared as follows:
亲本是F7和OGD,两个亲本杂交获得F1代,F1自交6代,得到F7代;从F7代中选取ZmCCT基因的启动子区域具有已知的5.1-kb转座子纯合插入的株系(引物为表1中的M7-1和M7-2,引物M7-1扩增无条带,且M7-2扩增有条带的单株为具有已知的5.1-kb转座子纯合插入的单株)。The parents are F7 and OGD, the two parents were crossed to obtain F 1 generation, F 1 was selfed for 6 generations, and F 7 generation was obtained; the promoter region of ZmCCT gene was selected from the F 7 generation with a known pure 5.1-kb transposon. The inserted strains (primers are M7-1 and M7-2 in Table 1, primer M7-1 amplifies no band, and M7-2 amplifies a single strain with a known 5.1-kb homozygous insertion of the transposon).
2、PCR鉴定转座子2. PCR identification of transposons
方法与实施例1的四的A相同。The method is the same as A of Example 1.
基因组中的ZmCCT基因含有LINE/L1转座子的待测玉米开花时间早于或候选早于基因组中的ZmCCT基因不含有LINE/L1转座子的待测玉米。The flowering time of the tested maize whose ZmCCT gene in the genome contains the LINE/L1 transposon is earlier or the candidate is earlier than the tested maize whose ZmCCT gene in the genome does not contain the LINE/L1 transposon.
基因组中的ZmCCT基因含有LINE/L1转座子的待测玉米株高低于或候选低于基因组中的ZmCCT基因不含有LINE/L1转座子的待测玉米。The ZmCCT gene in the genome contains the LINE/L1 transposon to be tested, the plant height is lower or the candidate is lower than the tested maize that the ZmCCT gene in the genome does not contain the LINE/L1 transposon.
基因组中的ZmCCT基因含有LINE/L1转座子的待测玉米穗位高低于或候选低于基因组中的ZmCCT基因不含有LINE/L1转座子的待测玉米。The ZmCCT gene in the genome contains the LINE/L1 transposon to be tested, and the ear height of the tested maize is lower or the candidate is lower than the ZmCCT gene in the genome does not contain the LINE/L1 transposon to be tested.
结果如表5所示,其中,-1株系均为缺失LINE/L1转座子插入的全部单株,-2株系均为具有LINE/L1转座子纯合插入的全部单株;可以看出,LINE/L1转座子纯合插入的全部单株的开花时间早于缺失LINE/L1转座子插入的全部单株,LINE/L1转座子纯合插入的全部单株的株高、穗位高均低于缺失LINE/L1转座子插入的全部单株,-1株系和对应的-2株系具有显著性差异。The results are shown in Table 5. Among them, the -1 lines are all the individual plants that lack the LINE/L1 transposon insertion, and the -2 lines are all the individual plants that have the LINE/L1 transposon homozygous insertion; It can be seen that the flowering time of all the individual plants with homozygous insertion of the LINE/L1 transposon is earlier than that of all the individual plants with the deletion of the LINE/L1 transposon insertion, and the plant height of all the individual plants with the homozygous insertion of the LINE/L1 transposon The height of ear position and ear position were lower than all the individual plants with the deletion of LINE/L1 transposon insertion, and the -1 line and the corresponding -2 line were significantly different.
表5为鉴定结果Table 5 shows the identification results
SEQUENCE LISTING SEQUENCE LISTING
<110>中国农业大学<110> China Agricultural University
<120> 一种转座子及其应用<120> A kind of transposon and its application
<160> 14<160> 14
<170> PatentIn version 3.5<170> PatentIn version 3.5
<210> 1<210> 1
<211> 6730<211> 6730
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 1<400> 1
atgtcgtcgg ggccagcagc atgcggtgtg tgcggcgcgg ccgcctgctg cccgcacctc 60atgtcgtcgg ggccagcagc atgcggtgtg tgcggcgcgg ccgcctgctg cccgcacctc 60
ttgcacaccg gtgacggcaa cgacgacgac ctcatcagcc gggccttctt ctccgtcttc 120ttgcacaccg gtgacggcaa cgacgacgac ctcatcagcc gggccttctt ctccgtcttc 120
cctgtcgtcg gtcatcaccg tcgtcatgag tccaccagca gccccgccat gcagcagcca 180cctgtcgtcg gtcatcaccg tcgtcatgag tccaccagca gccccgccat gcagcagcca 180
tcggggtgcc tgcacgagtt ccagttcttt ggccatcagg acgaccacca ccaccaagaa 240tcggggtgcc tgcacgagtt ccagttcttt ggccatcagg acgaccacca ccaccaagaa 240
accatcgcct ggctcttgga ccacccaccg ccacctgcgc ccgagcttgg cggcgacgac 300accatcgcct ggctcttgga ccacccaccg ccacctgcgc ccgagcttgg cggcgacgac 300
ggcccgtccc cagctggtga tgagaacgac gaccagcctg cgtttcaccc gtttgggaca 360ggcccgtccc cagctggtga tgagaacgac gaccagcctg cgtttcaccc gtttgggaca 360
ccacagtacc accaccccgg aaaagggaac gggaacgggc tcacctttga gctggacgcc 420ccacagtacc accaccccgg aaaagggaac gggaacgggc tcacctttga gctggacgcc 420
acgctgggcc tcggcaccgc gcggcaaacc actgagacag cagaagcaag cgccaccatc 480acgctgggcc tcggcaccgc gcggcaaacc actgagacag cagaagcaag cgccaccatc 480
gtaagtattg ctcccgaatt atcttaagta agttcagata attcacatgc atggtttcta 540gtaagtattg ctcccgaatt atcttaagta agttcagata attcacatgc atggtttcta 540
attggaattt ggtcccaagc tggacaccct ttttttatct tccgttttct caactctctt 600attggaattt ggtcccaagc tggacaccct ttttttatct tccgttttct caactctctt 600
atcgatcacc tgcataaagg acctttgtat caagtaccaa gagatcttgc catgagttgc 660atcgatcacc tgcataaagg acctttgtat caagtaccaa gagatcttgc catgagttgc 660
actttacgca catttttttt tctttttttt tttcaggaac gtactactct tcctatatat 720actttacgca cattttttttt tctttttttt tttcaggaac gtactactct tcctatatat 720
caatatatgt aaacaagatt aacatgcatg tttctaacct ttctcaaaga caaaagacac 780caatatatgt aaacaagatt aacatgcatg tttctaacct ttctcaaaga caaaagacac 780
tctggtgcac gaaatggatg gaagaaacca gatcattaat atatgcctca caacctcttc 840tctggtgcac gaaatggatg gaagaaacca gatcattaat atatgcctca caacctcttc 840
atgaatttaa tttgatgtgg aaagaataaa aacgacggtt ccggttgtta acccaatatt 900atgaatttaa tttgatgtgg aaagaataaa aacgacggtt ccggttgtta acccaatatt 900
caatgatatc ctgaacaaaa ctagctatag atctcaatca tagcatcagg catcagcgct 960caatgatatc ctgaacaaaa ctagctatag atctcaatca tagcatcagg catcagcgct 960
tccaaagttc tcacctgact ttttttttac tcaatctcca gattatattt ccttcctaca 1020tccaaagttc tcacctgact ttttttttac tcaatctcca gattatattt ccttcctaca 1020
aagagtcgga gagaacatag ccatgagtta aatcactgat gttgtaaata cagaccagta 1080aagagtcgga gagaacatag ccatgagtta aatcactgat gttgtaaata cagaccagta 1080
gtcaaaagca ttgactatac actaaaacta ttgttcaagg tcactatttc acaaaaaaat 1140gtcaaaagca ttgactatac actaaaacta ttgttcaagg tcactatttc acaaaaaaat 1140
tcattgccta tttgatagtt tgattgagag gtagcaatat tgccaaattt atttttcacg 1200tcattgccta tttgatagtt tgattgagag gtagcaatat tgccaaattt atttttcacg 1200
tacctagaca aaagtcggta gcaatattgc caattttatt gctccatcgt catatgcatc 1260tacctagaca aaagtcggta gcaatattgc caattttatt gctccatcgt catatgcatc 1260
ccgaagtcta ttattgctgt aatgacaaga tacagatctt ttatattgtg atatacttac 1320ccgaagtcta ttattgctgt aatgacaaga tacagatctt ttatattgtg atatacttac 1320
ttaagtttta tattgaagat aaaagggaga aagcagcttg cctccctttc tttttcttca 1380ttaagtttta tattgaagat aaaagggaga aagcagcttg cctccctttc tttttcttca 1380
ccactatata tattggattg tttcttcacc actatatata caagaaaata ttaatatctg 1440ccactatata tattggattg tttcttcacc actatatata caagaaaata ttaatatctg 1440
cagtacatat ttagtgtcat taaatatgtc ttttgaaact attttcataa taaacatatt 1500cagtacatat ttagtgtcat taaatatgtc ttttgaaact attttcataa taaacatatt 1500
tgaagataca tgtattgcaa atatttttta cgaatctaat caaatatgag aaattttgac 1560tgaagataca tgtattgcaa atatttttta cgaatctaat caaatatgag aaattttgac 1560
tgacatgtat gaccatacta tcaattattt taggacagag gcatggagtg cgcatttgta 1620tgacatgtat gaccatacta tcaattattt taggacagag gcatggagtg cgcatttgta 1620
tggtcgaaat cgatcaattg taaccatata tgcatgtacg tttggtacgc ccactgatgt 1680tggtcgaaat cgatcaattg taaccatata tgcatgtacg tttggtacgc ccactgatgt 1680
atctacctgg ttaattaatt agatgaccta gcttgtcgtc tgattgttat gattaaagaa 1740atctacctgg ttaattaatt agatgaccta gcttgtcgtc tgattgttat gattaaagaa 1740
ccaaaaagtc tactcagctc aaaacccaaa tatatatgtg tcaaacaact cccatgcaca 1800ccaaaaagtc tactcagctc aaaacccaaa tatatatgtg tcaaacaact cccatgcaca 1800
tgtccagctg tgtctaaatc tatcccgaag gattgtccat gccaaagttt gatgaaatag 1860tgtccagctg tgtctaaatc tatcccgaag gattgtccat gccaaagttt gatgaaatag 1860
ataataagtt gtctcatttt atgtggttcg tttttgcaga tttgctgctt actctttcgt 1920ataataagtt gtctcatttt atgtggttcg tttttgcaga tttgctgctt actctttcgt 1920
atacttggat tttataggga actaatatat acatatgatt ataattaatg cactttattc 1980atacttggat tttataggga actaatatat acatatgatt ataattaatg cactttattc 1980
cgtgccacat gtagatgaat aacgcaatca catggcttaa gatctaatat tctaccccaa 2040cgtgccacat gtagatgaat aacgcaatca catggcttaa gatctaatat tctaccccaa 2040
aacaaatcga gctaccaagg cgatatctga tgttcatcag gcatgcatgt aggcccattc 2100aacaaatcga gctaccaagg cgatatctga tgttcatcag gcatgcatgt aggcccattc 2100
agcatatcaa gcaaagtaca gattcttatc caaaccatgc atatacatat gaccaaagta 2160agcatatcaa gcaaagtaca gattcttatc caaaccatgc atatacatat gaccaaagta 2160
ctaattaatt agttgcctgc agttattagc tgtccaaaat ttgctttgat catcatgcaa 2220ctaattaatt agttgcctgc agttattagc tgtccaaaat ttgctttgat catcatgcaa 2220
taatatacac atgcagaaac taaaatgaat aacatatata aatccatgca tgcacatgca 2280taatatacac atgcagaaac taaaatgaat aacatatata aatccatgca tgcacatgca 2280
gcatactttt tttttgaaaa ctggcaagag aattgcctgt tatattaaaa agaaggtgag 2340gcatactttt tttttgaaaa ctggcaagag aattgcctgt tatattaaaa agaaggtgag 2340
agccagcgag ggctaaaata tacaagtaca tcacactagg gaatgagaca ttaagtatgg 2400agccagcgag ggctaaaata tacaagtaca tcacactagg gaatgagaca ttaagtatgg 2400
tgacacaaat gttccaaaac caacccctct ccttccccaa aacctacacc agccttgcta 2460tgacacaaat gttccaaaac caacccctct ccttccccaa aacctacacc agccttgcta 2460
agtgttttgc tcctcctaga atccacaggc gaatttgctc cttaatcatc gctatagtct 2520agtgttttgc tcctcctaga atccacaggc gaatttgctc cttaatcatc gctatagtct 2520
gtgtcacggt cttctctttg tgctcaaaga ttcgcgagtt tctctcacac caaattgtcc 2580gtgtcacggt cttctctttg tgctcaaaga ttcgcgagtt tctctcacac caaattgtcc 2580
aagacacgag gatgaggagg gttcgcaatc ccttcacgct tctcactgcg gatgttgcaa 2640aagacacgag gatgaggagg gttcgcaatc ccttcacgct tctcactgcg gatgttgcaa 2640
gattcgacca ccattggtgc accgaatcta caggttgcca agaagaatgg tggatctccg 2700gattcgacca ccattggtgc accgaatcta caggttgcca agaagaatgg tggatctccg 2700
taattgcagc ccaaatactc aaatcagccc aaatatgtct ggtgaagcaa cattcagcaa 2760taattgcagc ccaaatactc aaatcagccc aaatatgtct ggtgaagcaa cattcagcaa 2760
aaagatgtag tccagattct gggcatgagg cgcataggac acagcgcgga ttatgaggcc 2820aaagatgtag tccagattct gggcatgagg cgcataggac acagcgcgga ttatgaggcc 2820
atccccttct tgccaacctg tctgctgtcc aaactctatt ttgaacagca agccaactga 2880atccccttct tgccaacctg tctgctgtcc aaactctatt ttgaacagca agccaactga 2880
agaatttgca ctttggtggc gcccaagctt tccaaattat actattaaaa ttcctgcatg 2940agaatttgca ctttggtggc gcccaagctt tccaaattat actattaaaa ttcctgcatg 2940
tggagccata aaattgcgct tggtaggctg actgagttgt gtactggcca ctcatagtaa 3000tggagccata aaattgcgct tggtaggctg actgagttgt gtactggcca ctcatagtaa 3000
acttccactt gataaggtcg ggggtattgt gcattaagcg aagctgattt acctcgcacc 3060acttccactt gataaggtcg ggggtattgt gcattaagcg aagctgattt acctcgcacc 3060
ataattgaca atattcatgt atgtgcgcgg cagaaaaatc attgtgatta aaatgtaggt 3120ataattgaca atattcatgt atgtgcgcgg cagaaaaatc attgtgatta aaatgtaggt 3120
cctgaatcca tcggttgttt gagagagcct catggatgga tctatttttg ccgttagaaa 3180cctgaatcca tcggttgttt gagagagcct catggatgga tctatttttg ccgttagaaa 3180
tggagtaggt caaagggaag gtgtctctag gtgtcgtacc ccttagccag gcactttccc 3240tggagtaggt caaagggaag gtgtctctag gtgtcgtacc ccttagccag gcactttccc 3240
agaacaaaga agacttacca tcccccaccc tgactgaggt tgctgctgca aacagtctcc 3300agaacaaaga agacttacca tcccccaccc tgactgaggt tgctgctgca aacagtctcc 3300
tgtccaacat attgcaaggt atatctttcg ttatcctctg agtatttaac agtttggcct 3360tgtccaacat attgcaaggt atatctttcg ttatcctctg agtatttaac agtttggcct 3360
cctgccacaa ccatctgagg cgtaaggccc gcgagaaaaa accaaggtgc agaatgccta 3420cctgccacaa ccatctgagg cgtaaggccc gcgagaaaaa accaaggtgc agaatgccta 3420
aaccaccgta ttgttttggt cttgctgcac agacccagtt gactttgcat ttaccgccgg 3480aaccaccgta ttgttttggt cttgctgcac agacccagtt gactttgcat ttaccgccgg 3480
ttaatctctc tgaaccagcc caaagaaatt gcttacgttt tgagtcaatg aaatccaaca 3540ttaatctctc tgaaccagcc caaagaaatt gcttacgttt tgagtcaatg aaatccaaca 3540
ccctttgggg cggttttagg gctaggagca gataggtgac ctgcgaggtt agcactgcat 3600ccctttgggg cggttttagg gctaggagca gataggtgac ctgcgaggtt agcactgcat 3600
tgactagcgt gaggcggcca gccgccgaaa ggtttttccc gctccatgta ctcagctttg 3660tgactagcgt gaggcggcca gccgccgaaa ggttttttccc gctccatgta ctcagctttg 3660
cagagacttt gtcaatccat ggctggaagt ccacccgttt gagtctgttc atggacaaag 3720cagagacttt gtcaatccat ggctggaagt ccacccgttt gagtctgttc atggacaaag 3720
gtagtccaag gtatttaagg gggaaggaag tcctcgaggc cggtagacca gatagaacgt 3780gtagtccaag gtatttaagg gggaaggaag tcctcgaggc cggtagacca gatagaacgt 3780
cagagaggtt gatgttgtta cactggattg gcactactgt ggacttgtgg aaatttgttt 3840cagagaggtt gatgttgtta cactggattg gcactactgt ggacttgtgg aaatttgttt 3840
ttaggcctgt ggtttcacca aaaagttcca gaatccttgc aagcatagat acctcacctt 3900ttaggcctgt ggtttcacca aaaagttcca gaatccttgc aagcatagat acctcacctt 3900
ttgtgggcgt gacaaatatg accgcatcat ccgcaaacat cgatatgcgc aggtcgggcg 3960ttgtgggcgt gacaaatatg accgcatcat ccgcaaacat cgatatgcgc aggtcgggcg 3960
agcgaccatg gagcttggtg agcattccga attcagttgc tacttcgagg agtctttgca 4020agcgaccatg gagcttggtg agcattccga attcagttgc tacttcgagg agtctttgca 4020
ggggatctat tgcaatgaca aaaagaagag gagatagggg gtcgccttgc cttaggcctc 4080ggggatctat tgcaatgaca aaaagaagag gagataggggg gtcgccttgc cttaggcctc 4080
gcccgtgcca gatagggggg tttggtacgc cgttaaggat gactcttgag gttgaggtgg 4140gcccgtgcca gatagggggg tttggtacgc cgttaaggat gactcttgag gttgaggtgg 4140
agagaattgc cgcaatccat tcccgccacc ttatagggaa gcctaggtgc tcaaggaggg 4200agagaattgc cgcaatccat tcccgccacc ttatagggaa gcctaggtgc tcaaggaggg 4200
tgaggatata ctcccaacgt atcgaatcaa aggcttttgc tatgtccaat ttgaacaaga 4260tgaggatata ctcccaacgt atcgaatcaa aggcttttgc tatgtccaat ttgaacaaga 4260
gtgttggggt cttgtttgta tgaaaacgac gtgccgcggt acgcactgcc aagaagttat 4320gtgttggggt cttgtttgta tgaaaacgac gtgccgcggt acgcactgcc aagaagttat 4320
catgtatgct cctgttcttg atgaaggcgc tttgacaagt ggagacgatc gcgttcatat 4380catgtatgct cctgttcttg atgaaggcgc tttgacaagt ggagacgatc gcgttcatat 4380
ggggctgaag tcgtaatgcc agtatcttgg aaataagctt tatgaaggaa tgtattaagc 4440ggggctgaag tcgtaatgcc agtatcttgg aaataagctt tatgaaggaa tgtattaagc 4440
ttatcggtct aaaatcaccc acttcttcgg ccccttcttt ttttgggatt aggattacat 4500ttatcggtct aaaatcaccc acttcttcgg ccccttcttt ttttgggatt aggattacat 4500
tagcggtgtt gatgagtgat aagctgccac aacgtagagc atggaaagcg ttggccgccc 4560tagcggtgtt gatgagtgat aagctgccac aacgtagagc atggaaagcg ttggccgccc 4560
ttatgacatc cccttttatt atgctccagc acgtcttaaa aaataaccca gtgaaaccat 4620ttatgacatc cccttttatt atgctccagc acgtcttaaa aaataaccca gtgaaaccat 4620
caggtcccgg cgccttgtca atgggcaaca gatcaatggc tcttttgatc tcctcctctg 4680caggtcccgg cgccttgtca atgggcaaca gatcaatggc tcttttgatc tcctcctctg 4680
agaatggagc agctaaagaa gagagatcat ggtgtcgtaa tccaagcgtc gcccagttga 4740agaatggagc agctaaagaa gagagatcat ggtgtcgtaa tccaagcgtc gcccagttga 4740
agtctattct cggagcgggt ggacgactca acatattttc aaagtgagat tgtattttcg 4800agtctattct cggagcgggt ggacgactca acatattttc aaagtgagat tgtattttcg 4800
cggccttgca ttcatgtgtt gttgtcgagc cattttggtc tttgaggcaa tgaatgaaat 4860cggccttgca ttcatgtgtt gttgtcgagc cattttggtc tttgaggcaa tgaatgaaat 4860
tttttcgacg cctagaagtg atccttcgat gaaaaaatct agtgttagca tccccaaatt 4920tttttcgacg cctagaagtg atccttcgat gaaaaaatct agtgttagca tccccaaatt 4920
ttatcaaatt tagacgtgca gcctgttttt tccgcgcacg ttcgatgacc gcaaggccca 4980ttatcaaatt tagacgtgca gcctgttttt tccgcgcacg ttcgatgacc gcaaggccca 4980
gaattctttt cttaagtctg catcttagga gctcctctcc aggagagaga gctctcagct 5040gaattctttt cttaagtctg catcttagga gctcctctcc aggagagaga gctctcagct 5040
cctgtgctat gtcaaatcta tggattatct ccaaagccat atgaagttgc agcttagcat 5100cctgtgctat gtcaaatcta tggattatct ccaaagccat atgaagttgc agcttagcat 5100
ccgagatagt gttgaagctc cactgtctga gggcccttgc tgtagcctgc agtttgtgat 5160ccgagatagt gttgaagctc cactgtctga gggcccttgc tgtagcctgc agtttgtgat 5160
agagtctgtg gaaaggctcc tggtgtgtgc agtgagcgca ccaggacctc gacaccactt 5220agagtctgtg gaaaggctcc tggtgtgtgc agtgagcgca ccaggacctc gacaccactt 5220
ccatgaatcc tgggagcatg gcccaaaaat tctcaaattt aaaagagcgc gggcgtcggg 5280ccatgaatcc tgggagcatg gcccaaaaat tctcaaattt aaaagagcgc gggcgtcggg 5280
ggccagtttg gttagagagc agcagtggac agtgatccga gagcgaagaa gataggccgt 5340ggccagtttg gttagagagc agcagtggac agtgatccga gagcgaagaa gataggccgt 5340
gcagcacgtg gctatgaaaa gcttggtccc attcagcatt tgcaaagacc ctgtcaagct 5400gcagcacgtg gctatgaaaa gcttggtccc attcagcatt tgcaaagacc ctgtcaagct 5400
taataagggt tgggtttgta cgctcgttgc tccaagtgaa tcgtctattt tgcaggttaa 5460taataagggt tgggtttgta cgctcgttgc tccaagtgaa tcgtctattt tgcaggttaa 5460
tttccttcag gtcacaacag tctagcatat cactgaaacg gctcattagg ctaaggttca 5520tttccttcag gtcacaacag tctagcatat cactgaaacg gctcattagg ctaaggttca 5520
gacgcctctt attcttgtca ctagctttgt aaattagatt aaaatctccc aaaagcagcc 5580gacgcctctt attcttgtca ctagctttgt aaattagatt aaaatctccc aaaagcagcc 5580
acttaattcc ggattgcggt ttcagatctt gaatttcttg gagaaaggct gtcttcatgc 5640acttaattcc ggattgcggt ttcagatctt gaatttcttg gagaaaggct gtcttcatgc 5640
tattgcttgt aggcccataa accacagtta ataggaaggc agtatgggac gctgttagct 5700tattgcttgt aggcccataa accacagtta ataggaaggc agtatgggac gctgttagct 5700
tggcctttcc agaaatatgg aaatcaccaa ctgtaaaatc agtcagctcc acatggttgg 5760tggcctttcc agaaatatgg aaatcaccaa ctgtaaaatc agtcagctcc acatggttgg 5760
tatcccatag caatgcaatc cctccccgtg tgccgctggg gcctccagcc ggtttgcaga 5820tatcccatag caatgcaatc cctccccgtg tgccgctggg gcctccagcc ggtttgcaga 5820
agaatttgtc taggtggtgg cctccaaggt gccaggctgt tgtttggtcg aaggaagaca 5880agaatttgtc taggtggtgg cctccaaggt gccaggctgt tgtttggtcg aaggaagaca 5880
atttagtttc ttgcaaacaa gctaagtgac acctcgatga agtaattgtc tctctgaccg 5940atttagtttc ttgcaaacaa gctaagtgac acctcgatga agtaattgtc tctctgaccg 5940
tgtccttccg agcctgggaa ttgagacccc tcacattcca gcaaaaaacc ttaaggtcta 6000tgtccttccg agcctgggaa ttgagacccc tcacattcca gcaaaaaacc ttaaggtcta 6000
agtctgtcat tgggaaaaaa aggaggtacc ccttgaatca tagtttcctg ctctctcagc 6060agtctgtcat tgggaaaaaa aggaggtacc ccttgaatca tagtttcctg ctctctcagc 6060
acagtgacaa gacagtgtgc atggaaacca catatctggt gaatgaggtg cgtcgagaca 6120acagtgacaa gacagtgtgc atggaaacca catatctggt gaatgaggtg cgtcgagaca 6120
ttgatggtgc agggaatgca aaaggaccat acaagaggtg ttcacgccat acatcaaaca 6180ttgatggtgc agggaatgca aaaggaccat acaagaggtg ttcacgccat acatcaaaca 6180
catgcagtct gaagcctccg cacacaagcg aggcaggaat tcaaagccta aagttaacat 6240catgcagtct gaagcctccg cacacaagcg aggcaggaat tcaaagccta aagttaacat 6240
gatagaagca aaagaccctt aagcacagat cactggctca tgctgctgac tgcagctcct 6300gatagaagca aaagaccctt aagcacagat cactggctca tgctgctgac tgcagctcct 6300
caacgcgcag actcctgatg ccatcttgca agtctgcaac tccgtccccc atcatcgcaa 6360caacgcgcag actcctgatg ccatcttgca agtctgcaac tccgtccccc atcatcgcaa 6360
tcatagcatc ctcgaagctc ttctctgtcg acccttcgag gttgaaggcg gacgttaggg 6420tcatagcatc ctcgaagctc ttctctgtcg acccttcgag gttgaaggcg gacgttaggg 6420
ctgcaagtgt ctcctgtggc agagtccccg tggtcgtgta ctaattattg ctattaatta 6480ctgcaagtgt ctcctgtggc agagtccccg tggtcgtgta ctaattattg ctattaatta 6480
attgcagatg tcattctctg ggagcacatt cacggacgct gcaagcaagg agccagcact 6540attgcagatg tcattctctg ggagcacatt cacggacgct gcaagcaagg agccagcact 6540
gatcgacgac ggcaatgagc tgcaaatgcc ggtagatcag tcgtcgacgg agagggaggt 6600gatcgacgac ggcaatgagc tgcaaatgcc ggtagatcag tcgtcgacgg agagggaggt 6600
taagttgatg aggtacaagg agaagaggat gaggaggtgc tttgagaagc agataagata 6660taagttgatg aggtacaagg agaagaggat gaggaggtgc tttgagaagc agataagata 6660
tgcatccagg aaagcctatg cgcaggtgag acccagggtg aaaggccgct ttgccaaggt 6720tgcatccagg aaagcctatg cgcaggtgag acccagggtg aaaggccgct ttgccaaggt 6720
aaccgaatga 6730aaccgaatga 6730
<210> 2<210> 2
<211> 2561<211> 2561
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 2<400> 2
atgtcgtcgg ggccagcagc atgcggtgtg tgcggcgcgg ccgcctgctg cccgcacctc 60atgtcgtcgg ggccagcagc atgcggtgtg tgcggcgcgg ccgcctgctg cccgcacctc 60
ttgcacaccg gtgacggcaa cgacgacgac ctcatcagcc gggccttctt ctccgtcttc 120ttgcacaccg gtgacggcaa cgacgacgac ctcatcagcc gggccttctt ctccgtcttc 120
cctgtcgtcg gtcatcaccg tcgtcatgag tccaccagca gccccgccat gcagcagcca 180cctgtcgtcg gtcatcaccg tcgtcatgag tccaccagca gccccgccat gcagcagcca 180
tcggggtgcc tgcacgagtt ccagttcttt ggccaccagg acgaccacca ccaccaagaa 240tcggggtgcc tgcacgagtt ccagttcttt ggccaccagg acgaccacca ccaccaagaa 240
accatcgcct ggctcttgga ccacccaccg ccacctgcgc ccgagcttgg cggcgacgac 300accatcgcct ggctcttgga ccacccaccg ccacctgcgc ccgagcttgg cggcgacgac 300
ggcccgtccc tagctggtga tgagaacgac gaccagcctg cgtttcaccc gtttgggaca 360ggcccgtccc tagctggtga tgagaacgac gaccagcctg cgtttcaccc gtttgggaca 360
ccacagtacc accaccccgg aaaagggaac gggaacgggc tcacctttga gctggacgcc 420ccacagtacc accaccccgg aaaagggaac gggaacgggc tcacctttga gctggacgcc 420
acgctgggcc tcggcaccgc gcggcaaacc actgagacag cagaagcaag cgccaccatc 480acgctgggcc tcggcaccgc gcggcaaacc actgagacag cagaagcaag cgccaccatc 480
gtaagtattg ctcccgaatt atcttaagta agttcagata attcacatgc atggtttcta 540gtaagtattg ctcccgaatt atcttaagta agttcagata attcacatgc atggtttcta 540
attggaattt ggtcccaagc tggacaccct ttttttatct tccgttttct caactctctt 600attggaattt ggtcccaagc tggacaccct ttttttatct tccgttttct caactctctt 600
atcgatcacc tgcataaagg acctttgtat caagtaccaa gagatcttgc catgagttgc 660atcgatcacc tgcataaagg acctttgtat caagtaccaa gagatcttgc catgagttgc 660
actttacgca catttttttt ttcttttttt tttcaggaac gtactactct tcctatatat 720actttacgca cattttttttt ttcttttttt tttcaggaac gtactactct tcctatatat 720
caatatatgt aaacaagatt aacatgcatg tttctaacct ttctcaaaga caaaagacac 780caatatatgt aaacaagatt aacatgcatg tttctaacct ttctcaaaga caaaagacac 780
tctggtgcac gaaatggatg gaagaaacca gatcattaat atatgcctca caacctcttc 840tctggtgcac gaaatggatg gaagaaacca gatcattaat atatgcctca caacctcttc 840
atgaatttaa tttgatgtgg aaagaataaa aacgacggtt ccggttatta acccaatatt 900atgaatttaa tttgatgtgg aaagaataaa aacgacggtt ccggttatta acccaatatt 900
caatgatatc ctgaacaaaa ctagctatag atctcaatca tagcatcagg catcagcgct 960caatgatatc ctgaacaaaa ctagctatag atctcaatca tagcatcagg catcagcgct 960
tccaaagttc tcacctgact ttttttttac tcaatctcca gattatattt ccttcctaca 1020tccaaagttc tcacctgact ttttttttac tcaatctcca gattatattt ccttcctaca 1020
aagagtcgga gagaacatag ccatgagtta aatcactgat gttgtaaata cagaccagta 1080aagagtcgga gagaacatag ccatgagtta aatcactgat gttgtaaata cagaccagta 1080
gtcaaaagca ttgactatac actaaaacta ttgttcaagg tcactatttc acaaaaaaat 1140gtcaaaagca ttgactatac actaaaacta ttgttcaagg tcactatttc acaaaaaaat 1140
tcattgccta tttgatagtt tgattgagag gtagcaatat tgccaaattt atttttcacg 1200tcattgccta tttgatagtt tgattgagag gtagcaatat tgccaaattt atttttcacg 1200
tatctagaca aaatcggtag caatattgcc aattttattg ctccatcgtc atatccagcc 1260tatctagaca aaatcggtag caatattgcc aattttattg ctccatcgtc atatccagcc 1260
cgaagtctat tgctgtaatg acaagataca tatatatctt ttatattgtg atatacttac 1320cgaagtctat tgctgtaatg acaagataca tatatatctt ttatattgtg atatacttac 1320
ttaagttata tattgacgat aaaagggaga aagcagcttg cctccctttc tttcttcacc 1380ttaagttata tattgacgat aaaagggaga aagcagcttg cctccctttc tttcttcacc 1380
actatacata ttggattgtt tcttcaccac aatatacaag aaaatattaa tatctgcagt 1440actatacata ttggattgtt tcttcaccac aatatacaag aaaatattaa tatctgcagt 1440
acatatttgg tgtcattaaa tatgtctttt gaaactattt tcataataaa catatttgaa 1500acatatttgg tgtcattaaa tatgtctttt gaaactattt tcataataaa catatttgaa 1500
gatacagata tttcaaatat tttttacgaa tctaatcaaa tatgagaaat tttgactgac 1560gatacagata tttcaaatat tttttacgaa tctaatcaaa tatgagaaat tttgactgac 1560
atgtatgacc atactatcaa ttattttagg acagaggcag ggagtgcgca tttgtatggt 1620atgtatgacc atactatcaa ttattttagg acagaggcag ggagtgcgca tttgtatggt 1620
cgaaatcgat caattgtaac catatgcgtg tacgttggta cgcccactga tgtatctacc 1680cgaaatcgat caattgtaac catatgcgtg tacgttggta cgcccactga tgtatctacc 1680
tggttaatta attagatata tgacctagct tgtcgtctga ttgttatgat taaagaacca 1740tggttaatta attagatata tgacctagct tgtcgtctga ttgttatgat taaagaacca 1740
aaaagtctac tcagctcaaa acccaaatat atatgtgtca aacaactccc atgcacatgt 1800aaaagtctac tcagctcaaa acccaaatat atatgtgtca aacaactccc atgcacatgt 1800
ccagctgtga ctaaatctat cccgaaggat tgtccatgcc aaagtttgat gaaataggta 1860ccagctgtga ctaaatctat cccgaaggat tgtccatgcc aaagtttgat gaaataggta 1860
ataagttgtc tcattttatg tggttcattt gcagatttgc tgcttactct ttggtatact 1920ataagttgtc tcattttatg tggttcattt gcagatttgc tgcttactct ttggtatact 1920
tggattttat agggaactaa tatatacata tgattataat taatgcactt tattccgtgc 1980tggattttat agggaactaa tatatacata tgattataat taatgcactt tattccgtgc 1980
cacatgtaga cgatgaacaa ctcaatcaca tggcttaaga tctaactaat attctaccag 2040cacatgtaga cgatgaacaa ctcaatcaca tggcttaaga tctaactaat attctaccag 2040
cccaaacaaa tggtgctacc aaggcaatat ctgatgttca tcaggcatgc atgtagctag 2100cccaaacaaa tggtgctacc aaggcaatat ctgatgttca tcaggcatgc atgtagctag 2100
gcccattcag catatcaagc aaagtacaga ttcttatcca acaaacaata tatatgatca 2160gcccattcag catatcaagc aaagtacaga ttcttatcca acaaacaata tatatgatca 2160
aagtactaat tgattagttg cctgcagtta gctgtccaaa atttgctttg atcatcatgc 2220aagtactaat tgattagttg cctgcagtta gctgtccaaa atttgctttg atcatcatgc 2220
aataatatac acatgcagaa actaaaatgc aataacatat ataaatccat gcatgcatgc 2280aataatatac acatgcagaa actaaaatgc aataacatat ataaatccat gcatgcatgc 2280
agcatatata catactaatt gctattaatt aattgcagat gtcattctgt gggagcacat 2340agcatatata catactaatt gctattaatt aattgcagat gtcattctgt gggagcacat 2340
tcactgacgc tgcaagcaag gagccagcac tgatcgacga cggcaatgag ctgcaaatgc 2400tcactgacgc tgcaagcaag gagccagcac tgatcgacga cggcaatgag ctgcaaatgc 2400
cggtagatca gtcgtcgtcg gagagggagg ttaagttgat gaggtacaag gagaagagga 2460cggtagatca gtcgtcgtcg gagagggagg ttaagttgat gaggtacaag gagaagagga 2460
tgaggaggtg ctttgagaag cagataagat atgcatccag gaaagcctat gcgcaggtga 2520tgaggaggtg ctttgagaag cagataagat atgcatccag gaaagcctat gcgcaggtga 2520
gacccagggt gaaaggccgc tttgccaagg taaccgaatg a 2561gacccagggt gaaaggccgc tttgccaagg taaccgaatg a 2561
<210> 3<210> 3
<211> 240<211> 240
<212> PRT<212> PRT
<213> Artificial sequence<213> Artificial sequence
<400> 3<400> 3
Met Ser Ser Gly Pro Ala Ala Cys Gly Val Cys Gly Ala Ala Ala CysMet Ser Ser Gly Pro Ala Ala Cys Gly Val Cys Gly Ala Ala Ala Cys
1 5 10 151 5 10 15
Cys Pro His Leu Leu His Thr Gly Asp Gly Asn Asp Asp Asp Leu IleCys Pro His Leu Leu His Thr Gly Asp Gly Asn Asp Asp Asp Leu Ile
20 25 30 20 25 30
Ser Arg Ala Phe Phe Ser Val Phe Pro Val Val Gly His His Arg ArgSer Arg Ala Phe Phe Ser Val Phe Pro Val Val Gly His His Arg Arg
35 40 45 35 40 45
His Glu Ser Thr Ser Ser Pro Ala Met Gln Gln Pro Ser Gly Cys LeuHis Glu Ser Thr Ser Ser Pro Ala Met Gln Gln Pro Ser Gly Cys Leu
50 55 60 50 55 60
His Glu Phe Gln Phe Phe Gly His Gln Asp Asp His His His Gln GluHis Glu Phe Gln Phe Phe Gly His Gln Asp Asp His His His Gln Glu
65 70 75 8065 70 75 80
Thr Ile Ala Trp Leu Leu Asp His Pro Pro Pro Pro Ala Pro Glu LeuThr Ile Ala Trp Leu Leu Asp His Pro Pro Pro Pro Ala Pro Glu Leu
85 90 95 85 90 95
Gly Gly Asp Asp Gly Pro Ser Pro Ala Gly Asp Glu Asn Asp Asp GlnGly Gly Asp Asp Gly Pro Ser Pro Ala Gly Asp Glu Asn Asp Asp Gln
100 105 110 100 105 110
Pro Ala Phe His Pro Phe Gly Thr Pro Gln Tyr His His Pro Gly LysPro Ala Phe His Pro Phe Gly Thr Pro Gln Tyr His His Pro Gly Lys
115 120 125 115 120 125
Gly Asn Gly Asn Gly Leu Thr Phe Glu Leu Asp Ala Thr Leu Gly LeuGly Asn Gly Asn Gly Leu Thr Phe Glu Leu Asp Ala Thr Leu Gly Leu
130 135 140 130 135 140
Gly Thr Ala Arg Gln Thr Thr Glu Thr Ala Glu Ala Ser Ala Thr IleGly Thr Ala Arg Gln Thr Thr Glu Thr Ala Glu Ala Ser Ala Thr Ile
145 150 155 160145 150 155 160
Met Ser Phe Ser Gly Ser Thr Phe Thr Asp Ala Ala Ser Lys Glu ProMet Ser Phe Ser Gly Ser Thr Phe Thr Asp Ala Ala Ser Lys Glu Pro
165 170 175 165 170 175
Ala Leu Ile Asp Asp Gly Asn Glu Leu Gln Met Pro Val Asp Gln SerAla Leu Ile Asp Asp Gly Asn Glu Leu Gln Met Pro Val Asp Gln Ser
180 185 190 180 185 190
Ser Thr Glu Arg Glu Val Lys Leu Met Arg Tyr Lys Glu Lys Arg MetSer Thr Glu Arg Glu Val Lys Leu Met Arg Tyr Lys Glu Lys Arg Met
195 200 205 195 200 205
Arg Arg Cys Phe Glu Lys Gln Ile Arg Tyr Ala Ser Arg Lys Ala TyrArg Arg Cys Phe Glu Lys Gln Ile Arg Tyr Ala Ser Arg Lys Ala Tyr
210 215 220 210 215 220
Ala Gln Val Arg Pro Arg Val Lys Gly Arg Phe Ala Lys Val Thr GluAla Gln Val Arg Pro Arg Val Lys Gly Arg Phe Ala Lys Val Thr Glu
225 230 235 240225 230 235 240
<210> 4<210> 4
<211> 240<211> 240
<212> PRT<212> PRT
<213> Artificial sequence<213> Artificial sequence
<400> 4<400> 4
Met Ser Ser Gly Pro Ala Ala Cys Gly Val Cys Gly Ala Ala Ala CysMet Ser Ser Gly Pro Ala Ala Cys Gly Val Cys Gly Ala Ala Ala Cys
1 5 10 151 5 10 15
Cys Pro His Leu Leu His Thr Gly Asp Gly Asn Asp Asp Asp Leu IleCys Pro His Leu Leu His Thr Gly Asp Gly Asn Asp Asp Asp Leu Ile
20 25 30 20 25 30
Ser Arg Ala Phe Phe Ser Val Phe Pro Val Val Gly His His Arg ArgSer Arg Ala Phe Phe Ser Val Phe Pro Val Val Gly His His Arg Arg
35 40 45 35 40 45
His Glu Ser Thr Ser Ser Pro Ala Met Gln Gln Pro Ser Gly Cys LeuHis Glu Ser Thr Ser Ser Pro Ala Met Gln Gln Pro Ser Gly Cys Leu
50 55 60 50 55 60
His Glu Phe Gln Phe Phe Gly His Gln Asp Asp His His His Gln GluHis Glu Phe Gln Phe Phe Gly His Gln Asp Asp His His His Gln Glu
65 70 75 8065 70 75 80
Thr Ile Ala Trp Leu Leu Asp His Pro Pro Pro Pro Ala Pro Glu LeuThr Ile Ala Trp Leu Leu Asp His Pro Pro Pro Pro Ala Pro Glu Leu
85 90 95 85 90 95
Gly Gly Asp Asp Gly Pro Ser Leu Ala Gly Asp Glu Asn Asp Asp GlnGly Gly Asp Asp Gly Pro Ser Leu Ala Gly Asp Glu Asn Asp Asp Gln
100 105 110 100 105 110
Pro Ala Phe His Pro Phe Gly Thr Pro Gln Tyr His His Pro Gly LysPro Ala Phe His Pro Phe Gly Thr Pro Gln Tyr His His Pro Gly Lys
115 120 125 115 120 125
Gly Asn Gly Asn Gly Leu Thr Phe Glu Leu Asp Ala Thr Leu Gly LeuGly Asn Gly Asn Gly Leu Thr Phe Glu Leu Asp Ala Thr Leu Gly Leu
130 135 140 130 135 140
Gly Thr Ala Arg Gln Thr Thr Glu Thr Ala Glu Ala Ser Ala Thr IleGly Thr Ala Arg Gln Thr Thr Glu Thr Ala Glu Ala Ser Ala Thr Ile
145 150 155 160145 150 155 160
Met Ser Phe Cys Gly Ser Thr Phe Thr Asp Ala Ala Ser Lys Glu ProMet Ser Phe Cys Gly Ser Thr Phe Thr Asp Ala Ala Ser Lys Glu Pro
165 170 175 165 170 175
Ala Leu Ile Asp Asp Gly Asn Glu Leu Gln Met Pro Val Asp Gln SerAla Leu Ile Asp Asp Gly Asn Glu Leu Gln Met Pro Val Asp Gln Ser
180 185 190 180 185 190
Ser Ser Glu Arg Glu Val Lys Leu Met Arg Tyr Lys Glu Lys Arg MetSer Ser Glu Arg Glu Val Lys Leu Met Arg Tyr Lys Glu Lys Arg Met
195 200 205 195 200 205
Arg Arg Cys Phe Glu Lys Gln Ile Arg Tyr Ala Ser Arg Lys Ala TyrArg Arg Cys Phe Glu Lys Gln Ile Arg Tyr Ala Ser Arg Lys Ala Tyr
210 215 220 210 215 220
Ala Gln Val Arg Pro Arg Val Lys Gly Arg Phe Ala Lys Val Thr GluAla Gln Val Arg Pro Arg Val Lys Gly Arg Phe Ala Lys Val Thr Glu
225 230 235 240225 230 235 240
<210> 5<210> 5
<211> 4170<211> 4170
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 5<400> 5
ttttgaaaac tggcaagaga attgcctgtt atattaaaaa gaaggtgaga gccagcgagg 60ttttgaaaac tggcaagaga attgcctgtt atattaaaaa gaaggtgaga gccagcgagg 60
gctaaaatat acaagtacat cacactaggg aatgagacat taagtatggt gacacaaatg 120gctaaaatat acaagtacat cacactaggg aatgagacat taagtatggt gacacaaatg 120
ttccaaaacc aacccctctc cttccccaaa acctacacca gccttgctaa gtgttttgct 180ttccaaaacc aacccctctc cttccccaaa acctacacca gccttgctaa gtgttttgct 180
cctcctagaa tccacaggcg aatttgctcc ttaatcatcg ctatagtctg tgtcacggtc 240cctcctagaa tccacaggcg aatttgctcc ttaatcatcg ctatagtctg tgtcacggtc 240
ttctctttgt gctcaaagat tcgcgagttt ctctcacacc aaattgtcca agacacgagg 300ttctctttgt gctcaaagat tcgcgagttt ctctcacacc aaattgtcca agacacgagg 300
atgaggaggg ttcgcaatcc cttcacgctt ctcactgcgg atgttgcaag attcgaccac 360atgaggaggg ttcgcaatcc cttcacgctt ctcactgcgg atgttgcaag attcgaccac 360
cattggtgca ccgaatctac aggttgccaa gaagaatggt ggatctccgt aattgcagcc 420cattggtgca ccgaatctac aggttgccaa gaagaatggt ggatctccgt aattgcagcc 420
caaatactca aatcagccca aatatgtctg gtgaagcaac attcagcaaa aagatgtagt 480caaatactca aatcagccca aatatgtctg gtgaagcaac attcagcaaa aagatgtagt 480
ccagattctg ggcatgaggc gcataggaca cagcgcggat tatgaggcca tccccttctt 540ccagattctg ggcatgaggc gcataggaca cagcgcggat tatgaggcca tccccttctt 540
gccaacctgt ctgctgtcca aactctattt tgaacagcaa gccaactgaa gaatttgcac 600gccaacctgt ctgctgtcca aactctattt tgaacagcaa gccaactgaa gaatttgcac 600
tttggtggcg cccaagcttt ccaaattata ctattaaaat tcctgcatgt ggagccataa 660tttggtggcg cccaagcttt ccaaattata ctattaaaat tcctgcatgt ggagccataa 660
aattgcgctt ggtaggctga ctgagttgtg tactggccac tcatagtaaa cttccacttg 720aattgcgctt ggtaggctga ctgagttgtg tactggccac tcatagtaaa cttccacttg 720
ataaggtcgg gggtattgtg cattaagcga agctgattta cctcgcacca taattgacaa 780ataaggtcgg gggtattgtg cattaagcga agctgattta cctcgcacca taattgacaa 780
tattcatgta tgtgcgcggc agaaaaatca ttgtgattaa aatgtaggtc ctgaatccat 840tattcatgta tgtgcgcggc agaaaaatca ttgtgattaa aatgtaggtc ctgaatccat 840
cggttgtttg agagagcctc atggatggat ctatttttgc cgttagaaat ggagtaggtc 900cggttgtttg agagagcctc atggatggat ctatttttgc cgttagaaat ggagtaggtc 900
aaagggaagg tgtctctagg tgtcgtaccc cttagccagg cactttccca gaacaaagaa 960aaagggaagg tgtctctagg tgtcgtaccc cttagccagg cactttccca gaacaaagaa 960
gacttaccat cccccaccct gactgaggtt gctgctgcaa acagtctcct gtccaacata 1020gacttaccat cccccaccct gactgaggtt gctgctgcaa acagtctcct gtccaacata 1020
ttgcaaggta tatctttcgt tatcctctga gtatttaaca gtttggcctc ctgccacaac 1080ttgcaaggta tatctttcgt tatcctctga gtatttaaca gtttggcctc ctgccacaac 1080
catctgaggc gtaaggcccg cgagaaaaaa ccaaggtgca gaatgcctaa accaccgtat 1140catctgaggc gtaaggcccg cgagaaaaaa ccaaggtgca gaatgcctaa accaccgtat 1140
tgttttggtc ttgctgcaca gacccagttg actttgcatt taccgccggt taatctctct 1200tgttttggtc ttgctgcaca gacccagttg actttgcatt taccgccggt taatctctct 1200
gaaccagccc aaagaaattg cttacgtttt gagtcaatga aatccaacac cctttggggc 1260gaaccagccc aaagaaattg cttacgtttt gagtcaatga aatccaacac cctttggggc 1260
ggttttaggg ctaggagcag ataggtgacc tgcgaggtta gcactgcatt gactagcgtg 1320ggttttaggg ctaggagcag ataggtgacc tgcgaggtta gcactgcatt gactagcgtg 1320
aggcggccag ccgccgaaag gtttttcccg ctccatgtac tcagctttgc agagactttg 1380aggcggccag ccgccgaaag gtttttcccg ctccatgtac tcagctttgc agagactttg 1380
tcaatccatg gctggaagtc cacccgtttg agtctgttca tggacaaagg tagtccaagg 1440tcaatccatg gctggaagtc cacccgtttg agtctgttca tggacaaagg tagtccaagg 1440
tatttaaggg ggaaggaagt cctcgaggcc ggtagaccag atagaacgtc agagaggttg 1500tatttaaggg ggaaggaagt cctcgaggcc ggtagaccag atagaacgtc agagaggttg 1500
atgttgttac actggattgg cactactgtg gacttgtgga aatttgtttt taggcctgtg 1560atgttgttac actggattgg cactactgtg gacttgtgga aatttgtttt taggcctgtg 1560
gtttcaccaa aaagttccag aatccttgca agcatagata cctcaccttt tgtgggcgtg 1620gtttcaccaa aaagttccag aatccttgca agcatagata cctcaccttt tgtgggcgtg 1620
acaaatatga ccgcatcatc cgcaaacatc gatatgcgca ggtcgggcga gcgaccatgg 1680acaaatatga ccgcatcatc cgcaaacatc gatatgcgca ggtcgggcga gcgaccatgg 1680
agcttggtga gcattccgaa ttcagttgct acttcgagga gtctttgcag gggatctatt 1740agcttggtga gcattccgaa ttcagttgct acttcgagga gtctttgcag gggatctatt 1740
gcaatgacaa aaagaagagg agataggggg tcgccttgcc ttaggcctcg cccgtgccag 1800gcaatgacaa aaagaagagg agataggggg tcgccttgcc ttaggcctcg cccgtgccag 1800
ataggggggt ttggtacgcc gttaaggatg actcttgagg ttgaggtgga gagaattgcc 1860ataggggggt ttggtacgcc gttaaggatg actcttgagg ttgaggtgga gagaattgcc 1860
gcaatccatt cccgccacct tatagggaag cctaggtgct caaggagggt gaggatatac 1920gcaatccatt cccgccacct tatagggaag cctaggtgct caaggagggt gaggatatac 1920
tcccaacgta tcgaatcaaa ggcttttgct atgtccaatt tgaacaagag tgttggggtc 1980tcccaacgta tcgaatcaaa ggcttttgct atgtccaatt tgaacaagag tgttggggtc 1980
ttgtttgtat gaaaacgacg tgccgcggta cgcactgcca agaagttatc atgtatgctc 2040ttgtttgtat gaaaacgacg tgccgcggta cgcactgcca agaagttatc atgtatgctc 2040
ctgttcttga tgaaggcgct ttgacaagtg gagacgatcg cgttcatatg gggctgaagt 2100ctgttcttga tgaaggcgct ttgacaagtg gagacgatcg cgttcatatg gggctgaagt 2100
cgtaatgcca gtatcttgga aataagcttt atgaaggaat gtattaagct tatcggtcta 2160cgtaatgcca gtatcttgga aataagcttt atgaaggaat gtattaagct tatcggtcta 2160
aaatcaccca cttcttcggc cccttctttt tttgggatta ggattacatt agcggtgttg 2220aaatcaccca cttcttcggc cccttctttt tttgggatta ggattacatt agcggtgttg 2220
atgagtgata agctgccaca acgtagagca tggaaagcgt tggccgccct tatgacatcc 2280atgagtgata agctgccaca acgtagagca tggaaagcgt tggccgccct tatgacatcc 2280
ccttttatta tgctccagca cgtcttaaaa aataacccag tgaaaccatc aggtcccggc 2340ccttttatta tgctccagca cgtcttaaaa aataacccag tgaaaccatc aggtcccggc 2340
gccttgtcaa tgggcaacag atcaatggct cttttgatct cctcctctga gaatggagca 2400gccttgtcaa tgggcaacag atcaatggct cttttgatct cctcctctga gaatggagca 2400
gctaaagaag agagatcatg gtgtcgtaat ccaagcgtcg cccagttgaa gtctattctc 2460gctaaagaag agagatcatg gtgtcgtaat ccaagcgtcg cccagttgaa gtctattctc 2460
ggagcgggtg gacgactcaa catattttca aagtgagatt gtattttcgc ggccttgcat 2520ggagcgggtg gacgactcaa catattttca aagtgagatt gtattttcgc ggccttgcat 2520
tcatgtgttg ttgtcgagcc attttggtct ttgaggcaat gaatgaaatt ttttcgacgc 2580tcatgtgttg ttgtcgagcc attttggtct ttgaggcaat gaatgaaatt ttttcgacgc 2580
ctagaagtga tccttcgatg aaaaaatcta gtgttagcat ccccaaattt tatcaaattt 2640ctagaagtga tccttcgatg aaaaaatcta gtgttagcat ccccaaattt tatcaaattt 2640
agacgtgcag cctgtttttt ccgcgcacgt tcgatgaccg caaggcccag aattcttttc 2700agacgtgcag cctgttttttt ccgcgcacgt tcgatgaccg caaggcccag aattcttttc 2700
ttaagtctgc atcttaggag ctcctctcca ggagagagag ctctcagctc ctgtgctatg 2760ttaagtctgc atcttaggag ctcctctcca ggagagagag ctctcagctc ctgtgctatg 2760
tcaaatctat ggattatctc caaagccata tgaagttgca gcttagcatc cgagatagtg 2820tcaaatctat ggattatctc caaagccata tgaagttgca gcttagcatc cgagatagtg 2820
ttgaagctcc actgtctgag ggcccttgct gtagcctgca gtttgtgata gagtctgtgg 2880ttgaagctcc actgtctgag ggcccttgct gtagcctgca gtttgtgata gagtctgtgg 2880
aaaggctcct ggtgtgtgca gtgagcgcac caggacctcg acaccacttc catgaatcct 2940aaaggctcct ggtgtgtgca gtgagcgcac caggacctcg acaccacttc catgaatcct 2940
gggagcatgg cccaaaaatt ctcaaattta aaagagcgcg ggcgtcgggg gccagtttgg 3000gggagcatgg cccaaaaatt ctcaaattta aaagagcgcg ggcgtcgggg gccagttttgg 3000
ttagagagca gcagtggaca gtgatccgag agcgaagaag ataggccgtg cagcacgtgg 3060ttagagagca gcagtggaca gtgatccgag agcgaagaag ataggccgtg cagcacgtgg 3060
ctatgaaaag cttggtccca ttcagcattt gcaaagaccc tgtcaagctt aataagggtt 3120ctatgaaaag cttggtccca ttcagcattt gcaaagaccc tgtcaagctt aataagggtt 3120
gggtttgtac gctcgttgct ccaagtgaat cgtctatttt gcaggttaat ttccttcagg 3180gggtttgtac gctcgttgct ccaagtgaat cgtctatttt gcaggttaat ttccttcagg 3180
tcacaacagt ctagcatatc actgaaacgg ctcattaggc taaggttcag acgcctctta 3240tcacaacagt ctagcatatc actgaaacgg ctcattaggc taaggttcag acgcctctta 3240
ttcttgtcac tagctttgta aattagatta aaatctccca aaagcagcca cttaattccg 3300ttcttgtcac tagctttgta aattagatta aaatctccca aaagcagcca cttaattccg 3300
gattgcggtt tcagatcttg aatttcttgg agaaaggctg tcttcatgct attgcttgta 3360gattgcggtt tcagatcttg aatttcttgg agaaaggctg tcttcatgct attgcttgta 3360
ggcccataaa ccacagttaa taggaaggca gtatgggacg ctgttagctt ggcctttcca 3420ggcccataaa ccacagttaa taggaaggca gtatgggacg ctgttagctt ggcctttcca 3420
gaaatatgga aatcaccaac tgtaaaatca gtcagctcca catggttggt atcccatagc 3480gaaatatgga aatcaccaac tgtaaaatca gtcagctcca catggttggt atcccatagc 3480
aatgcaatcc ctccccgtgt gccgctgggg cctccagccg gtttgcagaa gaatttgtct 3540aatgcaatcc ctccccgtgt gccgctgggg cctccagccg gtttgcagaa gaatttgtct 3540
aggtggtggc ctccaaggtg ccaggctgtt gtttggtcga aggaagacaa tttagtttct 3600aggtggtggc ctccaaggtg ccaggctgtt gtttggtcga aggaagacaa tttagtttct 3600
tgcaaacaag ctaagtgaca cctcgatgaa gtaattgtct ctctgaccgt gtccttccga 3660tgcaaacaag ctaagtgaca cctcgatgaa gtaattgtct ctctgaccgt gtccttccga 3660
gcctgggaat tgagacccct cacattccag caaaaaacct taaggtctaa gtctgtcatt 3720gcctgggaat tgagacccct cacattccag caaaaaacct taaggtctaa gtctgtcatt 3720
gggaaaaaaa ggaggtaccc cttgaatcat agtttcctgc tctctcagca cagtgacaag 3780gggaaaaaaa ggaggtaccc cttgaatcat agtttcctgc tctctcagca cagtgacaag 3780
acagtgtgca tggaaaccac atatctggtg aatgaggtgc gtcgagacat tgatggtgca 3840acagtgtgca tggaaaccac atatctggtg aatgaggtgc gtcgagacat tgatggtgca 3840
gggaatgcaa aaggaccata caagaggtgt tcacgccata catcaaacac atgcagtctg 3900gggaatgcaa aaggaccata caagaggtgt tcacgccata catcaaacac atgcagtctg 3900
aagcctccgc acacaagcga ggcaggaatt caaagcctaa agttaacatg atagaagcaa 3960aagcctccgc acacaagcga ggcaggaatt caaagcctaa agttaacatg atagaagcaa 3960
aagaccctta agcacagatc actggctcat gctgctgact gcagctcctc aacgcgcaga 4020aagaccctta agcacagatc actggctcat gctgctgact gcagctcctc aacgcgcaga 4020
ctcctgatgc catcttgcaa gtctgcaact ccgtccccca tcatcgcaat catagcatcc 4080ctcctgatgc catcttgcaa gtctgcaact ccgtccccca tcatcgcaat catagcatcc 4080
tcgaagctct tctctgtcga cccttcgagg ttgaaggcgg acgttagggc tgcaagtgtc 4140tcgaagctct tctctgtcga cccttcgagg ttgaaggcgg acgttagggc tgcaagtgtc 4140
tcctgtggca gagtccccgt ggtcgtgtac 4170tcctgtggca gagtccccgt ggtcgtgtac 4170
<210> 6<210> 6
<211> 24<211> 24
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 6<400> 6
cccttgaatc atagtttcct gctc 24cccttgaatc atagtttcct gctc 24
<210> 7<210> 7
<211> 23<211> 23
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 7<400> 7
tgaatgtgct cccagagaat gac 23tgaatgtgct cccagagaat gac 23
<210> 8<210> 8
<211> 25<211> 25
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 8<400> 8
attgaagata aaagggagaa agcag 25attgaagata aaagggagaa agcag 25
<210> 9<210> 9
<211> 22<211> 22
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 9<400> 9
gtgaatgtgc tcccagagaa tg 22gtgaatgtgc tcccagagaa tg 22
<210> 10<210> 10
<211> 24<211> 24
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 10<400> 10
atacaagagg tgttcacgcc atac 24atacaagagg tgttcacgcc atac 24
<210> 11<210> 11
<211> 21<211> 21
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 11<400> 11
caggcttcgt cattcggtta c 21caggcttcgt cattcggtta
<210> 12<210> 12
<211> 24<211> 24
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 12<400> 12
gatgtggaaa gaataaaaac gacg 24gatgtggaaa gaataaaaac gacg 24
<210> 13<210> 13
<211> 25<211> 25
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 13<400> 13
gtacctcatc aacttaacct ccctc 25gtacctcatc aacttaacct ccctc 25
<210> 14<210> 14
<211> 6730<211> 6730
<212> DNA<212> DNA
<213> Artificial sequence<213> Artificial sequence
<400> 14<400> 14
atgtcgtcgg ggccagcagc atgcggtgtg tgcggcgcgg ccgcctgctg cccgcacctc 60atgtcgtcgg ggccagcagc atgcggtgtg tgcggcgcgg ccgcctgctg cccgcacctc 60
ttgcacaccg gtgacggcaa cgacgacgac ctcatcagcc gggccttctt ctccgtcttc 120ttgcacaccg gtgacggcaa cgacgacgac ctcatcagcc gggccttctt ctccgtcttc 120
cctgtcgtcg gtcatcaccg tcgtcatgag tccaccagca gccccgccat gcagcagcca 180cctgtcgtcg gtcatcaccg tcgtcatgag tccaccagca gccccgccat gcagcagcca 180
tcggggtgcc tgcacgagtt ccagttcttt ggccaccagg acgaccacca ccaccaagaa 240tcggggtgcc tgcacgagtt ccagttcttt ggccaccagg acgaccacca ccaccaagaa 240
accatcgcct ggctcttgga ccacccaccg ccacctgcgc ccgagcttgg cggcgacgac 300accatcgcct ggctcttgga ccacccaccg ccacctgcgc ccgagcttgg cggcgacgac 300
ggcccgtccc tagctggtga tgagaacgac gaccagcctg cgtttcaccc gtttgggaca 360ggcccgtccc tagctggtga tgagaacgac gaccagcctg cgtttcaccc gtttgggaca 360
ccacagtacc accaccccgg aaaagggaac gggaacgggc tcacctttga gctggacgcc 420ccacagtacc accaccccgg aaaagggaac gggaacgggc tcacctttga gctggacgcc 420
acgctgggcc tcggcaccgc gcggcaaacc actgagacag cagaagcaag cgccaccatc 480acgctgggcc tcggcaccgc gcggcaaacc actgagacag cagaagcaag cgccaccatc 480
gtaagtattg ctcccgaatt atcttaagta agttcagata attcacatgc atggtttcta 540gtaagtattg ctcccgaatt atcttaagta agttcagata attcacatgc atggtttcta 540
attggaattt ggtcccaagc tggacaccct ttttttatct tccgttttct caactctctt 600attggaattt ggtcccaagc tggacaccct ttttttatct tccgttttct caactctctt 600
atcgatcacc tgcataaagg acctttgtat caagtaccaa gagatcttgc catgagttgc 660atcgatcacc tgcataaagg acctttgtat caagtaccaa gagatcttgc catgagttgc 660
actttacgca catttttttt ttcttttttt tttcaggaac gtactactct tcctatatat 720actttacgca cattttttttt ttcttttttt tttcaggaac gtactactct tcctatatat 720
caatatatgt aaacaagatt aacatgcatg tttctaacct ttctcaaaga caaaagacac 780caatatatgt aaacaagatt aacatgcatg tttctaacct ttctcaaaga caaaagacac 780
tctggtgcac gaaatggatg gaagaaacca gatcattaat atatgcctca caacctcttc 840tctggtgcac gaaatggatg gaagaaacca gatcattaat atatgcctca caacctcttc 840
atgaatttaa tttgatgtgg aaagaataaa aacgacggtt ccggttatta acccaatatt 900atgaatttaa tttgatgtgg aaagaataaa aacgacggtt ccggttatta acccaatatt 900
caatgatatc ctgaacaaaa ctagctatag atctcaatca tagcatcagg catcagcgct 960caatgatatc ctgaacaaaa ctagctatag atctcaatca tagcatcagg catcagcgct 960
tccaaagttc tcacctgact ttttttttac tcaatctcca gattatattt ccttcctaca 1020tccaaagttc tcacctgact ttttttttac tcaatctcca gattatattt ccttcctaca 1020
aagagtcgga gagaacatag ccatgagtta aatcactgat gttgtaaata cagaccagta 1080aagagtcgga gagaacatag ccatgagtta aatcactgat gttgtaaata cagaccagta 1080
gtcaaaagca ttgactatac actaaaacta ttgttcaagg tcactatttc acaaaaaaat 1140gtcaaaagca ttgactatac actaaaacta ttgttcaagg tcactatttc acaaaaaaat 1140
tcattgccta tttgatagtt tgattgagag gtagcaatat tgccaaattt atttttcacg 1200tcattgccta tttgatagtt tgattgagag gtagcaatat tgccaaattt atttttcacg 1200
tacctagaca aaagtcggta gcaatattgc caattttatt gctccatcgt catatgcatc 1260tacctagaca aaagtcggta gcaatattgc caattttatt gctccatcgt catatgcatc 1260
ccgaagtcta ttattgctgt aatgacaaga tacagatctt ttatattgtg atatacttac 1320ccgaagtcta ttattgctgt aatgacaaga tacagatctt ttatattgtg atatacttac 1320
ttaagtttta tattgaagat aaaagggaga aagcagcttg cctccctttc tttttcttca 1380ttaagtttta tattgaagat aaaagggaga aagcagcttg cctccctttc tttttcttca 1380
ccactatata tattggattg tttcttcacc actatatata caagaaaata ttaatatctg 1440ccactatata tattggattg tttcttcacc actatatata caagaaaata ttaatatctg 1440
cagtacatat ttagtgtcat taaatatgtc ttttgaaact attttcataa taaacatatt 1500cagtacatat ttagtgtcat taaatatgtc ttttgaaact attttcataa taaacatatt 1500
tgaagataca tgtattgcaa atatttttta cgaatctaat caaatatgag aaattttgac 1560tgaagataca tgtattgcaa atatttttta cgaatctaat caaatatgag aaattttgac 1560
tgacatgtat gaccatacta tcaattattt taggacagag gcatggagtg cgcatttgta 1620tgacatgtat gaccatacta tcaattattt taggacagag gcatggagtg cgcatttgta 1620
tggtcgaaat cgatcaattg taaccatata tgcatgtacg tttggtacgc ccactgatgt 1680tggtcgaaat cgatcaattg taaccatata tgcatgtacg tttggtacgc ccactgatgt 1680
atctacctgg ttaattaatt agatgaccta gcttgtcgtc tgattgttat gattaaagaa 1740atctacctgg ttaattaatt agatgaccta gcttgtcgtc tgattgttat gattaaagaa 1740
ccaaaaagtc tactcagctc aaaacccaaa tatatatgtg tcaaacaact cccatgcaca 1800ccaaaaagtc tactcagctc aaaacccaaa tatatatgtg tcaaacaact cccatgcaca 1800
tgtccagctg tgtctaaatc tatcccgaag gattgtccat gccaaagttt gatgaaatag 1860tgtccagctg tgtctaaatc tatcccgaag gattgtccat gccaaagttt gatgaaatag 1860
ataataagtt gtctcatttt atgtggttcg tttttgcaga tttgctgctt actctttcgt 1920ataataagtt gtctcatttt atgtggttcg tttttgcaga tttgctgctt actctttcgt 1920
atacttggat tttataggga actaatatat acatatgatt ataattaatg cactttattc 1980atacttggat tttataggga actaatatat acatatgatt ataattaatg cactttattc 1980
cgtgccacat gtagatgaat aacgcaatca catggcttaa gatctaatat tctaccccaa 2040cgtgccacat gtagatgaat aacgcaatca catggcttaa gatctaatat tctaccccaa 2040
aacaaatcga gctaccaagg cgatatctga tgttcatcag gcatgcatgt aggcccattc 2100aacaaatcga gctaccaagg cgatatctga tgttcatcag gcatgcatgt aggcccattc 2100
agcatatcaa gcaaagtaca gattcttatc caaaccatgc atatacatat gaccaaagta 2160agcatatcaa gcaaagtaca gattcttatc caaaccatgc atatacatat gaccaaagta 2160
ctaattaatt agttgcctgc agttattagc tgtccaaaat ttgctttgat catcatgcaa 2220ctaattaatt agttgcctgc agttattagc tgtccaaaat ttgctttgat catcatgcaa 2220
taatatacac atgcagaaac taaaatgaat aacatatata aatccatgca tgcacatgca 2280taatatacac atgcagaaac taaaatgaat aacatatata aatccatgca tgcacatgca 2280
gcatactttt tttttgaaaa ctggcaagag aattgcctgt tatattaaaa agaaggtgag 2340gcatactttt tttttgaaaa ctggcaagag aattgcctgt tatattaaaa agaaggtgag 2340
agccagcgag ggctaaaata tacaagtaca tcacactagg gaatgagaca ttaagtatgg 2400agccagcgag ggctaaaata tacaagtaca tcacactagg gaatgagaca ttaagtatgg 2400
tgacacaaat gttccaaaac caacccctct ccttccccaa aacctacacc agccttgcta 2460tgacacaaat gttccaaaac caacccctct ccttccccaa aacctacacc agccttgcta 2460
agtgttttgc tcctcctaga atccacaggc gaatttgctc cttaatcatc gctatagtct 2520agtgttttgc tcctcctaga atccacaggc gaatttgctc cttaatcatc gctatagtct 2520
gtgtcacggt cttctctttg tgctcaaaga ttcgcgagtt tctctcacac caaattgtcc 2580gtgtcacggt cttctctttg tgctcaaaga ttcgcgagtt tctctcacac caaattgtcc 2580
aagacacgag gatgaggagg gttcgcaatc ccttcacgct tctcactgcg gatgttgcaa 2640aagacacgag gatgaggagg gttcgcaatc ccttcacgct tctcactgcg gatgttgcaa 2640
gattcgacca ccattggtgc accgaatcta caggttgcca agaagaatgg tggatctccg 2700gattcgacca ccattggtgc accgaatcta caggttgcca agaagaatgg tggatctccg 2700
taattgcagc ccaaatactc aaatcagccc aaatatgtct ggtgaagcaa cattcagcaa 2760taattgcagc ccaaatactc aaatcagccc aaatatgtct ggtgaagcaa cattcagcaa 2760
aaagatgtag tccagattct gggcatgagg cgcataggac acagcgcgga ttatgaggcc 2820aaagatgtag tccagattct gggcatgagg cgcataggac acagcgcgga ttatgaggcc 2820
atccccttct tgccaacctg tctgctgtcc aaactctatt ttgaacagca agccaactga 2880atccccttct tgccaacctg tctgctgtcc aaactctatt ttgaacagca agccaactga 2880
agaatttgca ctttggtggc gcccaagctt tccaaattat actattaaaa ttcctgcatg 2940agaatttgca ctttggtggc gcccaagctt tccaaattat actattaaaa ttcctgcatg 2940
tggagccata aaattgcgct tggtaggctg actgagttgt gtactggcca ctcatagtaa 3000tggagccata aaattgcgct tggtaggctg actgagttgt gtactggcca ctcatagtaa 3000
acttccactt gataaggtcg ggggtattgt gcattaagcg aagctgattt acctcgcacc 3060acttccactt gataaggtcg ggggtattgt gcattaagcg aagctgattt acctcgcacc 3060
ataattgaca atattcatgt atgtgcgcgg cagaaaaatc attgtgatta aaatgtaggt 3120ataattgaca atattcatgt atgtgcgcgg cagaaaaatc attgtgatta aaatgtaggt 3120
cctgaatcca tcggttgttt gagagagcct catggatgga tctatttttg ccgttagaaa 3180cctgaatcca tcggttgttt gagagagcct catggatgga tctatttttg ccgttagaaa 3180
tggagtaggt caaagggaag gtgtctctag gtgtcgtacc ccttagccag gcactttccc 3240tggagtaggt caaagggaag gtgtctctag gtgtcgtacc ccttagccag gcactttccc 3240
agaacaaaga agacttacca tcccccaccc tgactgaggt tgctgctgca aacagtctcc 3300agaacaaaga agacttacca tcccccaccc tgactgaggt tgctgctgca aacagtctcc 3300
tgtccaacat attgcaaggt atatctttcg ttatcctctg agtatttaac agtttggcct 3360tgtccaacat attgcaaggt atatctttcg ttatcctctg agtatttaac agtttggcct 3360
cctgccacaa ccatctgagg cgtaaggccc gcgagaaaaa accaaggtgc agaatgccta 3420cctgccacaa ccatctgagg cgtaaggccc gcgagaaaaa accaaggtgc agaatgccta 3420
aaccaccgta ttgttttggt cttgctgcac agacccagtt gactttgcat ttaccgccgg 3480aaccaccgta ttgttttggt cttgctgcac agacccagtt gactttgcat ttaccgccgg 3480
ttaatctctc tgaaccagcc caaagaaatt gcttacgttt tgagtcaatg aaatccaaca 3540ttaatctctc tgaaccagcc caaagaaatt gcttacgttt tgagtcaatg aaatccaaca 3540
ccctttgggg cggttttagg gctaggagca gataggtgac ctgcgaggtt agcactgcat 3600ccctttgggg cggttttagg gctaggagca gataggtgac ctgcgaggtt agcactgcat 3600
tgactagcgt gaggcggcca gccgccgaaa ggtttttccc gctccatgta ctcagctttg 3660tgactagcgt gaggcggcca gccgccgaaa ggttttttccc gctccatgta ctcagctttg 3660
cagagacttt gtcaatccat ggctggaagt ccacccgttt gagtctgttc atggacaaag 3720cagagacttt gtcaatccat ggctggaagt ccacccgttt gagtctgttc atggacaaag 3720
gtagtccaag gtatttaagg gggaaggaag tcctcgaggc cggtagacca gatagaacgt 3780gtagtccaag gtatttaagg gggaaggaag tcctcgaggc cggtagacca gatagaacgt 3780
cagagaggtt gatgttgtta cactggattg gcactactgt ggacttgtgg aaatttgttt 3840cagagaggtt gatgttgtta cactggattg gcactactgt ggacttgtgg aaatttgttt 3840
ttaggcctgt ggtttcacca aaaagttcca gaatccttgc aagcatagat acctcacctt 3900ttaggcctgt ggtttcacca aaaagttcca gaatccttgc aagcatagat acctcacctt 3900
ttgtgggcgt gacaaatatg accgcatcat ccgcaaacat cgatatgcgc aggtcgggcg 3960ttgtgggcgt gacaaatatg accgcatcat ccgcaaacat cgatatgcgc aggtcgggcg 3960
agcgaccatg gagcttggtg agcattccga attcagttgc tacttcgagg agtctttgca 4020agcgaccatg gagcttggtg agcattccga attcagttgc tacttcgagg agtctttgca 4020
ggggatctat tgcaatgaca aaaagaagag gagatagggg gtcgccttgc cttaggcctc 4080ggggatctat tgcaatgaca aaaagaagag gagataggggg gtcgccttgc cttaggcctc 4080
gcccgtgcca gatagggggg tttggtacgc cgttaaggat gactcttgag gttgaggtgg 4140gcccgtgcca gatagggggg tttggtacgc cgttaaggat gactcttgag gttgaggtgg 4140
agagaattgc cgcaatccat tcccgccacc ttatagggaa gcctaggtgc tcaaggaggg 4200agagaattgc cgcaatccat tcccgccacc ttatagggaa gcctaggtgc tcaaggaggg 4200
tgaggatata ctcccaacgt atcgaatcaa aggcttttgc tatgtccaat ttgaacaaga 4260tgaggatata ctcccaacgt atcgaatcaa aggcttttgc tatgtccaat ttgaacaaga 4260
gtgttggggt cttgtttgta tgaaaacgac gtgccgcggt acgcactgcc aagaagttat 4320gtgttggggt cttgtttgta tgaaaacgac gtgccgcggt acgcactgcc aagaagttat 4320
catgtatgct cctgttcttg atgaaggcgc tttgacaagt ggagacgatc gcgttcatat 4380catgtatgct cctgttcttg atgaaggcgc tttgacaagt ggagacgatc gcgttcatat 4380
ggggctgaag tcgtaatgcc agtatcttgg aaataagctt tatgaaggaa tgtattaagc 4440ggggctgaag tcgtaatgcc agtatcttgg aaataagctt tatgaaggaa tgtattaagc 4440
ttatcggtct aaaatcaccc acttcttcgg ccccttcttt ttttgggatt aggattacat 4500ttatcggtct aaaatcaccc acttcttcgg ccccttcttt ttttgggatt aggattacat 4500
tagcggtgtt gatgagtgat aagctgccac aacgtagagc atggaaagcg ttggccgccc 4560tagcggtgtt gatgagtgat aagctgccac aacgtagagc atggaaagcg ttggccgccc 4560
ttatgacatc cccttttatt atgctccagc acgtcttaaa aaataaccca gtgaaaccat 4620ttatgacatc cccttttatt atgctccagc acgtcttaaa aaataaccca gtgaaaccat 4620
caggtcccgg cgccttgtca atgggcaaca gatcaatggc tcttttgatc tcctcctctg 4680caggtcccgg cgccttgtca atgggcaaca gatcaatggc tcttttgatc tcctcctctg 4680
agaatggagc agctaaagaa gagagatcat ggtgtcgtaa tccaagcgtc gcccagttga 4740agaatggagc agctaaagaa gagagatcat ggtgtcgtaa tccaagcgtc gcccagttga 4740
agtctattct cggagcgggt ggacgactca acatattttc aaagtgagat tgtattttcg 4800agtctattct cggagcgggt ggacgactca acatattttc aaagtgagat tgtattttcg 4800
cggccttgca ttcatgtgtt gttgtcgagc cattttggtc tttgaggcaa tgaatgaaat 4860cggccttgca ttcatgtgtt gttgtcgagc cattttggtc tttgaggcaa tgaatgaaat 4860
tttttcgacg cctagaagtg atccttcgat gaaaaaatct agtgttagca tccccaaatt 4920tttttcgacg cctagaagtg atccttcgat gaaaaaatct agtgttagca tccccaaatt 4920
ttatcaaatt tagacgtgca gcctgttttt tccgcgcacg ttcgatgacc gcaaggccca 4980ttatcaaatt tagacgtgca gcctgttttt tccgcgcacg ttcgatgacc gcaaggccca 4980
gaattctttt cttaagtctg catcttagga gctcctctcc aggagagaga gctctcagct 5040gaattctttt cttaagtctg catcttagga gctcctctcc aggagagaga gctctcagct 5040
cctgtgctat gtcaaatcta tggattatct ccaaagccat atgaagttgc agcttagcat 5100cctgtgctat gtcaaatcta tggattatct ccaaagccat atgaagttgc agcttagcat 5100
ccgagatagt gttgaagctc cactgtctga gggcccttgc tgtagcctgc agtttgtgat 5160ccgagatagt gttgaagctc cactgtctga gggcccttgc tgtagcctgc agtttgtgat 5160
agagtctgtg gaaaggctcc tggtgtgtgc agtgagcgca ccaggacctc gacaccactt 5220agagtctgtg gaaaggctcc tggtgtgtgc agtgagcgca ccaggacctc gacaccactt 5220
ccatgaatcc tgggagcatg gcccaaaaat tctcaaattt aaaagagcgc gggcgtcggg 5280ccatgaatcc tgggagcatg gcccaaaaat tctcaaattt aaaagagcgc gggcgtcggg 5280
ggccagtttg gttagagagc agcagtggac agtgatccga gagcgaagaa gataggccgt 5340ggccagtttg gttagagagc agcagtggac agtgatccga gagcgaagaa gataggccgt 5340
gcagcacgtg gctatgaaaa gcttggtccc attcagcatt tgcaaagacc ctgtcaagct 5400gcagcacgtg gctatgaaaa gcttggtccc attcagcatt tgcaaagacc ctgtcaagct 5400
taataagggt tgggtttgta cgctcgttgc tccaagtgaa tcgtctattt tgcaggttaa 5460taataagggt tgggtttgta cgctcgttgc tccaagtgaa tcgtctattt tgcaggttaa 5460
tttccttcag gtcacaacag tctagcatat cactgaaacg gctcattagg ctaaggttca 5520tttccttcag gtcacaacag tctagcatat cactgaaacg gctcattagg ctaaggttca 5520
gacgcctctt attcttgtca ctagctttgt aaattagatt aaaatctccc aaaagcagcc 5580gacgcctctt attcttgtca ctagctttgt aaattagatt aaaatctccc aaaagcagcc 5580
acttaattcc ggattgcggt ttcagatctt gaatttcttg gagaaaggct gtcttcatgc 5640acttaattcc ggattgcggt ttcagatctt gaatttcttg gagaaaggct gtcttcatgc 5640
tattgcttgt aggcccataa accacagtta ataggaaggc agtatgggac gctgttagct 5700tattgcttgt aggcccataa accacagtta ataggaaggc agtatgggac gctgttagct 5700
tggcctttcc agaaatatgg aaatcaccaa ctgtaaaatc agtcagctcc acatggttgg 5760tggcctttcc agaaatatgg aaatcaccaa ctgtaaaatc agtcagctcc acatggttgg 5760
tatcccatag caatgcaatc cctccccgtg tgccgctggg gcctccagcc ggtttgcaga 5820tatcccatag caatgcaatc cctccccgtg tgccgctggg gcctccagcc ggtttgcaga 5820
agaatttgtc taggtggtgg cctccaaggt gccaggctgt tgtttggtcg aaggaagaca 5880agaatttgtc taggtggtgg cctccaaggt gccaggctgt tgtttggtcg aaggaagaca 5880
atttagtttc ttgcaaacaa gctaagtgac acctcgatga agtaattgtc tctctgaccg 5940atttagtttc ttgcaaacaa gctaagtgac acctcgatga agtaattgtc tctctgaccg 5940
tgtccttccg agcctgggaa ttgagacccc tcacattcca gcaaaaaacc ttaaggtcta 6000tgtccttccg agcctgggaa ttgagacccc tcacattcca gcaaaaaacc ttaaggtcta 6000
agtctgtcat tgggaaaaaa aggaggtacc ccttgaatca tagtttcctg ctctctcagc 6060agtctgtcat tgggaaaaaa aggaggtacc ccttgaatca tagtttcctg ctctctcagc 6060
acagtgacaa gacagtgtgc atggaaacca catatctggt gaatgaggtg cgtcgagaca 6120acagtgacaa gacagtgtgc atggaaacca catatctggt gaatgaggtg cgtcgagaca 6120
ttgatggtgc agggaatgca aaaggaccat acaagaggtg ttcacgccat acatcaaaca 6180ttgatggtgc agggaatgca aaaggaccat acaagaggtg ttcacgccat acatcaaaca 6180
catgcagtct gaagcctccg cacacaagcg aggcaggaat tcaaagccta aagttaacat 6240catgcagtct gaagcctccg cacacaagcg aggcaggaat tcaaagccta aagttaacat 6240
gatagaagca aaagaccctt aagcacagat cactggctca tgctgctgac tgcagctcct 6300gatagaagca aaagaccctt aagcacagat cactggctca tgctgctgac tgcagctcct 6300
caacgcgcag actcctgatg ccatcttgca agtctgcaac tccgtccccc atcatcgcaa 6360caacgcgcag actcctgatg ccatcttgca agtctgcaac tccgtccccc atcatcgcaa 6360
tcatagcatc ctcgaagctc ttctctgtcg acccttcgag gttgaaggcg gacgttaggg 6420tcatagcatc ctcgaagctc ttctctgtcg acccttcgag gttgaaggcg gacgttaggg 6420
ctgcaagtgt ctcctgtggc agagtccccg tggtcgtgta ctaattattg ctattaatta 6480ctgcaagtgt ctcctgtggc agagtccccg tggtcgtgta ctaattattg ctattaatta 6480
attgcagatg tcattctctg ggagcacatt cacggacgct gcaagcaagg agccagcact 6540attgcagatg tcattctctg ggagcacatt cacggacgct gcaagcaagg agccagcact 6540
gatcgacgac ggcaatgagc tgcaaatgcc ggtagatcag tcgtcgacgg agagggaggt 6600gatcgacgac ggcaatgagc tgcaaatgcc ggtagatcag tcgtcgacgg agagggaggt 6600
taagttgatg aggtacaagg agaagaggat gaggaggtgc tttgagaagc agataagata 6660taagttgatg aggtacaagg agaagaggat gaggaggtgc tttgagaagc agataagata 6660
tgcatccagg aaagcctatg cgcaggtgag acccagggtg aaaggccgct ttgccaaggt 6720tgcatccagg aaagcctatg cgcaggtgag acccagggtg aaaggccgct ttgccaaggt 6720
aaccgaatga 6730aaccgaatga 6730
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110355321.3A CN113088523B (en) | 2021-04-01 | 2021-04-01 | A kind of transposon and its application |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110355321.3A CN113088523B (en) | 2021-04-01 | 2021-04-01 | A kind of transposon and its application |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113088523A CN113088523A (en) | 2021-07-09 |
CN113088523B true CN113088523B (en) | 2022-04-26 |
Family
ID=76672557
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110355321.3A Active CN113088523B (en) | 2021-04-01 | 2021-04-01 | A kind of transposon and its application |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113088523B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115992289B (en) * | 2022-12-05 | 2025-07-04 | 北大荒垦丰种业股份有限公司 | Development and application of KASP markers related to maize stalk rot resistance genes |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1510574A2 (en) * | 2003-08-29 | 2005-03-02 | Tottori University | Transposon-like element in rye |
WO2015084969A1 (en) * | 2013-12-03 | 2015-06-11 | Iowa State University Research Foundation, Inc. | Plants with improved drought tolerance |
CN110669860A (en) * | 2019-10-24 | 2020-01-10 | 中国农业大学 | A method for detecting maize stalk strength and its specific transposon |
CN111320679A (en) * | 2020-02-14 | 2020-06-23 | 华南农业大学 | ZmPHYCs mutant protein related to maize flowering stage, its encoding gene, recombinant vector and application |
-
2021
- 2021-04-01 CN CN202110355321.3A patent/CN113088523B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1510574A2 (en) * | 2003-08-29 | 2005-03-02 | Tottori University | Transposon-like element in rye |
WO2015084969A1 (en) * | 2013-12-03 | 2015-06-11 | Iowa State University Research Foundation, Inc. | Plants with improved drought tolerance |
CN110669860A (en) * | 2019-10-24 | 2020-01-10 | 中国农业大学 | A method for detecting maize stalk strength and its specific transposon |
CN111320679A (en) * | 2020-02-14 | 2020-06-23 | 华南农业大学 | ZmPHYCs mutant protein related to maize flowering stage, its encoding gene, recombinant vector and application |
Non-Patent Citations (3)
Title |
---|
Opposite response of maize ZmCCT to photoperiod due to transposon jumping;Shuyang Zhong等;《Theoretical and Applied Genetics》;20210520;第134卷;2841-2855 * |
Useful parasites: the evolutionary biology and biotechnology applications of transposable elements;GEORGI N. BONCHEV;《Journal of Genetics》;20161115;第95卷;1039-1052 * |
玉米ZmCOL3_(pro217)启动子的克隆及功能分析;果天宇等;《玉米科学》;20200415(第02期);58-64 * |
Also Published As
Publication number | Publication date |
---|---|
CN113088523A (en) | 2021-07-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101388281B1 (en) | Disease resistent cucumber plants | |
US20150089685A1 (en) | Novel maize plant | |
CN109234431B (en) | Molecular marker of corn stalk rot resistance QTL and application thereof | |
CN105624154A (en) | Molecular marker of corn northern leaf blight resistant QTL and application thereof | |
US12022788B2 (en) | Prolific flowering watermelon | |
CN113088523B (en) | A kind of transposon and its application | |
RU2560599C2 (en) | Corn plants characterised by quantitative trait loci qtl | |
CN110527741A (en) | A kind of molecular labeling, primer and application with american pumpkin mildew-resistance biological strain 2F gene close linkage | |
CN108018290B (en) | Plant Anthocyanin Synthesis Control Gene and Its Application | |
CN104928299A (en) | Corynespora cassiicola anti-disease gene Cca as well as encoding protein and application thereof | |
CA2674243A1 (en) | Genetic markers for orobanche resistance in sunflower | |
CN118785808A (en) | Markers associated with spontaneous chromosome doubling | |
US9702015B2 (en) | Molecular markers associated with Mal de Rio Cuarto Virus in maize | |
CN115820895B (en) | Molecular markers tightly linked to chlorophyll content in maize and their application | |
CN109797234A (en) | With the molecular labeling R060939-2 of resistance gene of rice blast Pi2 close linkage | |
Adhikari et al. | Marker assisted sex determination in dioecious crops: An advancement in molecular biology | |
US20240389531A1 (en) | Peronospora resistant spinach | |
CN119563040A (en) | Methods and compositions for selecting soybean plants having favorable allele combinations for shoot apex and maturity | |
JP2005229847A (en) | Genetic markers linked to loci involved in panicle length and their use | |
CN115896323A (en) | Molecular marker closely linked with germination capacity of corn seeds and application thereof | |
CN117385074A (en) | SNP molecular marker for identifying resistance of cucumber to bacterial stem soft rot and application thereof | |
CN119053242A (en) | Novel genetic loci associated with disease resistance in soybean | |
CN108866235A (en) | A kind of InDel molecular labeling and its application for identifying or assisting to identify Chinese cabbage crossing compatibility | |
JP2005229845A (en) | Genetic markers linked to genetic loci involved in guards and their use |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |