CN112079903B - Mutant of mismatching binding protein and coding gene thereof - Google Patents
Mutant of mismatching binding protein and coding gene thereof Download PDFInfo
- Publication number
- CN112079903B CN112079903B CN201910518166.5A CN201910518166A CN112079903B CN 112079903 B CN112079903 B CN 112079903B CN 201910518166 A CN201910518166 A CN 201910518166A CN 112079903 B CN112079903 B CN 112079903B
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- glu
- arg
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 43
- 102000014914 Carrier Proteins Human genes 0.000 title 1
- 108091008324 binding proteins Proteins 0.000 title 1
- 230000035772 mutation Effects 0.000 claims abstract description 65
- 101710099953 DNA mismatch repair protein msh3 Proteins 0.000 claims abstract description 62
- 235000018417 cysteine Nutrition 0.000 claims abstract description 31
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims abstract description 31
- 238000000034 method Methods 0.000 claims abstract description 20
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims abstract description 18
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims abstract description 14
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims abstract description 13
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 claims abstract description 13
- 235000013922 glutamic acid Nutrition 0.000 claims abstract description 12
- 239000004220 glutamic acid Substances 0.000 claims abstract description 12
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 claims abstract description 12
- 229930182817 methionine Natural products 0.000 claims abstract description 12
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 claims abstract description 11
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 claims abstract description 10
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Natural products NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims abstract description 8
- 239000004471 Glycine Substances 0.000 claims abstract description 7
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 claims abstract description 6
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims abstract 2
- 108020004414 DNA Proteins 0.000 claims description 77
- 239000007788 liquid Substances 0.000 claims description 12
- 239000013598 vector Substances 0.000 claims description 8
- 230000027455 binding Effects 0.000 claims description 7
- 108091026890 Coding region Proteins 0.000 claims description 6
- 238000012258 culturing Methods 0.000 claims description 4
- 230000001131 transforming effect Effects 0.000 claims description 3
- 239000001963 growth medium Substances 0.000 claims description 2
- 150000001413 amino acids Chemical group 0.000 claims 12
- 235000004400 serine Nutrition 0.000 claims 4
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims 2
- 108010048367 enhanced green fluorescent protein Proteins 0.000 claims 1
- 238000004519 manufacturing process Methods 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 abstract description 13
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 abstract description 5
- 239000004473 Threonine Substances 0.000 abstract description 5
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 abstract description 5
- 239000004474 valine Substances 0.000 abstract description 5
- 125000002987 valine group Chemical group [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 abstract description 5
- 239000013612 plasmid Substances 0.000 description 221
- 239000000047 product Substances 0.000 description 56
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 29
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 24
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 24
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 24
- 241000588724 Escherichia coli Species 0.000 description 23
- 239000002609 medium Substances 0.000 description 23
- 102000004169 proteins and genes Human genes 0.000 description 18
- 235000018102 proteins Nutrition 0.000 description 17
- 238000003786 synthesis reaction Methods 0.000 description 16
- 238000012937 correction Methods 0.000 description 15
- 230000000694 effects Effects 0.000 description 15
- 230000015572 biosynthetic process Effects 0.000 description 14
- 229940041514 candida albicans extract Drugs 0.000 description 14
- 239000011780 sodium chloride Substances 0.000 description 14
- 239000012137 tryptone Substances 0.000 description 14
- 239000012138 yeast extract Substances 0.000 description 14
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 13
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 12
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 12
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 12
- 241000880493 Leptailurus serval Species 0.000 description 12
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 12
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 12
- 108010013835 arginine glutamate Proteins 0.000 description 12
- 108010047857 aspartylglycine Proteins 0.000 description 12
- 108010034529 leucyl-lysine Proteins 0.000 description 12
- 108010000761 leucylarginine Proteins 0.000 description 12
- 108010057821 leucylproline Proteins 0.000 description 12
- 241000894006 Bacteria Species 0.000 description 11
- 238000012408 PCR amplification Methods 0.000 description 11
- 230000029087 digestion Effects 0.000 description 11
- 238000001976 enzyme digestion Methods 0.000 description 11
- 108010050848 glycylleucine Proteins 0.000 description 10
- 239000000203 mixture Substances 0.000 description 10
- 239000000243 solution Substances 0.000 description 10
- 238000001179 sorption measurement Methods 0.000 description 10
- 102000004190 Enzymes Human genes 0.000 description 9
- 108090000790 Enzymes Proteins 0.000 description 9
- 108091034117 Oligonucleotide Proteins 0.000 description 9
- 229940088598 enzyme Drugs 0.000 description 9
- 239000012634 fragment Substances 0.000 description 9
- 239000006228 supernatant Substances 0.000 description 7
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 6
- VWWKKDNCCLAGRM-GVXVVHGQSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]propanoyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VWWKKDNCCLAGRM-GVXVVHGQSA-N 0.000 description 6
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 6
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 6
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 6
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 6
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 6
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 6
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 6
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 6
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 6
- QJABSQFUHKHTNP-SYWGBEHUSA-N Ala-Ile-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QJABSQFUHKHTNP-SYWGBEHUSA-N 0.000 description 6
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 6
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 6
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 6
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 6
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 6
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 6
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 6
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 6
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 6
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 6
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 6
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 6
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 6
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 6
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 6
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 6
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 6
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 6
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 6
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 6
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 6
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 6
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 6
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 6
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 6
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 6
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 6
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 6
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 6
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 6
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 6
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 6
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 6
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 6
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 6
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 6
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 6
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 6
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 6
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 6
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 6
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 6
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 6
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 6
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 6
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 6
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 6
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 6
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 6
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 6
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 6
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 6
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 6
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 6
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 6
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 6
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 6
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 6
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 6
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 6
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 6
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 6
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 6
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 6
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 6
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 6
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 6
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 6
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 6
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 6
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 6
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 6
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 6
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 6
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 6
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 6
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 6
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 6
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 6
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 6
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 6
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 6
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 6
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 6
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 6
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 6
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 6
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 6
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 6
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 6
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 6
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 6
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 6
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 6
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 6
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 6
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 6
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 6
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 6
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 6
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 6
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 6
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 6
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 6
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 6
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 6
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 6
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 6
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 6
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 6
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 6
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 6
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 6
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 6
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 6
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 6
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 6
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 6
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 6
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 6
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 6
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 6
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 6
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 6
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 6
- MPSBSKHOWJQHBS-IHRRRGAJSA-N Leu-His-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N MPSBSKHOWJQHBS-IHRRRGAJSA-N 0.000 description 6
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 6
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 6
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 6
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 6
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 6
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 6
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 6
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 6
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 6
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 6
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 6
- WGAZVKFCPHXZLO-SZMVWBNQSA-N Leu-Trp-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N WGAZVKFCPHXZLO-SZMVWBNQSA-N 0.000 description 6
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 6
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 6
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 6
- NLOZZWJNIKKYSC-WDSOQIARSA-N Lys-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 NLOZZWJNIKKYSC-WDSOQIARSA-N 0.000 description 6
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 6
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 6
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 6
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 6
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 6
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 6
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 6
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 6
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 6
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 6
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 6
- WXUUEPIDLLQBLJ-DCAQKATOSA-N Met-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N WXUUEPIDLLQBLJ-DCAQKATOSA-N 0.000 description 6
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 6
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 6
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 6
- IILUKIJNFMUBNF-IHRRRGAJSA-N Phe-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O IILUKIJNFMUBNF-IHRRRGAJSA-N 0.000 description 6
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 6
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 6
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 6
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 6
- OKQQWSNUSQURLI-JYJNAYRXSA-N Phe-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N OKQQWSNUSQURLI-JYJNAYRXSA-N 0.000 description 6
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 6
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 6
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 6
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 6
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 6
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 6
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 6
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 6
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 6
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 6
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 6
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 6
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 6
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 6
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 6
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 6
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 6
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 6
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 6
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 6
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 6
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 6
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 6
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 6
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 6
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 6
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 6
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 6
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 6
- YDWLCDQXLCILCZ-BWAGICSOSA-N Thr-His-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YDWLCDQXLCILCZ-BWAGICSOSA-N 0.000 description 6
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 6
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 6
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 6
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 6
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 6
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 6
- WNZRNOGHEONFMS-PXDAIIFMSA-N Trp-Ile-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WNZRNOGHEONFMS-PXDAIIFMSA-N 0.000 description 6
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 6
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 6
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 6
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 6
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 6
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 6
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 6
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 6
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 6
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 6
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 6
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 6
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 6
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 6
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 6
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 6
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 6
- 108010041407 alanylaspartic acid Proteins 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 108010070944 alanylhistidine Proteins 0.000 description 6
- 108010087924 alanylproline Proteins 0.000 description 6
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 6
- 108010068380 arginylarginine Proteins 0.000 description 6
- 108010062796 arginyllysine Proteins 0.000 description 6
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 6
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 6
- 108010089804 glycyl-threonine Proteins 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 108010025306 histidylleucine Proteins 0.000 description 6
- 108010085325 histidylproline Proteins 0.000 description 6
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 6
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 6
- 108010056582 methionylglutamic acid Proteins 0.000 description 6
- 108010005942 methionylglycine Proteins 0.000 description 6
- 108010077112 prolyl-proline Proteins 0.000 description 6
- 108010070643 prolylglutamic acid Proteins 0.000 description 6
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 6
- 108010080629 tryptophan-leucine Proteins 0.000 description 6
- 108010003137 tyrosyltyrosine Proteins 0.000 description 6
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 6
- 125000000539 amino acid group Chemical group 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 4
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 4
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 4
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 4
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 4
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 4
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 108091005804 Peptidases Proteins 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 239000003242 anti bacterial agent Substances 0.000 description 3
- 229940088710 antibiotic agent Drugs 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 230000033607 mismatch repair Effects 0.000 description 3
- 239000002244 precipitate Substances 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 239000002699 waste material Substances 0.000 description 3
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 2
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 2
- 108091033409 CRISPR Proteins 0.000 description 2
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 2
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 2
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 2
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 2
- 241000588722 Escherichia Species 0.000 description 2
- 241000672609 Escherichia coli BL21 Species 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 2
- OLHDPZMYUSBGDE-GUBZILKMSA-N Pro-Arg-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O OLHDPZMYUSBGDE-GUBZILKMSA-N 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- 241000607720 Serratia Species 0.000 description 2
- 108700005078 Synthetic Genes Proteins 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 229920002678 cellulose Polymers 0.000 description 2
- 239000001913 cellulose Substances 0.000 description 2
- 238000004140 cleaning Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 238000003306 harvesting Methods 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 238000006386 neutralization reaction Methods 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 239000013049 sediment Substances 0.000 description 2
- 230000035939 shock Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 239000007222 ypd medium Substances 0.000 description 2
- 244000153158 Ammi visnaga Species 0.000 description 1
- 235000010585 Ammi visnaga Nutrition 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000186312 Brevibacterium sp. Species 0.000 description 1
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 1
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 241000589500 Thermus aquaticus Species 0.000 description 1
- 239000007984 Tris EDTA buffer Substances 0.000 description 1
- 241000607598 Vibrio Species 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000000710 homodimer Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 238000013492 plasmid preparation Methods 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 102220059964 rs786201995 Human genes 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000000741 silica gel Substances 0.000 description 1
- 229910002027 silica gel Inorganic materials 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biophysics (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Gastroenterology & Hepatology (AREA)
- Microbiology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
Description
技术领域technical field
本发明涉及生物技术领域。具体地说,本发明涉及错配结合蛋白(简称eMutS)的突变体及其应用。The present invention relates to the field of biotechnology. Specifically, the present invention relates to a mutant of mismatch binding protein (abbreviated as eMutS) and its application.
背景技术Background technique
随着合成生物学的兴起,我们进入了创造具有新基因功能甚至全基因组的基因的时代。DNA合成具有广泛的应用,包括基因合成,代谢通路合成,甚至整个基因组的合成(Gibson et al.,2008;Gibson et al.,2010)。从头基因合成,作为合成生物学的重要技术手段,主要涉及通过基于PCR或DNA连接酶的连接反应将短链寡核苷酸(oligouncletoides,oligos)组装成长链DNA(Jingdong et al.,2004),在合成生物学和生物技术领域发挥着越来越重要的作用(Bayer and Smolke,2005;Carr and Church,2009;Gibson et al.,2008;Wang et al.,2009)。目前,在多孔二氧化硅珠粒上合成的寡核苷酸通常用作合成反应的起始寡核苷酸,其相对昂贵且具有低的合成通量。与传统方法相比,利用微流体合成技术在固相平面(微流控芯片)上平行合成寡核苷酸,不仅降低了寡核苷酸的成本,而且大大提高了合成的通量(Gao et al.,2001;Quan et al.,2011;Richmond et al.,2004;Sriram etal.,2010;Xiaochuan et al.,2004)。然而,微芯片合成反应的每个步骤的效率低于常规方法的效率,并且具有相对较高的错误率,导致微芯片合成具有较低的合成质量和较短的寡核苷酸长度(Tian et al.,2009)。合成基因中的错误来源于寡核苷酸化学合成过程中引入的错误碱基,聚合酶的延伸以及寡核苷酸之间的错误组装(Cline et al.,1996)。由于合成错误的存在,使用克隆测序方法从多个克隆中选择具有正确序列的克隆通常是不可避免的,并且该部分的成本通常占据基因合成总成本的较大部分。因此,提高合成DNA的保真度是大规模应用基因从头合成技术的迫切要求。酶纠错方法是改善错误的经济有效的方法。通常,这些酶特异性识别并结合含有错误配对序列的DNA。因此,DNA双链通过变性退火将错误的单链和正确的单链随机组合,使得错误合成的碱基形成错配,然后使用错配结合蛋白(MutS)或错配切除酶(例如,resolvase)删除合成错误。Lubock等人系统地比较了六种不同的纠错酶,并得出结论:MutS最适合增加完美组装的数量(最多25.2倍)(Lubock et al.,2017)。With the rise of synthetic biology, we have entered the era of creating genes with new gene functions and even whole genomes. DNA synthesis has a wide range of applications, including gene synthesis, metabolic pathway synthesis, and even whole genome synthesis (Gibson et al., 2008; Gibson et al., 2010). De novo gene synthesis, as an important technique in synthetic biology, mainly involves the assembly of short-chain oligonucleotides (oligouncletoides, oligos) into long-chain DNA by ligation reactions based on PCR or DNA ligase (Jingdong et al., 2004), Playing an increasingly important role in synthetic biology and biotechnology (Bayer and Smolke, 2005; Carr and Church, 2009; Gibson et al., 2008; Wang et al., 2009). Currently, oligonucleotides synthesized on porous silica beads are generally used as starting oligonucleotides for synthesis reactions, which are relatively expensive and have low synthesis throughput. Compared with traditional methods, the use of microfluidic synthesis technology to synthesize oligonucleotides in parallel on a solid phase plane (microfluidic chip) not only reduces the cost of oligonucleotides, but also greatly improves the throughput of synthesis (Gao et al. al., 2001; Quan et al., 2011; Richmond et al., 2004; Sriram et al., 2010; Xiaochuan et al., 2004). However, the efficiency of each step of the microchip synthesis reaction is lower than that of conventional methods, and has a relatively high error rate, resulting in microchip synthesis with lower synthesis quality and shorter oligonucleotide length (Tian et al. al., 2009). Errors in synthetic genes originate from erroneous bases introduced during chemical oligonucleotide synthesis, polymerase extension, and misassembly between oligonucleotides (Cline et al., 1996). Due to the existence of synthesis errors, it is usually unavoidable to select a clone with the correct sequence from multiple clones using clone sequencing methods, and the cost of this part usually occupies a larger part of the total cost of gene synthesis. Therefore, improving the fidelity of synthetic DNA is an urgent requirement for large-scale application of de novo gene synthesis technology. Enzymatic error correction methods are a cost-effective way to improve errors. Typically, these enzymes specifically recognize and bind DNA that contains mismatched sequences. Therefore, the DNA duplex is randomly combined with the wrong single strand and the correct single strand through denaturing annealing, so that the wrongly synthesized bases form a mismatch, and then use a mismatch binding protein (MutS) or a mismatch excision enzyme (for example, resolvase) Removed synthesis errors. Lubock et al. systematically compared six different error-correcting enzymes and concluded that MutS was the most suitable for increasing the number of perfect assemblies (up to 25.2-fold) (Lubock et al., 2017).
真核生物和原核生物中普遍存在的错配修复系统(MMR)系统确保了遗传物质的精确和稳定复制。错配结合蛋白MutS是MMR的重要组分,其识别并结合许多不同的DNA错配(And and Jinksrobertson,2000;Iyer et al.,2006;Modrich,1991)。尽管蛋白质结合不同错配的能力不同,但它可以结合几乎所有单碱基错配以及1-4碱基插入或缺失突变(Whitehouse et al.,1997)。源自大肠杆菌(EcoMutS)的MutS为约97kDa,并且与G:T,A:C,A:A,G:G的结合效率高于其他错配(Brown et al.,2001)。MutS以同源二聚体的形式不对称地结合错配的DNA。The Mismatch Repair (MMR) system ubiquitous in eukaryotes and prokaryotes ensures precise and stable replication of genetic material. The mismatch binding protein MutS is an important component of MMR, which recognizes and binds many different DNA mismatches (And and Jinksrobertson, 2000; Iyer et al., 2006; Modrich, 1991). Although the ability of the protein to bind different mismatches varies, it can bind almost all single-base mismatches as well as 1-4 base insertion or deletion mutations (Whitehouse et al., 1997). MutS derived from Escherichia coli (EcoMutS) is about 97 kDa and binds G:T, A:C, A:A, G:G more efficiently than other mismatches (Brown et al., 2001). MutS asymmetrically binds mismatched DNA as a homodimer.
据报道,MutS用于校正寡核苷酸组装的基因片段(Binkowski et al.,2005;Carret al.,2004)。来自水生栖热菌(TaqMutS)的MutS通常用于错误校正,因为它比EcoMutS更具热稳定性并且对完全互补的DNA具有弱的结合能力。然而,TaqMutS与错配碱基的结合能力远弱于EcoMutS,后者直接降低了错误修正效率(Brown et al.,2001;Cho et al.,2007)。最近,建立了用于校正芯片合成的寡核苷酸文库及其组装基因中的合成错误的高通量错误校正方法(Wan et al.,2017;Wen et al.,2014)。这种使用方法,可以将具有错配结合能力的蛋白EcoMutS和TaqMutS固定化在纤维素柱上,这种方案低成本,高通量。然而,EcoMutS的热稳定性较差限制了该错误校正系统在基因合成中的大规模应用。MutS has been reported to be used to correct oligonucleotide-assembled gene fragments (Binkowski et al., 2005; Carret al., 2004). MutS from Thermus aquaticus (TaqMutS) is commonly used for error correction because it is more thermostable than EcoMutS and has weak binding capacity for fully complementary DNA. However, the binding ability of TaqMutS to mismatched bases is much weaker than that of EcoMutS, which directly reduces the efficiency of error correction (Brown et al., 2001; Cho et al., 2007). Recently, high-throughput error correction methods for correcting synthetic errors in ChIP-synthesized oligonucleotide libraries and their assembled genes were established (Wan et al., 2017; Wen et al., 2014). Using this method, the proteins EcoMutS and TaqMutS with mismatch binding ability can be immobilized on the cellulose column, which is low cost and high throughput. However, the poor thermal stability of EcoMutS limits the large-scale application of this error correction system in gene synthesis.
因此,综上所述,本领域急需具有较高热稳定性的错配结合蛋白突变体。Therefore, in summary, there is an urgent need in the art for mismatch binding protein mutants with higher thermal stability.
发明内容Contents of the invention
本发明的目的在于提供一种错配结合蛋白的突变体及此类突变体的应用。The purpose of the present invention is to provide a mutant of mismatch binding protein and the application of the mutant.
在第一方面,本发明提供一种错配结合蛋白,所述错配结合蛋白:In a first aspect, the invention provides a mismatch binding protein that:
a.包含如SEQ ID NO:52所示氨基酸序列且具有选自以下至少一组的突变:a. comprising an amino acid sequence as shown in SEQ ID NO:52 and having a mutation selected from at least one of the following groups:
1)第120位和第151位的丝氨酸均突变为半胱氨酸,1) Both serine at position 120 and position 151 are mutated to cysteine,
2)第157位的亮氨酸和第233位的甘氨酸均突变为半胱氨酸,2) Leucine at position 157 and glycine at position 233 are both mutated to cysteine,
3)第451位的谷氨酸和第465位的缬氨酸均突变为半胱氨酸,3) Both glutamic acid at position 451 and valine at position 465 are mutated into cysteine,
4)第609位的蛋氨酸和第723位的苏氨酸均突变为半胱氨酸,或4) both methionine at position 609 and threonine at position 723 are mutated to cysteine, or
5)第120位的丝氨酸、第157位的亮氨酸、第451位的谷氨酸和第609位的蛋氨酸均突变为半胱氨酸;或5) Serine at position 120, leucine at position 157, glutamic acid at position 451 and methionine at position 609 are all mutated to cysteine; or
b.具有a)所限定的序列经过1-20个氨基酸残基的取代、缺失或添加而形成的序列,且基本具有a)所限定的错配结合蛋白功能的由a)衍生的错配结合蛋白。在优选的实施方式中,所述错配结合蛋白来源于埃希氏菌属细菌,优选地,来源于大肠杆菌。b. The sequence defined in a) is formed by substitution, deletion or addition of 1-20 amino acid residues, and the mismatch binding protein derived from a) basically has the function of a mismatch binding protein as defined in a) protein. In a preferred embodiment, the mismatch binding protein is derived from bacteria of the genus Escherichia, preferably from Escherichia coli.
在优选的实施方式中,所述错配结合蛋白:In a preferred embodiment, the mismatch binding protein:
a.具有如SEQ ID NO:52所示氨基酸序列且具有选自以下至少一组的突变:a. have an amino acid sequence as shown in SEQ ID NO:52 and have a mutation selected from at least one group of the following:
1)第120位和第151位的丝氨酸均突变为半胱氨酸,1) Both serine at position 120 and position 151 are mutated to cysteine,
2)第157位的亮氨酸和第233位的甘氨酸均突变为半胱氨酸,2) Leucine at position 157 and glycine at position 233 are both mutated to cysteine,
3)第451位的谷氨酸和第465位的缬氨酸均突变为半胱氨酸,3) Both glutamic acid at position 451 and valine at position 465 are mutated into cysteine,
4)第609位的蛋氨酸和第723位的苏氨酸突变均为半胱氨酸,或4) both the methionine at position 609 and the threonine at position 723 mutations are cysteine, or
5)第120位的丝氨酸、第157位的亮氨酸、第451位的谷氨酸和第609位的蛋氨酸均突变为半胱氨酸;或5) Serine at position 120, leucine at position 157, glutamic acid at position 451 and methionine at position 609 are all mutated to cysteine; or
b.具有a)所限定的序列经过一个或几个氨基酸残基,优选1-20个、更优选1-15个、更优选1-10个、更优选1-3个、最优选1个氨基酸残基的取代、缺失或添加而形成的序列,且基本具有a)所限定的错配结合蛋白功能的由a)衍生的错配结合蛋白。b. have the sequence defined in a) through one or several amino acid residues, preferably 1-20, more preferably 1-15, more preferably 1-10, more preferably 1-3, most preferably 1 amino acid residues A mismatch binding protein derived from a) that is a sequence formed by substitution, deletion or addition of residues, and basically has the function of the mismatch binding protein defined in a).
在优选的实施方式中,所述错配结合蛋白:In a preferred embodiment, the mismatch binding protein:
a.其氨基酸序列如SEQ ID NO:54、56、58、60或62所示;或a. its amino acid sequence is shown in SEQ ID NO: 54, 56, 58, 60 or 62; or
b.包含a)所限定的序列经过一个或几个氨基酸残基,优选1-20个、更优选1-15个、更优选1-10个、更优选1-3个、最优选1个氨基酸残基的取代、缺失或添加而形成的序列,且基本具有a)所限定的错配结合蛋白功能的由a)衍生的错配结合蛋白。b. comprising the sequence defined in a) through one or several amino acid residues, preferably 1-20, more preferably 1-15, more preferably 1-10, more preferably 1-3, most preferably 1 amino acid residues A mismatch binding protein derived from a) that is a sequence formed by substitution, deletion or addition of residues, and basically has the function of the mismatch binding protein defined in a).
在优选的实施方式中,所述错配结合蛋白的氨基酸序列如SEQ ID NO:54、56、58、60或62所示。In a preferred embodiment, the amino acid sequence of the mismatch binding protein is shown in SEQ ID NO: 54, 56, 58, 60 or 62.
在一优选例中,所述的错配结合蛋白在4℃条件下,保存90天至少保留50%以上的活性;优选地,60%-70%以上的活性;更优选地,70%-80%以上的活性,最优选,90%以上的活性。In a preferred example, the mismatch binding protein retains at least 50% of its activity when stored at 4°C for 90 days; preferably, 60%-70% of its activity; more preferably, 70%-80% % or more activity, most preferably, more than 90% activity.
在一优选例中,所述的错配结合蛋白利用固定化柱子纯化方式(MICC错配纠正系统)(Wen et al.,2014),将合成基因的错误率降低到2/kb,优选地,降低到1/kb,最优选地,降低到0.56/kb。In a preferred example, the mismatch binding protein is purified using an immobilized column (MICC mismatch correction system) (Wen et al., 2014), which reduces the error rate of the synthetic gene to 2/kb, preferably, Reduced to 1/kb, most preferably, to 0.56/kb.
本文所用的术语“对应于”具有本领域普通技术人员通常理解的意义。具体地说,“对应于”表示两条序列经同源性或序列相同性比对后,一条序列与另一条序列中的指定位置相对应的位置。The term "corresponding to" used herein has a meaning commonly understood by those of ordinary skill in the art. Specifically, "corresponding to" means that after two sequences are aligned for homology or sequence identity, one sequence corresponds to the specified position in the other sequence.
在第二方面,本发明提供一种编码本发明第一方面所述蛋白的基因。In a second aspect, the present invention provides a gene encoding the protein described in the first aspect of the present invention.
在一优选例中,所述基因的核苷酸序列如SEQ ID NO:53、55、57、59或61所示。In a preferred example, the nucleotide sequence of the gene is shown in SEQ ID NO: 53, 55, 57, 59 or 61.
在第三方面,本发明提供包含本发明第二方面所述编码基因的载体。In a third aspect, the present invention provides a vector comprising the coding gene of the second aspect of the present invention.
在第四方面,本发明提供一种宿主细胞,所述宿主细胞含有本发明第二方面所述编码基因。In the fourth aspect, the present invention provides a host cell containing the coding gene of the second aspect of the present invention.
在优选的实施方式中,所述宿主细胞来自埃希氏菌属(Escherichia)、棒状杆菌属(Corynebacterium)、短杆菌属(Brevibacterium sp.)、芽孢杆菌属(Bacillus)、沙雷氏菌属(Serratia)或弧菌属(Vibrio)。In a preferred embodiment, the host cell is from the genus Escherichia, Corynebacterium, Brevibacterium sp., Bacillus, Serratia ( Serratia) or Vibrio.
在优选的实施方式中,所述宿主细胞是大肠杆菌(E.Coli)。In a preferred embodiment, the host cell is Escherichia coli (E. Coli).
在一优选例中,所述宿主细胞染色体上整合有本发明第二方面所述编码基因或含有本发明第三方面所述载体。In a preferred example, the host cell is chromosomally integrated with the coding gene of the second aspect of the present invention or contains the vector of the third aspect of the present invention.
在优选的实施方式中,所述宿主细胞表达所述错配结合蛋白。In a preferred embodiment, said host cell expresses said mismatch binding protein.
在第五方面,本发明提供一种制备错配结合蛋白的方法,所述方法包括以下步骤:In a fifth aspect, the present invention provides a method of preparing a mismatch binding protein, said method comprising the steps of:
a.培养本发明第四方面所述的宿主细胞,使之产生所述错配结合蛋白;和a. cultivating the host cell according to the fourth aspect of the present invention to produce the mismatch binding protein; and
b.从培养液中分离所述错配结合蛋白。b. Isolating the mismatch binding protein from the culture medium.
在优选的实施方式中,所述方法培养温度在16-45℃,更优选地,培养温度在16℃实施。In a preferred embodiment, the cultivation temperature of the method is 16-45°C, more preferably, the cultivation temperature is implemented at 16°C.
在第六方面,本发明提供本发明第一方面所述错配结合蛋白在识别并结合错配DNA的应用。In the sixth aspect, the present invention provides the use of the mismatch binding protein described in the first aspect of the present invention in recognizing and binding to mismatched DNA.
在一优选例中,本发明第一方面所述错配结合蛋白以自由液态用于识别并结合错配DNA或者与EGFP(Enhance green fluorescent protein,绿色荧光蛋白)、CBM(Cellulosebinding module,纤维素结构域)联合使用从而固定化错配结合蛋白于柱子上用于识别并结合错配DNA。优选地,本发明第一方面所述错配结合蛋白与EGFP、CBM联合使用从而固定化错配结合蛋白于柱子上热稳定性更高。In a preferred example, the mismatch binding protein described in the first aspect of the present invention is used in a free liquid state to recognize and bind to mismatched DNA or to combine with EGFP (Enhance green fluorescent protein, green fluorescent protein), CBM (Cellulosebinding module, cellulose structure) domain) to immobilize the mismatch binding protein on the column for recognition and binding to mismatched DNA. Preferably, the mismatch binding protein described in the first aspect of the present invention is used in combination with EGFP and CBM so that the immobilized mismatch binding protein has higher thermal stability on the column.
在第七方面,本发明提供制备本发明第一方面所述错配结合蛋白的方法,所述方法包括以下步骤:In the seventh aspect, the present invention provides a method for preparing the mismatch binding protein according to the first aspect of the present invention, the method comprising the following steps:
a.改造SEQ ID NO:52所示氨基酸序列的编码序列,使得编码的氨基酸序列中对应于SEQ ID NO:52所示氨基酸序列具有选自以下至少一组突变:a. Transform the coding sequence of the amino acid sequence shown in SEQ ID NO:52, so that the amino acid sequence corresponding to SEQ ID NO:52 in the encoded amino acid sequence has at least one group of mutations selected from the following:
1)第120位和第151位的丝氨酸均突变为半胱氨酸,1) Both serine at position 120 and position 151 are mutated to cysteine,
2)第157位的亮氨酸和第233位的甘氨酸均突变为半胱氨酸,2) Leucine at position 157 and glycine at position 233 are both mutated to cysteine,
3)第451位的谷氨酸和第465位的缬氨酸均突变为半胱氨酸,3) Both glutamic acid at position 451 and valine at position 465 are mutated into cysteine,
4)第609位的蛋氨酸和第723位的苏氨酸突变均为半胱氨酸,或4) both the methionine at position 609 and the threonine at position 723 mutations are cysteine, or
5)第120位的丝氨酸、第157位的亮氨酸、第451位的谷氨酸和第609位的蛋氨酸均突变为半胱氨酸;5) Serine at position 120, leucine at position 157, glutamic acid at position 451 and methionine at position 609 are all mutated to cysteine;
b.将a)得到的编码序列直接转染合适的宿主细胞或经载体引入合适的宿主细胞;b. directly transfecting the coding sequence obtained in a) into a suitable host cell or introducing a vector into a suitable host cell;
c.培养b)得到的宿主细胞;c. cultivating the host cell obtained in b);
d.从步骤c)得到的培养体系中分离所述宿主细胞产生的错配结合蛋白;和d. isolating the mismatch binding protein produced by the host cell from the culture system obtained in step c); and
e.测定所述错配结合蛋白的酶活以及热稳定性。e. Determining the enzyme activity and thermal stability of the mismatch binding protein.
在第八方面,本发明提供改造野生型错配结合蛋白使之提高热稳定性的方法,所述方法包括以下步骤:In an eighth aspect, the present invention provides a method for modifying a wild-type mismatch binding protein to improve thermal stability, the method comprising the following steps:
a.将野生型错配结合蛋白的氨基酸序列与SEQ ID NO:52所示氨基酸序列作比对;和a. aligning the amino acid sequence of the wild-type mismatch binding protein with the amino acid sequence shown in SEQ ID NO:52; and
b.改造所述野生型错配结合蛋白的编码序列,使得编码的氨基酸序列中对应于SEQ ID NO:52所示氨基酸序列具有选自以下至少一组的突变:b. transforming the coding sequence of the wild-type mismatch binding protein, so that the amino acid sequence corresponding to SEQ ID NO:52 has a mutation selected from at least one of the following groups in the encoded amino acid sequence:
1)第120位和第151位的丝氨酸均突变为半胱氨酸,1) Both serine at position 120 and position 151 are mutated to cysteine,
2)第157位的亮氨酸和第233位的甘氨酸均突变为半胱氨酸,2) Leucine at position 157 and glycine at position 233 are both mutated to cysteine,
3)第451位的谷氨酸和第465位的缬氨酸均突变为半胱氨酸,3) Both glutamic acid at position 451 and valine at position 465 are mutated into cysteine,
4)第609位的蛋氨酸和第723位的苏氨酸突变均为半胱氨酸,或4) both the methionine at position 609 and the threonine at position 723 mutations are cysteine, or
5)第120位的丝氨酸、第157位的亮氨酸、第451位的谷氨酸和第609位的蛋氨酸均突变为半胱氨酸;5) Serine at position 120, leucine at position 157, glutamic acid at position 451 and methionine at position 609 are all mutated to cysteine;
c.将b得到的编码序列直接转染合适的宿主细胞或经载体引入合适的宿主细胞;c. Directly transfecting the coding sequence obtained in b into a suitable host cell or introducing a vector into a suitable host cell;
d.培养c得到的宿主细胞;d. culturing the host cells obtained in c;
e.从步骤d得到的培养体系中分离所述宿主细胞产生的错配结合蛋白;和e. isolating the mismatch binding protein produced by the host cell from the culture system obtained in step d; and
f.测定所述错配结合蛋白突变体的酶活以及热稳定性。f. Determining the enzyme activity and thermal stability of the mismatch binding protein mutant.
应理解,在本发明范围内中,本发明的上述各技术特征和在下文(如实施例)中具体描述的各技术特征之间都可以互相组合,从而构成新的或优选的技术方案。限于篇幅,在此不再一一累述。It should be understood that within the scope of the present invention, the above-mentioned technical features of the present invention and the technical features specifically described in the following (such as embodiments) can be combined with each other to form new or preferred technical solutions. Due to space limitations, we will not repeat them here.
附图说明Description of drawings
图1展示了本发明中质粒的构建图。Figure 1 shows the construction diagram of the plasmid in the present invention.
图2比较了本发明的eMutS突变体和野生型eMutS在4℃下的热稳定性。Figure 2 compares the thermostability at 4°C between the eMutS mutant of the present invention and the wild-type eMutS.
A.利用软件“ImageJ”处理的结果。A. Results processed with the software "ImageJ".
根据DNA琼脂糖凝胶上DNA的亮度结果,利用“ImageJ”软件对所有样品给出一个RawIntDen值,RawIntDen比例是指样品的RawIntDen值/对照eMutS的RawIntDen值,如果RawIntDen比例低于0.3,则表示蛋白有活性;如果RawIntDen比例高于0.3,则表示蛋白失去活性。According to the DNA brightness results on the DNA agarose gel, use the "ImageJ" software to give a RawIntDen value for all samples, and the RawIntDen ratio refers to the RawIntDen value of the sample/the RawIntDen value of the control eMutS, if the RawIntDen ratio is lower than 0.3, it means The protein is active; if the RawIntDen ratio is higher than 0.3, it means that the protein is inactive.
B.不同的eMutS蛋白突变体的热稳定性。B. Thermostability of different eMutS protein mutants.
eMutS是野生型对照,eMutS-E38C-G70C、eMutS-S120C-S151C、eMutS-L157C-G233C、eMutS-E177C-G195C、eMutS-Y474C-R500C、eMutS-E451C-V465C、eMutS-T581C-K644C、eMutS-H585C-Q626C、eMutS-I597C-H760C、eMutS-M609C-T723C和eMutS-S120C-L157C-E451C-M609C都是不同的eMutS蛋白突变体样品。eMutS is the wild-type control, eMutS-E38C-G70C, eMutS-S120C-S151C, eMutS-L157C-G233C, eMutS-E177C-G195C, eMutS-Y474C-R500C, eMutS-E451C-V465C, eMutS-T581C-MutS-K644C, H585C-Q626C, eMutS-I597C-H760C, eMutS-M609C-T723C and eMutS-S120C-L157C-E451C-M609C are all different eMutS protein mutant samples.
具体实施方式Detailed ways
发明人经过广泛而深入的研究,出乎意料地发现对大肠杆菌来源的错配结合蛋白进行定点突变,增加二硫键,获得的错配结合蛋白突变体在4℃下保存90天仍能保持大于90%的活性。在此基础上完成了本发明。After extensive and in-depth research, the inventor unexpectedly found that the mismatch binding protein derived from Escherichia coli was subjected to site-directed mutation to increase the disulfide bond, and the obtained mismatch binding protein mutant could still maintain the protein after being stored at 4°C for 90 days. Greater than 90% activity. The present invention has been accomplished on this basis.
本发明的应用与优点:Application and advantage of the present invention:
1.本发明提供的各种错配结合蛋白高效识别并结合错配的DNA;1. Various mismatch binding proteins provided by the invention efficiently recognize and bind mismatched DNA;
2.本发明提供的各种错配结合蛋白比未经改造的野生型错配结合蛋白具有较高的热稳定性,在工业上的应用前景广阔。2. The various mismatch binding proteins provided by the present invention have higher thermal stability than unmodified wild-type mismatch binding proteins, and have broad prospects for industrial application.
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如Sambrook等人,分子克隆:实验室手册(New York:Cold Spring Harbor LaboratoryPress,1989)中所述的条件,或按照制造厂商所建议的条件。Below in conjunction with specific embodiment, further illustrate the present invention. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. The experimental methods not indicating specific conditions in the following examples are usually according to conventional conditions such as Sambrook et al., molecular cloning: the conditions described in the laboratory manual (New York: Cold Spring Harbor Laboratory Press, 1989), or according to the manufacturer's instructions suggested conditions.
试剂和菌株:本发明中的所有试剂均是市场购买的试剂级以上的试剂。其中,Tryptone,Yeast extract,NaCl,质粒提取试剂盒,胶回收试剂盒以及所有的限制性内切酶均来源于上海生工生物工程公司。PrimeSTAR HS DNA聚合酶,Solution I连接酶以及pMD18-T载体购自于大连宝生物公司。大肠杆菌Escherichia coli XL10-gold菌株作为DNA操作时使用的宿主菌(美国加利福利亚Stratagene公司),包含100μg/ml氨苄青霉素的Luria-Bertani(LB)培养基用作培养E.coli。含有SEQ ID NO:51所示序列的野生型eMutS基因的质粒pEcoMutS-CBM3-EGFP(Wen et al.,2014)由中国科学技术大学教授惠赠,目前保存在本发明人所在的实验室。LB培养基(5g/l酵母提取物,10g/l胰蛋白胨,5g/l Nacl)用于诱导表达培养基。Reagents and bacterial strains: All reagents in the present invention are commercially purchased reagent grade reagents. Among them, Tryptone, Yeast extract, NaCl, plasmid extraction kit, gel recovery kit and all restriction endonucleases are from Shanghai Sangon Bioengineering Company. PrimeSTAR HS DNA polymerase, Solution I ligase and pMD18-T vector were purchased from Dalian Bao Biological Company. Escherichia coli XL10-gold strain was used as the host bacteria (Stratagene Company, California, USA) used for DNA manipulation, and Luria-Bertani (LB) medium containing 100 μg/ml ampicillin was used for culturing E. coli. The plasmid pEcoMutS-CBM3-EGFP (Wen et al., 2014) containing the wild-type eMutS gene sequence shown in SEQ ID NO: 51 was donated by a professor at the University of Science and Technology of China, and is currently kept in the laboratory of the inventor. LB medium (5g/l yeast extract, 10g/l tryptone, 5g/l Nacl) was used to induce expression medium.
实施例1菌株的制备:The preparation of
1.构建本发明中的各种质粒载体:1. Construct various plasmid vectors among the present invention:
1).提取pEcoMutS-CBM3-EGFP(中国科学技术大学教授惠赠)的质粒,并将提取的pEcoMutS-CBM3-EGFP质粒作为模板,使用KOD Plus Neo DNA聚合酶(TOYOBO),引物E38C-G70C-F1(SEQ ID No.01)和E38C-G70C-R1(SEQ ID No.02)引物进行PCR,扩增产物利用DpnI酶切处理后获得一个位点突变的质粒pZJQIBEBT001a。然后以质粒pZJQIBEBT001a为模版,用使用KOD Plus Neo DNA聚合酶(TOYOBO),引物E38C-G70C-F2(SEQ ID No.03)和E38C-G70C-R2(SEQ ID No.04)进行PCR扩增出pEcoMutS-E38C-G70C-CBM3-EGFP全质粒,用Dpn I酶切处理后获得二个位点突变的质粒,命名为pZJQIBEBT001。1). Extract the plasmid of pEcoMutS-CBM3-EGFP (a gift from the professor of University of Science and Technology of China), and use the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primer E38C-G70C-F1 (SEQ ID No.01) and E38C-G70C-R1 (SEQ ID No.02) primers were used for PCR, and the amplified product was digested with DpnI to obtain a plasmid pZJQIBEBT001a with a site mutation. Then using the plasmid pZJQIBEBT001a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers E38C-G70C-F2 (SEQ ID No.03) and E38C-G70C-R2 (SEQ ID No.04) to carry out PCR amplification The pEcoMutS-E38C-G70C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT001.
具体操作步骤为:The specific operation steps are:
首先,提取质粒,其提取步骤为:First, extract the plasmid, the extraction steps are:
a.将2ml含相应抗生素的LB培养基加入到容量为15ml的试管中,接种一单菌落,用棉塞封口,在37℃剧烈震荡(250rpm)培养过夜。a. Add 2ml of LB medium containing corresponding antibiotics into a test tube with a capacity of 15ml, inoculate a single colony, seal it with a cotton plug, and culture overnight at 37°C with vigorous shaking (250rpm).
b.将1.5ml培养物转入离心管中,用台式离心机在4℃条件下离心30秒(12,000×g)。b. Transfer 1.5ml of the culture into a centrifuge tube and centrifuge at 4°C for 30 seconds (12,000×g) in a tabletop centrifuge.
c.弃上清液,用滤纸吸干离心管中的液体培养基,将细菌沉淀物悬浮于200m lSTET缓冲液,用涡旋混合器充分混匀。c. Discard the supernatant, blot the liquid medium in the centrifuge tube with filter paper, suspend the bacterial sediment in 200 ml STET buffer, and mix well with a vortex mixer.
d.加入4m l新鲜配制的溶菌酶液,混匀,置室温下5min。d. Add 4ml of freshly prepared lysozyme solution, mix well, and place at room temperature for 5 minutes.
e.用漂子架住离心管,置于沸水浴中,精确记时45秒,取出后立刻离心10min。e. Hold the centrifuge tube with a float, place it in a boiling water bath, accurately time it for 45 seconds, and centrifuge it for 10 minutes immediately after taking it out.
f.用无菌牙签挑取离心管中的沉淀物弃去,离心管的上清液中加入8m l 5%CTAB,用混合器混匀后,高速离心5min,弃上清液,用滤纸吸干离心管中的液体。f. Use a sterile toothpick to pick up the sediment in the centrifuge tube and discard it. Add 8ml 5% CTAB to the supernatant of the centrifuge tube. After mixing with a mixer, centrifuge at high speed for 5min. Discard the supernatant and absorb it with filter paper. Dry the liquid in the centrifuge tube.
g.加入1.2M氯化钠300m l,充分溶解沉淀物,再加入750m l的冷乙醇,充分混匀后,离心15min,弃上清液。g. Add 300ml of 1.2M sodium chloride to fully dissolve the precipitate, then add 750ml of cold ethanol, mix thoroughly, centrifuge for 15min, and discard the supernatant.
h.取1ml 70%冷乙醇,缓慢淋洗离心管内壁,离心5min。弃上清液,用滤纸吸干管壁上的液体,置室温中使核酸沉淀自然干燥5-10min。h. Take 1ml of 70% cold ethanol, slowly rinse the inner wall of the centrifuge tube, and centrifuge for 5min. Discard the supernatant, blot the liquid on the tube wall with filter paper, and let the nucleic acid precipitate dry naturally at room temperature for 5-10 minutes.
i.沉淀物溶于50m l TE缓冲液中,混合器混匀。即成质粒制备液,制备的质粒供下一步实验使用。i. Dissolve the precipitate in 50ml TE buffer solution and mix with a mixer. The plasmid preparation solution is ready, and the prepared plasmid is used for the next experiment.
突变体质粒获取具体操作步骤为:The specific steps for obtaining mutant plasmids are as follows:
(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增pEcoMutS-E38C-CBM3-EGFP全质粒的PCR体系:(1) PCR system for amplifying pEcoMutS-E38C-CBM3-EGFP full plasmid using pEcoMutS-CBM3-EGFP plasmid as a template:
PCR程序PCR program
(2)将扩增产物pEcoMutS-E38C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第一个点突变的质粒pZJQIBEBT001a。(2) The entire plasmid of the amplified product pEcoMutS-E38C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT001a.
①pEcoMutS-E38C-CBM3-EGFP全质粒的酶切体系:①Enzyme digestion system of pEcoMutS-E38C-CBM3-EGFP whole plasmid:
②将获得的进行了定点突变的质粒,命名为pZJQIBEBT001a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT001a (Fig. 1).
(3)以pZJQIBEBT001a质粒为模板扩增pEcoMutS-E38C-G70C-CBM3-EGFP全质粒的PCR体系:(3) A PCR system for amplifying the pEcoMutS-E38C-G70C-CBM3-EGFP full plasmid using the pZJQIBEBT001a plasmid as a template:
PCR程序PCR program
(4)将扩增产物pEcoMutS-E38C-G70C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第二个点突变的质粒pZJQIBEBT001。(4) The entire plasmid of the amplified product pEcoMutS-E38C-G70C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT001.
①pEcoMutS-E38C-G70C-CBM3-EGFP全质粒的酶切体系:① Enzyme digestion system of pEcoMutS-E38C-G70C-CBM3-EGFP whole plasmid:
②将获得的进行了第二个位点定点突变的质粒,命名为pZJQIBEBT001(图1)。② The obtained plasmid subjected to site-directed mutation at the second site was named pZJQIBEBT001 ( FIG. 1 ).
2).以提取的pEcoMutS-CBM3-EGFP质粒作为模板,使用KOD Plus Neo DNA聚合酶(TOYOBO),引物S120C-S151C-F1(SEQ ID No.05)和S120C-S151C-R1(SEQ ID No.06)进行PCR,扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT002a。然后以质粒pZJQIBEBT002a为模版,用使用KOD Plus Neo DNA聚合酶(TOYOBO),引物S120C-S151C-F2(SEQ ID No.07)和S120C-S151C-R2(SEQ ID No.08)进行PCR扩增出pEcoMutS-S120C-S151C-CBM3-EGFP全质粒,用Dpn I酶切处理后获得二个位点突变的质粒,命名为pZJQIBEBT002。2). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers S120C-S151C-F1 (SEQ ID No.05) and S120C-S151C-R1 (SEQ ID No. 06) PCR was performed, and the amplified product was digested with Dpn I to obtain a site-mutated plasmid pZJQIBEBT002a. Then using the plasmid pZJQIBEBT002a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers S120C-S151C-F2 (SEQ ID No.07) and S120C-S151C-R2 (SEQ ID No.08) to carry out PCR amplification The pEcoMutS-S120C-S151C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT002.
突变体质粒获取具体操作步骤为:The specific steps for obtaining mutant plasmids are as follows:
(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增pEcoMutS-S120C-CBM3-EGFP全质粒的PCR体系:(1) PCR system for amplifying pEcoMutS-S120C-CBM3-EGFP full plasmid using pEcoMutS-CBM3-EGFP plasmid as a template:
PCR程序PCR program
(2)将扩增产物pEcoMutS-S120C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第一个点突变的质粒pZJQIBEBT002a。(2) The entire plasmid of the amplified product pEcoMutS-S120C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT002a.
①pEcoMutS-S120C-CBM3-EGFP全质粒的酶切体系:①Enzyme digestion system of pEcoMutS-S120C-CBM3-EGFP whole plasmid:
②将获得的进行了定点突变的质粒,命名为pZJQIBEBT002a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT002a ( FIG. 1 ).
(3)以pZJQIBEBT002a质粒为模板扩增pEcoMutS-S120C-S151C-CBM3-EGFP全质粒的PCR体系:(3) A PCR system for amplifying the pEcoMutS-S120C-S151C-CBM3-EGFP full plasmid using the pZJQIBEBT002a plasmid as a template:
PCR程序PCR program
(4)将扩增产物pEcoMutS-S120C-S151C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第二个点突变的质粒pZJQIBEBT002。(4) Digest the entire plasmid of the amplified product pEcoMutS-S120C-S151C-CBM3-EGFP with Dpn I, and recover the product to obtain the second point-mutated plasmid pZJQIBEBT002.
①pEcoMutS-S120C-S151C-CBM3-EGFP全质粒的酶切体系:① Enzyme digestion system of pEcoMutS-S120C-S151C-CBM3-EGFP whole plasmid:
②将获得的进行了第二个位点定点突变的质粒,命名为pZJQIBEBT002(图1)。② The obtained plasmid subjected to the second site-directed mutation was named pZJQIBEBT002 ( FIG. 1 ).
3).以提取的pEcoMutS-CBM3-EGFP质粒作为模板,使用KOD Plus Neo DNA聚合酶(TOYOBO),引物L157C-G233C-F1(SEQ ID No.09)和L157C-G233C-R1(SEQ ID No.10)进行PCR,扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT003a。然后以质粒pZJQIBEBT003a为模版,用使用KOD Plus Neo DNA聚合酶(TOYOBO),引物L157C-G233C-F2(SEQ ID No.11)和L157C-G233C-R2(SEQ ID No.12)进行PCR扩增出pEcoMutS-L157C-G233C-CBM3-EGFP全质粒,用Dpn I酶切处理后获得二个位点突变的质粒,命名为pZJQIBEBT003。3). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers L157C-G233C-F1 (SEQ ID No.09) and L157C-G233C-R1 (SEQ ID No. 10) PCR was performed, and the amplified product was digested with Dpn I to obtain a site-mutated plasmid pZJQIBEBT003a. Then using the plasmid pZJQIBEBT003a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers L157C-G233C-F2 (SEQ ID No.11) and L157C-G233C-R2 (SEQ ID No.12) to carry out PCR amplification The pEcoMutS-L157C-G233C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT003.
突变体质粒获取具体操作步骤为:The specific steps for obtaining mutant plasmids are as follows:
(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增pEcoMutS-L157C-CBM3-EGFP全质粒的PCR体系:(1) The PCR system for amplifying the pEcoMutS-L157C-CBM3-EGFP full plasmid using the pEcoMutS-CBM3-EGFP plasmid as a template:
PCR程序PCR program
(2)将扩增产物pEcoMutS-L157C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第一个点突变的质粒pZJQIBEBT003a。(2) The entire plasmid of the amplified product pEcoMutS-L157C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT003a.
①pEcoMutS-L157C-CBM3-EGFP全质粒的酶切体系:①Enzyme digestion system of pEcoMutS-L157C-CBM3-EGFP whole plasmid:
②将获得的进行了定点突变的质粒,命名为pZJQIBEBT003a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT003a (Fig. 1).
(3)以pZJQIBEBT003a质粒为模板扩增全质粒pEcoMutS-L157C-G233C-CBM3-EGFP的PCR体系:(3) The PCR system for amplifying the whole plasmid pEcoMutS-L157C-G233C-CBM3-EGFP using the pZJQIBEBT003a plasmid as a template:
PCR程序PCR program
(4)将扩增产物pEcoMutS-L157C-G233C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第二个点突变的质粒pZJQIBEBT003。(4) The entire plasmid of the amplified product pEcoMutS-L157C-G233C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT003.
①pEcoMutS-L157C-G233C-CBM3-EGFP全质粒的酶切体系:① Enzyme digestion system of pEcoMutS-L157C-G233C-CBM3-EGFP whole plasmid:
②将获得的进行了第二个位点定点突变的质粒,命名为pZJQIBEBT003(图1)。② The obtained plasmid subjected to site-directed mutation at the second site was named pZJQIBEBT003 ( FIG. 1 ).
4).以提取的pEcoMutS-CBM3-EGFP质粒作为模板,使用KOD Plus Neo DNA聚合酶(TOYOBO),引物E177C-G195C-F1(SEQ ID No.13)和E177C-G195C-R1(SEQ ID No.14)进行PCR,扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT004a。然后以质粒pZJQIBEBT004a为模版,用使用KOD Plus Neo DNA聚合酶(TOYOBO),引物E177C-G195C-F2(SEQ ID No.15)和E177C-G195C-R2(SEQ ID No.16)进行PCR扩增出pEcoMutS-E177C-G195C-CBM3-EGFP全质粒,用Dpn I酶切处理后获得二个位点突变的质粒,命名为pZJQIBEBT004。4). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers E177C-G195C-F1 (SEQ ID No.13) and E177C-G195C-R1 (SEQ ID No. 14) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT004a with a site mutation. Then using the plasmid pZJQIBEBT004a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers E177C-G195C-F2 (SEQ ID No.15) and E177C-G195C-R2 (SEQ ID No.16) to carry out PCR amplification The pEcoMutS-E177C-G195C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT004.
突变体质粒获取具体操作步骤为:The specific steps for obtaining mutant plasmids are as follows:
(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-E177C-CBM3-EGFP的PCR体系:(1) The PCR system for amplifying the whole plasmid pEcoMutS-E177C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:
PCR程序PCR program
(2)将扩增产物pEcoMutS-E177C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第一个点突变的质粒pZJQIBEBT004a。(2) The entire plasmid of the amplified product pEcoMutS-E177C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT004a.
①pEcoMutS-E177C-CBM3-EGFP全质粒的酶切体系:①Enzyme digestion system of pEcoMutS-E177C-CBM3-EGFP whole plasmid:
②将获得的进行了定点突变的质粒,命名为pZJQIBEBT004a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT004a (Fig. 1).
(3)以pZJQIBEBT004a质粒为模板扩增全质粒pEcoMutS-E177C-G195C-CBM3-EGFP的PCR体系:(3) A PCR system for amplifying the entire plasmid pEcoMutS-E177C-G195C-CBM3-EGFP using the pZJQIBEBT004a plasmid as a template:
PCR程序PCR program
(4)将扩增产物pEcoMutS-E177C-G195C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第二个点突变的质粒pZJQIBEBT004。(4) The entire plasmid of the amplified product pEcoMutS-E177C-G195C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT004.
①pEcoMutS-E177C-G195C-CBM3-EGFP全质粒的酶切体系:① Enzyme digestion system of pEcoMutS-E177C-G195C-CBM3-EGFP whole plasmid:
②将获得的进行了第二个位点定点突变的质粒,命名为pZJQIBEBT004(图1)。② The obtained plasmid subjected to site-directed mutation at the second site was named pZJQIBEBT004 ( FIG. 1 ).
5).以提取的pEcoMutS-CBM3-EGFP质粒作为模板,使用KOD Plus Neo DNA聚合酶(TOYOBO),引物E451C-V465C-F1(SEQ ID No.17)和E451C-V465C-R1(SEQ ID No.18)进行PCR,扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT005a。然后以质粒pZJQIBEBT005a为模版,用使用KOD Plus Neo DNA聚合酶(TOYOBO),引物E451C-V465C-F2(SEQ ID No.19)和E451C-V465C-R2(SEQ ID No.20)进行PCR扩增出pEcoMutS-E451C-V465C-CBM3-EGFP全质粒,用Dpn I酶切处理后获得二个位点突变的质粒,命名为pZJQIBEBT005。5). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers E451C-V465C-F1 (SEQ ID No.17) and E451C-V465C-R1 (SEQ ID No. 18) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT005a with a site mutation. Then using the plasmid pZJQIBEBT005a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers E451C-V465C-F2 (SEQ ID No.19) and E451C-V465C-R2 (SEQ ID No.20) to carry out PCR amplification The whole pEcoMutS-E451C-V465C-CBM3-EGFP plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT005.
突变体质粒获取具体操作步骤为:The specific steps for obtaining mutant plasmids are as follows:
(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-E451C-CBM3-EGFP的PCR体系:(1) The PCR system for amplifying the whole plasmid pEcoMutS-E451C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:
PCR程序PCR program
(2)将扩增产物pEcoMutS-E451C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第一个点突变的质粒pZJQIBEBT005a。(2) The entire plasmid of the amplified product pEcoMutS-E451C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT005a.
①pEcoMutS-E451C-CBM3-EGFP全质粒的酶切体系:①Enzyme digestion system of pEcoMutS-E451C-CBM3-EGFP whole plasmid:
②将获得的进行了定点突变的质粒,命名为pZJQIBEBT005a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT005a (Fig. 1).
(3)以pZJQIBEBT005a质粒为模板扩增全质粒pEcoMutS-E451C-V465C-CBM3-EGFP的PCR体系:(3) The PCR system for amplifying the whole plasmid pEcoMutS-E451C-V465C-CBM3-EGFP using the pZJQIBEBT005a plasmid as a template:
PCR程序PCR program
(4)将扩增产物pEcoMutS-E451C-V465C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第二个点突变的质粒pZJQIBEBT005。(4) Digest the entire plasmid of the amplified product pEcoMutS-E451C-V465C-CBM3-EGFP with Dpn I, and recover the product to obtain the second point-mutated plasmid pZJQIBEBT005.
①pEcoMutS-E451C-V465C-CBM3-EGFP全质粒的酶切体系:① Enzyme digestion system of pEcoMutS-E451C-V465C-CBM3-EGFP whole plasmid:
②将获得的进行了第二个位点定点突变的质粒,命名为pZJQIBEBT005(图1)。② The obtained plasmid subjected to site-directed mutation at the second site was named pZJQIBEBT005 ( FIG. 1 ).
6).以提取的pEcoMutS-CBM3-EGFP质粒作为模板,使用KOD Plus Neo DNA聚合酶(TOYOBO),引物Y474C-R500C-F1(SEQ ID No.21)和Y474C-R500C-R1(SEQ ID No.22)进行PCR,扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT006a。然后以质粒pZJQIBEBT006a为模版,用使用KOD Plus Neo DNA聚合酶(TOYOBO),引物Y474C-R500C-F2(SEQ ID No.23)和Y474C-R500C-R2(SEQ ID No.24)进行PCR扩增出pEcoMutS-Y474C-R500C-CBM3-EGFP全质粒,用Dpn I酶切处理后获得二个位点突变的质粒,命名为pZJQIBEBT006。6). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers Y474C-R500C-F1 (SEQ ID No.21) and Y474C-R500C-R1 (SEQ ID No. 22) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT006a with a site mutation. Then using the plasmid pZJQIBEBT006a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers Y474C-R500C-F2 (SEQ ID No.23) and Y474C-R500C-R2 (SEQ ID No.24) to carry out PCR amplification The pEcoMutS-Y474C-R500C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT006.
突变体质粒获取具体操作步骤为:The specific steps for obtaining mutant plasmids are as follows:
(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-Y474C-CBM3-EGFP的PCR体系:(1) PCR system for amplifying the whole plasmid pEcoMutS-Y474C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:
PCR程序PCR program
(2)将扩增产物pEcoMutS-Y474C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第一个点突变的质粒pZJQIBEBT006a。(2) The entire plasmid of the amplified product pEcoMutS-Y474C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT006a.
①pEcoMutS-Y474C-CBM3-EGFP全质粒的酶切体系:①Enzyme digestion system of pEcoMutS-Y474C-CBM3-EGFP whole plasmid:
②将获得的进行了定点突变的质粒,命名为pZJQIBEBT006a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT006a ( FIG. 1 ).
(3)以pZJQIBEBT006a质粒为模板扩增全质粒pEcoMutS-Y474C-R500C-CBM3-EGFP的PCR体系:(3) PCR system for amplifying the whole plasmid pEcoMutS-Y474C-R500C-CBM3-EGFP using the pZJQIBEBT006a plasmid as a template:
PCR程序PCR program
(4)将扩增产物pEcoMutS-Y474C-R500C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第二个点突变的质粒pZJQIBEBT006。(4) Digest the entire plasmid of the amplified product pEcoMutS-Y474C-R500C-CBM3-EGFP with Dpn I, and recover the product to obtain the second point-mutated plasmid pZJQIBEBT006.
①pEcoMutS-Y474C-R500C-CBM3-EGFP全质粒的酶切体系:① Enzyme digestion system of pEcoMutS-Y474C-R500C-CBM3-EGFP whole plasmid:
②将获得的进行了第二个位点定点突变的质粒,命名为pZJQIBEBT006(图1)。② The obtained plasmid subjected to the second site-directed mutation was named pZJQIBEBT006 ( FIG. 1 ).
7).以提取的pEcoMutS-CBM3-EGFP质粒作为模板,使用KOD Plus Neo DNA聚合酶(TOYOBO),引物T581C-K644C-F1(SEQ ID No.25)和T581C-K644C-R1(SEQ ID No.26)进行PCR,扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT007a。然后以质粒pZJQIBEBT007a为模版,用使用KOD Plus Neo DNA聚合酶(TOYOBO),引物T581C-K644C-F2(SEQ ID No.27)和T581C-K644C-R2(SEQ ID No.28)进行PCR扩增出pEcoMutS-T581C-K644C-CBM3-EGFP全质粒,用Dpn I酶切处理后获得二个位点突变的质粒,命名为pZJQIBEBT007。7). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers T581C-K644C-F1 (SEQ ID No.25) and T581C-K644C-R1 (SEQ ID No. 26) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT007a with a site mutation. Then using the plasmid pZJQIBEBT007a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers T581C-K644C-F2 (SEQ ID No.27) and T581C-K644C-R2 (SEQ ID No.28) to carry out PCR amplification The whole pEcoMutS-T581C-K644C-CBM3-EGFP plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT007.
突变体质粒获取具体操作步骤为:The specific steps for obtaining mutant plasmids are as follows:
(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-T581C-CBM3-EGFP的PCR体系:(1) The PCR system for amplifying the whole plasmid pEcoMutS-T581C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:
PCR程序PCR program
(2)将扩增产物pEcoMutS-T581C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第一个点突变的质粒pZJQIBEBT007a。(2) Digest the entire plasmid of the amplified product pEcoMutS-T581C-CBM3-EGFP with Dpn I, and recover the product to obtain the first point-mutated plasmid pZJQIBEBT007a.
①pEcoMutS-T581C-CBM3-EGFP全质粒的酶切体系:①Enzyme digestion system of pEcoMutS-T581C-CBM3-EGFP whole plasmid:
②将获得的进行了定点突变的质粒,命名为pZJQIBEBT007a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT007a (Fig. 1).
(3)以pZJQIBEBT007a质粒为模板扩增全质粒pEcoMutS-T581C-K644C-CBM3-EGFP的PCR体系:(3) PCR system for amplifying the whole plasmid pEcoMutS-T581C-K644C-CBM3-EGFP using the pZJQIBEBT007a plasmid as a template:
PCR程序PCR program
(4)将扩增产物pEcoMutS-T581C-K644C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第二个点突变的质粒pZJQIBEBT007。(4) The entire plasmid of the amplified product pEcoMutS-T581C-K644C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT007.
①pEcoMutS-T581C-K644C-CBM3-EGFP全质粒的酶切体系:① Enzyme digestion system of pEcoMutS-T581C-K644C-CBM3-EGFP whole plasmid:
②将获得的进行了第二个位点定点突变的质粒,命名为pZJQIBEBT007(图1)。② The obtained plasmid subjected to site-directed mutation at the second site was named pZJQIBEBT007 ( FIG. 1 ).
8).以提取的pEcoMutS-CBM3-EGFP质粒作为模板,使用KOD Plus Neo DNA聚合酶(TOYOBO),引物H585C-Q626C-F1(SEQ ID No.29)和H585C-Q626C-R1(SEQ ID No.30)进行PCR,扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT008a。然后以质粒pZJQIBEBT008a为模版,用使用KOD Plus Neo DNA聚合酶(TOYOBO),引物H585C-Q626C-F2(SEQ ID No.31)和H585C-Q626C-R2(SEQ ID No.32)进行PCR扩增出pEcoMutS-H585C-Q626C-CBM3-EGFP全质粒,用Dpn I酶切处理后获得二个位点突变的质粒,命名为pZJQIBEBT008。8). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers H585C-Q626C-F1 (SEQ ID No.29) and H585C-Q626C-R1 (SEQ ID No. 30) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT008a with a site mutation. Then using the plasmid pZJQIBEBT008a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers H585C-Q626C-F2 (SEQ ID No.31) and H585C-Q626C-R2 (SEQ ID No.32) to carry out PCR amplification The pEcoMutS-H585C-Q626C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, which was named pZJQIBEBT008.
突变体质粒获取具体操作步骤为:The specific steps for obtaining mutant plasmids are as follows:
(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-H585C-CBM3-EGFP的PCR体系:(1) PCR system for amplifying the whole plasmid pEcoMutS-H585C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:
PCR程序PCR program
(2)将扩增产物pEcoMutS-H585C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第一个点突变的质粒pZJQIBEBT008a。(2) The entire plasmid of the amplified product pEcoMutS-H585C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT008a.
①pEcoMutS-H585C-CBM3-EGFP全质粒的酶切体系:①Enzyme digestion system of pEcoMutS-H585C-CBM3-EGFP whole plasmid:
②将获得的进行了定点突变的质粒,命名为pZJQIBEBT008a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT008a (Fig. 1).
(3)以pZJQIBEBT008a质粒为模板扩增全质粒pEcoMutS-H585C-Q626C-CBM3-EGFP的PCR体系:(3) The PCR system for amplifying the whole plasmid pEcoMutS-H585C-Q626C-CBM3-EGFP using the pZJQIBEBT008a plasmid as a template:
PCR程序PCR program
(4)将扩增产物pEcoMutS-H585C-Q626C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第二个点突变的质粒pZJQIBEBT008。(4) The entire plasmid of the amplified product pEcoMutS-H585C-Q626C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT008.
①pEcoMutS-H585C-Q626C-CBM3-EGFP全质粒的酶切体系:① Enzyme digestion system of pEcoMutS-H585C-Q626C-CBM3-EGFP whole plasmid:
②将获得的进行了第二个位点定点突变的质粒,命名为pZJQIBEBT008(图1)。② The obtained plasmid subjected to the second site-directed mutation was named pZJQIBEBT008 ( FIG. 1 ).
9).以提取的pEcoMutS-CBM3-EGFP质粒作为模板,使用KOD Plus Neo DNA聚合酶(TOYOBO),引物I597C-H760C-F1(SEQ ID No.33)和I597C-H760C-R1(SEQ ID No.34)进行PCR,扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT009a。然后以质粒pZJQIBEBT009a为模版,用使用KOD Plus Neo DNA聚合酶(TOYOBO),引物I597C-H760C-F2(SEQ ID No.35)和I597C-H760C-R2(SEQ ID No.36)进行PCR扩增出pEcoMutS-I597C-H760C-CBM3-EGFP全质粒,用Dpn I酶切处理后获得二个位点突变的质粒,命名为pZJQIBEBT009。9). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers I597C-H760C-F1 (SEQ ID No.33) and I597C-H760C-R1 (SEQ ID No. 34) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT009a with a site mutation. Then using the plasmid pZJQIBEBT009a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers I597C-H760C-F2 (SEQ ID No.35) and I597C-H760C-R2 (SEQ ID No.36) to carry out PCR amplification The pEcoMutS-I597C-H760C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, named pZJQIBEBT009.
突变体质粒获取具体操作步骤为:The specific steps for obtaining mutant plasmids are as follows:
(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-I597C-CBM3-EGFP的PCR体系:(1) The PCR system for amplifying the whole plasmid pEcoMutS-I597C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:
PCR程序PCR program
(2)将扩增产物pEcoMutS-I597C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第一个点突变的质粒pZJQIBEBT009a。(2) The entire plasmid of the amplified product pEcoMutS-I597C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT009a.
①pEcoMutS-I597C-CBM3-EGFP全质粒的酶切体系:①Enzyme digestion system of pEcoMutS-I597C-CBM3-EGFP whole plasmid:
②将获得的进行了定点突变的质粒,命名为pZJQIBEBT009a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT009a ( FIG. 1 ).
(3)以pZJQIBEBT009a质粒为模板扩增全质粒pEcoMutS-I597C-H760C-CBM3-EGFP的PCR体系:(3) The PCR system for amplifying the whole plasmid pEcoMutS-I597C-H760C-CBM3-EGFP using the pZJQIBEBT009a plasmid as a template:
PCR程序PCR program
(4)将扩增产物pEcoMutS-I597C-H760C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第二个点突变的质粒pZJQIBEBT009。(4) The entire plasmid of the amplified product pEcoMutS-I597C-H760C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT009.
①pEcoMutS-I597C-H760C-CBM3-EGFP全质粒的酶切体系:①Enzyme digestion system of pEcoMutS-I597C-H760C-CBM3-EGFP whole plasmid:
②将获得的进行了第二个位点定点突变的质粒,命名为pZJQIBEBT009(图1)。② The obtained plasmid subjected to the second site-directed mutation was named pZJQIBEBT009 ( FIG. 1 ).
10).以提取的pEcoMutS-CBM3-EGFP质粒作为模板,使用KOD Plus Neo DNA聚合酶(TOYOBO),引物M609C-T723C-F1(SEQ ID No.37)和M609C-T723C-R1(SEQ ID No.38)进行PCR,扩增产物利用Dpn I酶切处理后获得一个位点突变的质粒pZJQIBEBT010a。然后以质粒pZJQIBEBT010a为模版,用使用KOD Plus Neo DNA聚合酶(TOYOBO),引物M609C-T723C-F2(SEQ ID No.39)和M609C-T723C-R2(SEQ ID No.40)进行PCR扩增出pEcoMutS-M609C-T723C-CBM3-EGFP全质粒,用Dpn I酶切处理后获得二个位点突变的质粒,命名为pZJQIBEBT010。10). Taking the extracted pEcoMutS-CBM3-EGFP plasmid as a template, using KOD Plus Neo DNA polymerase (TOYOBO), primers M609C-T723C-F1 (SEQ ID No.37) and M609C-T723C-R1 (SEQ ID No. 38) PCR was performed, and the amplified product was digested with Dpn I to obtain a plasmid pZJQIBEBT010a with a site mutation. Then, using the plasmid pZJQIBEBT010a as a template, use KOD Plus Neo DNA polymerase (TOYOBO), primers M609C-T723C-F2 (SEQ ID No.39) and M609C-T723C-R2 (SEQ ID No.40) to carry out PCR amplification The pEcoMutS-M609C-T723C-CBM3-EGFP complete plasmid was digested with Dpn I to obtain a plasmid with two mutations, named pZJQIBEBT010.
突变体质粒获取具体操作步骤为:The specific steps for obtaining mutant plasmids are as follows:
(1)以pEcoMutS-CBM3-EGFP质粒为模板扩增全质粒pEcoMutS-M609C-CBM3-EGFP的PCR体系:(1) The PCR system for amplifying the whole plasmid pEcoMutS-M609C-CBM3-EGFP using the pEcoMutS-CBM3-EGFP plasmid as a template:
PCR程序PCR program
(2)将扩增产物pEcoMutS-M609C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第一个点突变的质粒pZJQIBEBT010a。(2) The entire plasmid of the amplified product pEcoMutS-M609C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the first point-mutated plasmid pZJQIBEBT010a.
①pEcoMutS-M609C-CBM3-EGFP全质粒的酶切体系:①Enzyme digestion system of pEcoMutS-M609C-CBM3-EGFP whole plasmid:
②将获得的进行了定点突变的质粒,命名为pZJQIBEBT010a(图1)。② The obtained plasmid subjected to site-directed mutation was named pZJQIBEBT010a ( FIG. 1 ).
(3)以pZJQIBEBT010a质粒为模板扩增全质粒pEcoMutS-M609C-T723C-CBM3-EGFP的PCR体系:(3) The PCR system for amplifying the whole plasmid pEcoMutS-M609C-T723C-CBM3-EGFP using the pZJQIBEBT010a plasmid as a template:
PCR程序PCR program
(4)将扩增产物pEcoMutS-M609C-T723C-CBM3-EGFP全质粒利用Dpn I进行酶切,回收产物即可获得第二个点突变的质粒pZJQIBEBT010。(4) The entire plasmid of the amplified product pEcoMutS-M609C-T723C-CBM3-EGFP was digested with Dpn I, and the product was recovered to obtain the second point-mutated plasmid pZJQIBEBT010.
①pEcoMutS-M609C-T723C-CBM3-EGFP全质粒的酶切体系:① Enzyme digestion system of pEcoMutS-M609C-T723C-CBM3-EGFP whole plasmid:
②将获得的进行了第二个位点定点突变的质粒,命名为pZJQIBEBT010(图1)。② The obtained plasmid subjected to the second site-directed mutation was named pZJQIBEBT010 ( FIG. 1 ).
10).分别以提取的pEcoMutS-CBM3-EGFP质粒,pZJQIBEBT002质粒,pZJQIBEBT003质粒,pZJQIBEBT006质粒,pZJQIBEBT010质粒作为模板,使用KOD Plus Neo DNA聚合酶(TOYOBO),引物eMutS-NheI-F(SEQ ID No.41)和eMutS-340-R(SEQ ID No.42),eMutS-310-F(SEQ ID No.43)和eMutS-S120C-460-R(SEQ ID No.44),eMutS-430-F(SEQ ID No.45)和eMutS-800-R(SEQ ID No.46),eMutS-770-F(SEQ ID No.47)和eMutS-1500-R(SEQ IDNo.48),eMutS-1470-F(SEQ ID No.49)和eMutS-SacI-R(SEQ ID No.50)进行PCR,获得产物eMutS-1-340,S120C-310-460,L157C-430-800,E451C-770-1500,M609C-1470-末尾。然后将扩增产物利用KOD Plus Neo DNA聚合酶(TOYOBO),引物eMutS-NheI-F(SEQ ID No.41)和eMutS-SacI-R(SEQ ID No.50)进行融合PCR,获得产物eMutS-S120C-L157C-E451C-M609C。利用NheI和SacI分别酶切片段eMutS-S120C-L157C-E451C-M609C和质粒pEcoMutS-CBM3-EGFP,连接,转化后获得八个位点突变的质粒,命名为pZJQIBEBT011(图1)。10). Taking the extracted pEcoMutS-CBM3-EGFP plasmid, pZJQIBEBT002 plasmid, pZJQIBEBT003 plasmid, pZJQIBEBT006 plasmid, and pZJQIBEBT010 plasmid as templates respectively, using KOD Plus Neo DNA polymerase (TOYOBO), primer eMutS-NheI-F (SEQ ID No. 41) and eMutS-340-R (SEQ ID No.42), eMutS-310-F (SEQ ID No.43) and eMutS-S120C-460-R (SEQ ID No.44), eMutS-430-F ( SEQ ID No.45) and eMutS-800-R (SEQ ID No.46), eMutS-770-F (SEQ ID No.47) and eMutS-1500-R (SEQ ID No.48), eMutS-1470-F (SEQ ID No.49) and eMutS-SacI-R (SEQ ID No.50) carry out PCR, obtain product eMutS-1-340, S120C-310-460, L157C-430-800, E451C-770-1500, M609C -1470- end. The amplified product is then utilized for fusion PCR with KOD Plus Neo DNA polymerase (TOYOBO), primers eMutS-NheI-F (SEQ ID No.41) and eMutS-SacI-R (SEQ ID No.50), to obtain the product eMutS- S120C-L157C-E451C-M609C. The fragment eMutS-S120C-L157C-E451C-M609C and the plasmid pEcoMutS-CBM3-EGFP were respectively digested with NheI and SacI, ligated, and a plasmid with eight mutations was obtained after transformation, which was named pZJQIBEBT011 (Figure 1).
突变体质粒获取具体操作步骤为:The specific steps for obtaining mutant plasmids are as follows:
(1)以pEcoMutS-CBM3-EGFP质粒为模板,eMutS-NheI-F(SEQ ID No.41)和eMutS-340-R(SEQ ID No.42)为引物,扩增eMutS-1-340片段的PCR体系:(1) Using the pEcoMutS-CBM3-EGFP plasmid as a template, eMutS-NheI-F (SEQ ID No.41) and eMutS-340-R (SEQ ID No.42) as primers, amplify the eMutS-1-340 fragment PCR system:
PCR程序PCR program
(2)以pZJQIBEBT002质粒为模板,eMutS-310-F(SEQ ID No.43)和eMutS-S120C-460-R(SEQ ID No.44)为引物,扩增eMutS-1-340片段的PCR体系:(2) Using the pZJQIBEBT002 plasmid as a template, eMutS-310-F (SEQ ID No.43) and eMutS-S120C-460-R (SEQ ID No.44) as primers, a PCR system for amplifying the eMutS-1-340 fragment :
PCR程序PCR program
(3)以pZJQIBEBT003质粒为模板,eMutS-430-F(SEQ ID No.45)和eMutS-800-R(SEQ ID No.46)为引物,扩增L157C-430-800片段的PCR体系:(3) Using the pZJQIBEBT003 plasmid as a template, eMutS-430-F (SEQ ID No.45) and eMutS-800-R (SEQ ID No.46) as primers, a PCR system for amplifying the L157C-430-800 fragment:
PCR程序PCR program
(4)以pZJQIBEBT006质粒为模板,eMutS-770-F(SEQ ID No.47)和eMutS-1500-R(SEQ ID No.48)为引物,扩增E451C-770-1500片段的PCR体系:(4) Using the pZJQIBEBT006 plasmid as a template, eMutS-770-F (SEQ ID No.47) and eMutS-1500-R (SEQ ID No.48) as primers, a PCR system for amplifying the E451C-770-1500 fragment:
PCR程序PCR program
(5)以pZJQIBEBT010质粒为模板,eMutS-1470-F(SEQ ID No.49)和eMutS-SacI-R(SEQ ID No.50)为引物,扩增M609C-1470-末尾片段的PCR体系:(5) Using the pZJQIBEBT010 plasmid as a template, eMutS-1470-F (SEQ ID No.49) and eMutS-SacI-R (SEQ ID No.50) as primers, the PCR system for amplifying the M609C-1470-end fragment:
PCR程序PCR program
(6)然后将扩增产物利用KOD Plus Neo DNA聚合酶(TOYOBO),引物eMutS-NheI-F(SEQ ID No.41)和eMutS-SacI-R(SEQ ID No.50)进行融合PCR,获得产物eMutS-S120C-L157C-E451C-M609C的PCR体系:(6) Then the amplified product was carried out fusion PCR using KOD Plus Neo DNA polymerase (TOYOBO), primers eMutS-NheI-F (SEQ ID No.41) and eMutS-SacI-R (SEQ ID No.50), to obtain The PCR system of the product eMutS-S120C-L157C-E451C-M609C:
PCR程序PCR program
(7)利用NheI和SacI分别酶切片段eMutS-S120C-L157C-E451C-M609C和质粒pEcoMutS-CBM3-EGFP,连接,转化后获得八个位点突变的质粒,命名为pZJQIBEBT011(图1)。(7) The fragment eMutS-S120C-L157C-E451C-M609C and the plasmid pEcoMutS-CBM3-EGFP were respectively digested with NheI and SacI, ligated, and a plasmid with eight mutations was obtained after transformation, which was named pZJQIBEBT011 (Figure 1).
①片段eMutS-S120C-L157C-E451C-M609C的酶切体系:① Enzyme digestion system for fragment eMutS-S120C-L157C-E451C-M609C:
②质粒pEcoMutS-CBM3-EGFP的酶切体系:② Enzyme digestion system of plasmid pEcoMutS-CBM3-EGFP:
③将酶切产物利用Solution I(TAKATA)进行连接,将获得的含有8个位点定点突变的质粒,命名为pZJQIBEBT011(图1)。③ The digested products were ligated using Solution I (TAKATA), and the obtained plasmid containing 8 site-directed mutations was named pZJQIBEBT011 (Figure 1).
2.将构建的载体转入表达载体大肠杆菌BL21中:2. Transform the constructed vector into the expression vector Escherichia coli BL21:
分别将进行定点突变了的10个质粒转化到宿主菌株大肠杆菌BL21的制备The preparation of the host strain Escherichia coli BL21 by transforming the 10 plasmids that have undergone site-directed mutation
1)大肠杆菌化学转化步骤:1) Escherichia coli chemical transformation steps:
①.从-70℃超低温冰柜中取出一管(100ul)感受态菌,立即用手指加温融化后插入冰上,冰浴5~10min。。①. Take out a tube (100ul) of competent bacteria from the ultra-low temperature freezer at -70°C, heat it with your fingers immediately to melt it, insert it on ice, and bathe in ice for 5-10 minutes. .
②.加入5ul连接好的质粒混合液(DNA含量不超过100ng),轻轻震荡后放置冰上20min。②. Add 5ul of the ligated plasmid mixture (DNA content does not exceed 100ng), shake gently and place on ice for 20min.
③.轻轻摇匀后插入42℃水浴中45s进行热休克,然后迅速放回冰中,静置3~5min。③. Gently shake well and insert into a 42°C water bath for 45 seconds for heat shock, then quickly put it back into the ice, and let it stand for 3-5 minutes.
④.在超净工作台中向上述各管中分别加入500ul LB培养基(不含抗菌素)轻轻混匀,然后固定到摇床的弹簧架上37℃震荡1h。④. Add 500ul LB medium (without antibiotics) to each of the above-mentioned tubes in the ultra-clean workbench and mix gently, then fix it on the spring stand of the shaker at 37°C and shake for 1h.
⑤.在超净工作台中取上述转化混合液100-300ul,分别滴到含合适抗菌素的固体LB平板培养皿中,用酒精灯烧过的玻璃涂布棒涂布均匀。⑤. Take 100-300 ul of the above-mentioned transformation mixture in the ultra-clean workbench, drop them into solid LB plate culture dishes containing appropriate antibiotics, and spread evenly with a glass coating rod burned by an alcohol lamp.
⑥.在涂好的培养皿上做上标记,先放置在37℃恒温培养箱中30~60min直到表面的液体都渗透到培养基里后,再倒置过来放入37℃恒温培养箱过夜。⑥. Mark the coated culture dish, place it in a 37°C constant temperature incubator for 30-60 minutes until the liquid on the surface penetrates into the medium, then turn it upside down and put it in a 37°C constant temperature incubator overnight.
⑦.挑取板上克隆在液体LB中培养,提取质粒,并通过PCR鉴定转化结果。⑦. Pick the clones on the plate and culture them in liquid LB, extract the plasmids, and identify the transformation results by PCR.
2)构建各种eMutS蛋白突变体表达菌株的具体过程:2) The specific process of constructing various eMutS protein mutant expression strains:
将含有E38C位和G70C位突变的质粒pZJQIBEBT001,转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能,并新增eMutS的功能。在含有Amp抗性的培养基(配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l)上筛选阳性克隆,所得到的转化菌株命名为:EZJQIBEBT001;The plasmid pZJQIBEBT001 containing mutations at E38C and G70C was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT001;
将含有S120C位和S151C位突变的质粒pZJQIBEBT002,转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能,并新增eMutS的功能。在含有Amp抗性的培养基(配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l)上筛选阳性克隆,所得到的转化菌株命名为:EZJQIBEBT002;The plasmid pZJQIBEBT002 containing the S120C and S151C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT002;
将含有L157C位和G233C位突变的质粒pZJQIBEBT003,转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能,并新增eMutS的功能。在含有Amp抗性的培养基(配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l)上筛选阳性克隆,所得到的转化菌株命名为:EZJQIBEBT003;The plasmid pZJQIBEBT003 containing the L157C and G233C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT003;
将含有E177C位和G195C位突变的质粒pZJQIBEBT004,转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能,并新增eMutS的功能。在含有Amp抗性的培养基(配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l)上筛选阳性克隆,所得到的转化菌株命名为:EZJQIBEBT004;The plasmid pZJQIBEBT004 containing the E177C and G195C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT004;
将含有E451C位和V465C位突变的质粒pZJQIBEBT005,转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能,并新增eMutS的功能。在含有Amp抗性的培养基(配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l)上筛选阳性克隆,所得到的转化菌株命名为:EZJQIBEBT005;The plasmid pZJQIBEBT005 containing mutations at E451C and V465C was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT005;
将含有Y474C位和R500C位突变的质粒pZJQIBEBT006,转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能,并新增eMutS的功能。在含有Amp抗性的培养基(配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l)上筛选阳性克隆,所得到的转化菌株命名为:EZJQIBEBT006;The plasmid pZJQIBEBT006 containing Y474C and R500C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT006;
将含有T581C位和K644C位突变的质粒pZJQIBEBT007,转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能,并新增eMutS的功能。在含有Amp抗性的培养基(配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l)上筛选阳性克隆,所得到的转化菌株命名为:EZJQIBEBT007;The plasmid pZJQIBEBT007 containing T581C and K644C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT007;
将含有H585C位和Q626C位突变的质粒pZJQIBEBT008,转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能,并新增eMutS的功能。在含有Amp抗性的培养基(配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l)上筛选阳性克隆,所得到的转化菌株命名为:EZJQIBEBT008;The plasmid pZJQIBEBT008 containing the H585C and Q626C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT008;
将含有I597C位和H760C位突变的质粒pZJQIBEBT009,转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能,并新增eMutS的功能。在含有Amp抗性的培养基(配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l)上筛选阳性克隆,所得到的转化菌株命名为:EZJQIBEBT009;The plasmid pZJQIBEBT009 containing the I597C and H760C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT009;
将含有M609C位和T723C位突变的质粒pZJQIBEBT010,转化进BL21大肠杆菌中。将使菌株恢复具有Amp抗性的功能,并新增eMutS的功能。在含有Amp抗性的培养基(配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l)上筛选阳性克隆,所得到的转化菌株命名为:EZJQIBEBT010;The plasmid pZJQIBEBT010 containing the M609C and T723C mutations was transformed into BL21 Escherichia coli. The strain will be restored to function with Amp resistance and added to the function of eMutS. Positive clones were screened on a medium containing Amp resistance (recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l), and the resulting transformed strain was named: EZJQIBEBT010;
3)提取质粒,通过PCR鉴定大肠杆菌转化的阳性菌株。3) extract the plasmid, and identify the positive strain transformed by Escherichia coli by PCR.
⑴大肠杆菌质粒提取步骤:(1) Escherichia coli plasmid extraction steps:
①.收菌:将过夜培养(37℃,12-16小时)的菌液于室温≧10,000g离心1-2分钟,彻底弃除上清。①. Bacteria collection: Centrifuge the overnight culture (37°C, 12-16 hours) at room temperature≧10,000g for 1-2 minutes, and discard the supernatant completely.
②.重悬:加入250μl含RNase A的细胞悬浮液(S1),充分混悬震荡或用枪头反复抽打使细菌彻底分散悬浮。②. Resuspension: Add 250 μl of RNase A-containing cell suspension (S1), fully suspend and shake or beat repeatedly with a pipette tip to completely disperse and suspend the bacteria.
③.裂解:加入250μl细胞裂解液(S2),轻轻上下颠倒混合5次,室温静置1-5分钟,待细菌充分裂解,溶液变半透明。③. Lysis: Add 250 μl of cell lysate (S2), mix gently up and down 5 times, and let stand at room temperature for 1-5 minutes. After the bacteria are fully lysed, the solution becomes translucent.
④.中和:加入350μl中和缓冲液(S3),轻轻上下颠倒混合5次,充分混匀,避免剧烈震荡。室温下≧12,000g离心10分钟。④. Neutralization: Add 350 μl neutralization buffer (S3), mix gently up and down 5 times, mix well, avoid violent shock. Centrifuge at ≧12,000g for 10 minutes at room temperature.
⑤.DNA结合:小心吸取上清,转移到插入收集管的离心吸附柱内,室温下≧12,000g离心1分钟,弃除收集管中的废液,将离心吸附柱重新插回收集管中。⑤. DNA binding: Carefully aspirate the supernatant, transfer it to a centrifugal adsorption column inserted into a collection tube, centrifuge at ≧12,000g for 1 minute at room temperature, discard the waste liquid in the collection tube, and reinsert the centrifugal adsorption column into the collection tube.
⑥.清洗:加入500μl漂洗液(WB,请确认已加入乙醇!)于离心吸附柱中,室温下≧12,000g离心30秒,弃除收集管中的废液,将离心吸附柱重新插回收集管中。⑥. Cleaning: Add 500μl of washing solution (WB, please confirm that ethanol has been added!) to the centrifugal adsorption column, centrifuge at ≧12,000g for 30 seconds at room temperature, discard the waste liquid in the collection tube, and insert the centrifugal adsorption column back to collect tube.
⑦.清洗:加入500μl漂洗液(WB,请确认已加入乙醇!)于离心吸附柱中,室温下≧12,000g离心30秒,弃除收集管中的废液,将离心吸附柱重新插回收集管中。将离心吸附柱开盖再次离心2分钟,彻底除去残余漂洗液。。⑦. Cleaning: Add 500 μl of washing solution (WB, please confirm that ethanol has been added!) to the centrifugal adsorption column, centrifuge at ≧12,000g for 30 seconds at room temperature, discard the waste liquid in the collection tube, and insert the centrifugal adsorption column back to collect tube. Uncover the centrifugal adsorption column and centrifuge again for 2 minutes to completely remove the residual rinse solution. .
⑧.洗脱:小心取出离心吸附柱,将其套入一个新的1.5ml灭菌离心管中。向硅胶吸附膜的中央加入100μl洗脱缓冲液(EB),室温放置1分钟后,≧12,000g离心1分钟收集质粒DNA。⑧. Elution: Carefully take out the centrifugal adsorption column and insert it into a new 1.5ml sterilized centrifuge tube. Add 100 μl of elution buffer (EB) to the center of the silica gel adsorption membrane, and after standing at room temperature for 1 minute, centrifuge at ≧12,000 g for 1 minute to collect plasmid DNA.
⑨.储存:弃除离心吸附柱,纯化的质粒可直接用于后续反应或于-20℃长期保存。⑨. Storage: Discard the centrifugal adsorption column, and the purified plasmid can be used directly for subsequent reactions or stored at -20°C for a long time.
⑵鉴定大肠杆菌转化的阳性菌株的PCR体系:(2) PCR system for identifying positive strains transformed by E. coli:
①包含eMutS的质粒的PCR体系:①Plasmid PCR system containing eMutS:
PCR程序PCR program
以通用引物M13作为引物,以质粒为模板,PCR扩增后,能特异性的扩增出基因条带的菌株,即为阳性菌株,并进行下一步实验。其中基因的大小为2586bp。The general primer M13 is used as a primer, and the plasmid is used as a template. After PCR amplification, the strain that can specifically amplify the gene band is a positive strain, and the next step of the experiment is carried out. The size of the gene is 2586bp.
实施例2eMutS蛋白突变体的热稳定性The thermostability of embodiment 2eMutS protein mutant
该实施例用于比较不同eMutS突变体的菌株表达的蛋白酶活稳定性的高低。结果表明EZJQIBEBT011菌株表达的蛋白eMutS-S120C-L157C-E451C-M609C活性稳定性最高。This example is used to compare the stability of protease activity expressed by strains of different eMutS mutants. The results showed that the protein eMutS-S120C-L157C-E451C-M609C expressed by EZJQIBEBT011 strain had the highest activity and stability.
1.在LB培养基平板上复苏菌株。对照株:eMutS。实验株:eMutS-E38C-G70C、eMutS-Y474C-R500C、eMutS-T581C-K644C、eMutS-I597C-H760C、eMutS-S120C-S151C、eMutS-L157C-G233C、eMutS-E451C-V465C、eMutS-M609C-T723C、eMutS-E177C-G195C、eMutS-H585C-Q626C、eMutS-S120C-L157C-E451C-M609C。37℃培养1天。1. Recover strains on LB medium plates. Control strain: eMutS. Experimental strains: eMutS-E38C-G70C, eMutS-Y474C-R500C, eMutS-T581C-K644C, eMutS-I597C-H760C, eMutS-S120C-S151C, eMutS-L157C-G233C, eMutS-E451C-V465C, eMutS-T79CutS-M70 , eMutS-E177C-G195C, eMutS-H585C-Q626C, eMutS-S120C-L157C-E451C-M609C. Incubate at 37°C for 1 day.
2.分别挑取单克隆,接于5ml液体LB培养基。37℃,250rpm,过夜。2. Pick out single clones respectively, and connect them to 5ml liquid LB medium. 37°C, 250rpm, overnight.
3.配制36瓶100mL的YPD培养基分装于1L锥形三角瓶。配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l。灭菌待用。3. Prepare 36 bottles of 100mL YPD medium and distribute them in 1L Erlenmeyer flasks. Recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l. Sterilized and ready to use.
4.取适量过夜培养物接入200ml LB培养基中,接种比例为1:1000,37℃,250rpm培养。4. Take an appropriate amount of overnight culture and inoculate it into 200ml LB medium at an inoculation ratio of 1:1000, culture at 37°C and 250rpm.
5.OD600=0.8左右时,收菌,破碎菌体,纯化蛋白,测酶活。5. When OD600=0.8 or so, harvest the bacteria, crush the bacteria, purify the protein, and measure the enzyme activity.
6.从图2可知,突变体eMutS-E38C-G70C和eMutS-Y474C-R500C失去酶活。突变体eMutS-T581C-K644C和eMutS-I597C-H760C酶活可以持续21天,与对照组相似。突变体eMutS-S120C-S151C、eMutS-L157C-G233C、eMutS-E451C-V465C和eMutS-M609C-T723C,eMutS-S120C-L157C-E451C-M609C比对照组酶活持续时间要长,而且,突变体eMutS-S120C-L157C-E451C-M609C的酶活可以持续长达90天。6. It can be seen from Figure 2 that the mutants eMutS-E38C-G70C and eMutS-Y474C-R500C lost enzyme activity. The enzyme activity of mutants eMutS-T581C-K644C and eMutS-I597C-H760C can last for 21 days, which is similar to that of the control group. Mutants eMutS-S120C-S151C, eMutS-L157C-G233C, eMutS-E451C-V465C and eMutS-M609C-T723C, eMutS-S120C-L157C-E451C-M609C were longer than the control group, and the mutant eMutS -S120C-L157C-E451C-M609C can last for up to 90 days.
实施例3eMutS蛋白突变体结合错配DNA的纠错能力Example 3 Error Correction Ability of eMutS Protein Mutants in Binding to Mismatched DNA
该实施例用于验证两个酶活稳定性最强的eMutS突变体的菌株表达的蛋白酶的纠错能力。结果表明稳定性提高的蛋白酶的纠错能力仍然保持。其中,eMutS-S120C-L157C-E451C-M609C纠错效果最好,甚至优于突变前菌株。This example is used to verify the error correction ability of the protease expressed by the strains of the two eMutS mutants with the strongest enzyme activity stability. The results showed that the error-correcting ability of proteases with improved stability was still maintained. Among them, eMutS-S120C-L157C-E451C-M609C has the best error correction effect, even better than the strain before mutation.
1.在LB培养基平板上复苏菌株。对照株:eMutS。实验株:eMutS-S120C-S151C,eMutS-L157C-G233C,eMutS-S120C-L157C-E451C-M609C。37℃培养1天。1. Recover strains on LB medium plates. Control strain: eMutS. Experimental strains: eMutS-S120C-S151C, eMutS-L157C-G233C, eMutS-S120C-L157C-E451C-M609C. Incubate at 37°C for 1 day.
2.分别挑取单克隆,接于5ml液体LB培养基。37℃,250rpm,过夜。2. Pick out single clones respectively, and connect them to 5ml liquid LB medium. 37°C, 250rpm, overnight.
3.配制12瓶100mL的YPD培养基分装于1L锥形三角瓶。配方:胰蛋白胨10g/l,酵母提取物5g/l,NaCl 10g/l。灭菌待用。3. Prepare 12 bottles of 100mL YPD medium and distribute them in 1L Erlenmeyer flasks. Recipe: tryptone 10g/l, yeast extract 5g/l, NaCl 10g/l. Sterilized and ready to use.
4.取适量过夜培养物接入200ml LB培养基中,接种比例为1:1000,37℃,250rpm培养。4. Take an appropriate amount of overnight culture and inoculate it into 200ml LB medium at an inoculation ratio of 1:1000, culture at 37°C and 250rpm.
5.OD600=0.8左右时,收菌,破碎菌体,纯化蛋白,用于DNA纠错实验。5. When OD600=0.8 or so, harvest the bacteria, crush the bacteria, purify the protein, and use it for the DNA error correction experiment.
6.在应用实验中,所有eMutS蛋白都与MICC系统一起使用,这些在过去的研究中有报道。利用稳定性提高后的纠错系统来纠错用MCp-oligos合成的XR和Cas9同源基因。如表1所示,寡聚物的错误率约为1.2%。使用MICC去除错误显着降低了合成XR和Cas9同源基因的错误率。实验表明,二硫键构建使eMutS蛋白稳定性提高,同时保持原有纠错能力。其中,eMutS-S120C-L157C-E451C-M609C不仅热稳定性大幅度提高,而且纠错效果最好,甚至优于突变前菌株。6. In applied experiments, all eMutS proteins were used with the MICC system, which were reported in past studies. Error correction of XR and Cas9 homologs synthesized with MCp-oligos using the error correction system with improved stability. As shown in Table 1, the oligomer error rate is about 1.2%. Removing errors using MICC significantly reduced the error rate for synthesizing XR and Cas9 homologs. Experiments show that the construction of disulfide bonds improves the stability of eMutS protein while maintaining the original error correction ability. Among them, eMutS-S120C-L157C-E451C-M609C not only greatly improved thermal stability, but also had the best error correction effect, even better than the strain before mutation.
表1. eMutS突变体进行DNA纠错的结果Table 1. Results of DNA error correction in eMutS mutants
应该理解,尽管参考其示例性的实施方案,已经对本发明进行具体地显示和描述,但是本领域的普通技术人员应该理解,在不背离由权利要求所定义的本发明的精神和范围的条件下,可以在其中进行各种形式和细节的变化,可以进行各种实施方案的任意组合。It should be understood that while the invention has been particularly shown and described with reference to exemplary embodiments thereof, those skilled in the art will appreciate that, without departing from the spirit and scope of the invention as defined by the appended claims, , various changes in form and details can be made therein, and any combination of various embodiments can be made.
序列表sequence listing
<110> 中国科学院青岛生物能源与过程研究所<110> Qingdao Institute of Bioenergy and Process Technology, Chinese Academy of Sciences
<120> 一种错配结合蛋白的突变体及其编码基因<120> A mutant of a mismatch binding protein and its coding gene
<130> P2019-0992<130> P2019-0992
<160> 62<160> 62
<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0
<210> 1<210> 1
<211> 36<211> 36
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 1<400> 1
ggatgggtga tttttattgc ctgttttatg acgacg 36ggatgggtga tttttattgc ctgttttatg acgacg 36
<210> 2<210> 2
<211> 36<211> 36
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 2<400> 2
cgtcgtcata aaacaggcaa taaaaatcac ccatcc 36cgtcgtcata aaacaggcaa taaaaatcac ccatcc 36
<210> 3<210> 3
<211> 35<211> 35
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 3<400> 3
gccgatcccg atggcgtgca ttccctacca tgcgg 35gccgatcccg atggcgtgca ttccctacca tgcgg 35
<210> 4<210> 4
<211> 35<211> 35
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 4<400> 4
ccgcatggta gggaatgcac gccatcggga tcggc 35ccgcatggta gggaatgcac gccatcggga tcggc 35
<210> 5<210> 5
<211> 35<211> 35
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 5<400> 5
tacgccaggc accatctgcg atgaagccct gttgc 35tacgccaggc accatctgcg atgaagccct gttgc 35
<210> 6<210> 6
<211> 35<211> 35
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 6<400> 6
gcaacagggc ttcatcgcag atggtgcctg gcgta 35gcaacagggc ttcatcgcag atggtgcctg gcgta 35
<210> 7<210> 7
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 7<400> 7
gcgacgctgg atatctgctc cgggcgtttt cgc 33gcgacgctgg atatctgctc cgggcgtttt cgc 33
<210> 8<210> 8
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 8<400> 8
gcgaaaacgc ccggagcaga tatccagcgt cgc 33gcgaaaacgc ccggagcaga tatccagcgt cgc 33
<210> 9<210> 9
<211> 35<211> 35
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 9<400> 9
ttccgggcgt tttcgctgca gcgaaccggc tgacc 35ttccgggcgt tttcgctgca gcgaaccggc tgacc 35
<210> 10<210> 10
<211> 35<211> 35
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 10<400> 10
ggtcagccgg ttcgctgcag cgaaaacgcc cggaa 35ggtcagccgg ttcgctgcag cgaaaacgcc cggaa 35
<210> 11<210> 11
<211> 36<211> 36
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 11<400> 11
cgagaacgcg ccgcgctgcc tttgtgctgc cggttg 36cgagaacgcg ccgcgctgcc tttgtgctgc cggttg 36
<210> 12<210> 12
<211> 36<211> 36
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 12<400> 12
caaccggcag cacaaaggca gcgcggcgcg ttctcg 36caaccggcag cacaaaggca gcgcggcgcg ttctcg 36
<210> 13<210> 13
<211> 35<211> 35
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 13<400> 13
acgcactaat cctgcgtgcc tgctgtatgc agaag 35acgcactaat cctgcgtgcc tgctgtatgc agaag 35
<210> 14<210> 14
<211> 35<211> 35
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 14<400> 14
cttctgcata cagcaggcac gcaggattag tgcgt 35cttctgcata cagcaggcac gcaggattag tgcgt 35
<210> 15<210> 15
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 15<400> 15
attgaaggcc gtcgctgcct gcgccgtcgc ccg 33attgaaggcc gtcgctgcct gcgccgtcgc ccg 33
<210> 16<210> 16
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 16<400> 16
cgggcgacgg cgcaggcagc gacggccttc aat 33cgggcgacgg cgcaggcagc gacggccttc aat 33
<210> 17<210> 17
<211> 34<211> 34
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 17<400> 17
tatctggagc gtctgtgcgt ccgcgagcgt gaac 34tatctggagc gtctgtgcgt ccgcgagcgt gaac 34
<210> 18<210> 18
<211> 34<211> 34
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 18<400> 18
gttcacgctc gcggacgcac agacgctcca gata 34gttcacgctc gcggacgcac agacgctcca gata 34
<210> 19<210> 19
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 19<400> 19
ctggacacgc tgaaatgcgg ctttaatgcg gtg 33ctggacacgc tgaaatgcgg ctttaatgcg gtg 33
<210> 20<210> 20
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 20<400> 20
caccgcatta aagccgcatt tcagcgtgtc cag 33caccgcatta aagccgcatt tcagcgtgtc cag 33
<210> 21<210> 21
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 21<400> 21
gcggtgcacg gctactgcat tcaaatcagc cgt 33gcggtgcacg gctactgcat tcaaatcagc cgt 33
<210> 22<210> 22
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 22<400> 22
acggctgatt tgaatgcagt agccgtgcac cgc 33acggctgatt tgaatgcagt agccgtgcac cgc 33
<210> 23<210> 23
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 23<400> 23
ctgaaaaacg ccgagtgcta catcattcca gag 33ctgaaaaacg ccgagtgcta catcattcca gag 33
<210> 24<210> 24
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 24<400> 24
ctctggaatg atgtagcact cggcgttttt cag 33ctctggaatg atgtagcact cggcgttttt cag 33
<210> 25<210> 25
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 25<400> 25
ccgggcattc gcatttgcga aggtcgccat ccg 33ccgggcattc gcatttgcga aggtcgccat ccg 33
<210> 26<210> 26
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 26<400> 26
cggatggcga ccttcgcaaa tgcgaatgcc cgg 33cggatggcga ccttcgcaaa tgcgaatgcc cgg 33
<210> 27<210> 27
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 27<400> 27
tatgtaccgg cacaatgcgt cgagattgga cct 33tatgtaccgg cacaatgcgt cgagattgga cct 33
<210> 28<210> 28
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 28<400> 28
aggtccaatc tcgacgcatt gtgccggtac ata 33aggtccaatc tcgacgcatt gtgccggtac ata 33
<210> 29<210> 29
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 29<400> 29
attaccgaag gtcgctgccc ggtagttgaa caa 33attaccgaag gtcgctgccc ggtagttgaa caa 33
<210> 30<210> 30
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 30<400> 30
ttgttcaact accgggcagc gaccttcggt aat 33ttgttcaact accgggcagc gaccttcggt aat 33
<210> 31<210> 31
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 31<400> 31
agtacctata tgcgctgcac cgcactgatt gcg 33agtacctata tgcgctgcac cgcactgatt gcg 33
<210> 32<210> 32
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 32<400> 32
cgcaatcagt gcggtgcagc gcatataggt act 33cgcaatcagt gcggtgcagc gcatataggt act 33
<210> 33<210> 33
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 33<400> 33
ctgaatgagc cattttgcgc caacccgctg aat 33ctgaatgagc cattttgcgc caacccgctg aat 33
<210> 34<210> 34
<211> 33<211> 33
<212> DNA<212> DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 34<400> 34
attcagcggg ttggcgcaaa atggctcatt cag 33attcagcggg ttggcgcaaa atggctcatt cag 33
<210> 35<210> 35
<211> 33<211> 33
<212> DNA<212> DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 35<400> 35
accattgcct ttatgtgcag cgtgcaggat ggc 33accattgcct ttatgtgcag cgtgcaggat ggc 33
<210> 36<210> 36
<211> 33<211> 33
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 36<400> 36
gccatcctgc acgctgcaca taaaggcaat ggt 33gccatcctgc acgctgcaca taaaggcaat ggt 33
<210> 37<210> 37
<211> 34<211> 34
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 37<400> 37
tcgccgcagc gccgctgctt gatcatcacc ggtc 34tcgccgcagc gccgctgctt gatcatcacc ggtc 34
<210> 38<210> 38
<211> 34<211> 34
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 38<400> 38
gaccggtgat gatcaagcag cggcgctgcg gcga 34gaccggtgat gatcaagcag cggcgctgcg gcga 34
<210> 39<210> 39
<211> 36<211> 36
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 39<400> 39
taagattaag gcattgtgct tatttgctac ccacta 36taagattaag gcattgtgct tattgctac cacta 36
<210> 40<210> 40
<211> 36<211> 36
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 40<400> 40
tagtgggtag caaataagca caatgcctta atctta 36tagtgggtag caaataagca caatgcctta atctta 36
<210> 41<210> 41
<211> 31<211> 31
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 41<400> 41
ctagctagca tgagtgcaat agaaaatttc g 31ctagctagca tgagtgcaat agaaaatttc g 31
<210> 42<210> 42
<211> 20<211> 20
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 42<400> 42
cgatacgcac aactttgcgc 20
<210> 43<210> 43
<211> 20<211> 20
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 43<400> 43
gtccggttga gcgcaaagtt 20
<210> 44<210> 44
<211> 20<211> 20
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 44<400> 44
gcccggagca gatatccagc 20
<210> 45<210> 45
<211> 20<211> 20
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 45<400> 45
gctacgcgac gctggatatc 20gctacgcgac gctggatatc 20
<210> 46<210> 46
<211> 20<211> 20
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 46<400> 46
atgctgtcct gctcacgttc 20atgctgtcct gctcacgttc 20
<210> 47<210> 47
<211> 20<211> 20
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 47<400> 47
catcaccatg gaacgtgagc 20
<210> 48<210> 48
<211> 20<211> 20
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 48<400> 48
gcgctcggcg tttttcagcg 20
<210> 49<210> 49
<211> 22<211> 22
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 49<400> 49
cgtcgccaga cgctgaaaaa cg 22cgtcgccaga cgctgaaaaa cg 22
<210> 50<210> 50
<211> 25<211> 25
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 50<400> 50
cgagctccac caggctcttc aagcg 25cgagctccac caggctcttc aagcg 25
<210> 51<210> 51
<211> 2562<211> 2562
<212> DNA<212>DNA
<213> 大肠杆菌(Escherichia coli)<213> Escherichia coli
<400> 51<400> 51
atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60
aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120
tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180
tcggcgggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240tcggcggggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240
gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300
accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatcagc 360accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatcagc 360
gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420
aaaggtttcg gctacgcgac gctggatatc agttccgggc gttttcgcct gagcgaaccg 480aaaggtttcg gctacgcgac gctggatatc agttccgggc gttttcgcct gagcgaaccg 480
gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540
gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600
ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660
gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg 720gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg 720
ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780
gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840
cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900
ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960
ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020
gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080
cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140
cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200
gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260
gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320
gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg 1380gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg 1380
gacacgctga aagttggctt taatgcggtg cacggctact acattcaaat cagccgtggg 1440gacacgctga aagttggctt taatgcggtg cacggctact aattcaaat cagccgtggg 1440
caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500
tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560
ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620
gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680
cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740
accgaaggtc gccatccggt agttgaacaa gtactgaatg agccatttat cgccaacccg 1800accgaaggtc gccatccggt agttgaacaa gtactgaatg agccattat cgccaacccg 1800
ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa 1860ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa 1860
agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920
ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980
gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040
ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100
acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160
gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220
ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280
agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340
gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400
ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460
gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520
caggcgctgg agtggattta tcgcttgaag agcctggtgt aa 2562caggcgctgg agtggatta tcgcttgaag agcctggtgt aa 2562
<210> 52<210> 52
<211> 853<211> 853
<212> PRT<212> PRT
<213> 大肠杆菌(Escherichia coli)<213> Escherichia coli
<400> 52<400> 52
Met Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln GlnMet Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln Gln
1 5 10 151 5 10 15
Tyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr ArgTyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr Arg
20 25 30 20 25 30
Met Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala SerMet Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala Ser
35 40 45 35 40 45
Gln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly GluGln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly Glu
50 55 60 50 55 60
Pro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr LeuPro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr Leu
65 70 75 8065 70 75 80
Ala Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln IleAla Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln Ile
85 90 95 85 90 95
Gly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val ArgGly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val Arg
100 105 110 100 105 110
Ile Val Thr Pro Gly Thr Ile Ser Asp Glu Ala Leu Leu Gln Glu ArgIle Val Thr Pro Gly Thr Ile Ser Asp Glu Ala Leu Leu Gln Glu Arg
115 120 125 115 120 125
Gln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe GlyGln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe Gly
130 135 140 130 135 140
Tyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Leu Ser Glu ProTyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Leu Ser Glu Pro
145 150 155 160145 150 155 160
Ala Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro AlaAla Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro Ala
165 170 175 165 170 175
Glu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu GlyGlu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu Gly
180 185 190 180 185 190
Arg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp ThrArg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp Thr
195 200 205 195 200 205
Ala Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val GlyAla Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val Gly
210 215 220 210 215 220
Phe Gly Val Glu Asn Ala Pro Arg Gly Leu Cys Ala Ala Gly Cys LeuPhe Gly Val Glu Asn Ala Pro Arg Gly Leu Cys Ala Ala Gly Cys Leu
225 230 235 240225 230 235 240
Leu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile ArgLeu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile Arg
245 250 255 245 250 255
Ser Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala AlaSer Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala Ala
260 265 270 260 265 270
Thr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala GluThr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala Glu
275 280 285 275 280 285
Asn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly SerAsn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly Ser
290 295 300 290 295 300
Arg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg ValArg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg Val
305 310 315 320305 310 315 320
Leu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr AlaLeu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr Ala
325 330 335 325 330 335
Gly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile LeuGly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile Leu
340 345 350 340 345 350
Ala Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg MetAla Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg Met
355 360 365 355 360 365
Arg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu ThrArg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu Thr
370 375 380 370 375 380
Val Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu PheVal Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu Phe
385 390 395 400385 390 395 400
Ala Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro ProAla Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro Pro
405 410 415 405 410 415
Val Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu GluVal Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu Glu
420 425 430 420 425 430
Leu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu GluLeu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu Glu
435 440 445 435 440 445
Arg Leu Glu Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu LysArg Leu Glu Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu Lys
450 455 460 450 455 460
Val Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg GlyVal Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg Gly
465 470 475 480465 470 475 480
Gln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu LysGln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu Lys
485 490 495 485 490 495
Asn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp LysAsn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp Lys
500 505 510 500 505 510
Val Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu TyrVal Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu Tyr
515 520 525 515 520 525
Glu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln GlnGlu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln Gln
530 535 540 530 535 540
Ser Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala GluSer Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala Glu
545 550 555 560545 550 555 560
Arg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys ProArg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys Pro
565 570 575 565 570 575
Gly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val LeuGly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val Leu
580 585 590 580 585 590
Asn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg ArgAsn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg Arg
595 600 605 595 600 605
Met Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr MetMet Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr Met
610 615 620 610 615 620
Arg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr ValArg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr Val
625 630 635 640625 630 635 640
Pro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr ArgPro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr Arg
645 650 655 645 650 655
Val Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met ValVal Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met Val
660 665 670 660 665 670
Glu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr SerGlu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr Ser
675 680 685 675 680 685
Leu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp GlyLeu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp Gly
690 695 700 690 695 700
Leu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile LysLeu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile Lys
705 710 715 720705 710 715 720
Ala Leu Thr Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu ProAla Leu Thr Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu Pro
725 730 735 725 730 735
Glu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu HisGlu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu His
740 745 750 740 745 750
Gly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala SerGly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala Ser
755 760 765 755 760 765
Lys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys GluLys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys Glu
770 775 780 770 775 780
Val Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile SerVal Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile Ser
785 790 795 800785 790 795 800
Pro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu LeuPro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu Leu
805 810 815 805 810 815
Ser Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn LeuSer Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn Leu
820 825 830 820 825 830
Asp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr ArgAsp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr Arg
835 840 845 835 840 845
Leu Lys Ser Leu ValLeu Lys Ser Leu Val
850 850
<210> 53<210> 53
<211> 2562<211> 2562
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 53<400> 53
atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60
aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120
tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180
tcggcgggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240tcggcggggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240
gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300
accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360
gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420
aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgcct gagcgaaccg 480aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgcct gagcgaaccg 480
gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540
gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600
ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660
gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg 720gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg 720
ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780
gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840
cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900
ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960
ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020
gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080
cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140
cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200
gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260
gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320
gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg 1380gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg 1380
gacacgctga aagttggctt taatgcggtg cacggctact acattcaaat cagccgtggg 1440gacacgctga aagttggctt taatgcggtg cacggctact aattcaaat cagccgtggg 1440
caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500
tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560
ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620
gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680
cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740
accgaaggtc gccatccggt agttgaacaa gtactgaatg agccatttat cgccaacccg 1800accgaaggtc gccatccggt agttgaacaa gtactgaatg agccattat cgccaacccg 1800
ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa 1860ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa 1860
agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920
ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980
gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040
ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100
acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160
gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220
ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280
agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340
gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400
ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460
gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520
caggcgctgg agtggattta tcgcttgaag agcctggtgt aa 2562caggcgctgg agtggatta tcgcttgaag agcctggtgt aa 2562
<210> 54<210> 54
<211> 853<211> 853
<212> PRT<212> PRT
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 54<400> 54
Met Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln GlnMet Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln Gln
1 5 10 151 5 10 15
Tyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr ArgTyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr Arg
20 25 30 20 25 30
Met Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala SerMet Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala Ser
35 40 45 35 40 45
Gln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly GluGln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly Glu
50 55 60 50 55 60
Pro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr LeuPro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr Leu
65 70 75 8065 70 75 80
Ala Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln IleAla Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln Ile
85 90 95 85 90 95
Gly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val ArgGly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val Arg
100 105 110 100 105 110
Ile Val Thr Pro Gly Thr Ile Cys Asp Glu Ala Leu Leu Gln Glu ArgIle Val Thr Pro Gly Thr Ile Cys Asp Glu Ala Leu Leu Gln Glu Arg
115 120 125 115 120 125
Gln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe GlyGln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe Gly
130 135 140 130 135 140
Tyr Ala Thr Leu Asp Ile Cys Ser Gly Arg Phe Arg Leu Ser Glu ProTyr Ala Thr Leu Asp Ile Cys Ser Gly Arg Phe Arg Leu Ser Glu Pro
145 150 155 160145 150 155 160
Ala Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro AlaAla Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro Ala
165 170 175 165 170 175
Glu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu GlyGlu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu Gly
180 185 190 180 185 190
Arg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp ThrArg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp Thr
195 200 205 195 200 205
Ala Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val GlyAla Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val Gly
210 215 220 210 215 220
Phe Gly Val Glu Asn Ala Pro Arg Gly Leu Cys Ala Ala Gly Cys LeuPhe Gly Val Glu Asn Ala Pro Arg Gly Leu Cys Ala Ala Gly Cys Leu
225 230 235 240225 230 235 240
Leu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile ArgLeu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile Arg
245 250 255 245 250 255
Ser Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala AlaSer Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala Ala
260 265 270 260 265 270
Thr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala GluThr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala Glu
275 280 285 275 280 285
Asn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly SerAsn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly Ser
290 295 300 290 295 300
Arg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg ValArg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg Val
305 310 315 320305 310 315 320
Leu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr AlaLeu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr Ala
325 330 335 325 330 335
Gly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile LeuGly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile Leu
340 345 350 340 345 350
Ala Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg MetAla Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg Met
355 360 365 355 360 365
Arg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu ThrArg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu Thr
370 375 380 370 375 380
Val Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu PheVal Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu Phe
385 390 395 400385 390 395 400
Ala Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro ProAla Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro Pro
405 410 415 405 410 415
Val Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu GluVal Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu Glu
420 425 430 420 425 430
Leu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu GluLeu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu Glu
435 440 445 435 440 445
Arg Leu Glu Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu LysArg Leu Glu Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu Lys
450 455 460 450 455 460
Val Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg GlyVal Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg Gly
465 470 475 480465 470 475 480
Gln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu LysGln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu Lys
485 490 495 485 490 495
Asn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp LysAsn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp Lys
500 505 510 500 505 510
Val Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu TyrVal Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu Tyr
515 520 525 515 520 525
Glu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln GlnGlu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln Gln
530 535 540 530 535 540
Ser Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala GluSer Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala Glu
545 550 555 560545 550 555 560
Arg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys ProArg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys Pro
565 570 575 565 570 575
Gly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val LeuGly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val Leu
580 585 590 580 585 590
Asn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg ArgAsn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg Arg
595 600 605 595 600 605
Met Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr MetMet Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr Met
610 615 620 610 615 620
Arg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr ValArg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr Val
625 630 635 640625 630 635 640
Pro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr ArgPro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr Arg
645 650 655 645 650 655
Val Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met ValVal Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met Val
660 665 670 660 665 670
Glu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr SerGlu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr Ser
675 680 685 675 680 685
Leu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp GlyLeu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp Gly
690 695 700 690 695 700
Leu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile LysLeu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile Lys
705 710 715 720705 710 715 720
Ala Leu Thr Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu ProAla Leu Thr Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu Pro
725 730 735 725 730 735
Glu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu HisGlu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu His
740 745 750 740 745 750
Gly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala SerGly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala Ser
755 760 765 755 760 765
Lys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys GluLys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys Glu
770 775 780 770 775 780
Val Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile SerVal Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile Ser
785 790 795 800785 790 795 800
Pro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu LeuPro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu Leu
805 810 815 805 810 815
Ser Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn LeuSer Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn Leu
820 825 830 820 825 830
Asp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr ArgAsp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr Arg
835 840 845 835 840 845
Leu Lys Ser Leu ValLeu Lys Ser Leu Val
850 850
<210> 55<210> 55
<211> 2562<211> 2562
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 55<400> 55
atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60
aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120
tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180
tcggcgggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240tcggcggggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240
gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300
accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360
gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420
aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgctg cagcgaaccg 480aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgctg cagcgaaccg 480
gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540
gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600
ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660
gatctggtcg gttttggcgt cgagaacgcg ccgcgctgcc tttgtgctgc cggttgtctg 720gatctggtcg gttttggcgt cgagaacgcg ccgcgctgcc tttgtgctgc cggttgtctg 720
ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780
gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840
cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900
ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960
ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020
gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080
cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140
cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200
gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260
gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320
gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg 1380gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg 1380
gacacgctga aagttggctt taatgcggtg cacggctact acattcaaat cagccgtggg 1440gacacgctga aagttggctt taatgcggtg cacggctact aattcaaat cagccgtggg 1440
caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500
tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560
ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620
gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680
cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740
accgaaggtc gccatccggt agttgaacaa gtactgaatg agccatttat cgccaacccg 1800accgaaggtc gccatccggt agttgaacaa gtactgaatg agccattat cgccaacccg 1800
ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa 1860ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa 1860
agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920
ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980
gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040
ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100
acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160
gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220
ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280
agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340
gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400
ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460
gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520
caggcgctgg agtggattta tcgcttgaag agcctggtgt aa 2562caggcgctgg agtggatta tcgcttgaag agcctggtgt aa 2562
<210> 56<210> 56
<211> 853<211> 853
<212> PRT<212> PRT
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 56<400> 56
Met Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln GlnMet Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln Gln
1 5 10 151 5 10 15
Tyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr ArgTyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr Arg
20 25 30 20 25 30
Met Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala SerMet Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala Ser
35 40 45 35 40 45
Gln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly GluGln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly Glu
50 55 60 50 55 60
Pro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr LeuPro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr Leu
65 70 75 8065 70 75 80
Ala Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln IleAla Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln Ile
85 90 95 85 90 95
Gly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val ArgGly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val Arg
100 105 110 100 105 110
Ile Val Thr Pro Gly Thr Ile Ser Asp Glu Ala Leu Leu Gln Glu ArgIle Val Thr Pro Gly Thr Ile Ser Asp Glu Ala Leu Leu Gln Glu Arg
115 120 125 115 120 125
Gln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe GlyGln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe Gly
130 135 140 130 135 140
Tyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Cys Ser Glu ProTyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Cys Ser Glu Pro
145 150 155 160145 150 155 160
Ala Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro AlaAla Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro Ala
165 170 175 165 170 175
Glu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu GlyGlu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu Gly
180 185 190 180 185 190
Arg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp ThrArg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp Thr
195 200 205 195 200 205
Ala Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val GlyAla Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val Gly
210 215 220 210 215 220
Phe Gly Val Glu Asn Ala Pro Arg Cys Leu Cys Ala Ala Gly Cys LeuPhe Gly Val Glu Asn Ala Pro Arg Cys Leu Cys Ala Ala Gly Cys Leu
225 230 235 240225 230 235 240
Leu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile ArgLeu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile Arg
245 250 255 245 250 255
Ser Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala AlaSer Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala Ala
260 265 270 260 265 270
Thr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala GluThr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala Glu
275 280 285 275 280 285
Asn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly SerAsn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly Ser
290 295 300 290 295 300
Arg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg ValArg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg Val
305 310 315 320305 310 315 320
Leu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr AlaLeu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr Ala
325 330 335 325 330 335
Gly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile LeuGly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile Leu
340 345 350 340 345 350
Ala Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg MetAla Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg Met
355 360 365 355 360 365
Arg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu ThrArg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu Thr
370 375 380 370 375 380
Val Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu PheVal Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu Phe
385 390 395 400385 390 395 400
Ala Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro ProAla Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro Pro
405 410 415 405 410 415
Val Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu GluVal Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu Glu
420 425 430 420 425 430
Leu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu GluLeu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu Glu
435 440 445 435 440 445
Arg Leu Glu Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu LysArg Leu Glu Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu Lys
450 455 460 450 455 460
Val Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg GlyVal Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg Gly
465 470 475 480465 470 475 480
Gln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu LysGln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu Lys
485 490 495 485 490 495
Asn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp LysAsn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp Lys
500 505 510 500 505 510
Val Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu TyrVal Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu Tyr
515 520 525 515 520 525
Glu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln GlnGlu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln Gln
530 535 540 530 535 540
Ser Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala GluSer Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala Glu
545 550 555 560545 550 555 560
Arg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys ProArg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys Pro
565 570 575 565 570 575
Gly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val LeuGly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val Leu
580 585 590 580 585 590
Asn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg ArgAsn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg Arg
595 600 605 595 600 605
Met Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr MetMet Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr Met
610 615 620 610 615 620
Arg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr ValArg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr Val
625 630 635 640625 630 635 640
Pro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr ArgPro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr Arg
645 650 655 645 650 655
Val Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met ValVal Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met Val
660 665 670 660 665 670
Glu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr SerGlu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr Ser
675 680 685 675 680 685
Leu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp GlyLeu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp Gly
690 695 700 690 695 700
Leu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile LysLeu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile Lys
705 710 715 720705 710 715 720
Ala Leu Thr Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu ProAla Leu Thr Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu Pro
725 730 735 725 730 735
Glu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu HisGlu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu His
740 745 750 740 745 750
Gly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala SerGly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala Ser
755 760 765 755 760 765
Lys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys GluLys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys Glu
770 775 780 770 775 780
Val Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile SerVal Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile Ser
785 790 795 800785 790 795 800
Pro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu LeuPro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu Leu
805 810 815 805 810 815
Ser Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn LeuSer Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn Leu
820 825 830 820 825 830
Asp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr ArgAsp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr Arg
835 840 845 835 840 845
Leu Lys Ser Leu ValLeu Lys Ser Leu Val
850 850
<210> 57<210> 57
<211> 2562<211> 2562
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 57<400> 57
atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60
aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120
tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180
tcggcgggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240tcggcggggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240
gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300
accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360
gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420
aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgcct gagcgaaccg 480aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgcct gagcgaaccg 480
gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540
gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600
ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660
gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg 720gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg 720
ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780
gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840
cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900
ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960
ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020
gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080
cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140
cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200
gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260
gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320
gacggcgcga ccgattatct ggagcgtctg tgcgtccgcg agcgtgaacg taccggcctg 1380gacggcgcga ccgattatct ggagcgtctg tgcgtccgcg agcgtgaacg taccggcctg 1380
gacacgctga aatgcggctt taatgcggtg cacggctact acattcaaat cagccgtggg 1440gacacgctga aatgcggctt taatgcggtg cacggctact aattcaaat cagccgtggg 1440
caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500
tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560
ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620
gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680
cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740
accgaaggtc gccatccggt agttgaacaa gtactgaatg agccatttat cgccaacccg 1800accgaaggtc gccatccggt agttgaacaa gtactgaatg agccattat cgccaacccg 1800
ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa 1860ctgaatctgt cgccgcagcg ccgcatgttg atcatcaccg gtccgaacat gggcggtaaa 1860
agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920
ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980
gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040
ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100
acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160
gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220gcattgacgt tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220
ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280
agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340
gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400
ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460
gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520
caggcgctgg agtggattta tcgcttgaag agcctggtgt aa 2562caggcgctgg agtggatta tcgcttgaag agcctggtgt aa 2562
<210> 58<210> 58
<211> 853<211> 853
<212> PRT<212> PRT
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 58<400> 58
Met Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln GlnMet Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln Gln
1 5 10 151 5 10 15
Tyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr ArgTyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr Arg
20 25 30 20 25 30
Met Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala SerMet Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala Ser
35 40 45 35 40 45
Gln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly GluGln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly Glu
50 55 60 50 55 60
Pro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr LeuPro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr Leu
65 70 75 8065 70 75 80
Ala Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln IleAla Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln Ile
85 90 95 85 90 95
Gly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val ArgGly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val Arg
100 105 110 100 105 110
Ile Val Thr Pro Gly Thr Ile Ser Asp Glu Ala Leu Leu Gln Glu ArgIle Val Thr Pro Gly Thr Ile Ser Asp Glu Ala Leu Leu Gln Glu Arg
115 120 125 115 120 125
Gln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe GlyGln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe Gly
130 135 140 130 135 140
Tyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Leu Ser Glu ProTyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Leu Ser Glu Pro
145 150 155 160145 150 155 160
Ala Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro AlaAla Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro Ala
165 170 175 165 170 175
Glu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu GlyGlu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu Gly
180 185 190 180 185 190
Arg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp ThrArg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp Thr
195 200 205 195 200 205
Ala Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val GlyAla Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val Gly
210 215 220 210 215 220
Phe Gly Val Glu Asn Ala Pro Arg Gly Leu Cys Ala Ala Gly Cys LeuPhe Gly Val Glu Asn Ala Pro Arg Gly Leu Cys Ala Ala Gly Cys Leu
225 230 235 240225 230 235 240
Leu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile ArgLeu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile Arg
245 250 255 245 250 255
Ser Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala AlaSer Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala Ala
260 265 270 260 265 270
Thr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala GluThr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala Glu
275 280 285 275 280 285
Asn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly SerAsn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly Ser
290 295 300 290 295 300
Arg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg ValArg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg Val
305 310 315 320305 310 315 320
Leu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr AlaLeu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr Ala
325 330 335 325 330 335
Gly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile LeuGly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile Leu
340 345 350 340 345 350
Ala Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg MetAla Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg Met
355 360 365 355 360 365
Arg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu ThrArg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu Thr
370 375 380 370 375 380
Val Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu PheVal Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu Phe
385 390 395 400385 390 395 400
Ala Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro ProAla Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro Pro
405 410 415 405 410 415
Val Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu GluVal Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu Glu
420 425 430 420 425 430
Leu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu GluLeu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu Glu
435 440 445 435 440 445
Arg Leu Cys Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu LysArg Leu Cys Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu Lys
450 455 460 450 455 460
Cys Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg GlyCys Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg Gly
465 470 475 480465 470 475 480
Gln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu LysGln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu Lys
485 490 495 485 490 495
Asn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp LysAsn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp Lys
500 505 510 500 505 510
Val Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu TyrVal Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu Tyr
515 520 525 515 520 525
Glu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln GlnGlu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln Gln
530 535 540 530 535 540
Ser Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala GluSer Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala Glu
545 550 555 560545 550 555 560
Arg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys ProArg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys Pro
565 570 575 565 570 575
Gly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val LeuGly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val Leu
580 585 590 580 585 590
Asn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg ArgAsn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg Arg
595 600 605 595 600 605
Met Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr MetMet Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr Met
610 615 620 610 615 620
Arg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr ValArg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr Val
625 630 635 640625 630 635 640
Pro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr ArgPro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr Arg
645 650 655 645 650 655
Val Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met ValVal Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met Val
660 665 670 660 665 670
Glu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr SerGlu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr Ser
675 680 685 675 680 685
Leu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp GlyLeu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp Gly
690 695 700 690 695 700
Leu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile LysLeu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile Lys
705 710 715 720705 710 715 720
Ala Leu Thr Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu ProAla Leu Thr Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu Pro
725 730 735 725 730 735
Glu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu HisGlu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu His
740 745 750 740 745 750
Gly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala SerGly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala Ser
755 760 765 755 760 765
Lys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys GluLys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys Glu
770 775 780 770 775 780
Val Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile SerVal Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile Ser
785 790 795 800785 790 795 800
Pro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu LeuPro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu Leu
805 810 815 805 810 815
Ser Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn LeuSer Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn Leu
820 825 830 820 825 830
Asp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr ArgAsp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr Arg
835 840 845 835 840 845
Leu Lys Ser Leu ValLeu Lys Ser Leu Val
850 850
<210> 59<210> 59
<211> 2562<211> 2562
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 59<400> 59
atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60
aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120
tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180
tcggcgggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240tcggcggggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240
gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300
accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360
gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420
aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgcct gagcgaaccg 480aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgcct gagcgaaccg 480
gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540
gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600
ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660
gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg 720gatctggtcg gttttggcgt cgagaacgcg ccgcgcggac tttgtgctgc cggttgtctg 720
ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780
gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840
cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900
ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960
ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020
gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080
cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140
cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200
gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260
gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320
gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg 1380gacggcgcga ccgattatct ggagcgtctg gaagtccgcg agcgtgaacg taccggcctg 1380
gacacgctga aagttggctt taatgcggtg cacggctact acattcaaat cagccgtggg 1440gacacgctga aagttggctt taatgcggtg cacggctact aattcaaat cagccgtggg 1440
caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500
tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560
ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620
gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680
cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740
accgaaggtc gccatccggt agttgaacaa gtactgaatg agccatttat cgccaacccg 1800accgaaggtc gccatccggt agttgaacaa gtactgaatg agccattat cgccaacccg 1800
ctgaatctgt cgccgcagcg ccgctgcttg atcatcaccg gtccgaacat gggcggtaaa 1860ctgaatctgt cgccgcagcg ccgctgcttg atcatcaccg gtccgaacat gggcggtaaa 1860
agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920
ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980
gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040
ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100
acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160
gcattgtgct tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220gcattgtgct tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220
ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280
agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340
gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400
ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460
gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520
caggcgctgg agtggattta tcgcttgaag agcctggtgt aa 2562caggcgctgg agtggatta tcgcttgaag agcctggtgt aa 2562
<210> 60<210> 60
<211> 853<211> 853
<212> PRT<212> PRT
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 60<400> 60
Met Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln GlnMet Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln Gln
1 5 10 151 5 10 15
Tyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr ArgTyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr Arg
20 25 30 20 25 30
Met Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala SerMet Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala Ser
35 40 45 35 40 45
Gln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly GluGln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly Glu
50 55 60 50 55 60
Pro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr LeuPro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr Leu
65 70 75 8065 70 75 80
Ala Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln IleAla Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln Ile
85 90 95 85 90 95
Gly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val ArgGly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val Arg
100 105 110 100 105 110
Ile Val Thr Pro Gly Thr Ile Ser Asp Glu Ala Leu Leu Gln Glu ArgIle Val Thr Pro Gly Thr Ile Ser Asp Glu Ala Leu Leu Gln Glu Arg
115 120 125 115 120 125
Gln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe GlyGln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe Gly
130 135 140 130 135 140
Tyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Leu Ser Glu ProTyr Ala Thr Leu Asp Ile Ser Ser Gly Arg Phe Arg Leu Ser Glu Pro
145 150 155 160145 150 155 160
Ala Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro AlaAla Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro Ala
165 170 175 165 170 175
Glu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu GlyGlu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu Gly
180 185 190 180 185 190
Arg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp ThrArg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp Thr
195 200 205 195 200 205
Ala Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val GlyAla Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val Gly
210 215 220 210 215 220
Phe Gly Val Glu Asn Ala Pro Arg Gly Leu Cys Ala Ala Gly Cys LeuPhe Gly Val Glu Asn Ala Pro Arg Gly Leu Cys Ala Ala Gly Cys Leu
225 230 235 240225 230 235 240
Leu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile ArgLeu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile Arg
245 250 255 245 250 255
Ser Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala AlaSer Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala Ala
260 265 270 260 265 270
Thr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala GluThr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala Glu
275 280 285 275 280 285
Asn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly SerAsn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly Ser
290 295 300 290 295 300
Arg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg ValArg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg Val
305 310 315 320305 310 315 320
Leu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr AlaLeu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr Ala
325 330 335 325 330 335
Gly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile LeuGly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile Leu
340 345 350 340 345 350
Ala Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg MetAla Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg Met
355 360 365 355 360 365
Arg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu ThrArg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu Thr
370 375 380 370 375 380
Val Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu PheVal Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu Phe
385 390 395 400385 390 395 400
Ala Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro ProAla Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro Pro
405 410 415 405 410 415
Val Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu GluVal Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu Glu
420 425 430 420 425 430
Leu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu GluLeu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu Glu
435 440 445 435 440 445
Arg Leu Glu Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu LysArg Leu Glu Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu Lys
450 455 460 450 455 460
Val Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg GlyVal Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg Gly
465 470 475 480465 470 475 480
Gln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu LysGln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu Lys
485 490 495 485 490 495
Asn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp LysAsn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp Lys
500 505 510 500 505 510
Val Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu TyrVal Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu Tyr
515 520 525 515 520 525
Glu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln GlnGlu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln Gln
530 535 540 530 535 540
Ser Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala GluSer Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala Glu
545 550 555 560545 550 555 560
Arg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys ProArg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys Pro
565 570 575 565 570 575
Gly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val LeuGly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val Leu
580 585 590 580 585 590
Asn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg ArgAsn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg Arg
595 600 605 595 600 605
Cys Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr MetCys Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr Met
610 615 620 610 615 620
Arg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr ValArg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr Val
625 630 635 640625 630 635 640
Pro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr ArgPro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr Arg
645 650 655 645 650 655
Val Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met ValVal Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met Val
660 665 670 660 665 670
Glu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr SerGlu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr Ser
675 680 685 675 680 685
Leu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp GlyLeu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp Gly
690 695 700 690 695 700
Leu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile LysLeu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile Lys
705 710 715 720705 710 715 720
Ala Leu Cys Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu ProAla Leu Cys Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu Pro
725 730 735 725 730 735
Glu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu HisGlu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu His
740 745 750 740 745 750
Gly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala SerGly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala Ser
755 760 765 755 760 765
Lys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys GluLys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys Glu
770 775 780 770 775 780
Val Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile SerVal Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile Ser
785 790 795 800785 790 795 800
Pro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu LeuPro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu Leu
805 810 815 805 810 815
Ser Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn LeuSer Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn Leu
820 825 830 820 825 830
Asp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr ArgAsp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr Arg
835 840 845 835 840 845
Leu Lys Ser Leu ValLeu Lys Ser Leu Val
850 850
<210> 61<210> 61
<211> 2562<211> 2562
<212> DNA<212>DNA
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 61<400> 61
atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60atgagtgcaa tagaaaattt cgacgcccat acgcccatga tgcagcagta tctcaggctg 60
aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120aaagcccagc atcccgagat cctgctgttt taccggatgg gtgattttta tgaactgttt 120
tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180tatgacgacg caaaacgcgc gtcgcaactg ctggatattt cactgaccaa acgcggtgct 180
tcggcgggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240tcggcggggag agccgatccc gatggcgggg attccctacc atgcggtgga aaactatctc 240
gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300gccaaactgg tgaatcaggg agagtccgtt gccatctgcg aacaaattgg cgatccggcg 300
accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360accagcaaag gtccggttga gcgcaaagtt gtgcgtatcg ttacgccagg caccatctgc 360
gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420gatgaagccc tgttgcagga gcgtcaggac aacctgctgg cggctatctg gcaggacagc 420
aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgctg cagcgaaccg 480aaaggtttcg gctacgcgac gctggatatc tgctccgggc gttttcgctg cagcgaaccg 480
gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540gctgaccgcg aaacgatggc ggcagaactg caacgcacta atcctgcgga actgctgtat 540
gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600gcagaagatt ttgctgaaat gtcgttaatt gaaggccgtc gcggcctgcg ccgtcgcccg 600
ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660ctgtgggagt ttgaaatcga caccgcgcgc cagcagttga atctgcaatt tgggacccgc 660
gatctggtcg gttttggcgt cgagaacgcg ccgcgctgcc tttgtgctgc cggttgtctg 720gatctggtcg gttttggcgt cgagaacgcg ccgcgctgcc tttgtgctgc cggttgtctg 720
ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780ttgcagtatg cgaaagatac ccaacgtacg actctgccgc atattcgttc catcaccatg 780
gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840gaacgtgagc aggacagcat cattatggat gccgcgacgc gtcgtaatct ggaaatcacc 840
cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900cagaacctgg cgggtggtgc ggaaaatacg ctggcttctg tgctcgactg caccgtcacg 900
ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960ccgatgggca gccgtatgct gaaacgctgg ctgcatatgc cagtgcgcga tacccgcgtg 960
ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020ttgcttgagc gccagcaaac tattggcgca ttgcaggatt tcaccgccgg gctacagccg 1020
gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080gtactgcgtc aggtcggcga cctggaacgt attctggcac gtctggcttt acgaactgct 1080
cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140cgcccacgcg atctggcccg tatgcgccac gctttccagc aactgccgga gctgcgtgcg 1140
cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200cagttagaaa ctgtcgatag tgcaccggta caggcgctac gtgagaagat gggcgagttt 1200
gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260gccgagctgc gcgatctgct ggagcgagca atcatcgaca caccgccggt gctggtacgc 1260
gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320gacggtggtg ttatcgcatc gggctataac gaagagctgg atgagtggcg cgcgctggct 1320
gacggcgcga ccgattatct ggagcgtctg tgcgtccgcg agcgtgaacg taccggcctg 1380gacggcgcga ccgattatct ggagcgtctg tgcgtccgcg agcgtgaacg taccggcctg 1380
gacacgctga aatgcggctt taatgcggtg cacggctact acattcaaat cagccgtggg 1440gacacgctga aatgcggctt taatgcggtg cacggctact aattcaaat cagccgtggg 1440
caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500caaagccatc tggcacccat caactacatg cgtcgccaga cgctgaaaaa cgccgagcgc 1500
tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560tacatcattc cagagctaaa agagtacgaa gataaagttc tcacctcaaa aggcaaagca 1560
ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620ctggcactgg aaaaacagct ttatgaagag ctgttcgacc tgctgttgcc gcatctggaa 1620
gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680gcgttgcaac agagcgcgag cgcgctggcg gaactcgacg tgctggttaa cctggcggaa 1680
cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740cgggcctata ccctgaacta cacctgcccg accttcattg ataaaccggg cattcgcatt 1740
accgaaggtc gccatccggt agttgaacaa gtactgaatg agccatttat cgccaacccg 1800accgaaggtc gccatccggt agttgaacaa gtactgaatg agccattat cgccaacccg 1800
ctgaatctgt cgccgcagcg ccgctgcttg atcatcaccg gtccgaacat gggcggtaaa 1860ctgaatctgt cgccgcagcg ccgctgcttg atcatcaccg gtccgaacat gggcggtaaa 1860
agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920agtacctata tgcgccagac cgcactgatt gcgctgatgg cctacatcgg cagctatgta 1920
ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980ccggcacaaa aagtcgagat tggacctatc gatcgcatct ttacccgcgt aggcgcggca 1980
gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040gatgacctgg cgtccgggcg ctcaaccttt atggtggaga tgactgaaac cgccaatatt 2040
ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100ttacataacg ccaccgaata cagtctggtg ttaatggatg agatcgggcg tggaacgtcc 2100
acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160acctacgatg gtctgtcgct ggcgtgggcg tgcgcggaaa atctggcgaa taagattaag 2160
gcattgtgct tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220gcattgtgct tatttgctac ccactatttc gagctgaccc agttaccgga gaaaatggaa 2220
ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280ggcgtcgcta acgtgcatct cgatgcactg gagcacggcg acaccattgc ctttatgcac 2280
agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340agcgtgcagg atggcgcggc gagcaaaagc tacggcctgg cggttgcagc tctggcaggc 2340
gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400gtgccaaaag aggttattaa gcgcgcacgg caaaagctgc gtgagctgga aagcatttcg 2400
ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460ccgaacgccg ccgctacgca agtggatggt acgcaaatgt ctttgctgtc agtaccagaa 2460
gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520gaaacttcgc ctgcggtcga agctctggaa aatcttgatc cggattcact caccccgcgt 2520
caggcgctgg agtggattta tcgcttgaag agcctggtgt aa 2562caggcgctgg agtggatta tcgcttgaag agcctggtgt aa 2562
<210> 62<210> 62
<211> 853<211> 853
<212> PRT<212> PRT
<213> 人工序列(Artificial sequence)<213> Artificial sequence (Artificial sequence)
<400> 62<400> 62
Met Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln GlnMet Ser Ala Ile Glu Asn Phe Asp Ala His Thr Pro Met Met Gln Gln
1 5 10 151 5 10 15
Tyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr ArgTyr Leu Arg Leu Lys Ala Gln His Pro Glu Ile Leu Leu Phe Tyr Arg
20 25 30 20 25 30
Met Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala SerMet Gly Asp Phe Tyr Glu Leu Phe Tyr Asp Asp Ala Lys Arg Ala Ser
35 40 45 35 40 45
Gln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly GluGln Leu Leu Asp Ile Ser Leu Thr Lys Arg Gly Ala Ser Ala Gly Glu
50 55 60 50 55 60
Pro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr LeuPro Ile Pro Met Ala Gly Ile Pro Tyr His Ala Val Glu Asn Tyr Leu
65 70 75 8065 70 75 80
Ala Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln IleAla Lys Leu Val Asn Gln Gly Glu Ser Val Ala Ile Cys Glu Gln Ile
85 90 95 85 90 95
Gly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val ArgGly Asp Pro Ala Thr Ser Lys Gly Pro Val Glu Arg Lys Val Val Arg
100 105 110 100 105 110
Ile Val Thr Pro Gly Thr Ile Cys Asp Glu Ala Leu Leu Gln Glu ArgIle Val Thr Pro Gly Thr Ile Cys Asp Glu Ala Leu Leu Gln Glu Arg
115 120 125 115 120 125
Gln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe GlyGln Asp Asn Leu Leu Ala Ala Ile Trp Gln Asp Ser Lys Gly Phe Gly
130 135 140 130 135 140
Tyr Ala Thr Leu Asp Ile Cys Ser Gly Arg Phe Arg Cys Ser Glu ProTyr Ala Thr Leu Asp Ile Cys Ser Gly Arg Phe Arg Cys Ser Glu Pro
145 150 155 160145 150 155 160
Ala Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro AlaAla Asp Arg Glu Thr Met Ala Ala Glu Leu Gln Arg Thr Asn Pro Ala
165 170 175 165 170 175
Glu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu GlyGlu Leu Leu Tyr Ala Glu Asp Phe Ala Glu Met Ser Leu Ile Glu Gly
180 185 190 180 185 190
Arg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp ThrArg Arg Gly Leu Arg Arg Arg Pro Leu Trp Glu Phe Glu Ile Asp Thr
195 200 205 195 200 205
Ala Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val GlyAla Arg Gln Gln Leu Asn Leu Gln Phe Gly Thr Arg Asp Leu Val Gly
210 215 220 210 215 220
Phe Gly Val Glu Asn Ala Pro Arg Cys Leu Cys Ala Ala Gly Cys LeuPhe Gly Val Glu Asn Ala Pro Arg Cys Leu Cys Ala Ala Gly Cys Leu
225 230 235 240225 230 235 240
Leu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile ArgLeu Gln Tyr Ala Lys Asp Thr Gln Arg Thr Thr Leu Pro His Ile Arg
245 250 255 245 250 255
Ser Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala AlaSer Ile Thr Met Glu Arg Glu Gln Asp Ser Ile Ile Met Asp Ala Ala
260 265 270 260 265 270
Thr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala GluThr Arg Arg Asn Leu Glu Ile Thr Gln Asn Leu Ala Gly Gly Ala Glu
275 280 285 275 280 285
Asn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly SerAsn Thr Leu Ala Ser Val Leu Asp Cys Thr Val Thr Pro Met Gly Ser
290 295 300 290 295 300
Arg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg ValArg Met Leu Lys Arg Trp Leu His Met Pro Val Arg Asp Thr Arg Val
305 310 315 320305 310 315 320
Leu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr AlaLeu Leu Glu Arg Gln Gln Thr Ile Gly Ala Leu Gln Asp Phe Thr Ala
325 330 335 325 330 335
Gly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile LeuGly Leu Gln Pro Val Leu Arg Gln Val Gly Asp Leu Glu Arg Ile Leu
340 345 350 340 345 350
Ala Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg MetAla Arg Leu Ala Leu Arg Thr Ala Arg Pro Arg Asp Leu Ala Arg Met
355 360 365 355 360 365
Arg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu ThrArg His Ala Phe Gln Gln Leu Pro Glu Leu Arg Ala Gln Leu Glu Thr
370 375 380 370 375 380
Val Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu PheVal Asp Ser Ala Pro Val Gln Ala Leu Arg Glu Lys Met Gly Glu Phe
385 390 395 400385 390 395 400
Ala Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro ProAla Glu Leu Arg Asp Leu Leu Glu Arg Ala Ile Ile Asp Thr Pro Pro
405 410 415 405 410 415
Val Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu GluVal Leu Val Arg Asp Gly Gly Val Ile Ala Ser Gly Tyr Asn Glu Glu
420 425 430 420 425 430
Leu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu GluLeu Asp Glu Trp Arg Ala Leu Ala Asp Gly Ala Thr Asp Tyr Leu Glu
435 440 445 435 440 445
Arg Leu Cys Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu LysArg Leu Cys Val Arg Glu Arg Glu Arg Thr Gly Leu Asp Thr Leu Lys
450 455 460 450 455 460
Cys Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg GlyCys Gly Phe Asn Ala Val His Gly Tyr Tyr Ile Gln Ile Ser Arg Gly
465 470 475 480465 470 475 480
Gln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu LysGln Ser His Leu Ala Pro Ile Asn Tyr Met Arg Arg Gln Thr Leu Lys
485 490 495 485 490 495
Asn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp LysAsn Ala Glu Arg Tyr Ile Ile Pro Glu Leu Lys Glu Tyr Glu Asp Lys
500 505 510 500 505 510
Val Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu TyrVal Leu Thr Ser Lys Gly Lys Ala Leu Ala Leu Glu Lys Gln Leu Tyr
515 520 525 515 520 525
Glu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln GlnGlu Glu Leu Phe Asp Leu Leu Leu Pro His Leu Glu Ala Leu Gln Gln
530 535 540 530 535 540
Ser Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala GluSer Ala Ser Ala Leu Ala Glu Leu Asp Val Leu Val Asn Leu Ala Glu
545 550 555 560545 550 555 560
Arg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys ProArg Ala Tyr Thr Leu Asn Tyr Thr Cys Pro Thr Phe Ile Asp Lys Pro
565 570 575 565 570 575
Gly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val LeuGly Ile Arg Ile Thr Glu Gly Arg His Pro Val Val Glu Gln Val Leu
580 585 590 580 585 590
Asn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg ArgAsn Glu Pro Phe Ile Ala Asn Pro Leu Asn Leu Ser Pro Gln Arg Arg
595 600 605 595 600 605
Cys Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr MetCys Leu Ile Ile Thr Gly Pro Asn Met Gly Gly Lys Ser Thr Tyr Met
610 615 620 610 615 620
Arg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr ValArg Gln Thr Ala Leu Ile Ala Leu Met Ala Tyr Ile Gly Ser Tyr Val
625 630 635 640625 630 635 640
Pro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr ArgPro Ala Gln Lys Val Glu Ile Gly Pro Ile Asp Arg Ile Phe Thr Arg
645 650 655 645 650 655
Val Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met ValVal Gly Ala Ala Asp Asp Leu Ala Ser Gly Arg Ser Thr Phe Met Val
660 665 670 660 665 670
Glu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr SerGlu Met Thr Glu Thr Ala Asn Ile Leu His Asn Ala Thr Glu Tyr Ser
675 680 685 675 680 685
Leu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp GlyLeu Val Leu Met Asp Glu Ile Gly Arg Gly Thr Ser Thr Tyr Asp Gly
690 695 700 690 695 700
Leu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile LysLeu Ser Leu Ala Trp Ala Cys Ala Glu Asn Leu Ala Asn Lys Ile Lys
705 710 715 720705 710 715 720
Ala Leu Cys Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu ProAla Leu Cys Leu Phe Ala Thr His Tyr Phe Glu Leu Thr Gln Leu Pro
725 730 735 725 730 735
Glu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu HisGlu Lys Met Glu Gly Val Ala Asn Val His Leu Asp Ala Leu Glu His
740 745 750 740 745 750
Gly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala SerGly Asp Thr Ile Ala Phe Met His Ser Val Gln Asp Gly Ala Ala Ser
755 760 765 755 760 765
Lys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys GluLys Ser Tyr Gly Leu Ala Val Ala Ala Leu Ala Gly Val Pro Lys Glu
770 775 780 770 775 780
Val Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile SerVal Ile Lys Arg Ala Arg Gln Lys Leu Arg Glu Leu Glu Ser Ile Ser
785 790 795 800785 790 795 800
Pro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu LeuPro Asn Ala Ala Ala Thr Gln Val Asp Gly Thr Gln Met Ser Leu Leu
805 810 815 805 810 815
Ser Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn LeuSer Val Pro Glu Glu Thr Ser Pro Ala Val Glu Ala Leu Glu Asn Leu
820 825 830 820 825 830
Asp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr ArgAsp Pro Asp Ser Leu Thr Pro Arg Gln Ala Leu Glu Trp Ile Tyr Arg
835 840 845 835 840 845
Leu Lys Ser Leu ValLeu Lys Ser Leu Val
850 850
Claims (11)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910518166.5A CN112079903B (en) | 2019-06-14 | 2019-06-14 | Mutant of mismatching binding protein and coding gene thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910518166.5A CN112079903B (en) | 2019-06-14 | 2019-06-14 | Mutant of mismatching binding protein and coding gene thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112079903A CN112079903A (en) | 2020-12-15 |
CN112079903B true CN112079903B (en) | 2022-11-18 |
Family
ID=73734171
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910518166.5A Active CN112079903B (en) | 2019-06-14 | 2019-06-14 | Mutant of mismatching binding protein and coding gene thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112079903B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115141282B (en) * | 2021-03-30 | 2024-09-03 | 中国科学院青岛生物能源与过程研究所 | Mismatch binding protein mutant and application thereof in high-throughput gene error correction of automatic workstation |
CN115820625A (en) * | 2022-12-05 | 2023-03-21 | 中国科学技术大学 | Method, device and preparation method and kit for removing mismatched DNA |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6294325B1 (en) * | 1996-07-05 | 2001-09-25 | The Mount Sinai School Of Medicine Of The City University Of New York | Cloning and expression of thermostable multi genes and proteins and uses thereof |
CN101365794A (en) * | 2005-08-12 | 2009-02-11 | 巴斯福植物科学有限公司 | Nucleic acid sequences encoding proteins associated with abiotic stress response and plant cells and plants with increased tolerance to environmental stress |
CN103755815A (en) * | 2014-01-17 | 2014-04-30 | 中国科学技术大学 | Novel fusion protein of mispairing binding protein and cellulose binding domain 3 and method thereof for removing errors in DNA (Desoxvribose Nucleic Acid) synthesis at high flux |
CN107109380A (en) * | 2014-10-07 | 2017-08-29 | 牛津纳米孔技术公司 | Enzyme through modification |
CN109868271A (en) * | 2019-03-21 | 2019-06-11 | 江苏师范大学 | DNA is carried out using chip synthetic oligonucleotide library to shuffle the method for library de novo formation |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170016017A1 (en) * | 2014-07-31 | 2017-01-19 | Michael E Fromm | Method for increasing plant yields |
-
2019
- 2019-06-14 CN CN201910518166.5A patent/CN112079903B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6294325B1 (en) * | 1996-07-05 | 2001-09-25 | The Mount Sinai School Of Medicine Of The City University Of New York | Cloning and expression of thermostable multi genes and proteins and uses thereof |
CN101365794A (en) * | 2005-08-12 | 2009-02-11 | 巴斯福植物科学有限公司 | Nucleic acid sequences encoding proteins associated with abiotic stress response and plant cells and plants with increased tolerance to environmental stress |
CN103755815A (en) * | 2014-01-17 | 2014-04-30 | 中国科学技术大学 | Novel fusion protein of mispairing binding protein and cellulose binding domain 3 and method thereof for removing errors in DNA (Desoxvribose Nucleic Acid) synthesis at high flux |
CN107109380A (en) * | 2014-10-07 | 2017-08-29 | 牛津纳米孔技术公司 | Enzyme through modification |
CN109868271A (en) * | 2019-03-21 | 2019-06-11 | 江苏师范大学 | DNA is carried out using chip synthetic oligonucleotide library to shuffle the method for library de novo formation |
Non-Patent Citations (2)
Title |
---|
DNA化学合成中的错误去除;王冬梅;《生命的化学》;20121231;第32卷(第1期);34-38 * |
Error removal in microchip-synthesized DNA using immobilized MutS;Wan, Wen等;《NUCLEIC ACIDS RESEARCH》;20140514;第42卷(第12期);e102 * |
Also Published As
Publication number | Publication date |
---|---|
CN112079903A (en) | 2020-12-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102177236B (en) | RNA polymerase mutant with improved functions | |
CN113785053B (en) | Reverse transcriptase with increased enzymatic activity and use thereof | |
CN114561374B (en) | Thermophilic endonuclease mutant and preparation method and application thereof | |
CN103755815B (en) | A kind of method that mispairing associated proteins is wrong with the fusion protein of cellulose binding domain 3 and removal DNA synthesis thereof | |
WO2013138339A1 (en) | Gene shuffling methods | |
CN112079903B (en) | Mutant of mismatching binding protein and coding gene thereof | |
US20200354694A1 (en) | Ketoreductase mutant and application thereof | |
CN110846291A (en) | Amine dehydrogenase mutant with improved thermal stability and construction and application of genetically engineered bacterium thereof | |
CN111073871A (en) | DNA polymerase mutants with improved thermostability and methods for their construction and applications | |
CN116410952B (en) | Mutant Taq DNA polymerase, coding gene, recombinant expression vector, recombinant bacterium and application thereof | |
WO2014205882A1 (en) | High-fidelity dna polymerase, and preparation and use thereof | |
CN112359032B (en) | Mutant esterase and application thereof, recombinant vector and preparation method and application thereof, recombinant engineering bacteria and application thereof | |
CN115141282B (en) | Mismatch binding protein mutant and application thereof in high-throughput gene error correction of automatic workstation | |
WO2021081713A1 (en) | Transaminase mutant and application thereof | |
CN114829593B (en) | Chimeric DNA polymerase and application thereof | |
CN118308393A (en) | Expression system of escherichia coli without antibiotic resistance marker, construction method and application thereof | |
CN116004564B (en) | Reverse transcription mutant and application thereof | |
CN113564141B (en) | Single-cell genome amplification enzyme mutant and application thereof | |
CN114480342A (en) | Mutant PET hydrolase, recombinant vector, recombinant engineering bacterium and application thereof | |
CN114250206A (en) | Methyltransferase mutant, recombinant vector, recombinant engineering bacterium and application thereof | |
EP4036237A1 (en) | Pwo-neqssb polymerase, method of its preparation, recombinant plasmid, primers and the use of polymerase | |
CN112877308B (en) | DNA polymerase I large fragment with reverse transcriptase activity, and coding gene, preparation method and application thereof | |
WO2023082266A1 (en) | Chimeric dna polymerase and use thereof | |
CN114317485B (en) | Recombinant murine leukemia virus reverse transcriptase mutant, preparation method and application | |
WO2010105276A1 (en) | Method for constructing mutagenesis libraries in situ |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |