CN113355325B - Preparation method and application of humanized ACE2 gene modified mouse embryonic stem cell model - Google Patents
Preparation method and application of humanized ACE2 gene modified mouse embryonic stem cell model Download PDFInfo
- Publication number
- CN113355325B CN113355325B CN202110245653.6A CN202110245653A CN113355325B CN 113355325 B CN113355325 B CN 113355325B CN 202110245653 A CN202110245653 A CN 202110245653A CN 113355325 B CN113355325 B CN 113355325B
- Authority
- CN
- China
- Prior art keywords
- animal
- gene
- cell
- sgrna
- ace2
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 210000001671 embryonic stem cell Anatomy 0.000 title claims abstract description 47
- 101150054399 ace2 gene Proteins 0.000 title claims abstract description 36
- 238000002360 preparation method Methods 0.000 title claims abstract description 5
- 239000013598 vector Substances 0.000 claims abstract description 87
- 230000008685 targeting Effects 0.000 claims abstract description 64
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 60
- 108090000975 Angiotensin-converting enzyme 2 Proteins 0.000 claims abstract description 57
- 108091027544 Subgenomic mRNA Proteins 0.000 claims abstract description 49
- 239000012634 fragment Substances 0.000 claims abstract description 41
- 239000003814 drug Substances 0.000 claims abstract description 3
- 210000004027 cell Anatomy 0.000 claims description 137
- 241000699666 Mus <mouse, genus> Species 0.000 claims description 75
- 241000282414 Homo sapiens Species 0.000 claims description 67
- 108020004414 DNA Proteins 0.000 claims description 62
- 238000000034 method Methods 0.000 claims description 60
- 102000053723 Angiotensin-converting enzyme 2 Human genes 0.000 claims description 56
- 241001465754 Metazoa Species 0.000 claims description 56
- 241000699670 Mus sp. Species 0.000 claims description 30
- 210000001519 tissue Anatomy 0.000 claims description 29
- 238000011144 upstream manufacturing Methods 0.000 claims description 27
- 241000124008 Mammalia Species 0.000 claims description 23
- 241000283984 Rodentia Species 0.000 claims description 21
- 238000010171 animal model Methods 0.000 claims description 19
- 230000014509 gene expression Effects 0.000 claims description 16
- 210000004102 animal cell Anatomy 0.000 claims description 15
- 238000002744 homologous recombination Methods 0.000 claims description 13
- 230000006801 homologous recombination Effects 0.000 claims description 13
- 238000012216 screening Methods 0.000 claims description 13
- 238000010276 construction Methods 0.000 claims description 12
- 238000011160 research Methods 0.000 claims description 11
- 102000053602 DNA Human genes 0.000 claims description 9
- 238000010362 genome editing Methods 0.000 claims description 9
- 238000011577 humanized mouse model Methods 0.000 claims description 9
- 238000003780 insertion Methods 0.000 claims description 9
- 230000037431 insertion Effects 0.000 claims description 9
- 108091033409 CRISPR Proteins 0.000 claims description 6
- 101100433975 Homo sapiens ACE2 gene Proteins 0.000 claims description 6
- 238000003776 cleavage reaction Methods 0.000 claims description 6
- 239000003550 marker Substances 0.000 claims description 6
- 230000007017 scission Effects 0.000 claims description 6
- 201000010099 disease Diseases 0.000 claims description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 5
- 210000004602 germ cell Anatomy 0.000 claims description 5
- 238000011161 development Methods 0.000 claims description 4
- 238000004519 manufacturing process Methods 0.000 claims description 4
- 238000010354 CRISPR gene editing Methods 0.000 claims description 3
- 108091026898 Leader sequence (mRNA) Proteins 0.000 claims description 3
- 230000015572 biosynthetic process Effects 0.000 claims description 3
- 210000002459 blastocyst Anatomy 0.000 claims description 3
- 210000001124 body fluid Anatomy 0.000 claims description 3
- 239000010839 body fluid Substances 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 3
- 239000000284 extract Substances 0.000 claims description 3
- 239000007924 injection Substances 0.000 claims description 3
- 238000002347 injection Methods 0.000 claims description 3
- 208000035199 Tetraploidy Diseases 0.000 claims description 2
- 229940079593 drug Drugs 0.000 claims description 2
- 238000011065 in-situ storage Methods 0.000 claims description 2
- 238000003786 synthesis reaction Methods 0.000 claims 2
- 230000001900 immune effect Effects 0.000 claims 1
- 230000002906 microbiologic effect Effects 0.000 claims 1
- 230000003285 pharmacodynamic effect Effects 0.000 claims 1
- 230000000144 pharmacologic effect Effects 0.000 claims 1
- 230000001225 therapeutic effect Effects 0.000 claims 1
- 238000005520 cutting process Methods 0.000 abstract description 9
- 150000007523 nucleic acids Chemical group 0.000 description 57
- 102000039446 nucleic acids Human genes 0.000 description 37
- 108020004707 nucleic acids Proteins 0.000 description 37
- 241000700159 Rattus Species 0.000 description 33
- 108091028043 Nucleic acid sequence Proteins 0.000 description 26
- 239000002773 nucleotide Substances 0.000 description 22
- 125000003729 nucleotide group Chemical group 0.000 description 22
- 239000013612 plasmid Substances 0.000 description 22
- 101000929928 Homo sapiens Angiotensin-converting enzyme 2 Proteins 0.000 description 19
- 241001678559 COVID-19 virus Species 0.000 description 18
- 238000012239 gene modification Methods 0.000 description 18
- 102000048657 human ACE2 Human genes 0.000 description 18
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 17
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 16
- 238000012163 sequencing technique Methods 0.000 description 16
- 238000006243 chemical reaction Methods 0.000 description 14
- 102000040430 polynucleotide Human genes 0.000 description 13
- 108091033319 polynucleotide Proteins 0.000 description 13
- 239000002157 polynucleotide Substances 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- 108091026890 Coding region Proteins 0.000 description 12
- 241000699800 Cricetinae Species 0.000 description 11
- 238000000137 annealing Methods 0.000 description 11
- 238000004520 electroporation Methods 0.000 description 11
- 239000013604 expression vector Substances 0.000 description 11
- 150000001413 amino acids Chemical group 0.000 description 10
- 230000000295 complement effect Effects 0.000 description 10
- 102000004169 proteins and genes Human genes 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 9
- 230000005017 genetic modification Effects 0.000 description 9
- 235000013617 genetically modified food Nutrition 0.000 description 9
- 229950010131 puromycin Drugs 0.000 description 9
- 101100378094 Mus musculus Ace2 gene Proteins 0.000 description 8
- 239000001963 growth medium Substances 0.000 description 8
- 238000012795 verification Methods 0.000 description 8
- 241000282412 Homo Species 0.000 description 7
- 238000010586 diagram Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 210000002257 embryonic structure Anatomy 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 7
- 210000005260 human cell Anatomy 0.000 description 7
- 108090000765 processed proteins & peptides Proteins 0.000 description 7
- 210000001161 mammalian embryo Anatomy 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 210000003101 oviduct Anatomy 0.000 description 6
- 238000001890 transfection Methods 0.000 description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 238000007877 drug screening Methods 0.000 description 5
- 210000004072 lung Anatomy 0.000 description 5
- 210000004962 mammalian cell Anatomy 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 238000010172 mouse model Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 230000002194 synthesizing effect Effects 0.000 description 5
- 108020003589 5' Untranslated Regions Proteins 0.000 description 4
- 102000012410 DNA Ligases Human genes 0.000 description 4
- 108010061982 DNA Ligases Proteins 0.000 description 4
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 4
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 4
- NWIBSHFKIJFRCO-WUDYKRTCSA-N Mytomycin Chemical compound C1N2C(C(C(C)=C(N)C3=O)=O)=C3[C@@H](COC(N)=O)[C@@]2(OC)[C@@H]2[C@H]1N2 NWIBSHFKIJFRCO-WUDYKRTCSA-N 0.000 description 4
- 241000283973 Oryctolagus cuniculus Species 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 210000003527 eukaryotic cell Anatomy 0.000 description 4
- 210000002950 fibroblast Anatomy 0.000 description 4
- 238000010166 immunofluorescence Methods 0.000 description 4
- 229920001184 polypeptide Polymers 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 3
- 241000282693 Cercopithecidae Species 0.000 description 3
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 3
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 3
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 3
- 241000699660 Mus musculus Species 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 238000011529 RT qPCR Methods 0.000 description 3
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 3
- 229930006000 Sucrose Natural products 0.000 description 3
- 241000282887 Suidae Species 0.000 description 3
- 108700009124 Transcription Initiation Site Proteins 0.000 description 3
- 210000004504 adult stem cell Anatomy 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 235000011089 carbon dioxide Nutrition 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 239000010410 layer Substances 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 238000003753 real-time PCR Methods 0.000 description 3
- 210000000813 small intestine Anatomy 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 210000000130 stem cell Anatomy 0.000 description 3
- 239000005720 sucrose Substances 0.000 description 3
- 238000011830 transgenic mouse model Methods 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000282472 Canis lupus familiaris Species 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 108010046276 FLP recombinase Proteins 0.000 description 2
- 241001076388 Fimbria Species 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 101000827703 Homo sapiens Polyphosphoinositide phosphatase Proteins 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 102100023591 Polyphosphoinositide phosphatase Human genes 0.000 description 2
- 238000002123 RNA extraction Methods 0.000 description 2
- 241000315672 SARS coronavirus Species 0.000 description 2
- PXIPVTKHYLBLMZ-UHFFFAOYSA-N Sodium azide Chemical compound [Na+].[N-]=[N+]=[N-] PXIPVTKHYLBLMZ-UHFFFAOYSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 108091023045 Untranslated Region Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 230000024245 cell differentiation Effects 0.000 description 2
- 230000007910 cell fusion Effects 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000001976 enzyme digestion Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 210000005265 lung cell Anatomy 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 229960004857 mitomycin Drugs 0.000 description 2
- 210000001672 ovary Anatomy 0.000 description 2
- 230000008506 pathogenesis Effects 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 238000011282 treatment Methods 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 210000004291 uterus Anatomy 0.000 description 2
- 230000009385 viral infection Effects 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- HJCMDXDYPOUFDY-WHFBIAKZSA-N Ala-Gln Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O HJCMDXDYPOUFDY-WHFBIAKZSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 208000031873 Animal Disease Models Diseases 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 1
- XFXZKCRBBOVJKS-BVSLBCMMSA-N Arg-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XFXZKCRBBOVJKS-BVSLBCMMSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- UEONJSPBTSWKOI-CIUDSAMLSA-N Asn-Gln-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O UEONJSPBTSWKOI-CIUDSAMLSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 1
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- JRBVWZLHBGYZNY-QEJZJMRPSA-N Asp-Gln-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRBVWZLHBGYZNY-QEJZJMRPSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- RNAQPBOOJRDICC-BPUTZDHNSA-N Asp-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N RNAQPBOOJRDICC-BPUTZDHNSA-N 0.000 description 1
- BOXNGMVEVOGXOJ-UBHSHLNASA-N Asp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N BOXNGMVEVOGXOJ-UBHSHLNASA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 208000025721 COVID-19 Diseases 0.000 description 1
- AQGNHMOJWBZFQQ-UHFFFAOYSA-N CT 99021 Chemical compound CC1=CNC(C=2C(=NC(NCCNC=3N=CC(=CC=3)C#N)=NC=2)C=2C(=CC(Cl)=CC=2)Cl)=N1 AQGNHMOJWBZFQQ-UHFFFAOYSA-N 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241000700198 Cavia Species 0.000 description 1
- 241000282994 Cervidae Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- GQTNWYFWSUFFRA-KKUMJFAQSA-N Gln-Met-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GQTNWYFWSUFFRA-KKUMJFAQSA-N 0.000 description 1
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- RZMXBFUSQNLEQF-QEJZJMRPSA-N Glu-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RZMXBFUSQNLEQF-QEJZJMRPSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 1
- 102000019058 Glycogen Synthase Kinase 3 beta Human genes 0.000 description 1
- 108010051975 Glycogen Synthase Kinase 3 beta Proteins 0.000 description 1
- 241000282575 Gorilla Species 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- ZNPRMNDAFQKATM-LKTVYLICSA-N His-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZNPRMNDAFQKATM-LKTVYLICSA-N 0.000 description 1
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- 102000003839 Human Proteins Human genes 0.000 description 1
- 108090000144 Human Proteins Proteins 0.000 description 1
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- 241000713666 Lentivirus Species 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- QKIBIXAQKAFZGL-GUBZILKMSA-N Leu-Cys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QKIBIXAQKAFZGL-GUBZILKMSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 1
- NHRINZSPIUXYQZ-DCAQKATOSA-N Leu-Met-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N NHRINZSPIUXYQZ-DCAQKATOSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- RYOLKFYZBHMYFW-WDSOQIARSA-N Lys-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 RYOLKFYZBHMYFW-WDSOQIARSA-N 0.000 description 1
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- 229940124647 MEK inhibitor Drugs 0.000 description 1
- 241000282560 Macaca mulatta Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- PZUUMQPMHBJJKE-AVGNSLFASA-N Met-Leu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N PZUUMQPMHBJJKE-AVGNSLFASA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- 241000282339 Mustela Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 102000011931 Nucleoproteins Human genes 0.000 description 1
- 108010061100 Nucleoproteins Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- GRVMHFCZUIYNKQ-UFYCRDLUSA-N Phe-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GRVMHFCZUIYNKQ-UFYCRDLUSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- YTGGLKWSVIRECD-JBACZVJFSA-N Phe-Trp-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 YTGGLKWSVIRECD-JBACZVJFSA-N 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 208000037847 SARS-CoV-2-infection Diseases 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- FKZSXTKZLPPHQU-GQGQLFGLSA-N Ser-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N FKZSXTKZLPPHQU-GQGQLFGLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 1
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- GTNCSPKYWCJZAC-XIRDDKMYSA-N Trp-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GTNCSPKYWCJZAC-XIRDDKMYSA-N 0.000 description 1
- NFVQCNMGJILYMI-SZMVWBNQSA-N Trp-Met-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NFVQCNMGJILYMI-SZMVWBNQSA-N 0.000 description 1
- IYHRKILQAQWODS-VJBMBRPKSA-N Trp-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IYHRKILQAQWODS-VJBMBRPKSA-N 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- LMLBOGIOLHZXOT-JYJNAYRXSA-N Tyr-Glu-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O LMLBOGIOLHZXOT-JYJNAYRXSA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- AKRHKDCELJLTMD-BVSLBCMMSA-N Tyr-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N AKRHKDCELJLTMD-BVSLBCMMSA-N 0.000 description 1
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 1
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 210000001766 X chromosome Anatomy 0.000 description 1
- 210000001015 abdomen Anatomy 0.000 description 1
- 210000000683 abdominal cavity Anatomy 0.000 description 1
- 230000003187 abdominal effect Effects 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 101150063416 add gene Proteins 0.000 description 1
- 108010031014 alanyl-histidyl-leucyl-leucine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- -1 biomolecules Substances 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 235000011148 calcium chloride Nutrition 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000005336 cracking Methods 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 210000002308 embryonic cell Anatomy 0.000 description 1
- 108010030074 endodeoxyribonuclease MluI Proteins 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 230000004720 fertilization Effects 0.000 description 1
- 238000011010 flushing procedure Methods 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000002350 laparotomy Methods 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 210000003712 lysosome Anatomy 0.000 description 1
- 230000001868 lysosomic effect Effects 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 241001515942 marmosets Species 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000002829 mitogen activated protein kinase inhibitor Substances 0.000 description 1
- 210000000472 morula Anatomy 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 239000007923 nasal drop Substances 0.000 description 1
- 229940100662 nasal drops Drugs 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 210000002220 organoid Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 210000004681 ovum Anatomy 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 108010073832 phenylalanyl-leucyl-leucyl-arginyl-asparagine Proteins 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 210000001778 pluripotent stem cell Anatomy 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000019525 primary metabolic process Effects 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000029058 respiratory gaseous exchange Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012772 sequence design Methods 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 239000002356 single layer Substances 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000013517 stratification Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 230000002381 testicular Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229940126585 therapeutic drug Drugs 0.000 description 1
- 210000003014 totipotent stem cell Anatomy 0.000 description 1
- 238000007862 touchdown PCR Methods 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 210000004340 zona pellucida Anatomy 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New or modified breeds of animals
- A01K67/027—New or modified breeds of vertebrates
- A01K67/0275—Genetically modified vertebrates, e.g. transgenic
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/907—Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2207/00—Modified animals
- A01K2207/15—Humanized animals
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/07—Animals genetically altered by homologous recombination
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/105—Murine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/03—Animal model, e.g. for test or diseases
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/106—Plasmid DNA for vertebrates
- C12N2800/107—Plasmid DNA for vertebrates for mammalian
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2810/00—Vectors comprising a targeting moiety
- C12N2810/10—Vectors comprising a non-peptidic targeting moiety
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- Organic Chemistry (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Chemical & Material Sciences (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Environmental Sciences (AREA)
- Cell Biology (AREA)
- Mycology (AREA)
- Veterinary Medicine (AREA)
- Animal Behavior & Ethology (AREA)
- Animal Husbandry (AREA)
- Biodiversity & Conservation Biology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
技术领域Technical Field
本发明涉及生物医药领域,尤其涉及一种人源化ACE2基因改造小鼠胚胎干细胞模型制备方法及应用。The present invention relates to the field of biomedicine, and in particular to a method for preparing a humanized ACE2 gene-modified mouse embryonic stem cell model and application thereof.
背景技术Background Art
虽然现阶段已有在Vero E6 cells等细胞模型上感染2019-nCoV进行药物筛选,但细胞模型的单一性远不能比拟动物模型优势。Although 2019-nCoV has been infected in cell models such as Vero E6 cells for drug screening, the singleness of cell models is far from comparable to the advantages of animal models.
实验动物疾病模型对于研究人类疾病发生的病因、发病机制、开发防治技术和治疗药物是不可缺少的研究工具。常见的实验动物包括小鼠、大鼠、豚鼠、地鼠(仓鼠)、兔、犬、猴、猪和鱼等。然而,人类和动物的基因和蛋白质序列还是存在不少差异,许多人类蛋白质不能与动物的同源蛋白质结合而产生生物活性,导致许多临床试验的结果也与动物实验的结果不相符。Experimental animal disease models are indispensable research tools for studying the causes and pathogenesis of human diseases, and developing prevention and treatment technologies and therapeutic drugs. Common experimental animals include mice, rats, guinea pigs, hamsters, rabbits, dogs, monkeys, pigs, and fish. However, there are still many differences in the gene and protein sequences between humans and animals. Many human proteins cannot bind to homologous proteins in animals to produce biological activity, resulting in the results of many clinical trials not being consistent with the results of animal experiments.
随着基因工程技术的不断发展和成熟,用人类细胞或基因替代或置换动物的内源性同类细胞或基因,以建立更接近人类的生物体系或疾病模型,建立人源化实验动物模型(humanized animal model1),已经为临床上新的治疗方法或手段提供了重要工具。其中基因人源化动物模型,即利用基因遗传操作技术,用人类正常或突变基因替换动物同类基因,可在动物体内建立更接近人类的正常或突变,体系的塞因大源化呦物模型。因大源化动物不但本身具有重要应用价值,如通过基因人源化还可改进和提升细胞人源化小鼠CN107815468 A2/28页模型,更重要的是,由于人类基因片段的存在,动物体内可表达或部分表达含有人类功能的蛋白质,从而大大减少人和动物的临床实验差异,为在动物水平进行药物筛选提供了可能。With the continuous development and maturity of genetic engineering technology, human cells or genes are used to replace or displace the endogenous cells or genes of animals to establish a biological system or disease model closer to humans, and to establish a humanized experimental animal model (humanized animal model1), which has provided an important tool for new clinical treatment methods or means. Among them, the gene humanized animal model, that is, using gene genetic manipulation technology, replaces the same gene of animals with normal or mutant genes of humans, and can establish a normal or mutant system closer to humans in the animal body. Because the humanized animal itself not only has important application value, such as through gene humanization, it can also improve and enhance the cell humanized mouse CN107815468 A2/28 page model, more importantly, due to the presence of human gene fragments, the animal body can express or partially express proteins containing human functions, thereby greatly reducing the clinical experimental differences between humans and animals, and providing the possibility for drug screening at the animal level.
2019-nCoV与SARS冠状病毒一样都以血管紧张素转化酶2(Angiotensin-converting enzyme 2,ACE2)作为感染人类的关键靶点,其中2019-nCoV除感染人类外还可感染多种哺乳动物如猴子、猪、兔子、白鼬、猩猩等,但小鼠和大鼠除外。猴子、猪以及兔子等作为动物模型感染2019-nCoV用作药物筛选而言,其生长周期较长且体型较大难以大批量操作,而相对容易大批量操作的小鼠模型却对2019-nCoV不易感。Like SARS coronavirus, 2019-nCoV uses angiotensin-converting enzyme 2 (ACE2) as a key target for infecting humans. In addition to infecting humans, 2019-nCoV can also infect a variety of mammals such as monkeys, pigs, rabbits, ferrets, and gorillas, but mice and rats are excluded. Monkeys, pigs, and rabbits are used as animal models to infect 2019-nCoV for drug screening, but their growth cycle is long and their body size is large, making it difficult to operate in large quantities. However, the mouse model, which is relatively easy to operate in large quantities, is not susceptible to 2019-nCoV.
发明内容Summary of the invention
由于人源ACE2和鼠源ACE2在基因序列上存在一定差异,有文献报道:病毒感染后,人源ACE2较鼠源ACE2对SARS-CoV的敏感性更强,病理症状更为明显。目前的临床研究急需更好的动物模型。Because there are certain differences in the gene sequences of human ACE2 and mouse ACE2, there are literature reports that after virus infection, human ACE2 is more sensitive to SARS-CoV than mouse ACE2, and the pathological symptoms are more obvious. Current clinical research urgently needs better animal models.
已有hACE2转基因小鼠被证实可感染2019-nCoV并出现肺部组织典型致病特征(https://www.biorxiv.org/content/10.1101/2020.02.07.939389v3#disqus_thread)。但hACE2转基因小鼠为全身性过表达hACE2小鼠模型,并不能模拟ACE2时空组织表达特征,本发明人源化ACE2小鼠模型为原小鼠Ace2位点特异性表达hACE2小鼠模型,可模拟原mACE2表达特征,具有组织特异性,更为严谨的模拟人感染2019-nCoV发病过程。It has been confirmed that hACE2 transgenic mice can be infected with 2019-nCoV and show typical pathogenic characteristics in lung tissue (https://www.biorxiv.org/content/10.1101/2020.02.07.939389v3#disqus_thread). However, hACE2 transgenic mice are systemic hACE2 overexpression mouse models and cannot simulate the spatiotemporal tissue expression characteristics of ACE2. The humanized ACE2 mouse model of the present invention is an original mouse Ace2 site-specific hACE2 mouse model, which can simulate the original mACE2 expression characteristics, has tissue specificity, and more rigorously simulates the pathogenesis of human infection with 2019-nCoV.
所以本公开将通过在小鼠Ace2位点表达人源化ACE2的方法大批量制备2019-nCoV易感小鼠人源化ACE2动物模型来对2019-nCoV进行药物筛选和疾病研究等应用。Therefore, the present invention will prepare a 2019-nCoV susceptible mouse humanized ACE2 animal model in large quantities by expressing humanized ACE2 at the mouse Ace2 site for drug screening and disease research on 2019-nCoV.
本发明的目的之一在于提供一种特异性的核酸片段序列。One of the purposes of the present invention is to provide a specific nucleic acid fragment sequence.
本发明另一个目的在于提供一种特异性的靶向ACE2基因的sgRNA序列。Another object of the present invention is to provide a sgRNA sequence that specifically targets the ACE2 gene.
本发明另一个目的在于提供一种特异性的靶向ACE2基因的靶向载体。Another object of the present invention is to provide a targeting vector that specifically targets the ACE2 gene.
本发明另一个目的在于提供一种ACE2基因人源化细胞株。Another object of the present invention is to provide an ACE2 gene humanized cell line.
本发明另一个目的在于提供一种ACE2基因人源化细胞株的构建方法。Another object of the present invention is to provide a method for constructing an ACE2 gene humanized cell line.
本发明另一个目的在于提供一种ACE2基因人源化细胞株。Another object of the present invention is to provide an ACE2 gene humanized cell line.
本发明另一个目的在于提供一种基因人源化动物的构建方法。Another object of the present invention is to provide a method for constructing a genetically humanized animal.
本发明还有一个目的在于提供一种人源化ACE2基因改造小鼠胚胎干细胞模型。Another object of the present invention is to provide a humanized ACE2 gene-modified mouse embryonic stem cell model.
一方面,本发明提供了一种核酸片段序列,所述核酸片段序列如SEQ ID NO:1或SEQ ID NO:2所示。本公开的核酸片段序列的靶向特异性和切割目的基因的效率都较好。In one aspect, the present invention provides a nucleic acid fragment sequence, and the nucleic acid fragment sequence is shown in SEQ ID NO: 1 or SEQ ID NO: 2. The nucleic acid fragment sequence disclosed in the present invention has good targeting specificity and efficiency in cutting target genes.
一些实施例中,合成序列如SEQ ID NO:1所示的核酸的上下游引物序列分别如SEQID NO:6和SEQ ID NO:7所示。一些实施例中,合成序列如SEQ ID NO:2所示的核酸的上下游引物序列分别如SEQ ID NO:4和SEQ ID NO:5所示。In some embodiments, the upstream and downstream primer sequences of the nucleic acid synthesized as SEQ ID NO: 1 are shown as SEQ ID NO: 6 and SEQ ID NO: 7, respectively. In some embodiments, the upstream and downstream primer sequences of the nucleic acid synthesized as SEQ ID NO: 2 are shown as SEQ ID NO: 4 and SEQ ID NO: 5, respectively.
一些实施例中,所述核酸为sgRNA。In some embodiments, the nucleic acid is sgRNA.
一些实施例中,所述sgRNA片段序列靶向的基因为动物ACE2基因,用于定点切割ACE2基因。In some embodiments, the gene targeted by the sgRNA fragment sequence is the animal ACE2 gene, which is used for site-specific cutting of the ACE2 gene.
一些实施例中,所述sgRNA片段序列用于构建人源化动物模型。In some embodiments, the sgRNA fragment sequence is used to construct a humanized animal model.
一些实施例中,所述sgRNA片段序列的切割点位于所述的动物ACE2基因1号外显子。In some embodiments, the cutting point of the sgRNA fragment sequence is located in
一些实施例中,所述切割点靶点处于所述的动物ACE2基因5’UTR后140bp碱基范围内。In some embodiments, the cleavage point target is within the 140bp base range after the 5'UTR of the animal ACE2 gene.
一些实施例中,所述动物为哺乳动物。In some embodiments, the animal is a mammal.
一些实施例中,所述哺乳动物为啮齿类动物。In some embodiments, the mammal is a rodent.
一些实施例中,所述啮齿类动物为鼠。In some embodiments, the rodent is a mouse.
另一方面,本发明提供了一种靶向载体,包含5’同源臂、人源ACE2基因片段和3’同源臂;所述的靶向载体靶向动物ACE2基因,所述动物ACE2基因含有与权利要求1或2所述的sgRNA片段序列相对应的切割点。所述5’同源臂为与在所述目标基因组基因座处的5’靶序列同源的5’同源臂。On the other hand, the present invention provides a targeting vector comprising a 5' homology arm, a human ACE2 gene fragment and a 3' homology arm; the targeting vector targets an animal ACE2 gene, and the animal ACE2 gene contains a cleavage point corresponding to the sgRNA fragment sequence of
一些实施例中,所述靶向载体用于将人源ACE2基因的CDS序列插入到动物基因的启动子和5’UTR区域序列后,利用动物目的基因启动子启动人目的基因表达。In some embodiments, the targeting vector is used to insert the CDS sequence of the human ACE2 gene into the promoter and 5'UTR region sequence of the animal gene, and then use the animal target gene promoter to initiate the expression of the human target gene.
一些实施例中,所述打靶载体还包含3’同源臂,所述3’同源臂为与在所述目标基因组基因座处的3’靶序列同源的3’同源臂。In some embodiments, the targeting vector further comprises a 3' homology arm, wherein the 3' homology arm is a 3' homology arm homologous to the 3' target sequence at the target genomic locus.
一些实施例中,所述5’同源臂序列如SEQ ID NO:13所示,3’同源臂的序列如SEQID NO:14所示。In some embodiments, the 5' homology arm sequence is shown as SEQ ID NO:13, and the 3' homology arm sequence is shown as SEQ ID NO:14.
一些实施例中,所述ACE2基因的CDS序列如SEQ ID NO:10所示。In some embodiments, the CDS sequence of the ACE2 gene is shown in SEQ ID NO:10.
一些实施例中,所述靶向载体还包含如SEQ ID NO:12所示的SV40 polyA序列,所述SV40 polyA序列位于所述的CDS序列之后。In some embodiments, the targeting vector further comprises an SV40 polyA sequence as shown in SEQ ID NO: 12, and the SV40 polyA sequence is located after the CDS sequence.
一些实施例中,所述靶向载体还包含筛选标志PGK-Puro。In some embodiments, the targeting vector further comprises a selection marker PGK-Puro.
一些实施例中,所述动物为哺乳动物。In some embodiments, the animal is a mammal.
一些实施例中,所述动物为鼠。In some embodiments, the animal is a mouse.
一些实施例中,所述靶向载体还包含pX330载体。In some embodiments, the targeting vector further comprises a pX330 vector.
另一方面,本发明提供了一种编码所述的sgRNA片段序列的DNA分子。On the other hand, the present invention provides a DNA molecule encoding the sgRNA fragment sequence.
另一方面,本发明提供了一种细胞,包含所述的sgRNA片段序列或所述的靶向载体或所述的DNA分子。On the other hand, the present invention provides a cell comprising the sgRNA fragment sequence or the targeting vector or the DNA molecule.
另一方面,本发明提供了所述的sgRNA片段序列在制备基因人源化细胞株或人源化动物模型中的应用。On the other hand, the present invention provides the use of the sgRNA fragment sequence in preparing a genetic humanized cell line or a humanized animal model.
一些实施例中,所述基因为ACE2。In some embodiments, the gene is ACE2.
一些实施例中,所述动物为哺乳动物。In some embodiments, the animal is a mammal.
一些实施例中,所述动物为啮齿类动物。In some embodiments, the animal is a rodent.
一些实施例中,所述动物为鼠。In some embodiments, the animal is a mouse.
另一方面,本发明提供了一种人源化动物细胞株的构建方法,所述方法中使用了所述的sgRNA片段序列。On the other hand, the present invention provides a method for constructing a humanized animal cell line, wherein the sgRNA fragment sequence is used.
一些实施例中,所述方法包括将人源目的基因导入动物细胞中,使得目的基因在动物细胞内原位表达人源全长目的基因的CDS。In some embodiments, the method includes introducing a human target gene into an animal cell, so that the target gene expresses the CDS of the human full-length target gene in situ in the animal cell.
一些实施例中,所述目的基因为ACE2。In some embodiments, the target gene is ACE2.
一些实施例中,所述动物为哺乳动物。In some embodiments, the animal is a mammal.
一些实施例中,所述动物为啮齿类动物。In some embodiments, the animal is a rodent.
一些实施例中,所述动物为鼠。In some embodiments, the animal is a mouse.
一些实施例中,所述细胞为胚胎干细胞。In some embodiments, the cells are embryonic stem cells.
一些实施例中,所述方法使用了基因编辑技术进行人源化动物细胞模型的构建。In some embodiments, the method uses gene editing technology to construct a humanized animal cell model.
一些实施例中,所述基因编辑技术包括基于胚胎干细胞的DNA同源重组技术、CRISPR/Cas9技术。In some embodiments, the gene editing technology includes DNA homologous recombination technology based on embryonic stem cells and CRISPR/Cas9 technology.
一些实施例中,所述基因编辑技术为基于CRISPR/Cas9的基因编辑技术进行ACE2人源化动物的构建。In some embodiments, the gene editing technology is based on CRISPR/Cas9 gene editing technology to construct ACE2 humanized animals.
一些实施例中,所述构建方法包括以下步骤:(1)构建权利要求3所述的靶向载体;(2)将构建的靶向载体和连接有sgRNA的载体导入动物来源的胚胎干细胞中;(3)将步骤(2)中的胚胎干细胞培养成克隆,即得。In some embodiments, the construction method comprises the following steps: (1) constructing the targeting vector of
一些实施例中,所述动物为哺乳动物。In some embodiments, the animal is a mammal.
一些实施例中,所述哺乳动物为啮齿类动物。In some embodiments, the mammal is a rodent.
一些实施例中,所述啮齿类动物为鼠。In some embodiments, the rodent is a mouse.
另一方面,本发明提供了所述的方法制备得到的ACE2基因人源化细胞株。On the other hand, the present invention provides an ACE2 gene humanized cell line prepared by the method described.
一些实施例中,所述细胞株为人源化小鼠胚胎干细胞,经特定分化或制备成小鼠模型后可表达人的全长ACE2蛋白,同时内源ACE2蛋白表达降低或消失。In some embodiments, the cell line is a humanized mouse embryonic stem cell, which can express the full-length human ACE2 protein after specific differentiation or preparation into a mouse model, while the expression of endogenous ACE2 protein is reduced or disappears.
另一方面,本发明提供了一种基因人源化动物模型的构建方法,包括将所述的动物细胞注入动物体内。On the other hand, the present invention provides a method for constructing a genetically humanized animal model, comprising injecting the animal cells into an animal.
一些实施例中,所述基因人源化动物模型的构建方法包括将所述的动物细胞通过四倍体补偿或囊胚注射方法形成动物个体。In some embodiments, the method for constructing the genetically humanized animal model includes forming animal individuals by tetraploid compensation or blastocyst injection of the animal cells.
另一方面,本发明提供了一种人源化鼠或其子孙的组织、体液、细胞、以及它们的破碎物或提取物,所述的人源化鼠是利用基因人源化动物模型的构建方法得到的。On the other hand, the present invention provides a humanized mouse or its offspring's tissue, body fluid, cell, and their fragments or extracts, wherein the humanized mouse is obtained by using the method for constructing a genetic humanized animal model.
利用本发明的方法成功得到了人源化ACE2基因改造小鼠。Humanized ACE2 gene-modified mice were successfully obtained using the method of the present invention.
在一些实施方式中,所述人源化ACE2基因改造小鼠在RNA水平上表达人ACE2并具有组织特异性。In some embodiments, the humanized ACE2 genetically modified mouse expresses human ACE2 at the RNA level and has tissue specificity.
在一些实施方式中,所述人源化ACE2基因改造小鼠在蛋白水平上表达人ACE2并具有组织特异性。In some embodiments, the humanized ACE2 genetically modified mouse expresses human ACE2 at the protein level and has tissue specificity.
在一些实施方式中,本发明研究显示利用本发明的方法获得的人源化ACE2基因改造小鼠对SARS-CoV-2易感。从而成功地获得了对2019-nCoV易感的小鼠,从而能够很好地为2019-nCoV的药物筛选或其他研究提供很好的动物模型。具有很好的现实意义。In some embodiments, the present invention shows that the humanized ACE2 gene-modified mice obtained by the method of the present invention are susceptible to SARS-CoV-2. Thus, mice susceptible to 2019-nCoV are successfully obtained, which can provide a good animal model for drug screening or other research on 2019-nCoV. It has great practical significance.
还有一方面,本发明提供了来源于所述的构建方法得到的人源化动物模型或其后代在制造人类抗体,或者作为药理学、免疫学、微生物学和医学研究的模型系统中的应用,或在生产和利用动物实验疾病模型,用于病原学研究和/或用于开发新的诊断策略和/或治疗策略中的应用,或在筛选、验证、评价或研究ACE2基因功能、ACE2抗体、针对ACE2靶点的药物、药效研究方面的用途。In still another aspect, the present invention provides the use of a humanized animal model or its progeny obtained by the construction method in the manufacture of human antibodies, or as a model system for pharmacology, immunology, microbiology and medical research, or in the production and use of animal experimental disease models for etiology research and/or for the development of new diagnostic strategies and/or treatment strategies, or in the screening, verification, evaluation or research of ACE2 gene function, ACE2 antibodies, drugs targeting ACE2 targets, and efficacy research.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1为pX330质粒图谱。Figure 1 is a plasmid map of pX330.
图2为pX330-sgRNA1构建测序结果图。Figure 2 shows the sequencing results of pX330-sgRNA1 construction.
图3为pX330-sgRNA2构建测序结果图。Figure 3 shows the sequencing results of pX330-sgRNA2 construction.
图4为pX330-sgRNA1-3切割效率验证结果。FIG4 is the result of verifying the cutting efficiency of pX330-sgRNA1-3.
图5为pX330-sgRNA3构建测序结果图。Figure 5 is a diagram showing the sequencing results of pX330-sgRNA3 construction.
图6为人源化ACE2打靶策略示意图。Figure 6 is a schematic diagram of the humanized ACE2 targeting strategy.
图7为人源化ACE2打靶载体测序结果信息。Figure 7 shows the sequencing results of the humanized ACE2 targeting vector.
图8为对比例1中的三片段连接的PCR鉴定图。FIG8 is a PCR identification diagram of the three-fragment connection in Comparative Example 1.
图9为人源化ACE2小鼠胚胎干细胞基因型的PCR鉴定结果图。FIG. 9 is a diagram showing the PCR identification results of the genotype of humanized ACE2 mouse embryonic stem cells.
图10为删除PGK-Puro后人源化ACE2小鼠胚胎干细胞基因型的PCR鉴定结果图。Figure 10 is a graph showing the PCR identification results of the humanized ACE2 mouse embryonic stem cell genotype after deleting PGK-Puro.
图11为人源化ACE2基因示意图。Figure 11 is a schematic diagram of the humanized ACE2 gene.
图12为本发明获得的人源化ACE2基因改造小鼠。FIG. 12 shows the humanized ACE2 gene-modified mice obtained by the present invention.
图13显示人源化ACE2基因改造小鼠在RNA水平上表达人ACE2并具有组织特异性。Figure 13 shows that humanized ACE2 gene-modified mice express human ACE2 at the RNA level and have tissue specificity.
图14的组织免疫荧光结果显示人ACE2在蛋白在人源化ACE2基因改造小鼠组织上特异表达。The tissue immunofluorescence results in Figure 14 show that human ACE2 protein is specifically expressed in the tissues of humanized ACE2 gene-modified mice.
图15显示本发明人源化ACE2基因改造小鼠对SARS-CoV-2易感。Figure 15 shows that the humanized ACE2 gene-modified mice of the present invention are susceptible to SARS-CoV-2.
具体实施方式DETAILED DESCRIPTION
以下通过具体的实施例进一步说明本发明的技术方案,具体实施例不代表对本发明保护范围的限制。其他人根据本发明理念所做出的一些非本质的修改和调整仍属于本发明的保护范围。The technical solution of the present invention is further described below by specific embodiments, which do not limit the protection scope of the present invention. Some non-essential modifications and adjustments made by others based on the concept of the present invention still fall within the protection scope of the present invention.
除非另有定义,本文中所使用的所有技术与科学术语的定义与本领域技术人员所熟悉的定义相同。此外,任何与所记载内容相似或均等的方法和材料皆可应用于本发明方法中,具体实施方式中描述了优选的方法和材料。Unless otherwise defined, the definitions of all technical and scientific terms used herein are the same as those familiar to those skilled in the art. In addition, any methods and materials similar or equivalent to those described herein can be applied to the methods of the present invention, and preferred methods and materials are described in the specific embodiments.
文中所用的“一”和“一种”指语法上不定冠词的释义,表示“一个”、“一种”或“多个”、“多种"(即“至少一个”、“至少一种”)。例如“一要素”指一种或种要素。As used herein, "a" and "an" refer to the grammatical indefinite article, meaning "one", "a kind" or "a plurality", "a variety" (i.e., "at least one", "at least one"). For example, "an element" refers to one or a kind of element.
“CDS”是编码序列(Coding sequence)的缩写,“编码序列”是指用于编码基因的多肽产物的任何核苷酸序列。相反,术语“非编码序列”是指不编码基因的多肽产物的任何核苷酸序列。"CDS" is the abbreviation of coding sequence, which refers to any nucleotide sequence that encodes the polypeptide product of a gene. In contrast, the term "non-coding sequence" refers to any nucleotide sequence that does not encode the polypeptide product of a gene.
术语“片段”将被理解为指长度比参考核酸短并且在共同部分包含与参考核酸相同的核苷酸序列的核苷酸序列。如果合适的话,根据本发明的这样的核酸片段可以被包含在更大的多核苷酸中,该片段是该更大的多核苷酸的组成成分。这样的片段包括长度在本发明的核酸的至少6、8、9、10、12、15、18、20、21、22、23、24、25、30、39、40、42、45、48、50、51、54、57、60、63、66、70、75、78、80、90、100、105、120、135、150、200、300、500、720、900、1000或1500个连续核苷酸范围内的寡核苷酸,或可选地由这样的寡核苷酸组成。The term "fragment" will be understood to refer to a nucleotide sequence that is shorter than a reference nucleic acid and that comprises the same nucleotide sequence as the reference nucleic acid in the common portion. If appropriate, such nucleic acid fragment according to the present invention can be included in a larger polynucleotide, and this fragment is a constituent of this larger polynucleotide. Such a fragment comprises an oligonucleotide having a length of at least 6,8,9,10,12,15,18,20,21,22,23,24,25,30,39,40,42,45,48,50,51,54,57,60,63,66,70,75,78,80,90,100,105,120,135,150,200,300,500,720,900,1000 or 1500 continuous nucleotides of nucleic acid of the present invention, or alternatively consists of such an oligonucleotide.
在本说明中,除非上下文另有要求,词语"包含"、"包括"将被理解为是指包括所述的步骤或要素或步骤和要素的集合,但并不排除任何其它的步骤或要素或步骤和要素的集合;即开放式限定。In this specification, unless the context requires otherwise, the words "comprise" and "include" will be understood to mean including the steps or elements or a collection of steps and elements, but not excluding any other steps or elements or a collection of steps and elements; that is, an open limitation.
“相对应”是指:(a)一多核苷酸具有与参考核苷酸序列的全部或者部分基本上相同或互补的核苷酸序列,或者一多核苷酸编码与肽或者蛋白质中的氨基酸序列完全相同的氨基酸序列;或(b)一肽或多肽具有一种氨基酸序列,该氨基酸序列基本上与参考肽或蛋白质中的氨基酸序列相同。"Corresponding to" means: (a) a polynucleotide has a nucleotide sequence that is substantially identical or complementary to all or part of a reference nucleotide sequence, or a polynucleotide encodes an amino acid sequence that is completely identical to the amino acid sequence in a peptide or protein; or (b) a peptide or polypeptide has an amino acid sequence that is substantially identical to the amino acid sequence in a reference peptide or protein.
术语“下游”是指位于参考核苷酸序列3’端的核苷酸序列。特别地,下游核苷酸序列通常涉及转录起始点之后的序列。例如,基因的翻译起始密码子位于转录起始位点的下游。The term "downstream" refers to a nucleotide sequence located at the 3' end of a reference nucleotide sequence. In particular, a downstream nucleotide sequence generally refers to a sequence after the transcription start point. For example, the translation start codon of a gene is located downstream of the transcription start site.
术语“上游”是指位于参考核苷酸序列5’端的核苷酸序列。特别地,上游核苷酸通常涉及位于编码序列或转录起始点的5’侧的序列。例如,大部分的启动子位于转录起始位点的上游。The term "upstream" refers to a nucleotide sequence located at the 5' end of a reference nucleotide sequence. In particular, upstream nucleotides generally refer to sequences located at the 5' side of a coding sequence or transcription start site. For example, most promoters are located upstream of the transcription start site.
“启动子”是指能够控制编码序列或功能性RNA的表达的DNA序列。一般而言,编码序列位于启动子序列的3’端。启动子可以整体源自天然基因,或由源自天然发现的不同启动子的不同元件组成,或甚至包括合成的DNA片段。本领域技术人员应该理解的是,不同的启动子可以指导基因在不同的组织或细胞类型中,或在发育的不同阶段,或应答于不同的环境或生理条件进行表达。导致基因在大多数细胞类型中在大多数的时间被表达的启动子通常称为“组成型启动子”。导致基因在特定的细胞类型中被表达的启动子通常称为“细胞特异性启动子”或“组织特异性启动子”。导致基因在特定的发育或细胞分化阶段被表达的启动子通常称为“发育特异性启动子”或“细胞分化特异性启动子”。在将细胞暴露于诱导启动子的药剂、生物分子、化学品、配体、光或类似物,或用这些物质对细胞进行处理之后,被诱导并导致基因被表达的启动子通常被称为“诱导型启动子”或“调控型启动子”。还应该认识的是,因为在大多数情况下,调控序列的准确界限还没有完全限定,所以不同长度的DNA片段可能具有相同的启动子活性。"Promoter" refers to a DNA sequence that can control the expression of a coding sequence or functional RNA. Generally speaking, the coding sequence is located at the 3' end of the promoter sequence. The promoter can be derived from a natural gene as a whole, or composed of different elements derived from different promoters found in nature, or even include synthetic DNA fragments. It should be understood by those skilled in the art that different promoters can direct genes to be expressed in different tissues or cell types, or at different stages of development, or in response to different environmental or physiological conditions. Promoters that cause genes to be expressed in most cell types at most times are generally referred to as "constitutive promoters". Promoters that cause genes to be expressed in specific cell types are generally referred to as "cell-specific promoters" or "tissue-specific promoters". Promoters that cause genes to be expressed at specific developmental or cell differentiation stages are generally referred to as "developmental specific promoters" or "cell differentiation specific promoters". After cells are exposed to agents, biomolecules, chemicals, ligands, light or the like that induce promoters, or cells are treated with these substances, promoters that are induced and cause genes to be expressed are generally referred to as "inducible promoters" or "regulatory promoters". It should also be recognized that because in most cases the exact boundaries of regulatory sequences have not been completely defined, DNA fragments of different lengths may have identical promoter activity.
术语“5’UTR”或“5'非编码序列”或“5'非翻译区(UTR)”是指位于编码序列上游(5')的DNA序列。The term "5'UTR" or "5' non-coding sequence" or "5' untranslated region (UTR)" refers to the DNA sequence located upstream (5') of a coding sequence.
术语“限制性核酸内切酶”和“限制性酶”是指结合并切割双链DNA内的特定核苷酸序列的酶。The terms "restriction endonuclease" and "restriction enzyme" refer to enzymes that bind to and cleave specific nucleotide sequences within double-stranded DNA.
术语“载体”意指核酸分子,其能转移它已连接的核酸分子。一种载体类型是“质粒”,其指环状双链DNA环,其他DNA区段可连接进去。另一类载体是病毒载体,其中其他DNA区段可连接到病毒基因组内。某些载体能在引入了这些载体的宿主细胞中自我复制(如具有细菌的复制起始点的细菌载体以及游离型哺乳动物载体)。其他载体(如非游离型哺乳动物载体)在引入到宿主细胞中后能够整合到宿主细胞的基因组中,因此随宿主基因组复制。此外,某些载体能够指导表达它们操作地连接的基因。The term "vector" means a nucleic acid molecule that is capable of transferring a nucleic acid molecule to which it has been linked. One type of vector is a "plasmid," which refers to a circular double-stranded DNA loop into which additional DNA segments can be linked. Another type of vector is a viral vector, in which additional DNA segments can be linked into the viral genome. Certain vectors are capable of self-replication in a host cell into which they are introduced (e.g., bacterial vectors with bacterial origins of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are capable of integrating into the genome of a host cell upon introduction into the host cell, thereby replicating with the host genome. In addition, certain vectors are capable of directing the expression of genes to which they are operatively linked.
一些载体本发明称为“重组表达载体”(或简称为“表达载体”)是指被设计成使得插入的核酸序列在转化入宿主后能够表达的载体、质粒或媒介。总的来说,重组DNA技术所使用的表达载体经常是质粒形式。本说明书的“质粒”和“载体”交换使用,因为质粒是载体最普遍使用的形式。但是,本发明意包括这种表达载体的其他形式,如病毒载体(如复制缺陷型反转录病毒、腺病毒和腺伴随病毒),其充当同等作用。Some vectors referred to herein as "recombinant expression vectors" (or simply "expression vectors") refer to vectors, plasmids or vectors designed to allow the inserted nucleic acid sequence to be expressed after transformation into a host. In general, expression vectors used in recombinant DNA technology are often in the form of plasmids. "Plasmid" and "vector" are used interchangeably in this specification because plasmids are the most commonly used form of vectors. However, the present invention is intended to include other forms of such expression vectors, such as viral vectors (such as replication-defective retroviruses, adenoviruses and adeno-associated viruses), which serve the same role.
术语“质粒”是指染色体外元件,它们常常携带不是作为细胞的中心代谢的一部分的基因,并且常常是环状双链DNA分子的形式。这样的元件可以是来自任何来源的自主复制序列、基因组整合序列、噬菌体或核苷酸序列,线性、环状或超螺旋的,单链或双链DNA或RNA,其中许多核苷酸序列已经被连接入或重组入独特的结构中,该结构能够将针对所选的基因产物的启动子片段和DNA序列以及合适的3’端非翻译序列导入细胞。The term "plasmid" refers to an extrachromosomal element that often carries genes that are not part of the central metabolism of the cell and is often in the form of a circular double-stranded DNA molecule. Such an element can be an autonomously replicating sequence, a genome integrating sequence, a phage, or a nucleotide sequence from any source, linear, circular or supercoiled, single-stranded or double-stranded DNA or RNA, in which many nucleotide sequences have been linked or recombined into a unique structure that is capable of introducing a promoter fragment and DNA sequence for a selected gene product and appropriate 3' non-translated sequences into the cell.
“靶向载体”或“打靶载体”是含有邻接所需遗传修饰的、与内源染色体核酸序列“同源的”序列的DNA构建体。所述侧翼同源序列(称为“同源臂”)借助所述同源臂和相应的内源序列之间存在的同源性,指导所述打靶载体定位于所述基因组中的特定染色体位置,并且通过称为“同源重组”的过程引入所述所需的遗传修饰。“靶向载体”和“打靶载体”某些时候可以通用。采用靶向载体以将插入核酸引入大鼠、真核、非大鼠真核、哺乳动物、非人类哺乳动物、人类、啮齿动物、非大鼠啮齿动物、小鼠或仓鼠核酸的靶基因座中。所述靶向载体包含所述插入核酸且进一步包含5’同源臂和3’同源臂,其侧接所述插入核酸。侧接所述插入核酸的同源臂对应于在大鼠、真核、非大鼠真核、哺乳动物、非人类哺乳动物、人类、啮齿动物、非大鼠啮齿动物、小鼠或仓鼠核酸的靶基因座内的区域。为了便于提及,在靶基因组基因座内的相应同源基因组区在本文中称为“靶位点”。例如,靶向载体可包含由与第一靶位点和第二靶位点互补的第一同源臂和第二同源臂侧接的第一插入核酸。因而,所述靶向载体由此有助于经由在细胞的基因组内的同源臂和互补靶位点之间发生的同源重组事件将插入核酸整合到大鼠、真核、非大鼠真核、哺乳动物、非人类哺乳动物、人类、啮齿动物、非大鼠啮动物、小鼠或仓鼠核酸的靶基因座中。"Targeting vector" or "targeting vector" is a DNA construct containing a sequence "homologous" to an endogenous chromosomal nucleic acid sequence adjacent to the desired genetic modification. The flanking homologous sequences (called "homology arms") guide the targeting vector to locate at a specific chromosomal position in the genome by means of the homology between the homology arms and the corresponding endogenous sequences, and introduce the desired genetic modification by a process called "homologous recombination." "Targeting vector" and "targeting vector" can be used interchangeably at some times. A targeting vector is used to introduce an insert nucleic acid into a target locus of a rat, eukaryotic, non-rat eukaryotic, mammal, non-human mammal, human, rodent, non-rat rodent, mouse or hamster nucleic acid. The targeting vector comprises the insert nucleic acid and further comprises a 5' homology arm and a 3' homology arm, which flanks the insert nucleic acid. The homology arms flanking the insert nucleic acid correspond to regions within the target locus of a rat, eukaryotic, non-rat eukaryotic, mammal, non-human mammal, human, rodent, non-rat rodent, mouse or hamster nucleic acid. For ease of reference, the corresponding homologous genomic region within the target genomic locus is referred to herein as a "target site". For example, a targeting vector may comprise a first insert nucleic acid flanked by a first homology arm complementary to the first target site and a second homology arm complementary to the first target site. Thus, the targeting vector thereby facilitates integration of the insert nucleic acid into the target locus of a rat, eukaryotic, non-rat eukaryotic, mammalian, non-human mammalian, human, rodent, non-rat rodent, mouse or hamster nucleic acid via a homologous recombination event occurring between the homology arms and the complementary target sites within the genome of the cell.
在一个实施方案中,所述大鼠、真核、非大鼠真核、哺乳动物、非人类哺乳动物、人类、啮齿动物、非大鼠啮齿动物、小鼠或仓鼠核酸的靶基因座包含与5’同源臂互补的第一核酸序列和与3’同源臂互补的第二核酸序列。In one embodiment, the target locus of the rat, eukaryotic, non-rat eukaryotic, mammalian, non-human mammalian, human, rodent, non-rat rodent, mouse or hamster nucleic acid comprises a first nucleic acid sequence that is complementary to the 5' homology arm and a second nucleic acid sequence that is complementary to the 3' homology arm.
载体可以通过本领域已知的方法导入期望的宿主细胞,例如转染、电穿孔、微注射、转导、细胞融合、DEAE葡聚糖、磷酸钙沉淀、脂转染(溶酶体融合)、使用基因枪或DNA载体转运蛋白(参见例如Wu et al.,1992,J.Biol.Chem.267:963-967;Wu和Wu,1988,J.Biol.Chem.263:14621-14624;和Hartmut etal.,1990年3月15日提交的加拿大专利申请2,012,311)。The vector can be introduced into the desired host cell by methods known in the art, such as transfection, electroporation, microinjection, transduction, cell fusion, DEAE dextran, calcium phosphate precipitation, lipofection (lysosome fusion), use of a gene gun or a DNA vector transporter (see, e.g., Wu et al., 1992, J. Biol. Chem. 267:963-967; Wu and Wu, 1988, J. Biol. Chem. 263:14621-14624; and Hartmut et al., Canadian Patent Application No. 2,012,311 filed March 15, 1990).
术语“转染”是指细胞摄取外源或异源RNA或DNA。当外源或异源RNA或DNA已被引入细胞内时,该细胞便被这样的RNA或DNA“转染”。当转染的RNA或DNA影响表型变化时,该细胞便被外源或异源RNA或DNA“转化”。转化RNA或DNA可以被整合入(共价连接入)构成细胞基因组的染色体DNA。The term "transfection" refers to the uptake of exogenous or heterologous RNA or DNA by a cell. When exogenous or heterologous RNA or DNA has been introduced into a cell, the cell is "transfected" by such RNA or DNA. When the transfected RNA or DNA affects a phenotypic change, the cell is "transformed" by the exogenous or heterologous RNA or DNA. The transforming RNA or DNA can be integrated into (covalently linked into) the chromosomal DNA that makes up the genome of the cell.
术语“同源性”或“同源的”指序列例如核苷酸或氨基酸序列的两条序列经最佳比对和比较具有至少约75%的核苷酸或氨基酸、至少约80%的核苷酸或氨基酸、至少约90-95%的核苷酸或氨基酸,例如超过97%的核苷酸或氨基酸是相同的。本领域技术人员将理解,对于最佳基因靶向而言,靶向构建体应含有与内源性DNA序列同源的臂(即“同源臂”);因此,能够在靶向构建体与被靶向的内源性序列之间发生同源重组。The term "homology" or "homologous" refers to two sequences, such as nucleotide or amino acid sequences, that are optimally aligned and compared with at least about 75% of the nucleotides or amino acids, at least about 80% of the nucleotides or amino acids, at least about 90-95% of the nucleotides or amino acids, such as more than 97% of the nucleotides or amino acids being identical. Those skilled in the art will appreciate that for optimal gene targeting, the targeting construct should contain arms that are homologous to endogenous DNA sequences (i.e., "homology arms"); therefore, homologous recombination can occur between the targeting construct and the targeted endogenous sequence.
如本文中所用,在两个区彼此共有足够水平的序列同一性时同源臂和靶位点(即同源基因组区)彼此互补,从而充当用于同源重组反应的底物。“同源性”是指DNA序列与相应或“互补”序列相同或共有序列同一性。在给定靶位点和在靶向载体上发现的相应同源臂之间的序列同一性可为允许同源重组发生的任何程度的序列同一性。例如,靶向载体的同源臂(或其片段)与靶位点(或其片段)共有的序列同一性的量可为至少51%、53%、57%、60%、65%、70%、75%、80%、83%、85%、87%、89%、91%、93%、95%、97%、98%、99%或100%序列同一性,因此所述序列经历同源重组。此外,在同源臂和互补靶位点之间同源的互补区可具有足以促进在裂解的识别位点处同源重组的任何长度。因此同源臂与在细胞的基因组内的相应靶位点具有足以进行同源重组的同源性。为了便于提及,同源臂在本文中作为5’同源臂和3’同源臂提到。该术语涉及在靶向载体中同源臂与插入核酸的相对位置。As used herein, homology arms and target sites (i.e., homologous genomic regions) are complementary to each other when two regions have a sufficient level of sequence identity, thereby serving as a substrate for homologous recombination reactions." homology " refers to that a DNA sequence is identical or has a common sequence identity to a corresponding or "complementary" sequence. The sequence identity between a given target site and the corresponding homology arms found on a targeting vector can be any degree of sequence identity that allows homologous recombination to occur. For example, the amount of the sequence identity shared by the homology arms (or its fragment) of the targeting vector and the target site (or its fragment) can be at least 51%, 53%, 57%, 60%, 65%, 70%, 75%, 80%, 83%, 85%, 87%, 89%, 91%, 93%, 95%, 97%, 98%, 99% or 100% sequence identity, so the sequence undergoes homologous recombination. In addition, the complementary region of homology between homology arms and complementary target sites can have any length that is enough to promote homologous recombination at the recognition site of cracking. Therefore, the homology arms have enough homology to carry out homologous recombination with the corresponding target site in the genome of the cell. For ease of reference, homology arms are mentioned in this article as 5' homology arms and 3' homology arms. The term relates to the relative position of homology arms and the inserted nucleic acid in the targeting vector.
一些实施例中,所述靶向载体的同源臂可具有足以促进与相应靶位点的同源重组事件的任何长度,包括例如至少5-10kb、5-15kb、10-20kb、20-30kb、30-40kb、40-50kb、50-60kb、60-70kb、70-80kb、80-90kb、90-100kb、100-110kb、110-120kb、120-130kb、130-140kb、140-150kb、150-160kb、160-170kb、170-180kb、180-190kb、190-200kb长或更长。如下文进一步详细概述,靶向载体可采用更大长度的靶向臂。在一个特定的实施方案中5’同源臂和3’同源臂的总和为至少10kb或5’同源臂和3’同源臂的总和为至少约16kb-约100kb或约30kb-约100kb,在其它实施方案中,所述ACE2的5’同源臂和3’同源臂的总和的尺寸为约10kb-约150kb、约10kb-约100kb、约10kb-约75kb、约20kb-约150kb、约20kb-约100kb、约20kb-约75kb、约30kb-约150kb、约30kb-约100kb、约30kb-约75kb、约40kb-约150kb、约40kb-约100kb、约40kb-约75kb、约50kb-约150kb、约50kb-约100kb、或约50kb-约75kb、约10kb-约30kb、约20kb-约40kb、约40kb-约60kb、约60kb-约80kb、约80kb-约100kb、约100kb-约120kb或约120kb-约150kb。In some embodiments, the homology arms of the targeting vector can have any length sufficient to promote homologous recombination events with corresponding target sites, including, for example, at least 5-10 kb, 5-15 kb, 10-20 kb, 20-30 kb, 30-40 kb, 40-50 kb, 50-60 kb, 60-70 kb, 70-80 kb, 80-90 kb, 90-100 kb, 100-110 kb, 110-120 kb, 120-130 kb, 130-140 kb, 140-150 kb, 150-160 kb, 160-170 kb, 170-180 kb, 180-190 kb, 190-200 kb long or longer. As further detailed below, targeting vectors can employ targeting arms of greater length. In a specific embodiment, the sum of the 5' homology arm and the 3' homology arm is at least 10 kb, or the sum of the 5' homology arm and the 3' homology arm is at least about 16 kb to about 100 kb or about 30 kb to about 100 kb. In other embodiments, the size of the sum of the 5' homology arm and the 3' homology arm of ACE2 is about 10 kb to about 150 kb, about 10 kb to about 100 kb, about 10 kb to about 75 kb, about 20 kb to about 150 kb, about 20 kb to about 100 kb, about 20 kb to about 75 kb, about 30 kb to about 150 kb, about 20 kb to about 100 kb, about 20 kb to about 75 kb, about 30 kb to about 15 kb, about 10 kb to about 30 kb, about 20 kb to about 40 kb, about 40 kb to about 60 kb, about 60 kb to about 80 kb, about 80 kb to about 100 kb, about 100 kb to about 120 kb, or about 120 kb to about 150 kb.
本文中的某些实施例涉及人源化基因编辑哺乳动物,所述人源化基因编辑哺乳动物的基因组包括编码人的全长ACE2蛋白的多聚核糖核酸。例如,所述多聚核糖核酸可操作地连接至启动子多聚核糖核酸。在一些实施例中,人源化基因编辑哺乳动物不表达全部的或部分的编码人源化基因编辑哺乳动物的内源性ACE2蛋白的多聚核糖核酸,并且所述编码人ACE2蛋白的多聚核糖核酸包括人ACE2蛋白基因的修饰。Certain embodiments herein relate to humanized gene-edited mammals, the genome of which includes a polyribonucleic acid encoding a full-length ACE2 protein of a human. For example, the polyribonucleic acid is operably linked to a promoter polyribonucleic acid. In some embodiments, the humanized gene-edited mammal does not express all or part of the polyribonucleic acid encoding the endogenous ACE2 protein of the humanized gene-edited mammal, and the polyribonucleic acid encoding the human ACE2 protein includes a modification of the human ACE2 protein gene.
一些实施例中,所述细胞为多潜能细胞、非多潜能细胞、哺乳动物细胞、人类细胞、非人类哺乳动物细胞、啮齿动物细胞、小鼠细胞、仓鼠细胞、非人类多潜能细胞、人类多潜能细胞、啮齿动物多潜能细胞或成纤维细胞或肺细胞。In some embodiments, the cell is a pluripotent cell, a non-pluripotent cell, a mammalian cell, a human cell, a non-human mammalian cell, a rodent cell, a mouse cell, a hamster cell, a non-human pluripotent cell, a human pluripotent cell, a rodent pluripotent cell, or a fibroblast or a lung cell.
在一些上述方法中,所述细胞为原代细胞或永生化细胞。在一些上述方法中,所述啮齿动物多潜能细胞为小鼠或大鼠胚胎干(ES)细胞。In some of the above methods, the cell is a primary cell or an immortalized cell. In some of the above methods, the rodent pluripotent cell is a mouse or rat embryonic stem (ES) cell.
在一些上述方法中,所述动物细胞或所述人类细胞为原代细胞或永生化细胞。In some of the above methods, the animal cell or the human cell is a primary cell or an immortalized cell.
在一些上述方法中,所述动物细胞或所述人类细胞为多潜能细胞。在一些上述方法中,所述动物多潜能细胞为小鼠胚胎干(ES)细胞。在一些上述方法中,所述人类多潜能细胞为人类胚胎干(ES)细胞、人类成人干细胞、发育受限的人类祖细胞或人类诱导的多潜能干(iPS)细胞。In some of the above methods, the animal cell or the human cell is a pluripotent cell. In some of the above methods, the animal pluripotent cell is a mouse embryonic stem (ES) cell. In some of the above methods, the human pluripotent cell is a human embryonic stem (ES) cell, a human adult stem cell, a developmentally restricted human progenitor cell or a human induced pluripotent stem (iPS) cell.
一些实施例中,本文中的某些实施例提供人源化基因编辑的细胞,尤其是还提供分离的人类和非人类全能或多潜能干细胞,尤其是小鼠胚胎干细胞,其能够在一种或多种体外连续基因修饰之后维持多潜能性且能够经由种系将所述靶向基因修饰传递到后代。In some embodiments, certain embodiments herein provide humanized gene-edited cells, and in particular, also provide isolated human and non-human totipotent or pluripotent stem cells, in particular mouse embryonic stem cells, which are capable of maintaining pluripotency after one or more continuous gene modifications in vitro and are capable of transmitting the targeted gene modifications to offspring via the germline.
本文使用的术语“胚胎干细胞”或"ES细胞”包括在引入胚胎中后能够促进发育胚胎的任何组织的源自胚胎的全能或多潜能细胞。本文使用的术语“多潜能细胞”包括具有发育成多于一种类型的分化细胞的能力的未分化细胞。术语“非多潜能细胞”包括不是多潜能细胞的细胞。The term "embryonic stem cell" or "ES cell" used herein includes an embryonic omnipotence or multipotency cell that can promote any tissue of the developing embryo after being introduced into the embryo. The term "pluripotent cell" used herein includes an undifferentiated cell with the ability to develop into more than one type of differentiated cell. The term "non-pluripotent cell" includes cells that are not pluripotent cells.
在一些上述方法中,所述靶向基因编辑同时包括在所述目标基因组基因座处缺失内源核酸序列或在所述目标基因组基因座处插入所述核酸。In some of the above methods, the targeted gene editing simultaneously comprises deleting an endogenous nucleic acid sequence at the target genomic locus or inserting the nucleic acid at the target genomic locus.
一些实施例中,所述基因修饰或基因编辑包括对细胞(例如,真核细胞、非大鼠真核细胞、哺乳动物细胞、类细胞、非人类哺乳动物细胞、多潜能细胞、非多潜能细胞、非人类多潜能细胞、人类多潜能细胞、人类ES细胞、人类成人干细胞、发育受限的人类祖细胞、人类iPS细胞、人类细胞、啮齿动物细胞、非大鼠啮齿动物细胞、大鼠细胞、小鼠细胞、仓鼠细胞、成纤维细胞或中国仓鼠卵巢(CHO)细胞)独立地实施的两种或多种修饰。第一修饰可通过电穿孔或本领域已知的任何其他方法实现。随后,采用合适的第二核酸构建体对同一细胞基因组进行第二修饰。所述第三修饰可通过第二电穿孔或本领域已知的任何其它方法实现。在各种实施方案中,在同一细胞的第一基因修饰和第二基因修饰之后,可使用例如连续电穿孔或本领域已知的任何其它合适方法(连续地)实现第三基因修饰、第四基因修饰、第五基因修饰、第六基因修饰等连修饰(一种基因修饰跟着另一基因修饰)。In some embodiments, the gene modification or gene editing includes two or more modifications independently implemented to cells (e.g., eukaryotic cells, non-rat eukaryotic cells, mammalian cells, cell-like cells, non-human mammalian cells, pluripotent cells, non-pluripotent cells, non-human pluripotent cells, human pluripotent cells, human ES cells, human adult stem cells, developmentally restricted human progenitor cells, human iPS cells, human cells, rodent cells, non-rat rodent cells, rat cells, mouse cells, hamster cells, fibroblasts or Chinese hamster ovary (CHO) cells). The first modification can be achieved by electroporation or any other method known in the art. Subsequently, the same cell genome is modified for the second time using a suitable second nucleic acid construct. The third modification can be achieved by a second electroporation or any other method known in the art. In various embodiments, after the first gene modification and the second gene modification of the same cell, the third gene modification, the fourth gene modification, the fifth gene modification, the sixth gene modification, etc. can be achieved (continuously) using, for example, continuous electroporation or any other suitable method known in the art.
一些实施例中,本发明是打靶载体同源重组方法进行基因的编辑,将外源核酸插入内源基因组中。In some embodiments, the present invention uses a targeting vector homologous recombination method to perform gene editing and insert exogenous nucleic acid into an endogenous genome.
一些实施例中,所述插入核酸包含插入同源或直系同源人类核酸序列或用其替换真核、非大鼠真核、哺乳动物、人类或非人类哺乳动物核酸序列。In some embodiments, the insert nucleic acid comprises inserting a homologous or orthologous human nucleic acid sequence or replacing a eukaryotic, non-rat eukaryotic, mammalian, human or non-human mammalian nucleic acid sequence with it.
一些实施例中,所述给定插入的多核苷酸可来自任何生物体,包括例如啮齿动物、非大鼠啮齿动物、大鼠、小鼠、仓鼠、哺乳动物、非人类哺乳动物、真核生物、非大鼠真核生物、人类、农业动物或家养动物。In some embodiments, the given inserted polynucleotide can be from any organism, including, for example, a rodent, a non-rat rodent, a rat, a mouse, a hamster, a mammal, a non-human mammal, a eukaryote, a non-rat eukaryote, a human, an agricultural animal, or a domesticated animal.
在特定的实施方案中,所述插入核酸可包含来自大鼠的核酸,其可包含基因组DNA的片段、CDNA、调控区或其任何部分或组合。在其它实施方案中,所述插入核酸可包括来自真核生物、非大鼠真核生物、哺乳动物、人类、非人类哺乳动物、啮齿动物、非大鼠啮齿动物、人类、大鼠、小鼠、仓鼠、兔、猪、牛、鹿、绵羊、山羊、小鸡、猫、狗、白融、灵长类动物(例如,狨猴、恒河猴)、家养哺乳动物或农业哺乳动物或任何其它目标生物体的核酸。如在本文中更详细地概述,在各种方法和组合物中采用的插入核酸可引起目标靶基因座的“人源化”。In a specific embodiment, the insertion nucleic acid may include a nucleic acid from a rat, which may include a fragment of a genomic DNA, a cDNA, a regulatory region, or any portion or combination thereof. In other embodiments, the insertion nucleic acid may include a nucleic acid from a eukaryote, a non-rat eukaryote, a mammal, a human, a non-human mammal, a rodent, a non-rat rodent, a human, a rat, a mouse, a hamster, a rabbit, a pig, a cow, a deer, a sheep, a goat, a chicken, a cat, a dog, a white melt, a primate (e.g., a marmoset, a rhesus monkey), a domestic mammal, or an agricultural mammal, or any other target organism. As outlined in more detail herein, the insertion nucleic acid used in various methods and compositions may cause "humanization" of the target locus.
在一个实施方案中,所述基因修饰为添加核酸序列。在一个实施方案中,所述插入核酸包含在编码序列中的基因修饰。在一个实施方案中,所述基因修饰包括编码序列的缺失突变。在一个实施方案中,所述基因修饰包括两种内源编码序列的融合。在一个实施方案中,所述插入核酸包含插入同源或直系同源人类核酸序列或用其替换真核、非大鼠真核、哺乳动物、人类或非人类哺乳动物核酸序列。在一个实施方案中,所述插入核酸包含在包含相应小鼠DNA序列的内源小鼠基因编码区插入同源或直系同源人类核酸序列或用其替换小鼠DNA序列。在一个实施方案中,所述插入核酸包含在包含相应小鼠DNA序列的内源小鼠基因编码区插入同源或直系同源人类核酸序列或用其替换小鼠DNA序列。在一个实施方案中,利用打靶载体,针对小鼠Ace2位点,在紧接mAce2的EXON1 CDS起始ATG位点插入hACE2序列。In one embodiment, the genetic modification is to add a nucleic acid sequence. In one embodiment, the insert nucleic acid comprises a genetic modification in a coding sequence. In one embodiment, the genetic modification comprises a deletion mutation of a coding sequence. In one embodiment, the genetic modification comprises a fusion of two endogenous coding sequences. In one embodiment, the insert nucleic acid comprises inserting a homologous or orthologous human nucleic acid sequence or replacing a eukaryotic, non-rat eukaryotic, mammalian, human or non-human mammalian nucleic acid sequence with it. In one embodiment, the insert nucleic acid comprises inserting a homologous or orthologous human nucleic acid sequence into an endogenous mouse gene coding region comprising a corresponding mouse DNA sequence or replacing a mouse DNA sequence with it. In one embodiment, the insert nucleic acid comprises inserting a homologous or orthologous human nucleic acid sequence into an endogenous mouse gene coding region comprising a corresponding mouse DNA sequence or replacing a mouse DNA sequence with it. In one embodiment, a targeting vector is used to insert a hACE2 sequence at the start ATG site of the EXON1 CDS immediately following mAce2 for the mouse Ace2 site.
在一个实施方案中,所述靶向载体的核酸序列可包含在整合到基因组中时将生成哺乳动物、人类或非人类哺乳动物ACE2基因座的区域的基因修饰的多核苷酸,其中在ACE2基因座处的基因修饰引起ACE2活性减小、ACE2活性增加或ACE2活性调整。在一个实施方案中,产生ACE2基因被完全替换。In one embodiment, the nucleic acid sequence of the targeting vector may include a polynucleotide that will generate a genetic modification of the region of the ACE2 locus of a mammal, a human or a non-human mammal when integrated into the genome, wherein the genetic modification at the ACE2 locus causes a decrease in ACE2 activity, an increase in ACE2 activity or an adjustment in ACE2 activity. In one embodiment, the ACE2 gene is completely replaced.
在一个实施方案中,所述插入核酸可包含调控元件,包括例如启动子、增强子或转录。In one embodiment, the insert nucleic acid may comprise a regulatory element, including, for example, a promoter, enhancer or transcription factor.
一些实施例中,给定插入的多核苷酸和/或哺乳动物、人类细胞或非人类哺乳动物基因座的相应替换区可为编码区、内含子、外显子、未转译区、调控区、启动子或增强子或其任何组合。In some embodiments, a given inserted polynucleotide and/or the corresponding replaced region of a mammalian, human cell, or non-human mammalian locus may be a coding region, an intron, an exon, an untranslated region, a regulatory region, a promoter, or an enhancer, or any combination thereof.
本文提供允许将一种或多种目标多核苷酸靶向整合到靶基因座中的方法,如上概述,引入序列,由此产生的基因编辑细胞。“引入”以使得序列进入细胞内部的方式而将序列呈递到细胞中(多肽或多核苷酸)。Provided herein are methods that allow for targeted integration of one or more polynucleotides of interest into a target locus, as outlined above, introduction of sequences, and gene-edited cells produced thereby."Introduction" is the presentation of a sequence into a cell (polypeptide or polynucleotide) in such a way that the sequence enters the interior of the cell.
来自任何生物体的任何细胞都可用于本文提供的方法中。在特定的实施方案中,所述细胞来自真核生物、非大鼠真核生物、哺乳动物、非人类哺乳动物、人类、啮齿动物、非大鼠啮齿动物、大鼠、小鼠或仓鼠。在特定的实施方案中,所述细胞为真核细胞、非大鼠真核细胞、多潜能细胞、非多潜能细胞、非人类多潜能细胞、非人类哺乳动物细胞、人类多潜能细胞、人类ES细胞、人类成人干细胞、发育受限的人类祖细胞、人类诱导的多潜能细胞(iPS)细胞、哺乳动物细胞、人类细胞、成纤维细胞、啮齿动物细胞、非大鼠啮齿动物细胞、大鼠细胞、小鼠细胞、小鼠ES细胞、仓鼠细胞或CHO细胞。Any cell from any organism can be used in the method provided herein.In a specific embodiment, the cell is from eukaryotic organism, non-rat eukaryotic organism, mammal, non-human mammal, mankind, rodent, non-rat rodent, rat, mouse or hamster.In a specific embodiment, the cell is eukaryotic cell, non-rat eukaryotic cell, pluripotent cell, non-pluripotent cell, non-human pluripotent cell, non-human mammal cell, human pluripotent cell, human ES cell, human adult stem cell, development-restricted human progenitor cell, human induced pluripotent cell (iPS) cell, mammalian cell, human cell, fibroblast, rodent cell, non-rat rodent cell, rat cell, mouse cell, mouse ES cell, hamster cell or CHO cell.
在一些实施方案中,在所述方法中采用的细胞具有稳定地并入其基因座中的DNA构建体。“稳定地并入”或“稳定地引入”是指将多核苷酸引入细胞中,因此核苷酸序列整合到细胞的基因组中且能够由其子代遗传。In some embodiments, the cell employed in the method has the DNA construct stably incorporated into its locus."Stably incorporated" or "stably introduced" refers to the introduction of a polynucleotide into a cell so that the nucleotide sequence is integrated into the genome of the cell and can be inherited by its progeny.
在一个实施方案中,向细胞中引入一种或多种多核苷酸通过电穿孔、胞浆内注射、病毒感染、腺病毒、慢病毒、反转录病毒、转染、脂质介导的转染介导,或者经由NucleofectionTM介导。In one embodiment, introduction of one or more polynucleotides into a cell is mediated by electroporation, intracytoplasmic injection, viral infection, adenovirus, lentivirus, retrovirus, transfection, lipid-mediated transfection, or via Nucleofection™.
在一个实施例中,所述表达构建体与所述被引入的核酸一起引入。In one embodiment, the expression construct is introduced together with the introduced nucleic acid.
在一个实施方案中,向所述细胞中引入所述一种或多种多核苷酸可经一段时间执行多次。在一个实施方案中,向所述细胞中引入所述一种或多种多核苷酸可经一段时间执行至少两次,经一段时间执行至少三次,经一段时间执行至少四次,经一段时间执行至少五次,经一段时间执行至少六次,经一段时间执行至少七次,经一段时间执行至少八次,经一段时间执行至少九次,经一段时间执行至少十次,经一段时间执行至少十一次,经一段时间执行至少十二次,经一段时间执行至少十三次,经一段时间执行至少十四次,经一段时间执行至少十五次,经一段时间执行至少十六次,经一段时间执行至少十七次,经一段时间执行至少十八次,经一段时间执行至少十九次或经一段时间执行至少二十次。In one embodiment, the introduction of the one or more polynucleotides into the cell can be performed multiple times over a period of time. In one embodiment, the introduction of the one or more polynucleotides into the cell can be performed at least twice over a period of time, at least three times over a period of time, at least four times over a period of time, at least five times over a period of time, at least six times over a period of time, at least seven times over a period of time, at least eight times over a period of time, at least nine times over a period of time, at least ten times over a period of time, at least eleven times over a period of time, at least twelve times over a period of time, at least thirteen times over a period of time, at least fourteen times over a period of time, at least fifteen times over a period of time, at least sixteen times over a period of time, at least seventeen times over a period of time, at least eighteen times over a period of time, at least nineteen times over a period of time, or at least twenty times over a period of time.
在一个实施方案中,所述靶向载体(含有引入核酸)与表达载体(含有sgRNA)同时引入细胞中。In one embodiment, the targeting vector (containing the introduced nucleic acid) and the expression vector (containing the sgRNA) are introduced into the cell simultaneously.
在一个实施例中,进一步提供用于制造人源化非人类动物的方法,其包括:(a)用包含插入核酸的靶向载体修饰多潜能细胞的基因组以形成供体细胞,所述插入核酸包含人类核酸序列;(b)将所述供体细胞引入宿主胚胎中;和(c)在代孕母体中孕育所述宿主胚胎,其中所述代孕母体生成包含所述人类核酸序列的子代。在一个实施方案中,所述供体细胞引入处于胚泡段或处于前桑椹胚阶段(即,4-细胞阶段或8-细胞阶段)的宿主胚胎中。在更进一步的实施方案中,所述基因修饰能够经由种系传递。In one embodiment, a method for making a humanized non-human animal is further provided, comprising: (a) modifying the genome of pluripotent cells to form donor cells with a targeting vector comprising an insertion nucleic acid, wherein the insertion nucleic acid comprises a human nucleic acid sequence; (b) the donor cell is introduced into a host embryo; and (c) the host embryo is bred in a surrogate mother, wherein the surrogate mother generates a progeny comprising the human nucleic acid sequence. In one embodiment, the donor cell is introduced into a host embryo in a blastocyst segment or in a pre-morula stage (i.e., 4-cell stage or 8-cell stage). In a further embodiment, the genetic modification can be transmitted via germline.
在一个特定实施例中,提供了用于制造人源化小鼠的方法,所述方法包括:(a)将包含ACE2基因片段的靶向载体与连接有sgRNA的表达载体引入小鼠胚胎细胞中,形成基因编辑的供体细胞;(b)将所述供体细胞引入小鼠胚胎中;和(c)在代孕母体中孕育所述小鼠胚胎,其中所述代孕母体生成包含所述人类ACE2序列的子代。In a specific embodiment, a method for making a humanized mouse is provided, the method comprising: (a) introducing a targeting vector comprising an ACE2 gene fragment and an expression vector connected to an sgRNA into mouse embryonic cells to form gene-edited donor cells; (b) introducing the donor cells into mouse embryos; and (c) gestating the mouse embryos in a surrogate mother, wherein the surrogate mother generates offspring comprising the human ACE2 sequence.
实施例1 Ace2基因sgRNA 1及pX330-sgRNA质粒构建Example 1 Construction of
合成识别靶位点的sgRNA1序列。Synthesize the sgRNA1 sequence that recognizes the target site.
sgRNA1序列(SEQ ID NO:1):5’-tactgctcagtccctcaccgagg-3’sgRNA1 sequence (SEQ ID NO:1): 5’-tactgctcagtccctcaccgagg-3’
针对sgRNA位点引入BbsI酶切位点合成sgRNA的上下游退火引物进行后续退火实验。合成sgRNA1的上下游单链引物序列如下:The upstream and downstream annealing primers for synthesizing sgRNA by introducing BbsI restriction site into sgRNA site were used for subsequent annealing experiments. The upstream and downstream single-stranded primer sequences for synthesizing sgRNA1 are as follows:
上游:5’-caccgtactgctcagtccctcaccg-3’(SEQ ID NO:6)Upstream: 5'-caccgtactgctcagtccctcaccg-3' (SEQ ID NO: 6)
下游:5’-aaaccggtgagggactgagcagtac-3’(SEQ ID NO:7)Downstream: 5’-aaaccggtgagggactgagcagtac-3’(SEQ ID NO:7)
pX330质粒来源:pX330载体图谱,参见图1。该质粒骨架来源淼灵质粒平台,货号P0123。Source of pX330 plasmid: See Figure 1 for the map of pX330 vector. The plasmid backbone is from Miaoling Plasmid Platform, catalog number P0123.
将上述sgRNA退火引物经退火后分别连接至pX330质粒(质粒先用BbsI线性化),获得表达载体pX330-sgRNA1。The above sgRNA annealing primers were ligated to the pX330 plasmid (the plasmid was linearized with BbsI first) after annealing to obtain the expression vector pX330-sgRNA1.
具体的连接反应体系如表1所示。The specific ligation reaction system is shown in Table 1.
表1连接反应体系Table 1 Ligation reaction system
反应条件为:16℃连接30min以上,转化至30μL TOP10感受态细胞中,然后取200μL涂布于Amp抗性的平板,37℃培养至少12小时后挑选2个克隆接种含有Amp抗性的LB培养基(5mL)中,37℃,250rpm摇培至少12小时。The reaction conditions are as follows: connect at 16°C for more than 30 min, transform into 30 μL TOP10 competent cells, then take 200 μL and spread on the Amp-resistant plate, culture at 37°C for at least 12 hours, select 2 clones and inoculate into LB medium (5 mL) containing Amp resistance, and culture at 37°C and 250 rpm for at least 12 hours.
随机挑选的克隆送测序公司进行测序验证,测序结果如图2所示,选择连接正确的表达载体pX330-sgRNA1进行后续实验。The randomly selected clones were sent to a sequencing company for sequencing verification. The sequencing results are shown in Figure 2. The correctly connected expression vector pX330-sgRNA1 was selected for subsequent experiments.
实施例2 Ace2基因sgRNA2及pX330-sgRNA2质粒构建Example 2 Construction of Ace2 gene sgRNA2 and pX330-sgRNA2 plasmid
合成识别靶位点的sgRNA2序列。Synthesize the sgRNA2 sequence that recognizes the target site.
sgRNA2-序列(SEQ ID NO:2):5’-cttggcattttcctcggtgaggg-3’sgRNA2-Sequence (SEQ ID NO:2): 5’-cttggcattttcctcggtgaggg-3’
针对sgRNA位点引入BbsI酶切位点合成sgRNA的上下游退火引物进行后续退火实验。合成sgRNA2的上下游单链引物序列如下:The upstream and downstream annealing primers for synthesizing sgRNA by introducing BbsI restriction site into sgRNA site were used for subsequent annealing experiments. The upstream and downstream single-stranded primer sequences for synthesizing sgRNA2 are as follows:
上游:5’-caccgcttggcattttcctcggtga-3’(SEQ ID NO:4)Upstream: 5’-caccgcttggcattttcctcggtga-3’(SEQ ID NO:4)
下游:5’-aaactcaccgaggaaaatgccaagc-3’(SEQ ID NO:5)Downstream: 5'-aaactcaccgaggaaaatgccaagc-3' (SEQ ID NO: 5)
pX330质粒来源:pX330载体图谱,参见图1。该质粒骨架来源淼灵质粒平台,货号P0123。Source of pX330 plasmid: See Figure 1 for the map of pX330 vector. The plasmid backbone is from Miaoling Plasmid Platform, catalog number P0123.
将上述sgRNA退火引物经退火后分别连接至pX330质粒(质粒先用BbsI线性化),获得表达载体pX330-sgRNA2。The above sgRNA annealing primers were ligated to the pX330 plasmid (the plasmid was linearized with BbsI) after annealing to obtain the expression vector pX330-sgRNA2.
具体的连接反应体系如表2所示。The specific ligation reaction system is shown in Table 2.
表2连接反应体系Table 2 Ligation reaction system
反应条件为:16℃连接30min以上,转化至30μL TOP10感受态细胞中,然后取200μL涂布于Amp抗性的平板,37℃培养至少12小时后挑选2个克隆接种含有Amp抗性的LB培养基(5mL)中,37℃,250rpm摇培至少12小时。The reaction conditions are as follows: connect at 16°C for more than 30 min, transform into 30 μL TOP10 competent cells, then take 200 μL and spread on the Amp-resistant plate, culture at 37°C for at least 12 hours, select 2 clones and inoculate into LB medium (5 mL) containing Amp resistance, and culture at 37°C and 250 rpm for at least 12 hours.
随机挑选的克隆送测序公司进行测序验证,测序结果如图3所示,选择连接正确的表达载体pX330-sgRNA2进行后续实验。The randomly selected clones were sent to a sequencing company for sequencing verification. The sequencing results are shown in Figure 3. The correctly connected expression vector pX330-sgRNA2 was selected for subsequent experiments.
实施例3 pX330-sgRNA切割效率鉴定Example 3 Identification of pX330-sgRNA Cutting Efficiency
分别将2μg实施例1、实施例2制备的pX330-sgRNA质粒利用脂质体3000(Lipofectamine 3000,invitrogen,货号L3000001)方法转染进小鼠胚胎干细胞,具体转染操作参考Lipofectamine 3000试剂操作说明。收取转染两天后的小鼠胚胎干细胞,用细胞基因组提取试剂盒进行基因组提取(天根,DP304-02)。随后在基因组切割位点两侧设计PCR上下游引物,上游引物(5arm-sgF:ggttttgatttggccataaaatgttagc(SEQ ID NO:10))和下游引物(3arm-sgR:attcccaggtccagtttcacctaag(SEQ ID NO:11))对提取所得基因组进行PCR反应(使用诺唯赞Phanta Max Super-FidelityDNAPolymerase)。然后利用T7核酸内切酶Ⅰ(T7EⅠ)(Biolabs,M0302L)进行pX330-sgRNA切割效率验证,原理T7核酸内切酶Ⅰ可识别并切割不完全配对DNA。2 μg of pX330-sgRNA plasmid prepared in Example 1 and Example 2 were transfected into mouse embryonic stem cells using Lipofectamine 3000 (Invitrogen, Cat. No. L3000001). For specific transfection operations, refer to the
T7 EⅠ实验具体操作步骤:Specific steps of T7 EⅠ experiment:
将上述从各转染pX330-sgRNA1和pX330-sgRNA2的细胞提取基因组,通过5arm-sgF+3arm-sgR扩增得到的PCR产物进行回收(天根,DP214-02),取1μg进行退火反应,体系如下所示:The genomes were extracted from the cells transfected with pX330-sgRNA1 and pX330-sgRNA2, and the PCR products obtained by 5arm-sgF+3arm-sgR amplification were recovered (Tian Gen, DP214-02), and 1 μg was taken for annealing reaction. The system is as follows:
表3Table 3
然后往退火产物中加入1μL T7核酸内切酶Ⅰ于37℃反应30min,后直接跑胶验证。跑胶结果如图4所示。实施例1、实施例2的sgRNA1和sgRNA2其切割条带亮度较强(sgRNA1可切割成约376bp和581bp两条带,sgRNA2可切割成约371bp和586bp两条带),具有较高切割效率。Then, 1 μL of T7 endonuclease I was added to the annealed product and reacted at 37°C for 30 min, and then directly run on the gel for verification. The gel running results are shown in Figure 4. The cleavage bands of sgRNA1 and sgRNA2 in Example 1 and Example 2 were brighter (sgRNA1 can be cleaved into two bands of about 376 bp and 581 bp, and sgRNA2 can be cleaved into two bands of about 371 bp and 586 bp), and had a higher cleavage efficiency.
由于切割效率主要通过图显示,为了显现实施例1、2的方案的效果,图4中同时列举了另一sgRNA(指代为sgRNA3)的切割效率。Since the cutting efficiency is mainly displayed through the graph, in order to show the effects of the schemes of Examples 1 and 2, the cutting efficiency of another sgRNA (referred to as sgRNA3) is also listed in FIG4 .
所列举的sgRNRA3序列(SEQ ID NO:3):5’-caagtgaactttgataagacagg-3’The listed sgRNRA3 sequence (SEQ ID NO:3): 5'-caagtgaactttgataagacagg-3'
合成sgRNA3的上下游单链引物序列如下:The upstream and downstream single-stranded primer sequences for synthesizing sgRNA3 are as follows:
上游:5’-caccgcaagtgaactttgataagac-3’(SEQ ID NO:8)Upstream: 5’-caccgcaagtgaactttgataagac-3’(SEQ ID NO:8)
下游:5’-aaacgtcttatcaaagttcacttgc-3’(SEQ ID NO:9)Downstream: 5'-aaacgtcttatcaaagttcacttgc-3' (SEQ ID NO: 9)
其余步骤如同实施例1。随机挑选克隆送测序公司进行测序验证,测序结果如图5所示,选择连接正确的表达载体pX330-sgRNA3进行后续实验。The remaining steps were the same as in Example 1. Randomly selected clones were sent to a sequencing company for sequencing verification. The sequencing results are shown in FIG5 . The correctly connected expression vector pX330-sgRNA3 was selected for subsequent experiments.
由于其他的sgRNA及pX330-sgRNA质粒构建非本发明实施例的重点,因此不作过多赘述和列举。Since the construction of other sgRNAs and pX330-sgRNA plasmids is not the focus of the embodiments of the present invention, they are not described or listed in detail.
实施例4打靶载体的设计Example 4 Design of Targeting Vector
将人ACE2基因(Gene ID:ID:59272)CDS蛋白编码序列(基于NCBI登录号为NM_001371415.1→NP_001358344.1的转录本,其CDS序列如hACE2-CDS SEQ ID NO:12所示,蛋白序列如hACE2-protein SEQ ID NO:13所示)插入到鼠Ace2的启动子和5’UTR区域序列后,利用鼠Ace2启动子启动人ACE2基因表达。同时,插入的人ACE2 CDS序列后加上SV40 polyA序列信号(SV40-polyA序列如SEQ ID NO:14所示)终止信号加强人ACE2 mRNA转录停止。After inserting the CDS protein coding sequence of the human ACE2 gene (Gene ID: ID: 59272) (based on the transcript with NCBI accession number NM_001371415.1→NP_001358344.1, its CDS sequence is shown in hACE2-CDS SEQ ID NO: 12, and the protein sequence is shown in hACE2-protein SEQ ID NO: 13) into the promoter and 5'UTR region sequence of mouse Ace2, the mouse Ace2 promoter was used to start the expression of human ACE2 gene. At the same time, the inserted human ACE2 CDS sequence was followed by an SV40 polyA sequence signal (SV40-polyA sequence is shown in SEQ ID NO: 14) termination signal to strengthen the termination of human ACE2 mRNA transcription.
hACE2-CDS序列如下(SEQ ID NO:12):The hACE2-CDS sequence is as follows (SEQ ID NO: 12):
atgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacacctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggccctctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactggagtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatgggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaaatgg tctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttca gcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcgg caagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggatt attggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtgggg tagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgg gatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccataacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgaca gctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagct gttggggaaatcatgtcactttctgcagccacacctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaa aaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgtt ccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaaacatgaaggccctctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggag caaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgg gatggagtaccgactggagtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggc taatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcagg atgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatggggagtgatagtggttggcattgtcatcctgatcttcactggggatcagagatcggaagaagaaaaaata aagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttag
hACE2-protein序列如下(SEQ ID NO:13):The hACE2-protein sequence is as follows (SEQ ID NO: 13):
SV40 polyA序列(SEQ ID NO:14):SV40 polyA sequence (SEQ ID NO: 14):
aacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatctta
根据序列设计,发明人进一步的设计了如图6所示的打靶方案和包含5’同源臂(5arm)、人ACE2基因片段、3’同源臂(3arm)的载体。其中5’同源臂为NCBI登录号为AC091606.8的第125683-126652位核苷酸(5’同源臂序列如SEQ ID NO:15所示),3’同源臂为NCBI登录号为AC091606.8的第126796-127766位核苷酸(3’同源臂序列如SEQ ID NO:16所示)。同时,PGK-嘌呤霉素异种基因包也被插入载体用做筛选需要,嘌呤霉素抗性基因(PGK-puro,SEQ ID NO:17)表达框两端设计有一对同向的frt位点(SEQ ID NO:18),因此可以利用FLP重组酶去除从而解决转基因引起的安全问题。构建的载体被进行完全序列验证。在靶向之前,通过AgeI,所述载体被线性化(打靶载体示意图和打靶策略示意图如图6所示)。According to the sequence design, the inventor further designed a targeting scheme as shown in Figure 6 and a vector comprising a 5' homology arm (5arm), a human ACE2 gene fragment, and a 3' homology arm (3arm). Wherein the 5' homology arm is the 125683-126652 nucleotides of the NCBI accession number AC091606.8 (the 5' homology arm sequence is shown in SEQ ID NO: 15), and the 3' homology arm is the 126796-127766 nucleotides of the NCBI accession number AC091606.8 (the 3' homology arm sequence is shown in SEQ ID NO: 16). At the same time, the PGK-puromycin heterologous gene package is also inserted into the vector for screening needs, and a pair of frt sites (SEQ ID NO: 18) are designed at both ends of the puromycin resistance gene (PGK-puro, SEQ ID NO: 17) expression frame, so FLP recombinase can be used to remove and solve the safety problems caused by transgenics. The constructed vector is fully sequence verified. Prior to targeting, the vector was linearized by AgeI (a schematic diagram of the targeting vector and a schematic diagram of the targeting strategy are shown in FIG6 ).
5’同源臂序列(SEQ ID NO:15):5' homology arm sequence (SEQ ID NO: 15):
ccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccattaattttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttataggattcttaaaaagaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggagagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagttgctcagtggatgggatcttggcgcacggggaaagccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccatta attttaagtgaatgcttgcaaaagctc acttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttaggattcttaaaaagaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaagg ctacagagaggaactgtctaaaaacaccaagaaagagagaaagga gagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggtt ttctaggattaatgaataacatgtcttt atttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagatttttactc tagggaaagttgctcagtggatgggatcttggcgcacggggaaag
3’同源臂序列(SEQ ID NO:16):3' homology arm sequence (SEQ ID NO: 16):
GaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaaatgcaactggggtttactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactcttttttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtcttGaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaa aattgttatgaatgaa ttatttgcccagattccaaagatgagtgaaaatgtttaataaaagttgccatcactattctcattatttggtatgtaaagcattcatggaaatgttctaagtcgttatgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaaatgcaactggggtttactgaagg cagcagcttagtaattaaggtaaccatggcttaggt gaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagttaagtggt ggaaaattgagcaactatttaagaaaagactcttt tttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagactttgacaatg tagttgccaaatgagttgcttttaactgctctaatagtttggtctt
PGK-puro序列(SEQ ID NO:17):PGK-puro sequence (SEQ ID NO: 17):
GggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatcccccggtaggcgccaaccggctccgttctttggtggccccttcgcgccaccttctactcctcccctagtcaggaagttcccccccgccccgcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaatggaagcgggtaggcctttggggcagcggccaatagcagctttgctccttcgctttctgggctcagaggctgggaaggggtgggtccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcgctgttctcctcttcctcatctccgggcctttcgacctgcagcccaagctagcttaccatgaccgagtacaagcccacggtgcgcctcgccacccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggttcctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctccccttctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgcccgaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctgaGggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatccccccggtaggcgccaaccggctccgttctttggtggccccttcgcgccaccttctactcctcccctagtcaggaagttcccccccgcccc gcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaa tggaagcgggtaggcctttggggcagcggccaatagcagctttgctccttcgctttctgggctcagaggctgggaaggggtgggtccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcg ctgttctcctcttcctcatctccgggcctttcgacctgcagcccaagctagcttaccatgaccgagtacaagcccacggtgcgcctcgccacccg cgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtgg cggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagc aacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggttcctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctcccc ttctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgcccgaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctga
Frt序列(SEQ ID NO:18):gaagttcctattctctagaaagtataggaacttcFrt sequence (SEQ ID NO:18): gaagttcctattctctgaaagtataggaacttc
载体构建过程如下:The vector construction process is as follows:
设计扩增5’同源臂的搭桥片段的上游引物(5arm-pcrF:tcgcacacattccacatccaccggtccctatggagtggagaagagtctta(SEQ ID NO:19))和与其匹配的下游引物(5arm-pcrR:gaaggagccaggaagagcttgacatctttccccgtgcgccaagatcc(SEQ ID NO:20));Design the upstream primer (5arm-pcrF: tcgcacacattccacatccaccggtccctatggagtggagaagagtctta (SEQ ID NO: 19)) and the matching downstream primer (5arm-pcrR: gaaggagccaggaagagcttgacatctttccccgtgcgccaagatcc (SEQ ID NO: 20)) for amplifying the bridging fragment of the 5' homology arm;
设计扩增人ACE2 CDS的搭桥片段的上游引物(hACE2-F:ggatcttggcgcacggggaaagatgtcaagctcttcctggctccttc(SEQ ID NO:21))和与其匹配的下游引物(hACE2-R:cattataagctgcaataaacaagttctaaaaggaggtctgaacatcatc(SEQ ID NO:22));Design an upstream primer (hACE2-F: ggatcttggcgcacggggaaagatgtcaagctcttcctggctccttc (SEQ ID NO: 21)) and a matching downstream primer (hACE2-R: cattataagctgcaataaacaagttctaaaaggaggtctgaacatcatc (SEQ ID NO: 22)) for amplifying the bridging fragment of human ACE2 CDS;
设计扩增SV40 polyA的搭桥片段的上游引物(SV40-F:gatgatgttcagacctccttttagaacttgtttattgcagcttataatg(SEQ ID NO:23))和与其匹配的下游引物(SV40-R:AGAGAATAGGAACTTCGCACGCGTtaagatacattgatgagtttggac(SEQ ID NO:24));Design an upstream primer (SV40-F: gatgatgttcagacctccttttagaacttgtttattgcagcttataatg (SEQ ID NO: 23)) and a matching downstream primer (SV40-R: AGAGAATAGGAACTTCGCACGCGTtaagatacattgatgagtttggac (SEQ ID NO: 24)) for amplifying the bridge fragment of SV40 polyA;
设计扩增3’同源臂片段的上游引物(3arm-pcrF:tacgaagttatGtcgacgcGGCGCGCCgaattataatactaacattactg(SEQ ID NO:25))和与其匹配的下游引物(3arm-pcrR:tatgaccatgattacgccaagcttaagaccaaactattagagcagttaaaagc(SEQ ID NO:26))。5’同源臂与3’同源臂的扩增模板为C57BL6/J小鼠基因组,人ACE2的扩增模板为人肺细胞cDNA。PCR反应体系(使用诺唯赞Phanta Max Super-FidelityDNAPolymerase)和条件见表4:The upstream primer (3arm-pcrF: tacgaagttatGtcgacgcGGCGCGCCgaattataatactaacattactg (SEQ ID NO: 25)) and the matching downstream primer (3arm-pcrR: tatgaccatgattacgccaagcttaagaccaaactattagagcagttaaaagc (SEQ ID NO: 26)) for amplifying the 3' homology arm fragment were designed. The amplification templates of the 5' homology arm and the 3' homology arm were the C57BL6/J mouse genome, and the amplification template of human ACE2 was human lung cell cDNA. The PCR reaction system (using Novozyme Phanta Max Super-Fidelity DNA Polymerase) and conditions are shown in Table 4:
表4 PCR反应体系(50μL)Table 4 PCR reaction system (50 μL)
如是,将PCR扩增得到的产物片段5’同源臂、人ACE2 CDS、SV40 polyA使用搭桥PCR(overlap PCR)搭配Touch down PCR方法PCR成连续片段5arm-hACE-SV40;以及PCR扩增得到的3’同源臂回收后直接用于构建同源重组打靶载体,构建过程如下:If so, the 5' homology arm of the product fragment obtained by PCR amplification, human ACE2 CDS, and SV40 polyA are PCR-amplified into a continuous fragment 5arm-hACE-SV40 using the overlap PCR (overlap PCR) and touch down PCR method; and the 3' homology arm obtained by PCR amplification is directly used to construct a homologous recombination targeting vector after recovery. The construction process is as follows:
1.将5arm-hACE-SV40片段进行AgeI+MluI双酶切;3’同源臂片段进行AscI+HindIII双酶切后分别通过酶切连接的方法连接上去,从而得打靶载体。2.通过酶切鉴定后将获得的阳性打靶载体送去测序鉴定,经测序公司验证序列正确(如图7),从而成功获得人源化ACE2打靶载体,序列如下:1. The 5arm-hACE-SV40 fragment was double-digested with AgeI+MluI; the 3' homology arm fragment was double-digested with AscI+HindIII and then connected by enzyme digestion and ligation to obtain a targeting vector. 2. After enzyme digestion identification, the obtained positive targeting vector was sent for sequencing identification. The sequencing company verified that the sequence was correct (as shown in Figure 7), thus successfully obtaining a humanized ACE2 targeting vector, the sequence is as follows:
整个打靶载体序列(SEQ ID NO:27):The entire targeting vector sequence (SEQ ID NO: 27):
tcgcgcgtttcggtgatgacggtgaaaacctctgacacatgcagctcccggagacggtcacagcttgtctgtaagcggatgccgggagcagacaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggggctggcttaactatgcggcatcagagcagattgtactgagagtgcaccatatgtaccgggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatccaccggtccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccattaattttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttataggattcttaaaaagaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggagagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagttgctcagtggatgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacacctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggccctctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactggagtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatgggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttagaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctagaaagtataggaacttcatcgataccgggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatcccccggtaggcgccaaccggctccgttctttggtggccccttcgcgccaccttctactcctcccctagtcaggaagttcccccccgccccgcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaatggaagcgggtaggcctttggggcagcggccaatagcagctttgctccttcgctttctgggctcagaggctgggaaggggtgggtccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcgctgttctcctcttcctcatctccgggcctttcgacctgcagcccaagctagcttaccatgaccgagtacaagcccacggtgcgcctcgccacccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggttcctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctccccttctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgcccgaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctgaggtacctctcatgctggagttcttcgcccaccccaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatcgatgaagttcctattctctagaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacgaagttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaaatgcaactggggtttactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactcttttttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtcttaagcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccgctcacaattccacacaacatacgagccggaagcataaagtgtaaagcctggggtgcctaatgagtgagctaactcacattaattgcgttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaagaacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagtaaacttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctcaccggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctccatccagtctattaattgttgccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgccattgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaagttggccgcagtgttatcactcatggttatggcagcactgcataattctcttactgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgctcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatgtaacccactcgtgcacccaactgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcatactcttcctttttcaatattattgaagcatttatcagggttattgtctcatgagcggatacatatttgaatgtatttagaaaaataaacaaataggggttccgcgcacatttccccgaaaagtgccacctgacgtctaagaaaccattattatcatgacattaacctataaaaataggcgtatcacgaggccctttcgtctcgcgcgtttcggtgatgacggtgaaaacctctgacacatgcagctcccggagacggtcacagcttgtctgtaagcggatgccgggagcagacaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggggctggcttaactatgcggcatcagagcagattgtactgagagtgcaccatat gtaccgggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgct acacaagtggcctctggcctcgcacacattccacatccaccggtccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagt gctgggattaaaggcatggaccaccactgcctgccccattctctccattaatttta agtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgttttataggattcttaaaaagaaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacaga gggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaagga gagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggtt ttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatca ttctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaaactcatgaagagattttactctagggaaagttgctcagtgg atgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaac tgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccact acaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaat gggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcag ctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaa gagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatt tgcttggtgatatgtggggtagattttggacaaatctgtactc tttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccaca gcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacga cttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacaccctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctg ctcaaacaagcactcacgattgttgggactctgccatttacttacatgtta gagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagc actttgtcaagcagctaaacatgaaggccctctgcacaaatgtgacatct caaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactggagtccatatgcagaccaaagcat caaagtgaggataagcctaaaatcagctcttggagataaagcata tgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactga agttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacag cctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatgggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaaatccttatgcctccatcgatattagcaaaggaga aaataatccaggattccaaaacactgatgatgttcagacctccttttagaacttgttt attgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaaaagcattttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctagaaagtataggaacttcatcgataccgggtaggggaggcgcttttcccaaggcagtctggagcatgc gctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacat tccacatcccccggtaggcgccaaccggctccgttctttggtggccccttcgcgccaccttctactcctcccctagtcaggaagttccccccgccccgcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaatggaagcggg taggcctttggggcagcggccaatagcagctttgctccttcgctttctgggctcagaggctgggaaggggtgggt ccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcgctgttctcctcttcctcatctccgggcctttcgacctgcagcccaagctagcttaccatgaccgagtacaagcccacggtgcgcctcgcc acccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccaca ccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggcc gagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggtt cctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctccccttctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgccc gaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctgaggtacctctcatgctggagttcttcgcccacccca acttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatcgatgaagttcctattctctagaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacga agttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaatgcc caaaagatggtaagttcttgaggctacccagggggttattgattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattc caaagatgagtgaaaatgtttaataaagttgccatcacta ttctcattatttggtatgtaaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtcttccgagaactacaaatgacatattaactgaaaaatgcaactggggtttaactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcat tgacacagagctctgaggaatttccaaaggtcacagaagaaaagctat aattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaactttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactctttttttttcttcttccagcaatgctttccttcaaaacggtag cttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattca tagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtcttaagcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccg ctcacaattccacacaacatacgagccggaagcataaagtgtaaagcctggggtgcctaatgagtgagct aactcacattaattgcgttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgcattaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagct cactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaagg ccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctg tccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggt cgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctac actagaagaacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaagagtt ggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcacctag atccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagtaaacttggtctgacagtt accaatgcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctcaccggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaag tggtcctgcaactttatccgcctccatccagtctattaattgttgccgggaagcta gagtaagtagttcgccagttaatagtttgcgcaacgttgttgccattgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatcgttgt cagaagtaagttggccgcagtgttatcactcatggttatggcagcactgcataattctcttactgtcatgccatccgtaag atgcttttctgtgactggtgagtactcaaccaagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgctcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgat gtaacccactcgtgcacccaactgatcttcagcatcttttactttcaccagcgtttctgggtgagca aaaacaggaaggcaaaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcatactcttcctttttcaatattattgaagcatttatcagggttattgtctcatgagcggatacatatttgaatgtatttagaaaaataaacaaataggggttccgcgcacatttccccgaaaagtgccacctgacgtcta agaaaccattattatcatgacattaacctataaaaataggcgtatcacgaggccctttcgtc
对比例1Comparative Example 1
将PCR扩增得到的产物片段5’同源臂、人ACE2 CDS、SV40 polyA使用搭桥PCR(overlap PCR)方法PCR成连续片段5arm-hACE-SV40,PCR反应条件如表5所示,其余条件同实施例4。The product fragment 5' homology arm, human ACE2 CDS, and SV40 polyA obtained by PCR amplification were PCR-amplified into a continuous fragment 5arm-hACE-SV40 using the overlap PCR method. The PCR reaction conditions are shown in Table 5, and the other conditions are the same as Example 4.
表5 PCR反应体系(50μL)Table 5 PCR reaction system (50 μL)
结果如图8所示,并未能扩增出3片段连接片段。The results are shown in FIG8 , and no 3-fragment junction fragment could be amplified.
实施例5人源化ACE2小鼠胚胎干细胞的获得Example 5 Obtaining humanized ACE2 mouse embryonic stem cells
其中一个实施方式中,人源化ACE2小鼠胚胎干细胞的获得可按照如下方法进行:In one embodiment, obtaining humanized ACE2 mouse embryonic stem cells can be performed as follows:
C57BL6/J小鼠胚胎干细胞(所述的胚胎干细胞来源于已确立的胚胎干细胞系)从液氮冻存细胞库中复苏,特别使用代数为p10以内的小鼠胚胎干细胞,在6cm培养皿中培养生长3天。C57BL6/J mouse embryonic stem cells (the embryonic stem cells are derived from an established embryonic stem cell line) were revived from a liquid nitrogen frozen cell bank, and mouse embryonic stem cells within the generation of p10 were used, and cultured and grown in a 6 cm culture dish for 3 days.
通过配套使用NucleofectorTMIIs/2b电转仪以及Mouse ES CellKit(Lonza,VPH-1001)小鼠胚胎干细胞电转试剂盒,使用A023程序,约2×106个细胞被电穿孔,并且所述电转过程含有3μg线性化打靶载体以及1μg pX330-sgRNA1的100μL电转缓冲液中进行。所述转染细胞被种到3个6孔板的孔中,然后恢复36小时。恢复后,1μg/mL嘌呤霉素(默克公司)被加入到细胞培养基中。By using the Nucleofector TM IIs/2b electroporator and Mouse ES Cell Kit (Lonza, VPH-1001) mouse embryonic stem cell electroporation kit, using the A023 program, about 2×10 6 cells were electroporated, and the electroporation process was carried out in 100 μL electroporation buffer containing 3 μg of linearized targeting vector and 1 μg of pX330-sgRNA1. The transfected cells were seeded into three wells of a 6-well plate and then recovered for 36 hours. After recovery, 1 μg/mL puromycin (Merck) was added to the cell culture medium.
经过3-4天的筛选后,通过使用玻璃拉针吸取单克隆的方式挑取并培养嘌呤霉素抗性小鼠胚胎干细胞克隆进96孔板中培养。第二天将单克隆通过胰酶消化并分成两份培养,其中一份细胞在10μL的NP 40裂解缓冲液中于56℃下单独地裂解60分钟,随后在95℃下裂解10分钟。After 3-4 days of screening, puromycin-resistant mouse embryonic stem cell clones were picked and cultured in 96-well plates by using a glass needle to draw single clones. The next day, the single clones were trypsinized and divided into two cultures, one of which was lysed separately in 10 μL of NP 40 lysis buffer at 56°C for 60 minutes, followed by lysis at 95°C for 10 minutes.
其中,小鼠胚胎干细胞培养具体步骤:提前一天准备饲养层feeder细胞,按照需求铺在不同的孔板中,过夜培养形成单层。其中饲养层feeder细胞为经丝裂霉素C(MMC)法处理制备的小鼠胚胎成纤维细胞。使用含有LIF和2i(chir99021和pD0325901inhibitor)的mES培养基培养,每天更换新鲜培养基,可根据细胞的长势适当增加培养基,一般3天传一代,传代时用0.25%胰酶消化,按照6厘米盘30万、6孔板10万每孔左右的密度种植。Among them, the specific steps of mouse embryonic stem cell culture are: prepare the feeder layer feeder cells one day in advance, spread them in different well plates as needed, and culture them overnight to form a monolayer. The feeder layer feeder cells are mouse embryonic fibroblasts prepared by mitomycin C (MMC) method. Use mES culture medium containing LIF and 2i (chir99021 and pD0325901inhibitor) for culture, replace fresh culture medium every day, and increase the culture medium appropriately according to the growth of cells. Generally, one generation is passed every 3 days. During the passaging, use 0.25% trypsin to digest and plant at a density of about 300,000 per well in a 6-cm dish and 100,000 per well in a 6-well plate.
mES+LIF+2i培养基组成为:Knockout DMEM(1×,gibco)+15% FBS(FRONTBIOMEDICAL,0.22μm Millipore滤器过滤)+GlutaMAX(100×,gibco)+NEAA(100×,gibco)+P/S(P:50units,S:50mg/ml,Hyclone)+β-mercaptoethanol(gibco,使用浓度0.1mM)+LIF(1000units/ml,Millipore)+CHIR99021(GSK3β抑制剂,使用浓度3μM)+PD0325901(MEK抑制剂,使用浓度1μM)。The composition of mES+LIF+2i culture medium is: Knockout DMEM (1×, gibco) + 15% FBS (FRONTBIOMEDICAL, filtered through 0.22μm Millipore filter) + GlutaMAX (100×, gibco) + NEAA (100×, gibco) + P/S (P: 50 units, S: 50 mg/ml, Hyclone) + β-mercaptoethanol (gibco, used at a concentration of 0.1 mM) + LIF (1000 units/ml, Millipore) + CHIR99021 (GSK3β inhibitor, used at a concentration of 3μM) + PD0325901 (MEK inhibitor, used at a concentration of 1μM).
NP40裂解液的组成为:10mL TE(20mM Tris pH8.0,150mM NaCl,2mM EDTA)+0.5%NP40+10μL蛋白酶K(10mg/mL)。The composition of NP40 lysis buffer is: 10 mL TE (20 mM Tris pH 8.0, 150 mM NaCl, 2 mM EDTA) + 0.5% NP40 + 10 μL proteinase K (10 mg/mL).
上述方法中的pX330-sgRNA1替换成pX330-sgRNA2或pX330-sgRNA3,其他步骤同pX330-sgRNA1。In the above method, pX330-sgRNA1 was replaced with pX330-sgRNA2 or pX330-sgRNA3, and the other steps were the same as pX330-sgRNA1.
pX330-sgRNA1、pX330-sgRNA2和pX330-sgRNA3搭配打靶载体打靶效果如表6所示。The targeting effects of pX330-sgRNA1, pX330-sgRNA2 and pX330-sgRNA3 combined with targeting vectors are shown in Table 6.
表6.sgRNA1-3搭配打靶载体打靶效果(存活克隆数)Table 6. Targeting effect of sgRNA1-3 combined with targeting vector (number of surviving clones)
实施例6人源化ACE2小鼠胚胎干细胞基因型鉴定Example 6 Identification of humanized ACE2 mouse embryonic stem cell genotype
将实施例5最后得到的细胞裂解物用作基因型鉴定PCR筛选模板。通过使用PhantaMax Super-FidelityDNAPolymerase试剂(诺唯赞公司),根据制造商的说明书实施PCR筛选。PCR分析用于定向插入小鼠Ace2位点的具有5’同源臂和3’同源臂的HDR。由于ACE2基因位于X染色体上,使用的小鼠胚胎干细胞为XY雄性,所以不存在双等位基因靶向发生。The cell lysate obtained at the end of Example 5 was used as a genotype identification PCR screening template. PCR screening was performed according to the manufacturer's instructions using PhantaMax Super-Fidelity DNA Polymerase reagent (Novozyme). PCR analysis was used for directional insertion of HDR with 5' homology arms and 3' homology arms in the mouse Ace2 site. Since the ACE2 gene is located on the X chromosome, the mouse embryonic stem cells used are XY males, so there is no double allele targeting.
鉴定定向插入引物的上游位于嘌呤霉素抗性内部(Puro-F:aacctccccttctacgagc(SEQ ID NO:28)),与其配对的引物下游位于3’同源臂的外侧(3arm-outR:tacagccaggatctggatgtcagc(SEQ ID NO:29))。如果重组载体插入位置正确,则应出现一条长度应为1504bp的条带。此时小鼠胚胎干细胞Ace2位点基因组序列替换为SEQ ID NO:30:The upstream of the primer for identification of directional insertion is located inside the puromycin resistance (Puro-F: aacctccccttctacgagc (SEQ ID NO: 28)), and the downstream of the primer paired with it is located outside the 3' homology arm (3arm-outR: tacagccaggatctggatgtcagc (SEQ ID NO: 29)). If the insertion position of the recombinant vector is correct, a band with a length of 1504 bp should appear. At this time, the genomic sequence of the mouse embryonic stem cell Ace2 site is replaced with SEQ ID NO: 30:
ccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccattaattttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttataggattcttaaaaagaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggagagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagttgctcagtggatgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacacctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggccctctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactggagtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatgggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttagaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctagaaagtataggaacttcatcgataccgggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatcccccggtaggcgccaaccggctccgttctttggtggccccttcgcgccaccttctactcctcccctagtcaggaagttcccccccgccccgcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaatggaagcgggtaggcctttggggcagcggccaatagcagctttgctccttcgctttctgggctcagaggctgggaaggggtgggtccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcgctgttctcctcttcctcatctccgggcctttcgacctgcagcccaagctagcttaccatgaccgagtacaagcccacggtgcgcctcgccacccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactaccccgccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggctcgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaaggagcccgcgtggttcctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctccccttctacgagcggctcggcttcaccgtcaccgccgacgtcgaggtgcccgaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctgaggtacctctcatgctggagttcttcgcccaccccaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttatcatcgatgaagttcctattctctagaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacgaagttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaaatgcaactggggtttactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactcttttttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtctt(seq id no:30)ccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccattaatt ttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttataggattcttaaaaagaaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggag gcagaggaaagcagatctcttgagtttgaggctagcctagtctacagaggggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggagagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttct gttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttactt aaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgtctcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagtt gctcagtggatgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttc ttggaattaacaccaatattactgaagagaatgtccaaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacac aattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctg aggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgag ggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttac tgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgtgcaca aaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacacctaagcatt taaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaaaagtggtgggagatgaagcgagatagttggggtggtgga acctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggccctctgcacaa atgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaagaattcttttgtggggatggagtaccgactggagtccatatgca gaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattcttt ttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggac ctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatggggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaa aggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttagaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctctaga aagtataggaacttcatcgataccgggtaggggaggcgcttttcccaaggcagtctggagcatgcgctttagcagccccgctgggcacttggcgctacacaagtggcctctggcctcgcacacattccacatcccccggtaggcgccaaccggctccgttctttgg tggccccttcgcgccaccttctactcctcccctagtcaggaagttcccccccgccccgcagctcgcgtcgtgcaggacgtgacaaatggaagtagcacgtctcactagtctcgtgcagatggacagcaccgctgagcaatggaagcgggtaggcctttggggcagcggccaatagcagctttgctcctt cgctttctgggctcagaggctgggaaggggtgggtccgggggcgggctcaggggcgggctcaggggcggggcgggcgcccgaaggtcctccggaggcccggcattctgcacgcttcaaaagcgcacgtctgccgcgctgttctcctcttcctcatctccgggcctttcgacctgcagccca agctagcttaccatgaccgagtacaagcccacggtgcgcctcgccacccgcgacgacgtccccagggccgtacgcaccctcgccgccgcgttcgccgactacccggccacgcgccacaccgtcgatccggaccgccacatcgagcgggtcaccgagctgcaagaactcttcctcacgcgcgtcgggct cgacatcggcaaggtgtgggtcgcggacgacggcgccgcggtggcggtctggaccacgccggagagcgtcgaagcgggggcggtgttcgccgagatcggcccgcgcatggccgagttgagcggttcccggctggccgcgcagcaacagatggaaggcctcctggcgccgcaccggcccaagga gcccgcgtggttcctggccaccgtcggcgtctcgcccgaccaccagggcaagggtctgggcagcgccgtcgtgctccccggagtggaggcggccgagcgcgccggggtgcccgccttcctggagacctccgcgccccgcaacctccccttctacgagcggctcggcttcaccgtcaccgccgac gtcgaggtgcccgaaggaccgcgcacctggtgcatgacccgcaagcccggtgcctgaggtacctctcatgctggagttcttcgcccaccccaacttgttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaaataaagcatttttttcactgcattctagttgtgtgg tttgtccaaactcatcaatgtatcttatcatcgatgaagttcctattctctagaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacgaagttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggtt attgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgtta tgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaatgcaactggggtttact gaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaa gcagcatattttgtttgacaagtgcaaggacttagaactttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactcttttttttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggct gtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtctt(seq id no:30)
PCR鉴定结果见图9,共鉴定了14株克隆,其中标注*号为阳性克隆,同时PCR产物经测序结果匹配一致。此PCR结果阳性的小鼠胚胎干细胞克隆为成功编辑的人源化小鼠胚胎干细胞模型,但其中还保留有PGK-Puro筛选标志,由于PGK-Puro两端具有一对同向的frt位点,因此可以利用FLP重组酶去除从而解决转基因引起的安全问题。The PCR identification results are shown in Figure 9. A total of 14 clones were identified, of which the ones marked with * are positive clones, and the PCR products are consistent with the sequencing results. The mouse embryonic stem cell clone with positive PCR results is a successfully edited humanized mouse embryonic stem cell model, but it still retains the PGK-Puro screening marker. Since there is a pair of frt sites in the same direction at both ends of PGK-Puro, it can be removed by FLP recombinase to solve the safety problem caused by transgenics.
PGK-Puro筛选标志的删除方法如下:The PGK-Puro screening marker is removed as follows:
将上述PCR结果阳性的小鼠胚胎干细胞继续培养至6cm盘,通过配套使用NucleofectorTMIIs/2b电转仪以及Mouse ES CellKit(Lonza,VPH-1001)小鼠胚胎干细胞电转试剂盒,使用A023程序,约2×106个细胞被电穿孔,并且所述电转过程含有pPGK-FLPo质粒(Addgene,13793)的100μL电转缓冲液中进行。所述转染细胞被种到6个12孔板的孔中,3天后,采取玻璃拉针吸取单克隆的方式挑取小鼠胚胎干细胞克隆进96孔板中培养,第二天将单克隆消化传代一分为二,一份进行嘌呤霉素筛选,一份正常培养。若成功删除PGK-Puro抗性筛选标志,则嘌呤霉素筛选后细胞不耐受死亡。嘌呤霉素不耐受相对应的克隆可使用上游引物(hACE2-F:tgatagtggttggcattgt(SEQ ID NO:31))和下游引物(3arm-outR:tacagccaggatctggatgtcagc(SEQ ID NO:29))进行基因组基因型PCR鉴定。若成功删除PGK-Puro筛选标志,则PCR产物长度为1532bp,删除PGK-Puro前的PCR产物长度为2863bp。基因型鉴定结果如图9所示,跑胶条带在1500bp处的为成功删除PGK-puro后的克隆,条带在3000bp处的为删除前克隆。嘌呤霉素不耐受相对应的克隆则为最终成功编辑为人源化ACE2打靶的小鼠胚胎干细胞,也就是小鼠胚胎干细胞最终获得如图10所示的人源化ACE2基因。最终删除PGK-puro后小鼠胚胎干细胞Ace2位点基因组序列替换为SEQ ID NO:32,如下所示:The mouse embryonic stem cells with positive PCR results were further cultured in 6 cm dishes and electroporated using the Nucleofector TM IIs/2b electroporator and Mouse ES Cell Kit (Lonza, VPH-1001) mouse embryonic stem cell electroporation kit, using the A023 program, about 2×10 6 cells were electroporated, and the electroporation process was carried out in 100 μL electroporation buffer containing pPGK-FLPo plasmid (Addgene, 13793). The transfected cells were seeded into 6 wells of 12-well plates. After 3 days, the mouse embryonic stem cell clones were picked up by glass needle aspiration and cultured in 96-well plates. The next day, the monoclonal digestion and passage were divided into two, one for puromycin screening and one for normal culture. If the PGK-Puro resistance screening marker is successfully deleted, the cells will die due to intolerance after puromycin screening. The clones corresponding to puromycin intolerance can be identified by genomic genotype PCR using upstream primers (hACE2-F: tgatagtggttggcattgt (SEQ ID NO: 31)) and downstream primers (3arm-outR: tacagccaggatctggatgtcagc (SEQ ID NO: 29)). If the PGK-Puro screening marker is successfully deleted, the length of the PCR product is 1532bp, and the length of the PCR product before deleting PGK-Puro is 2863bp. The genotype identification results are shown in Figure 9. The band at 1500bp is the clone after the successful deletion of PGK-puro, and the band at 3000bp is the clone before deletion. The clone corresponding to puromycin intolerance is the mouse embryonic stem cell that was finally successfully edited to target humanized ACE2, that is, the mouse embryonic stem cell finally obtained the humanized ACE2 gene as shown in Figure 10. After the final deletion of PGK-puro, the genomic sequence of the mouse embryonic stem cell Ace2 site was replaced with SEQ ID NO:32, as shown below:
ccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccattaattttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgccttcgtttttataggattcttaaaaagaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggagagagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccctttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctccaggattaacttcatattggtccagcagcttgtttactgttctcttctgtttcttcttctgctttttttttcttctcttctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagttgctcagtggatgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacatttttggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtccaaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactggaaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagtttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctttgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgcctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttactgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgttggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatcccacagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacacctaagcatttaaaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaagcactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgtttctaatgattactcattcattcgatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggccctctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactggagtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatgaatggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagttgaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaacacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatgggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacactgatgatgttcagacctccttttagaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttcacaaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctagaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacgaagttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaatgcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaactggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcatggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgagaactacaaatgacatattaactgaaaaatgcaactggggtttactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaatctcagcctactctgggaaagcagcatattttgtttgacaagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagttaagtggtggaaaattgagcaactatttaagaaaagactcttttttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccagttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtctt(seq id no:32)ccctatggagtggagaagagtcttataattttttaaatgggcagagaaatgaatttatttttaatttttagagacagggtttctttgtatagctctagctgtctttgattggtagacaaagctgtcctcaaactcagagatcttccttcctttgtctcctgagtgctgggattaaaggcatggaccaccactgccctgccccattctctccatta attttaagtgaatgcttgcaaaagctcacttctttggtgaacagcttcctttacaaataagtacctttgcct tcgtttttataggattcttaaaaagaaaaaaaagattcagccaggtggttgtggtgcacacctttaatcccagcagtcaggaggcagaggaaagcagatctcttgagtttgaggctagcctagtctacagagggagttccaggacagccaaggctacagagaggaactgtctaaaaacaccaagaaagagagaaaggaga gagggagaggatggatagcttattgatagaattgtcagaaaaggctataagttccaatatgtgtcccatgatttctaagtctagccc tttctgttatagtaaaatcatagtacaccctcctcctccagtgtatctttaacagcttttaaggaacatattaactaaatgtccaggttttgatttggccataaaatgttagcaaagctaaggttttctaggattaatgaataacatgtctttatttagtttacttaaaaaaatcattctaaaatatctgtttacatatctgtcctctc caggattaacttcatattggtccagcagcttgtttactgttctcttctgtttcttcttctgctttttttttcttctct tctcagtgcccaacccaagttcaaaggctgatgagagagaaaaactcatgaagagattttactctagggaaagttgctcagtggatgggatcttggcgcacggggaaagatgtcaagctcttcctggctccttctcagccttgttgctgtaactgctgctcagtccaccattgaggaacaggccaagacattttt ggacaagtttaaccacgaagccgaagacctgttctatcaaagttcacttgcttcttggaattataacaccaatattactgaagagaatgtcc aaaacatgaataatgctggggacaaatggtctgcctttttaaaggaacagtccacacttgcccaaatgtatccactacaagaaattcagaatctcacagtcaagcttcagctgcaggctcttcagcaaaaatgggtcttcagtgctctcagaagacaagagcaaacggttgaacacaattctaaatacaatgagcaccatctacagtactgg aaaagtttgtaacccagataatccacaagaatgcttattacttgaaccaggtttgaatgaaataatggcaaacagt ttagactacaatgagaggctctgggcttgggaaagctggagatctgaggtcggcaagcagctgaggccattatatgaagagtatgtggtcttgaaaaatgagatggcaagagcaaatcattatgaggactatggggattattggagaggagactatgaagtaaatggggtagatggctatgactacagccgcggccagttgattgaagatgtggaacatacctt tgaagagattaaaccattatatgaacatcttcatgcctatgtgagggcaaagttgatgaatgc ctatccttcctatatcagtccaattggatgcctccctgctcatttgcttggtgatatgtggggtagattttggacaaatctgtactctttgacagttccctttggacagaaaccaaacatagatgttatgatgcaatggtggaccaggcctgggatgcacagagaatattcaaggaggccgagaagttctttgtatctgt tggtcttcctaatatgactcaaggattctgggaaaattccatgctaacggacccaggaaatgttcagaaagcagtctgccatccca cagcttgggacctggggaagggcgacttcaggatccttatgtgcacaaaggtgacaatggacgacttcctgacagctcatcatgagatggggcatatccagtatgatatggcatatgctgcacaaccttttctgctaagaaatggagctaatgaaggattccatgaagctgttggggaaatcatgtcactttctgcagccacaccctaagcattta aaatccattggtcttctgtcacccgattttcaagaagacaatgaaacagaaataaacttcctgctcaaacaag cactcacgattgttgggactctgccatttacttacatgttagagaagtggaggtggatggtctttaaaggggaaattcccaaagaccagtggatgaaaaagtggtgggagatgaagcgagagatagttggggtggtggaacctgtgccccatgatgaaacatactgtgaccccgcatctctgttccatgttctaatgattactcattcattc gatattacacaaggaccctttaccaattccagtttcaagaagcactttgtcaagcagctaaacatgaaggcct ctgcacaaatgtgacatctcaaactctacagaagctggacagaaactgttcaatatgctgaggcttggaaaatcagaaccctggaccctagcattggaaaatgttgtaggagcaaagaacatgaatgtaaggccactgctcaactactttgagcccttatttacctggctgaaagaccagaacaagaattcttttgtgggatggagtaccgactgg agtccatatgcagaccaaagcatcaaagtgaggataagcctaaaatcagctcttggagataaagcatatga atggaacgacaatgaaatgtacctgttccgatcatctgttgcatatgctatgaggcagtactttttaaaagtaaaaatcagatgattctttttggggaggaggatgtgcgagtggctaatttgaaaccaagaatctcctttaatttctttgtcactgcacctaaaaatgtgtctgatatcattcctagaactgaagtt gaaaaggccatcaggatgtcccggagccgtatcaatgatgctttccgtctgaatgacaacagcctagagtttctggggatacagccaa cacttggacctcctaaccagccccctgtttccatatggctgattgtttttggagttgtgatggggagtgatagtggttggcattgtcatcctgatcttcactgggatcagagatcggaagaagaaaaataaagcaagaagtggagaaaatccttatgcctccatcgatattagcaaaggagaaaataatccaggattccaaaacact gatgatgttcagacctccttttagaacttgtttattgcagcttataatggttacaaataaagcaatagcatcacaaatttca caaataaagcatttttttcactgcattctagttgtggtttgtccaaactcatcaatgtatcttaacgcgtgcgaagttcctattctctgaaagtataggaacttctaacctcccgggtgacagataacttcgtataatgtatgctatacgaagttatgtcgacgcggcgcgccgaattataatactaacattactgaagaaaat gcccaaaagatggtaagttcttgaggctacccagggggttattgattgcttcttaaagatcagaattactgcctataaaact ggataaggaaatcatagagatctctcaagtgtgaggatgagtgactgcctctgtagctctgatcctagtctcccagatggctaaattcaattgaccttagagttcatctggaaaattgttatgaatgaattatttgcccagattccaaagatgagtgaaaatgtttaataaagttgccatcactattctcattatatttggtatgtaaagcattcat ggaaatgttctaagtcgttattgagccaataattttctttagcttataatgccaacaggtctatccgaga actacaaatgacatattaactgaaaaatgcaactggggtttactgaaggcagcagcttagtaattaaggtaaccatggcttaggtgaaactggacctgggaattccttctttcattgacacagagctctgaggaatttccaaaggtcacagaagaaaagctataattaaactagtcccaaaaaaatctcagcctactctgggaaagcagcatattttgtttga caagtgcaaggacttagaacttttttttttctcactgatcctgaagtgccttttaagtatagtta agtggtggaaaattgagcaactatttaagaaaagactctttttttcttcttccagcaatgctttccttcaaaacggtagcttcaaaacttcctgtcttttaaatgatcagggggctgtgtgtgtttaaattattgccattcatagaacagagtgggtctgaggatgcctgtttcctttgaaattctatgccccctcccag ttttctaaaatttaagaaaccacagagactttgacaatgtagttgccaaatgagttgcttttaactgctctaatagtttggtctt(seq id no:32)
实施例7人源化ACE2基因改造小鼠的形成Example 7 Formation of Humanized ACE2 Gene-Transformed Mice
1、腹腔注射7.5单位PMSG到4-10周龄B6C3F1雌鼠,48小时后注射hCG并与CD1种公鼠合笼,次日早上检查雌鼠阴栓并将有阴栓的雌鼠挑出,记录雌鼠相应的受精时间。1. Intraperitoneally inject 7.5 units of PMSG into 4-10 week old B6C3F1 female mice. 48 hours later, inject hCG and place them in a cage with CD1 male mice. The next morning, check the female mice for vaginal plugs and select the female mice with vaginal plugs. Record the corresponding fertilization time of the female mice.
2、下一日将孕鼠进行颈椎脱臼法安乐死,70%酒精消毒小鼠腹部,用辅料镊子和眼科剪刀剪开腹部皮肤和肌肉层,打开腹腔。用镊子抓住一个子宫角的上部,用剪刀在靠近输卵管的膜上开一个小口,将输卵管和卵巢连接处剪断,将输卵管和附带的子宫移至35mm培养皿中,用镊子固定好输卵管伞口端,用装填好M2培养液(Hogan,B.(1994).Manipulating the mouse embryo:a laboratory manual,2nd edn(Cold SpringHarbor,NY,Cold Spring Harbor Laboratory Press).)的冲洗针轻轻插入伞口,用0.1mL的M2培养液冲洗输卵管,用移卵管收集冲出的胚胎并用M2清洗3次,收集E1.5的小鼠2-细胞胚胎。2. The next day, the pregnant mice were euthanized by cervical dislocation, the abdomen of the mice was disinfected with 70% alcohol, and the abdominal skin and muscle layer were cut open with auxiliary forceps and ophthalmic scissors to open the abdominal cavity. The upper part of one uterine horn was grasped with forceps, and a small incision was made on the membrane near the fallopian tube with scissors, and the connection between the fallopian tube and the ovary was cut off. The fallopian tube and the attached uterus were moved to a 35 mm culture dish, and the fimbria of the fallopian tube was fixed with forceps. The flushing needle filled with M2 culture medium (Hogan, B. (1994). Manipulating the mouse embryo: a laboratory manual, 2nd edn (Cold Spring Harbor, NY, Cold Spring Harbor Laboratory Press).) was gently inserted into the fimbria, and the fallopian tube was flushed with 0.1 mL of M2 culture medium. The embryos flushed out were collected with an oviduct transfer tube and washed 3 times with M2 to collect E1.5 mouse 2-cell embryos.
3、对收集到的小鼠胚胎放入含0.1mM MgSO4,0.1mM CaCl2和0.3%牛血清白蛋白的0.3M甘露醇(Sigma-Aldrich Inc.,St.Louis,MO)中,用Cellfusion CF-150/B电融合仪和250-um融合槽(BLS Ltd.,Budapest,Hungary)进行60V 50微秒直流电融合,获得4倍体胚胎;放入KSOM培养基(Summers,M.C.,McGinnis,L.K.,Lawitts,J.A.,Raffin,M.,andBiggers,J.D.(2000).IVF of mouse ova in a simplex optimized mediumsupplemented with amino acids.Hum Reprod 15,1791-1801.),于CO2培养箱中培养24小时后用酸性台氏液(Sigma-Aldrich,T1788)去掉透明带,与胚胎干细胞(即本申请制备的人源化ACE2小鼠胚胎干细胞)聚集,形成嵌合体胚胎(Nagy,A.,Rossant,J.,Nagy,R.,Abramow-Newerly,W.,and Roder,J.C.(1993).Derivation of completely cellculture-derived mice from early-passage embryonic stem cells.Proc Natl AcadSci U S A 90,8424-8428.)。3. The collected mouse embryos were placed in 0.3 M mannitol (Sigma-Aldrich Inc., St. Louis, MO) containing 0.1 mM MgSO4, 0.1 mM CaCl2 and 0.3% bovine serum albumin, and fused with 60
4、于CO2培养箱中培养过夜,移植到E2.5天的假孕鼠子宫,17天后在将代孕鼠颈椎脱臼法安乐死后进行剖腹手术,存活和呼吸的新生鼠放入代乳鼠笼内,21天后断奶,获得完全来自胚胎干细胞的转基因小鼠,即得人源化ACE2基因改造小鼠(图12)。4. Cultured overnight in a CO2 incubator, transplanted into the uterus of a pseudo-pregnant mouse on day E2.5, and euthanized by cervical dislocation after 17 days, and then performed a laparotomy. The surrogate mice that were alive and breathing were placed in a suckling mouse cage and weaned after 21 days to obtain transgenic mice completely derived from embryonic stem cells, i.e., humanized ACE2 gene-modified mice (Figure 12).
相应的培养基配方如表7所示。The corresponding culture medium formula is shown in Table 7.
表7培养基配方Table 7 Culture medium formula
实施例8人源化ACE2基因改造小鼠的hACE2表达验证Example 8 Verification of hACE2 expression in humanized ACE2 gene-modified mice
取一只野生型小鼠以及一只人源化ACE2基因改造小鼠(实施例7制备得到的)的小肠、肺以及肾脏样品,用于RNA提取以及qPCR验证hACE2基因表达情况。组织样品加入Trizol试剂裂解。每毫升的Trizol裂解液中加入200μl氯仿,剧烈震荡后静置至分层,重复操作3次后,4℃,14000rpm离心15min,取上清400μl至预冷的400μl异丙醇中,颠倒混匀,冰上静置5min,4℃,14000pm离心10min,在EP管底部可见白色沉淀,为RNA。Small intestine, lung and kidney samples of a wild-type mouse and a humanized ACE2 gene-modified mouse (prepared in Example 7) were taken for RNA extraction and qPCR verification of hACE2 gene expression. Tissue samples were lysed by adding Trizol reagent. 200 μl of chloroform was added to each milliliter of Trizol lysate, and the mixture was allowed to stand until stratification after vigorous shaking. After repeating the
弃上清,加入预冷的70%乙醇溶液,洗涤一次,离心,去掉乙醇,将RNA晾干至无色透明,按照沉淀多少,加入适量RNase-free H2O,60℃水浴至完全溶解。分别取1μl,用10mMTris-HCl(pH 7.5)以及H2O稀释50倍测定纯度及浓度,OD260/280的值为2.0左右为宜。提完的RNA存放于-80℃,可长期保存。Discard the supernatant, add pre-cooled 70% ethanol solution, wash once, centrifuge, remove ethanol, dry RNA until colorless and transparent, add appropriate amount of RNase-free H2O according to the amount of precipitation, and bathe in 60℃ water until completely dissolved. Take 1μl respectively, dilute 50 times with 10mMTris-HCl (pH 7.5) and H2O to determine the purity and concentration, and the OD260/280 value is about 2.0. The extracted RNA is stored at -80℃ for long-term storage.
取2μg RNA进行反转,体系如表8所示。反转得到的cDNA样品稀释30倍后,进一步用来做荧光定量PCR检测基因表达情况,hACE2检测qPCR引物如下(hACE2-qF:GGTCTTCAGTGCTCTCAG(SEQ ID NO:33);hACE2-qR:GCATTCTTGTGGATTATCTGG(SEQ ID NO:34)),并可见人源化ACE2基因改造小鼠在RNA水平上表达人ACE2并具有组织特异性(图13)。2 μg RNA was taken for inversion, and the system is shown in Table 8. After the cDNA sample obtained by inversion was diluted 30 times, it was further used for fluorescence quantitative PCR to detect gene expression. The hACE2 detection qPCR primers are as follows (hACE2-qF: GGTCTTCAGTGCTCTCAG (SEQ ID NO: 33); hACE2-qR: GCATTCTTGTGGATTATCTGG (SEQ ID NO: 34)), and it can be seen that the humanized ACE2 gene-modified mice express human ACE2 at the RNA level and have tissue specificity (Figure 13).
表8Table 8
另外再取人源化ACE2基因改造小鼠的小肠、肺以及睾丸组织进行免疫荧光组织蛋白水平验证人ACE2表达情况。将小鼠组织在4%PFA中于4℃固定过夜。第二天,将组织在PBS中漂洗3次,并在4℃在PBS中的30%蔗糖中冷冻保护过夜。然后将组织在30%蔗糖/PBS和O.C.T.按1:1混合物中孵育2-4小时。冷冻嵌入培养基(Sakura,目录号4583)。接下来,将组织从蔗糖/OCT混合物中转移到低温模具中,并用O.C.T.填充。将包埋的组织在干冰上冷冻,然后保存在-80℃直至低温恒温器切片。使用低温恒温器将冷冻的类器官组织切成20μm切片,并收集在superfrost Ultra Plus载玻片上。将切片干燥过夜,然后用于免疫荧光。在室温下,将4%PFA直接后固定在载玻片上10分钟,然后在PBS中洗涤3x10次。使用疏水性PAP笔勾勒出组织区域的轮廓。使用含0.05%叠氮化钠的5%BSA/0.3%TX100的PBS封闭,并在室温下孵育30分钟。然后将组织与抗ACE2抗体(ET1611-58,Huabio)在4℃孵育过夜。PBS洗涤后,二抗(A11004,Invitrogen)孵育1小时,然后DAPI 2分钟。最后,将盖玻片安装在载玻片上进行观察,可从组织免疫荧光结果中看出人ACE2在蛋白表达水平上在人源化ACE2基因改造小鼠组织上特异表达(图14)。In addition, the small intestine, lung and testicular tissues of humanized ACE2 gene-modified mice were taken for immunofluorescence tissue protein level verification of human ACE2 expression. Mouse tissues were fixed in 4% PFA at 4°C overnight. The next day, the tissues were rinsed 3 times in PBS and cryoprotected in 30% sucrose in PBS at 4°C overnight. The tissues were then incubated in a 1:1 mixture of 30% sucrose/PBS and O.C.T. for 2-4 hours. Frozen embedding medium (Sakura, catalog number 4583). Next, the tissues were transferred from the sucrose/OCT mixture to a cryogenic mold and filled with O.C.T. The embedded tissues were frozen on dry ice and then stored at -80°C until cryostat sectioning. The frozen organoid tissues were cut into 20 μm slices using a cryostat and collected on superfrost Ultra Plus slides. The sections were dried overnight and then used for immunofluorescence. At room temperature, 4% PFA was directly post-fixed on the slide for 10 minutes, and then washed 3x10 times in PBS. The outline of the tissue area was outlined using a hydrophobic PAP pen. Blocked with 5% BSA/0.3% TX100 in PBS containing 0.05% sodium azide and incubated at room temperature for 30 minutes. The tissue was then incubated with anti-ACE2 antibody (ET1611-58, Huabio) at 4°C overnight. After washing with PBS, the secondary antibody (A11004, Invitrogen) was incubated for 1 hour, and then DAPI for 2 minutes. Finally, the coverslip was mounted on a slide for observation, and it can be seen from the tissue immunofluorescence results that human ACE2 is specifically expressed in humanized ACE2 genetically modified mouse tissues at the protein expression level (Figure 14).
实施例9 SARS-CoV-2感染人源化ACE2基因改造小鼠Example 9 SARS-CoV-2 infection of humanized ACE2 genetically modified mice
人源化ACE2基因改造小鼠通过滴鼻吸入30μl DMEM,内含2×106TCID50病毒载量的SARS-CoV-2,对照组则滴入30μl DMEM。分别在D1、D3、D5、D6、D7以及D9取感染小鼠的肺、气道和小肠组织用Trizol裂解并进行RNA提取以及荧光定量PCR检测SARS-CoV-2病毒滴度,RNA提取方法同上述hACE2表达检测实施例一致。提取得到的组织RNA将使用Probe One-step qRT-PCR Kit(Toyobo)试剂盒对SARS-CoV-2的核蛋白N基因进行检测。荧光定量PCR检测所用到的引物如下:正向引物:5’-GGGGAACTTCTCCTGCTAGAAT-3’(SEQ IDNO:35);反向引物:5’-CAGACATTTTGCTCTCAAGCTG-3’(SEQ ID NO:36),TaqMan探针序列为5’-FAM-TTGCTGCTGCTTGACAGATT-TAMRA-3’(SEQ ID NO:37),结果可见人源化ACE2基因改造小鼠经SARS-CoV-2感染后肺、气道及小肠在D1、D3病毒载量达到顶峰,随之下降,表明本发明人源化ACE2基因改造小鼠对SARS-CoV-2易感(图15),而对照组小鼠并未检测到病毒。图15中的折线表示平均值。The humanized ACE2 gene-modified mice inhaled 30 μl of DMEM containing 2×10 6 TCID 50 viral load of SARS-CoV-2 through nasal drops, while the control group was dripped with 30 μl of DMEM. The lung, airway and small intestinal tissues of the infected mice were taken on D1, D3, D5, D6, D7 and D9, and RNA was extracted and fluorescent quantitative PCR was used to detect the SARS-CoV-2 virus titer. The RNA extraction method was consistent with the above-mentioned hACE2 expression detection example. The extracted tissue RNA was used The Probe One-step qRT-PCR Kit (Toyobo) was used to detect the nucleoprotein N gene of SARS-CoV-2. The primers used for the fluorescence quantitative PCR test are as follows: forward primer: 5'-GGGGAACTTCTCCTGCTAGAAT-3' (SEQ ID NO: 35); reverse primer: 5'-CAGACATTTTGCTCTCAAGCTG-3' (SEQ ID NO: 36), and the TaqMan probe sequence is 5'-FAM-TTGCTGCTGCTTGACAGATT-TAMRA-3' (SEQ ID NO: 37). The results show that the humanized ACE2 gene-modified mice were infected with SARS-CoV-2. The lung, airway and small intestine reached a peak viral load on D1 and D3, and then decreased, indicating that the humanized ACE2 gene-modified mice of the present invention are susceptible to SARS-CoV-2 (Figure 15), while the control group mice did not detect the virus. The broken line in Figure 15 represents the average value.
序列表Sequence Listing
<110> 中国科学院广州生物医药与健康研究院<110> Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences
<120> 人源化ACE2基因改造小鼠胚胎干细胞模型制备方法及应用<120> Preparation method and application of humanized ACE2 gene-modified mouse embryonic stem cell model
<150> 202010152182.X<150> 202010152182.X
<151> 2020-03-06<151> 2020-03-06
<160> 37<160> 37
<170> SIPOSequenceListing 1.0<170> SIPOSequenceListing 1.0
<210> 1<210> 1
<211> 23<211> 23
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> sgRNA1序列<223> sgRNA1 sequence
<400> 1<400> 1
tactgctcag tccctcaccg agg 23tactgctcag tccctcaccg agg 23
<210> 2<210> 2
<211> 23<211> 23
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> sgRNA2序列<223> sgRNA2 sequence
<400> 2<400> 2
cttggcattt tcctcggtga ggg 23cttggcattt tcctcggtga ggg 23
<210> 3<210> 3
<211> 23<211> 23
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> sgRNRA3序列<223> sgRNRA3 sequence
<400> 3<400> 3
caagtgaact ttgataagac agg 23caagtgaactttgataagac agg 23
<210> 4<210> 4
<211> 25<211> 25
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 合成sgRNA2的上游单链引物序列<223> Synthesize the upstream single-stranded primer sequence of sgRNA2
<400> 4<400> 4
caccgcttgg cattttcctc ggtga 25caccgcttgg cattttcctc ggtga 25
<210> 5<210> 5
<211> 25<211> 25
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 合成sgRNA2的下游单链引物序列<223> Synthesize the downstream single-stranded primer sequence of sgRNA2
<400> 5<400> 5
aaactcaccg aggaaaatgc caagc 25aaactcaccg aggaaaatgc caagc 25
<210> 6<210> 6
<211> 25<211> 25
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 合成sgRNA1的上游单链引物序列<223> Synthesize the upstream single-stranded primer sequence of sgRNA1
<400> 6<400> 6
caccgtactg ctcagtccct caccg 25caccgtactg ctcagtccct caccg 25
<210> 7<210> 7
<211> 25<211> 25
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 合成sgRNA1的下游单链引物序列<223> Synthesize the downstream single-stranded primer sequence of sgRNA1
<400> 7<400> 7
aaaccggtga gggactgagc agtac 25aaaccggtga gggactgagc agtac 25
<210> 8<210> 8
<211> 25<211> 25
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 合成sgRNA3的上游单链引物序列<223> Synthesize the upstream single-stranded primer sequence of sgRNA3
<400> 8<400> 8
caccgcaagt gaactttgat aagac 25caccgcaagt gaactttgat aagac 25
<210> 9<210> 9
<211> 25<211> 25
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 合成sgRNA3的下游单链引物序列<223> Synthesize the downstream single-stranded primer sequence of sgRNA3
<400> 9<400> 9
aaacgtctta tcaaagttca cttgc 25aaacgtctta tcaaagttca cttgc 25
<210> 10<210> 10
<211> 28<211> 28
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> 5arm-sgF<223> 5arm-sgF
<400> 10<400> 10
ggttttgatt tggccataaa atgttagc 28ggttttgatt tggccataaa atgttagc 28
<210> 11<210> 11
<211> 25<211> 25
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> 3arm-sgR<223> 3arm-sgR
<400> 11<400> 11
attcccaggt ccagtttcac ctaag 25attcccaggt ccagtttcac ctaag 25
<210> 12<210> 12
<211> 2418<211> 2418
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> hACE2-CDS序列<223> hACE2-CDS sequence
<400> 12<400> 12
atgtcaagct cttcctggct ccttctcagc cttgttgctg taactgctgc tcagtccacc 60atgtcaagct cttcctggct ccttctcagc cttgttgctg taactgctgc tcagtccacc 60
attgaggaac aggccaagac atttttggac aagtttaacc acgaagccga agacctgttc 120attgaggaac aggccaagac atttttggac aagtttaacc acgaagccga agacctgttc 120
tatcaaagtt cacttgcttc ttggaattat aacaccaata ttactgaaga gaatgtccaa 180tatcaaagtt cacttgcttc ttggaattat aacaccaata ttactgaaga gaatgtccaa 180
aacatgaata atgctgggga caaatggtct gcctttttaa aggaacagtc cacacttgcc 240aacatgaata atgctgggga caaatggtct gcctttttaa aggaacagtc cacacttgcc 240
caaatgtatc cactacaaga aattcagaat ctcacagtca agcttcagct gcaggctctt 300caaatgtatc cactacaaga aattcagaat ctcacagtca agcttcagct gcaggctctt 300
cagcaaaatg ggtcttcagt gctctcagaa gacaagagca aacggttgaa cacaattcta 360cagcaaaatg ggtcttcagt gctctcagaa gacaagagca aacggttgaa cacaattcta 360
aatacaatga gcaccatcta cagtactgga aaagtttgta acccagataa tccacaagaa 420aatacaatga gcaccatcta cagtactgga aaagtttgta acccagataa tccacaagaa 420
tgcttattac ttgaaccagg tttgaatgaa ataatggcaa acagtttaga ctacaatgag 480tgcttattac ttgaaccagg tttgaatgaa ataatggcaa acagtttaga ctacaatgag 480
aggctctggg cttgggaaag ctggagatct gaggtcggca agcagctgag gccattatat 540aggctctggg cttgggaaag ctggagatct gaggtcggca agcagctgag gccattatat 540
gaagagtatg tggtcttgaa aaatgagatg gcaagagcaa atcattatga ggactatggg 600gaagagtatg tggtcttgaa aaatgagatg gcaagagcaa atcattatga ggactatggg 600
gattattgga gaggagacta tgaagtaaat ggggtagatg gctatgacta cagccgcggc 660gattattgga gaggagacta tgaagtaaat ggggtagatg gctatgacta cagccgcggc 660
cagttgattg aagatgtgga acataccttt gaagagatta aaccattata tgaacatctt 720cagttgattg aagatgtgga acataccttt gaagagatta aaccattata tgaacatctt 720
catgcctatg tgagggcaaa gttgatgaat gcctatcctt cctatatcag tccaattgga 780catgcctatg tgagggcaaa gttgatgaat gcctatcctt cctatatcag tccaattgga 780
tgcctccctg ctcatttgct tggtgatatg tggggtagat tttggacaaa tctgtactct 840tgcctccctg ctcatttgct tggtgatatg tggggtagat tttggacaaa tctgtactct 840
ttgacagttc cctttggaca gaaaccaaac atagatgtta ctgatgcaat ggtggaccag 900ttgacagttc cctttggaca gaaaccaaac atagatgtta ctgatgcaat ggtggaccag 900
gcctgggatg cacagagaat attcaaggag gccgagaagt tctttgtatc tgttggtctt 960gcctgggatg cacagagaat attcaaggag gccgagaagt tctttgtatc tgttggtctt 960
cctaatatga ctcaaggatt ctgggaaaat tccatgctaa cggacccagg aaatgttcag 1020cctaatatga ctcaaggatt ctgggaaaat tccatgctaa cggacccagg aaatgttcag 1020
aaagcagtct gccatcccac agcttgggac ctggggaagg gcgacttcag gatccttatg 1080aaagcagtct gccatcccac agcttgggac ctggggaagg gcgacttcag gatccttatg 1080
tgcacaaagg tgacaatgga cgacttcctg acagctcatc atgagatggg gcatatccag 1140tgcacaaagg tgacaatgga cgacttcctg acagctcatc atgagatggg gcatatccag 1140
tatgatatgg catatgctgc acaacctttt ctgctaagaa atggagctaa tgaaggattc 1200tatgatatgg catatgctgc acaacctttt ctgctaagaa atggagctaa tgaaggattc 1200
catgaagctg ttggggaaat catgtcactt tctgcagcca cacctaagca tttaaaatcc 1260catgaagctg ttggggaaat catgtcactt tctgcagcca cacctaagca tttaaaatcc 1260
attggtcttc tgtcacccga ttttcaagaa gacaatgaaa cagaaataaa cttcctgctc 1320attggtcttc tgtcacccga ttttcaagaa gacaatgaaa cagaaataaa cttcctgctc 1320
aaacaagcac tcacgattgt tgggactctg ccatttactt acatgttaga gaagtggagg 1380aaacaagcac tcacgattgt tgggactctg ccatttactt acatgttaga gaagtggagg 1380
tggatggtct ttaaagggga aattcccaaa gaccagtgga tgaaaaagtg gtgggagatg 1440tggatggtct ttaaagggga aattcccaaa gaccagtgga tgaaaaagtg gtgggatg 1440
aagcgagaga tagttggggt ggtggaacct gtgccccatg atgaaacata ctgtgacccc 1500aagcgagaga tagttggggt ggtggaacct gtgccccatg atgaaacata ctgtgacccc 1500
gcatctctgt tccatgtttc taatgattac tcattcattc gatattacac aaggaccctt 1560gcatctctgt tccatgtttc taatgattac tcattcattc gatattacac aaggaccctt 1560
taccaattcc agtttcaaga agcactttgt caagcagcta aacatgaagg ccctctgcac 1620taccaattcc agtttcaaga agcactttgt caagcagcta aacatgaagg ccctctgcac 1620
aaatgtgaca tctcaaactc tacagaagct ggacagaaac tgttcaatat gctgaggctt 1680aaatgtgaca tctcaaactc tacagaagct ggacagaaac tgttcaatat gctgaggctt 1680
ggaaaatcag aaccctggac cctagcattg gaaaatgttg taggagcaaa gaacatgaat 1740ggaaaatcag aaccctggac cctagcattg gaaaatgttg taggagcaaa gaacatgaat 1740
gtaaggccac tgctcaacta ctttgagccc ttatttacct ggctgaaaga ccagaacaag 1800gtaaggccac tgctcaacta ctttgagccc ttatttacct ggctgaaaga ccagaacaag 1800
aattcttttg tgggatggag taccgactgg agtccatatg cagaccaaag catcaaagtg 1860aattcttttg tgggatggag taccgactgg agtccatatg cagaccaaag catcaaagtg 1860
aggataagcc taaaatcagc tcttggagat aaagcatatg aatggaacga caatgaaatg 1920aggataagcc taaaatcagc tcttggagat aaagcatatg aatggaacga caatgaaatg 1920
tacctgttcc gatcatctgt tgcatatgct atgaggcagt actttttaaa agtaaaaaat 1980tacctgttcc gatcatctgt tgcatatgct atgaggcagt actttttaaa agtaaaaaat 1980
cagatgattc tttttgggga ggaggatgtg cgagtggcta atttgaaacc aagaatctcc 2040cagatgattc tttttgggga ggaggatgtg cgagtggcta atttgaaacc aagaatctcc 2040
tttaatttct ttgtcactgc acctaaaaat gtgtctgata tcattcctag aactgaagtt 2100tttaatttct ttgtcactgc acctaaaaat gtgtctgata tcattcctag aactgaagtt 2100
gaaaaggcca tcaggatgtc ccggagccgt atcaatgatg ctttccgtct gaatgacaac 2160gaaaaggcca tcaggatgtc ccggagccgt atcaatgatg ctttccgtct gaatgacaac 2160
agcctagagt ttctggggat acagccaaca cttggacctc ctaaccagcc ccctgtttcc 2220agcctagagt ttctggggat acagccaaca cttggacctc ctaaccagcc ccctgtttcc 2220
atatggctga ttgtttttgg agttgtgatg ggagtgatag tggttggcat tgtcatcctg 2280atatggctga ttgtttttgg agttgtgatg ggagtgatag tggttggcat tgtcatcctg 2280
atcttcactg ggatcagaga tcggaagaag aaaaataaag caagaagtgg agaaaatcct 2340atcttcactg ggatcagaga tcggaagaag aaaaataaag caagaagtgg agaaaatcct 2340
tatgcctcca tcgatattag caaaggagaa aataatccag gattccaaaa cactgatgat 2400tatgcctcca tcgatattag caaaggagaa aataatccag gattccaaaa cactgatgat 2400
gttcagacct ccttttag 2418gttcagacctccttttag 2418
<210> 13<210> 13
<211> 805<211> 805
<212> PRT<212> PRT
<213> Natural sequence<213> Natural sequence
<220><220>
<223> hACE2-protein序列<223> hACE2-protein sequence
<400> 13<400> 13
Met Ser Ser Ser Ser Trp Leu Leu Leu Ser Leu Val Ala Val Thr AlaMet Ser Ser Ser Ser Trp Leu Leu Leu Ser Leu Val Ala Val Thr Ala
1 5 10 151 5 10 15
Ala Gln Ser Thr Ile Glu Glu Gln Ala Lys Thr Phe Leu Asp Lys PheAla Gln Ser Thr Ile Glu Glu Gln Ala Lys Thr Phe Leu Asp Lys Phe
20 25 3020 25 30
Asn His Glu Ala Glu Asp Leu Phe Tyr Gln Ser Ser Leu Ala Ser TrpAsn His Glu Ala Glu Asp Leu Phe Tyr Gln Ser Ser Leu Ala Ser Trp
35 40 4535 40 45
Asn Tyr Asn Thr Asn Ile Thr Glu Glu Asn Val Gln Asn Met Asn AsnAsn Tyr Asn Thr Asn Ile Thr Glu Glu Asn Val Gln Asn Met Asn Asn
50 55 6050 55 60
Ala Gly Asp Lys Trp Ser Ala Phe Leu Lys Glu Gln Ser Thr Leu AlaAla Gly Asp Lys Trp Ser Ala Phe Leu Lys Glu Gln Ser Thr Leu Ala
65 70 75 8065 70 75 80
Gln Met Tyr Pro Leu Gln Glu Ile Gln Asn Leu Thr Val Lys Leu GlnGln Met Tyr Pro Leu Gln Glu Ile Gln Asn Leu Thr Val Lys Leu Gln
85 90 9585 90 95
Leu Gln Ala Leu Gln Gln Asn Gly Ser Ser Val Leu Ser Glu Asp LysLeu Gln Ala Leu Gln Gln Asn Gly Ser Ser Val Leu Ser Glu Asp Lys
100 105 110100 105 110
Ser Lys Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr SerSer Lys Arg Leu Asn Thr Ile Leu Asn Thr Met Ser Thr Ile Tyr Ser
115 120 125115 120 125
Thr Gly Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu LeuThr Gly Lys Val Cys Asn Pro Asp Asn Pro Gln Glu Cys Leu Leu Leu
130 135 140130 135 140
Glu Pro Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn GluGlu Pro Gly Leu Asn Glu Ile Met Ala Asn Ser Leu Asp Tyr Asn Glu
145 150 155 160145 150 155 160
Arg Leu Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln LeuArg Leu Trp Ala Trp Glu Ser Trp Arg Ser Glu Val Gly Lys Gln Leu
165 170 175165 170 175
Arg Pro Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala ArgArg Pro Leu Tyr Glu Glu Tyr Val Val Leu Lys Asn Glu Met Ala Arg
180 185 190180 185 190
Ala Asn His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr GluAla Asn His Tyr Glu Asp Tyr Gly Asp Tyr Trp Arg Gly Asp Tyr Glu
195 200 205195 200 205
Val Asn Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile GluVal Asn Gly Val Asp Gly Tyr Asp Tyr Ser Arg Gly Gln Leu Ile Glu
210 215 220210 215 220
Asp Val Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His LeuAsp Val Glu His Thr Phe Glu Glu Ile Lys Pro Leu Tyr Glu His Leu
225 230 235 240225 230 235 240
His Ala Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr IleHis Ala Tyr Val Arg Ala Lys Leu Met Asn Ala Tyr Pro Ser Tyr Ile
245 250 255245 250 255
Ser Pro Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp GlySer Pro Ile Gly Cys Leu Pro Ala His Leu Leu Gly Asp Met Trp Gly
260 265 270260 265 270
Arg Phe Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln LysArg Phe Trp Thr Asn Leu Tyr Ser Leu Thr Val Pro Phe Gly Gln Lys
275 280 285275 280 285
Pro Asn Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp AlaPro Asn Ile Asp Val Thr Asp Ala Met Val Asp Gln Ala Trp Asp Ala
290 295 300290 295 300
Gln Arg Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly LeuGln Arg Ile Phe Lys Glu Ala Glu Lys Phe Phe Val Ser Val Gly Leu
305 310 315 320305 310 315 320
Pro Asn Met Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Asp ProPro Asn Met Thr Gln Gly Phe Trp Glu Asn Ser Met Leu Thr Asp Pro
325 330 335325 330 335
Gly Asn Val Gln Lys Ala Val Cys His Pro Thr Ala Trp Asp Leu GlyGly Asn Val Gln Lys Ala Val Cys His Pro Thr Ala Trp Asp Leu Gly
340 345 350340 345 350
Lys Gly Asp Phe Arg Ile Leu Met Cys Thr Lys Val Thr Met Asp AspLys Gly Asp Phe Arg Ile Leu Met Cys Thr Lys Val Thr Met Asp Asp
355 360 365355 360 365
Phe Leu Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met AlaPhe Leu Thr Ala His His Glu Met Gly His Ile Gln Tyr Asp Met Ala
370 375 380370 375 380
Tyr Ala Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly PheTyr Ala Ala Gln Pro Phe Leu Leu Arg Asn Gly Ala Asn Glu Gly Phe
385 390 395 400385 390 395 400
His Glu Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro LysHis Glu Ala Val Gly Glu Ile Met Ser Leu Ser Ala Ala Thr Pro Lys
405 410 415405 410 415
His Leu Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp AsnHis Leu Lys Ser Ile Gly Leu Leu Ser Pro Asp Phe Gln Glu Asp Asn
420 425 430420 425 430
Glu Thr Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val GlyGlu Thr Glu Ile Asn Phe Leu Leu Lys Gln Ala Leu Thr Ile Val Gly
435 440 445435 440 445
Thr Leu Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val PheThr Leu Pro Phe Thr Tyr Met Leu Glu Lys Trp Arg Trp Met Val Phe
450 455 460450 455 460
Lys Gly Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu MetLys Gly Glu Ile Pro Lys Asp Gln Trp Met Lys Lys Trp Trp Glu Met
465 470 475 480465 470 475 480
Lys Arg Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu ThrLys Arg Glu Ile Val Gly Val Val Glu Pro Val Pro His Asp Glu Thr
485 490 495485 490 495
Tyr Cys Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser PheTyr Cys Asp Pro Ala Ser Leu Phe His Val Ser Asn Asp Tyr Ser Phe
500 505 510500 505 510
Ile Arg Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu AlaIle Arg Tyr Tyr Thr Arg Thr Leu Tyr Gln Phe Gln Phe Gln Glu Ala
515 520 525515 520 525
Leu Cys Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp IleLeu Cys Gln Ala Ala Lys His Glu Gly Pro Leu His Lys Cys Asp Ile
530 535 540530 535 540
Ser Asn Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg LeuSer Asn Ser Thr Glu Ala Gly Gln Lys Leu Phe Asn Met Leu Arg Leu
545 550 555 560545 550 555 560
Gly Lys Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly AlaGly Lys Ser Glu Pro Trp Thr Leu Ala Leu Glu Asn Val Val Gly Ala
565 570 575565 570 575
Lys Asn Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu PheLys Asn Met Asn Val Arg Pro Leu Leu Asn Tyr Phe Glu Pro Leu Phe
580 585 590580 585 590
Thr Trp Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser ThrThr Trp Leu Lys Asp Gln Asn Lys Asn Ser Phe Val Gly Trp Ser Thr
595 600 605595 600 605
Asp Trp Ser Pro Tyr Ala Asp Gln Ser Ile Lys Val Arg Ile Ser LeuAsp Trp Ser Pro Tyr Ala Asp Gln Ser Ile Lys Val Arg Ile Ser Leu
610 615 620610 615 620
Lys Ser Ala Leu Gly Asp Lys Ala Tyr Glu Trp Asn Asp Asn Glu MetLys Ser Ala Leu Gly Asp Lys Ala Tyr Glu Trp Asn Asp Asn Glu Met
625 630 635 640625 630 635 640
Tyr Leu Phe Arg Ser Ser Val Ala Tyr Ala Met Arg Gln Tyr Phe LeuTyr Leu Phe Arg Ser Ser Val Ala Tyr Ala Met Arg Gln Tyr Phe Leu
645 650 655645 650 655
Lys Val Lys Asn Gln Met Ile Leu Phe Gly Glu Glu Asp Val Arg ValLys Val Lys Asn Gln Met Ile Leu Phe Gly Glu Glu Asp Val Arg Val
660 665 670660 665 670
Ala Asn Leu Lys Pro Arg Ile Ser Phe Asn Phe Phe Val Thr Ala ProAla Asn Leu Lys Pro Arg Ile Ser Phe Asn Phe Phe Val Thr Ala Pro
675 680 685675 680 685
Lys Asn Val Ser Asp Ile Ile Pro Arg Thr Glu Val Glu Lys Ala IleLys Asn Val Ser Asp Ile Ile Pro Arg Thr Glu Val Glu Lys Ala Ile
690 695 700690 695 700
Arg Met Ser Arg Ser Arg Ile Asn Asp Ala Phe Arg Leu Asn Asp AsnArg Met Ser Arg Ser Arg Ile Asn Asp Ala Phe Arg Leu Asn Asp Asn
705 710 715 720705 710 715 720
Ser Leu Glu Phe Leu Gly Ile Gln Pro Thr Leu Gly Pro Pro Asn GlnSer Leu Glu Phe Leu Gly Ile Gln Pro Thr Leu Gly Pro Pro Asn Gln
725 730 735725 730 735
Pro Pro Val Ser Ile Trp Leu Ile Val Phe Gly Val Val Met Gly ValPro Pro Val Ser Ile Trp Leu Ile Val Phe Gly Val Val Met Gly Val
740 745 750740 745 750
Ile Val Val Gly Ile Val Ile Leu Ile Phe Thr Gly Ile Arg Asp ArgIle Val Val Gly Ile Val Ile Leu Ile Phe Thr Gly Ile Arg Asp Arg
755 760 765755 760 765
Lys Lys Lys Asn Lys Ala Arg Ser Gly Glu Asn Pro Tyr Ala Ser IleLys Lys Lys Asn Lys Ala Arg Ser Gly Glu Asn Pro Tyr Ala Ser Ile
770 775 780770 775 780
Asp Ile Ser Lys Gly Glu Asn Asn Pro Gly Phe Gln Asn Thr Asp AspAsp Ile Ser Lys Gly Glu Asn Asn Pro Gly Phe Gln Asn Thr Asp Asp
785 790 795 800785 790 795 800
Val Gln Thr Ser PheVal Gln Thr Ser Phe
805805
<210> 14<210> 14
<211> 122<211> 122
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> SV40 polyA序列<223> SV40 polyA sequence
<400> 14<400> 14
aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca 60aacttgttta ttgcagctta taatggttac aaataaagca atagcatcac aaatttcaca 60
aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct 120aataaagcat ttttttcact gcattctagt tgtggtttgt ccaaactcat caatgtatct 120
ta 122ta 122
<210> 15<210> 15
<211> 970<211> 970
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> 5’同源臂序列<223> 5' homology arm sequence
<400> 15<400> 15
ccctatggag tggagaagag tcttataatt ttttaaatgg gcagagaaat gaatttattt 60ccctatggag tggagaagag tcttataatt ttttaaatgg gcagagaaat gaatttattt 60
ttaattttta gagacagggt ttctttgtat agctctagct gtctttgatt ggtagacaaa 120ttaattttta gagacagggt ttctttgtat agctctagct gtctttgatt ggtagacaaa 120
gctgtcctca aactcagaga tcttccttcc tttgtctcct gagtgctggg attaaaggca 180gctgtcctca aactcagaga tcttccttcc tttgtctcct gagtgctggg attaaaggca 180
tggaccacca ctgccctgcc ccattctctc cattaatttt aagtgaatgc ttgcaaaagc 240tggaccacca ctgccctgcc ccattctctc cattaatttt aagtgaatgc ttgcaaaagc 240
tcacttcttt ggtgaacagc ttcctttaca aataagtacc tttgccttcg tttttatagg 300tcacttcttt ggtgaacagc ttcctttaca aataagtacc tttgccttcg tttttatagg 300
attcttaaaa agaaaaaaaa gattcagcca ggtggttgtg gtgcacacct ttaatcccag 360attcttaaaa agaaaaaaaa gattcagcca ggtggttgtg gtgcacacct ttaatcccag 360
cagtcaggag gcagaggaaa gcagatctct tgagtttgag gctagcctag tctacagagg 420cagtcaggag gcagaggaaa gcagatctct tgagtttgag gctagcctag tctacagagg 420
gagttccagg acagccaagg ctacagagag gaactgtcta aaaacaccaa gaaagagaga 480gagttccagg acagccaagg ctacagagag gaactgtcta aaaacaccaa gaaagagaga 480
aaggagagag ggagaggatg gatagcttat tgatagaatt gtcagaaaag gctataagtt 540aaggagagag ggagaggatg gatagctttat tgatagaatt gtcagaaaag gctataagtt 540
ccaatatgtg tcccatgatt tctaagtcta gccctttctg ttatagtaaa atcatagtac 600ccaatatgtg tcccatgatt tctaagtcta gccctttctg ttatagtaaa atcatagtac 600
accctcctcc tccagtgtat ctttaacagc ttttaaggaa catattaact aaatgtccag 660accctcctcc tccagtgtat ctttaacagc ttttaaggaa catattaact aaatgtccag 660
gttttgattt ggccataaaa tgttagcaaa gctaaggttt tctaggatta atgaataaca 720gttttgattt ggccataaaa tgttagcaaa gctaaggttt tctaggatta atgaataaca 720
tgtctttatt tagtttactt aaaaaaatca ttctaaaata tctgtttaca tatctgtcct 780tgtctttatt tagtttactt aaaaaaatca ttctaaaata tctgtttaca tatctgtcct 780
ctccaggatt aacttcatat tggtccagca gcttgtttac tgttctcttc tgtttcttct 840ctccaggatt aacttcatat tggtccagca gcttgtttac tgttctcttc tgtttcttct 840
tctgcttttt ttttcttctc ttctcagtgc ccaacccaag ttcaaaggct gatgagagag 900tctgcttttt ttttcttctc ttctcagtgc ccaacccaag ttcaaaggct gatgagagag 900
aaaaactcat gaagagattt tactctaggg aaagttgctc agtggatggg atcttggcgc 960aaaaactcat gaagagattt tactctaggg aaagttgctc agtggatggg atcttggcgc 960
acggggaaag 970acggggaaag 970
<210> 16<210> 16
<211> 971<211> 971
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> 3’同源臂序列<223> 3’ homology arm sequence
<400> 16<400> 16
gaattataat actaacatta ctgaagaaaa tgcccaaaag atggtaagtt cttgaggcta 60gaattataat actaacatta ctgaagaaaa tgcccaaaag atggtaagtt cttgaggcta 60
cccagggggt tattgattgc ttcttaaaga tcagaattac tgcctataaa actggataag 120cccagggggt tattgattgc ttcttaaaga tcagaattac tgcctataaa actggataag 120
gaaatcatag agatctctca agtgtgagga tgagtgactg cctctgtagc tctgatccta 180gaaatcatag agatctctca agtgtgagga tgagtgactg cctctgtagc tctgatccta 180
gtctcccaga tggctaaatt caattgacct tagagttcat ctggaaaatt gttatgaatg 240gtctcccaga tggctaaatt caattgacct tagagttcat ctggaaaatt gttatgaatg 240
aattatttgc ccagattcca aagatgagtg aaaatgttta ataaagttgc catcactatt 300aattatttgc ccagattcca aagatgagtg aaaatgttta ataaagttgc catcactatt 300
ctcattatat ttggtatgta aagcattcat ggaaatgttc taagtcgtta ttgagccaat 360ctcattatat ttggtatgta aagcattcat ggaaatgttc taagtcgtta ttgagccaat 360
aattttcttt agcttataat gccaacaggt ctatccgaga actacaaatg acatattaac 420aattttcttt agcttataat gccaacaggt ctatccgaga actacaaatg acatattaac 420
tgaaaaatgc aactggggtt tactgaaggc agcagcttag taattaaggt aaccatggct 480tgaaaaatgc aactggggtt tactgaaggc agcagcttag taattaaggt aaccatggct 480
taggtgaaac tggacctggg aattccttct ttcattgaca cagagctctg aggaatttcc 540taggtgaaac tggacctggg aattccttct ttcattgaca cagagctctg aggaatttcc 540
aaaggtcaca gaagaaaagc tataattaaa ctagtcccaa aaaatctcag cctactctgg 600aaaggtcaca gaagaaaagc tataattaaa ctagtcccaa aaaatctcag cctactctgg 600
gaaagcagca tattttgttt gacaagtgca aggacttaga actttttttt ttctcactga 660gaaagcagca tattttgttt gacaagtgca aggacttaga actttttttt ttctcactga 660
tcctgaagtg ccttttaagt atagttaagt ggtggaaaat tgagcaacta tttaagaaaa 720tcctgaagtg ccttttaagt atagttaagt ggtggaaaat tgagcaacta tttaagaaaa 720
gactcttttt tttcttcttc cagcaatgct ttccttcaaa acggtagctt caaaacttcc 780gactcttttt tttcttcttc cagcaatgct ttccttcaaa acggtagctt caaaacttcc 780
tgtcttttaa atgatcaggg ggctgtgtgt ttaaattatt gccattcata gaacagagtg 840tgtcttttaa atgatcaggg ggctgtgtgt ttaaattatt gccattcata gaacagagtg 840
ggtctgagga tgcctgtttc ctttgaaatt ctatgccccc tcccagtttt ctaaaattta 900ggtctgagga tgcctgtttc ctttgaaatt ctatgccccc tcccagtttt ctaaaattta 900
agaaaccaca gagactttga caatgtagtt gccaaatgag ttgcttttaa ctgctctaat 960agaaaccaca gagactttga caatgtagtt gccaaatgag ttgcttttaa ctgctctaat 960
agtttggtct t 971agtttggtct t 971
<210> 17<210> 17
<211> 1124<211> 1124
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> PGK-puro序列<223> PGK-puro sequence
<400> 17<400> 17
gggtagggga ggcgcttttc ccaaggcagt ctggagcatg cgctttagca gccccgctgg 60gggtagggga ggcgcttttc ccaaggcagt ctggagcatg cgctttagca gccccgctgg 60
gcacttggcg ctacacaagt ggcctctggc ctcgcacaca ttccacatcc cccggtaggc 120gcacttggcg ctacacaagt ggcctctggc ctcgcacaca ttccacatcc cccggtaggc 120
gccaaccggc tccgttcttt ggtggcccct tcgcgccacc ttctactcct cccctagtca 180gccaaccggc tccgttcttt ggtggcccct tcgcgccacc ttctactcct cccctagtca 180
ggaagttccc ccccgccccg cagctcgcgt cgtgcaggac gtgacaaatg gaagtagcac 240ggaagttccc ccccgccccg cagctcgcgt cgtgcaggac gtgacaaatg gaagtagcac 240
gtctcactag tctcgtgcag atggacagca ccgctgagca atggaagcgg gtaggccttt 300gtctcactag tctcgtgcag atggacagca ccgctgagca atggaagcgg gtaggccttt 300
ggggcagcgg ccaatagcag ctttgctcct tcgctttctg ggctcagagg ctgggaaggg 360ggggcagcgg ccaatagcag ctttgctcct tcgctttctg ggctcagagg ctgggaaggg 360
gtgggtccgg gggcgggctc aggggcgggc tcaggggcgg ggcgggcgcc cgaaggtcct 420gtgggtccgg gggcgggctc aggggcgggc tcaggggcgg ggcgggcgcc cgaaggtcct 420
ccggaggccc ggcattctgc acgcttcaaa agcgcacgtc tgccgcgctg ttctcctctt 480ccggaggccc ggcattctgc acgcttcaaa agcgcacgtc tgccgcgctg ttctcctctt 480
cctcatctcc gggcctttcg acctgcagcc caagctagct taccatgacc gagtacaagc 540cctcatctcc gggcctttcg acctgcagcc caagctagct taccatgacc gagtacaagc 540
ccacggtgcg cctcgccacc cgcgacgacg tccccagggc cgtacgcacc ctcgccgccg 600ccacggtgcg cctcgccacc cgcgacgacg tccccagggc cgtacgcacc ctcgccgccg 600
cgttcgccga ctaccccgcc acgcgccaca ccgtcgatcc ggaccgccac atcgagcggg 660cgttcgccga ctaccccgcc acgcgccaca ccgtcgatcc ggaccgccac atcgagcggg 660
tcaccgagct gcaagaactc ttcctcacgc gcgtcgggct cgacatcggc aaggtgtggg 720tcaccgagct gcaagaactc ttcctcacgc gcgtcgggct cgacatcggc aaggtgtggg 720
tcgcggacga cggcgccgcg gtggcggtct ggaccacgcc ggagagcgtc gaagcggggg 780tcgcggacga cggcgccgcg gtggcggtct ggaccacgcc ggagagcgtc gaagcggggg 780
cggtgttcgc cgagatcggc ccgcgcatgg ccgagttgag cggttcccgg ctggccgcgc 840cggtgttcgc cgagatcggc ccgcgcatgg ccgagttgag cggttcccgg ctggccgcgc 840
agcaacagat ggaaggcctc ctggcgccgc accggcccaa ggagcccgcg tggttcctgg 900agcaacagat ggaaggcctc ctggcgccgc accggcccaa ggagcccgcg tggttcctgg 900
ccaccgtcgg cgtctcgccc gaccaccagg gcaagggtct gggcagcgcc gtcgtgctcc 960ccaccgtcgg cgtctcgccc gaccaccagg gcaagggtct gggcagcgcc gtcgtgctcc 960
ccggagtgga ggcggccgag cgcgccgggg tgcccgcctt cctggagacc tccgcgcccc 1020ccggagtgga ggcggccgag cgcgccgggg tgcccgcctt cctggagacc tccgcgcccc 1020
gcaacctccc cttctacgag cggctcggct tcaccgtcac cgccgacgtc gaggtgcccg 1080gcaacctccc cttctacgag cggctcggct tcaccgtcac cgccgacgtc gaggtgcccg 1080
aaggaccgcg cacctggtgc atgacccgca agcccggtgc ctga 1124aaggaccgcg cacctggtgc atgacccgca agcccggtgc ctga 1124
<210> 18<210> 18
<211> 34<211> 34
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> Frt序列<223> Frt sequence
<400> 18<400> 18
gaagttccta ttctctagaa agtataggaa cttc 34gaagttccta ttctctagaa agtataggaa cttc 34
<210> 19<210> 19
<211> 50<211> 50
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 5arm-pcrF<223> 5arm-pcrF
<400> 19<400> 19
tcgcacacat tccacatcca ccggtcccta tggagtggag aagagtctta 50tcgcacacat tccacatcca ccggtcccta tggagtggag aagagtctta 50
<210> 20<210> 20
<211> 47<211> 47
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 5arm-pcrR<223> 5arm-pcrR
<400> 20<400> 20
gaaggagcca ggaagagctt gacatctttc cccgtgcgcc aagatcc 47gaaggagcca ggaagagctt gacatctttc cccgtgcgcc aagatcc 47
<210> 21<210> 21
<211> 47<211> 47
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> hACE2-F<223> hACE2-F
<400> 21<400> 21
ggatcttggc gcacggggaa agatgtcaag ctcttcctgg ctccttc 47ggatcttggc gcacggggaa agatgtcaag ctcttcctgg ctccttc 47
<210> 22<210> 22
<211> 49<211> 49
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> hACE2-R<223> hACE2-R
<400> 22<400> 22
cattataagc tgcaataaac aagttctaaa aggaggtctg aacatcatc 49cattataagc tgcaataaac aagttctaaa aggaggtctg aacatcatc 49
<210> 23<210> 23
<211> 49<211> 49
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> SV40-F<223> SV40-F
<400> 23<400> 23
gatgatgttc agacctcctt ttagaacttg tttattgcag cttataatg 49gatgatgttc agacctcctt ttagaacttg tttattgcag cttataatg 49
<210> 24<210> 24
<211> 48<211> 48
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> SV40-R<223> SV40-R
<400> 24<400> 24
agagaatagg aacttcgcac gcgttaagat acattgatga gtttggac 48agagaatagg aacttcgcac gcgttaagat acattgatga gtttggac 48
<210> 25<210> 25
<211> 50<211> 50
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 3arm-pcrF<223> 3arm-pcrF
<400> 25<400> 25
tacgaagtta tgtcgacgcg gcgcgccgaa ttataatact aacattactg 50tacgaagtta tgtcgacgcg gcgcgccgaa ttataatact aacattactg 50
<210> 26<210> 26
<211> 53<211> 53
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 3arm-pcrR<223> 3arm-pcrR
<400> 26<400> 26
tatgaccatg attacgccaa gcttaagacc aaactattag agcagttaaa agc 53tatgaccatg attacgccaa gcttaagacc aaactattag agcagttaaa agc 53
<210> 27<210> 27
<211> 8470<211> 8470
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 打靶载体序列<223> Targeting vector sequence
<400> 27<400> 27
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180
accatatgta ccgggtaggg gaggcgcttt tcccaaggca gtctggagca tgcgctttag 240accatatgta ccgggtaggg gaggcgcttt tcccaaggca gtctggagca tgcgctttag 240
cagccccgct gggcacttgg cgctacacaa gtggcctctg gcctcgcaca cattccacat 300cagccccgct gggcacttgg cgctacacaa gtggcctctg gcctcgcaca cattccacat 300
ccaccggtcc ctatggagtg gagaagagtc ttataatttt ttaaatgggc agagaaatga 360ccaccggtcc ctatggagtg gagaagagtc ttataatttt ttaaatgggc agagaaatga 360
atttattttt aatttttaga gacagggttt ctttgtatag ctctagctgt ctttgattgg 420atttattttt aatttttaga gacagggttt ctttgtatag ctctagctgt ctttgattgg 420
tagacaaagc tgtcctcaaa ctcagagatc ttccttcctt tgtctcctga gtgctgggat 480tagacaaagc tgtcctcaaa ctcagagatc ttccttcctt tgtctcctga gtgctgggat 480
taaaggcatg gaccaccact gccctgcccc attctctcca ttaattttaa gtgaatgctt 540taaaggcatg gaccaccact gccctgcccc attctctcca ttaattttaa gtgaatgctt 540
gcaaaagctc acttctttgg tgaacagctt cctttacaaa taagtacctt tgccttcgtt 600gcaaaagctc acttctttgg tgaacagctt cctttacaaa taagtacctt tgccttcgtt 600
tttataggat tcttaaaaag aaaaaaaaga ttcagccagg tggttgtggt gcacaccttt 660tttataggat tcttaaaaag aaaaaaaaga ttcagccagg tggttgtggt gcacaccttt 660
aatcccagca gtcaggaggc agaggaaagc agatctcttg agtttgaggc tagcctagtc 720aatcccagca gtcaggaggc agaggaaagc agatctcttg agtttgaggc tagcctagtc 720
tacagaggga gttccaggac agccaaggct acagagagga actgtctaaa aacaccaaga 780tacagaggga gttccaggac agccaaggct acagagagga actgtctaaa aacaccaaga 780
aagagagaaa ggagagaggg agaggatgga tagcttattg atagaattgt cagaaaaggc 840aagagagaaa ggagagagggg agaggatgga tagcttattg atagaattgt cagaaaaggc 840
tataagttcc aatatgtgtc ccatgatttc taagtctagc cctttctgtt atagtaaaat 900tataagttcc aatatgtgtc ccatgatttc taagtctagc cctttctgtt atagtaaaat 900
catagtacac cctcctcctc cagtgtatct ttaacagctt ttaaggaaca tattaactaa 960catagtacac cctcctcctc cagtgtatct ttaacagctt ttaaggaaca tattaactaa 960
atgtccaggt tttgatttgg ccataaaatg ttagcaaagc taaggttttc taggattaat 1020atgtccaggt tttgatttgg ccataaaatg ttagcaaagc taaggttttc taggattaat 1020
gaataacatg tctttattta gtttacttaa aaaaatcatt ctaaaatatc tgtttacata 1080gaataacatg tctttattta gtttacttaa aaaaatcatt ctaaaatatc tgtttacata 1080
tctgtcctct ccaggattaa cttcatattg gtccagcagc ttgtttactg ttctcttctg 1140tctgtcctct ccaggattaa cttcatattg gtccagcagc ttgtttactg ttctcttctg 1140
tttcttcttc tgcttttttt ttcttctctt ctcagtgccc aacccaagtt caaaggctga 1200tttcttcttc tgcttttttt ttcttctctt ctcagtgccc aacccaagtt caaaggctga 1200
tgagagagaa aaactcatga agagatttta ctctagggaa agttgctcag tggatgggat 1260tgagagagaa aaactcatga agagatttta ctctagggaa agttgctcag tggatgggat 1260
cttggcgcac ggggaaagat gtcaagctct tcctggctcc ttctcagcct tgttgctgta 1320cttggcgcac ggggaaagat gtcaagctct tcctggctcc ttctcagcct tgttgctgta 1320
actgctgctc agtccaccat tgaggaacag gccaagacat ttttggacaa gtttaaccac 1380actgctgctc agtccaccat tgaggaacag gccaagacat ttttggacaa gtttaaccac 1380
gaagccgaag acctgttcta tcaaagttca cttgcttctt ggaattataa caccaatatt 1440gaagccgaag acctgttcta tcaaagttca cttgcttctt ggaattataa caccaatatt 1440
actgaagaga atgtccaaaa catgaataat gctggggaca aatggtctgc ctttttaaag 1500actgaagaga atgtccaaaa catgaataat gctggggaca aatggtctgc ctttttaaag 1500
gaacagtcca cacttgccca aatgtatcca ctacaagaaa ttcagaatct cacagtcaag 1560gaacagtcca cacttgccca aatgtatcca ctacaagaaa ttcagaatct cacagtcaag 1560
cttcagctgc aggctcttca gcaaaatggg tcttcagtgc tctcagaaga caagagcaaa 1620cttcagctgc aggctcttca gcaaaatggg tcttcagtgc tctcagaaga caagagcaaa 1620
cggttgaaca caattctaaa tacaatgagc accatctaca gtactggaaa agtttgtaac 1680cggttgaaca caattctaaa tacaatgagc accatctaca gtactggaaa agtttgtaac 1680
ccagataatc cacaagaatg cttattactt gaaccaggtt tgaatgaaat aatggcaaac 1740ccagataatc cacaagaatg cttattactt gaaccaggtt tgaatgaaat aatggcaaac 1740
agtttagact acaatgagag gctctgggct tgggaaagct ggagatctga ggtcggcaag 1800agtttagact acaatgagag gctctgggct tgggaaagct ggagatctga ggtcggcaag 1800
cagctgaggc cattatatga agagtatgtg gtcttgaaaa atgagatggc aagagcaaat 1860cagctgaggc cattatatga agagtatgtg gtcttgaaaa atgagatggc aagagcaaat 1860
cattatgagg actatgggga ttattggaga ggagactatg aagtaaatgg ggtagatggc 1920cattatgagg actatgggga ttatggaga ggagactatg aagtaaatgg ggtagatggc 1920
tatgactaca gccgcggcca gttgattgaa gatgtggaac atacctttga agagattaaa 1980tatgactaca gccgcggcca gttgattgaa gatgtggaac atacctttga agagattaaa 1980
ccattatatg aacatcttca tgcctatgtg agggcaaagt tgatgaatgc ctatccttcc 2040ccattatatg aacatcttca tgcctatgtg agggcaaagt tgatgaatgc ctatccttcc 2040
tatatcagtc caattggatg cctccctgct catttgcttg gtgatatgtg gggtagattt 2100tatatcagtc caattggatg cctccctgct catttgcttg gtgatatgtg gggtagattt 2100
tggacaaatc tgtactcttt gacagttccc tttggacaga aaccaaacat agatgttact 2160tggacaaatc tgtactcttt gacagttccc tttggacaga aaccaaacat agatgttatact 2160
gatgcaatgg tggaccaggc ctgggatgca cagagaatat tcaaggaggc cgagaagttc 2220gatgcaatgg tggaccaggc ctgggatgca cagagaatat tcaaggaggc cgagaagttc 2220
tttgtatctg ttggtcttcc taatatgact caaggattct gggaaaattc catgctaacg 2280tttgtatctg ttggtcttcc taatatgact caaggattct gggaaaattc catgctaacg 2280
gacccaggaa atgttcagaa agcagtctgc catcccacag cttgggacct ggggaagggc 2340gacccaggaa atgttcagaa agcagtctgc catcccacag cttggggacct ggggaagggc 2340
gacttcagga tccttatgtg cacaaaggtg acaatggacg acttcctgac agctcatcat 2400gacttcagga tccttatgtg cacaaaggtg acaatggacg acttcctgac agctcatcat 2400
gagatggggc atatccagta tgatatggca tatgctgcac aaccttttct gctaagaaat 2460gagatggggc atatccagta tgatatggca tatgctgcac aaccttttct gctaagaaat 2460
ggagctaatg aaggattcca tgaagctgtt ggggaaatca tgtcactttc tgcagccaca 2520ggagctaatg aaggattcca tgaagctgtt ggggaaatca tgtcactttc tgcagccaca 2520
cctaagcatt taaaatccat tggtcttctg tcacccgatt ttcaagaaga caatgaaaca 2580cctaagcatt taaaatccat tggtcttctg tcacccgatt ttcaagaaga caatgaaaca 2580
gaaataaact tcctgctcaa acaagcactc acgattgttg ggactctgcc atttacttac 2640gaaataaact tcctgctcaa acaagcactc acgattgttg ggactctgcc atttacttac 2640
atgttagaga agtggaggtg gatggtcttt aaaggggaaa ttcccaaaga ccagtggatg 2700atgttagaga agtggaggtg gatggtcttt aaaggggaaa ttcccaaaga ccagtggatg 2700
aaaaagtggt gggagatgaa gcgagagata gttggggtgg tggaacctgt gccccatgat 2760aaaaagtggt gggagatgaa gcgagagata gttggggtgg tggaacctgt gccccatgat 2760
gaaacatact gtgaccccgc atctctgttc catgtttcta atgattactc attcattcga 2820gaaacatact gtgaccccgc atctctgttc catgtttcta atgattactc attcattcga 2820
tattacacaa ggacccttta ccaattccag tttcaagaag cactttgtca agcagctaaa 2880tattacacaa ggacccttta ccaattccag tttcaagaag cactttgtca agcagctaaa 2880
catgaaggcc ctctgcacaa atgtgacatc tcaaactcta cagaagctgg acagaaactg 2940catgaaggcc ctctgcacaa atgtgacatc tcaaactcta cagaagctgg acagaaactg 2940
ttcaatatgc tgaggcttgg aaaatcagaa ccctggaccc tagcattgga aaatgttgta 3000ttcaatatgc tgaggcttgg aaaatcagaa ccctggaccc tagcattgga aaatgttgta 3000
ggagcaaaga acatgaatgt aaggccactg ctcaactact ttgagccctt atttacctgg 3060ggagcaaaga acatgaatgt aaggccactg ctcaactact ttgagccctt atttacctgg 3060
ctgaaagacc agaacaagaa ttcttttgtg ggatggagta ccgactggag tccatatgca 3120ctgaaagacc agaacaagaa ttcttttgtg ggatggagta ccgactggag tccatatgca 3120
gaccaaagca tcaaagtgag gataagccta aaatcagctc ttggagataa agcatatgaa 3180gaccaaagca tcaaagtgag gataagccta aaatcagctc ttggagataa agcatatgaa 3180
tggaacgaca atgaaatgta cctgttccga tcatctgttg catatgctat gaggcagtac 3240tggaacgaca atgaaatgta cctgttccga tcatctgttg catatgctat gaggcagtac 3240
tttttaaaag taaaaaatca gatgattctt tttggggagg aggatgtgcg agtggctaat 3300tttttaaaag taaaaaatca gatgattctt tttggggagg aggatgtgcg agtggctaat 3300
ttgaaaccaa gaatctcctt taatttcttt gtcactgcac ctaaaaatgt gtctgatatc 3360ttgaaaccaa gaatctcctt taatttcttt gtcactgcac ctaaaaatgt gtctgatatc 3360
attcctagaa ctgaagttga aaaggccatc aggatgtccc ggagccgtat caatgatgct 3420attcctagaa ctgaagttga aaaggccatc aggatgtccc ggagccgtat caatgatgct 3420
ttccgtctga atgacaacag cctagagttt ctggggatac agccaacact tggacctcct 3480ttccgtctga atgacaacag cctagagttt ctggggatac agccaacact tggacctcct 3480
aaccagcccc ctgtttccat atggctgatt gtttttggag ttgtgatggg agtgatagtg 3540aaccagcccc ctgtttccat atggctgatt gtttttggag ttgtgatggg agtgatagtg 3540
gttggcattg tcatcctgat cttcactggg atcagagatc ggaagaagaa aaataaagca 3600gttggcattg tcatcctgat cttcactggg atcagagatc ggaagaagaa aaataaagca 3600
agaagtggag aaaatcctta tgcctccatc gatattagca aaggagaaaa taatccagga 3660agaagtggag aaaatcctta tgcctccatc gatattagca aaggagaaaa taatccagga 3660
ttccaaaaca ctgatgatgt tcagacctcc ttttagaact tgtttattgc agcttataat 3720ttccaaaaca ctgatgatgt tcagacctcc ttttagaact tgtttatattgc agcttataat 3720
ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt ttcactgcat 3780ggttacaaat aaagcaatag catcacaaat ttcacaaata aagcattttt ttcactgcat 3780
tctagttgtg gtttgtccaa actcatcaat gtatcttaac gcgtgcgaag ttcctattct 3840tctagttgtg gtttgtccaa actcatcaat gtatcttaac gcgtgcgaag ttcctattct 3840
ctagaaagta taggaacttc atcgataccg ggtaggggag gcgcttttcc caaggcagtc 3900ctagaaagta taggaacttc atcgataccg ggtagggggag gcgcttttcc caaggcagtc 3900
tggagcatgc gctttagcag ccccgctggg cacttggcgc tacacaagtg gcctctggcc 3960tggagcatgc gctttagcag ccccgctggg cacttggcgc tacacaagtg gcctctggcc 3960
tcgcacacat tccacatccc ccggtaggcg ccaaccggct ccgttctttg gtggcccctt 4020tcgcacacat tccacatccc ccggtaggcg ccaaccggct ccgttctttg gtggcccctt 4020
cgcgccacct tctactcctc ccctagtcag gaagttcccc cccgccccgc agctcgcgtc 4080cgcgccacct tctactcctc ccctagtcag gaagttcccc cccgccccgc agctcgcgtc 4080
gtgcaggacg tgacaaatgg aagtagcacg tctcactagt ctcgtgcaga tggacagcac 4140gtgcaggacg tgacaaatgg aagtagcacg tctcactagt ctcgtgcaga tggacagcac 4140
cgctgagcaa tggaagcggg taggcctttg gggcagcggc caatagcagc tttgctcctt 4200cgctgagcaa tggaagcggg taggcctttg gggcagcggc caatagcagc tttgctcctt 4200
cgctttctgg gctcagaggc tgggaagggg tgggtccggg ggcgggctca ggggcgggct 4260cgctttctgg gctcagaggc tgggaagggg tgggtccggg ggcgggctca ggggcgggct 4260
caggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca cgcttcaaaa 4320caggggcggg gcgggcgccc gaaggtcctc cggaggcccg gcattctgca cgcttcaaaa 4320
gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga cctgcagccc 4380gcgcacgtct gccgcgctgt tctcctcttc ctcatctccg ggcctttcga cctgcagccc 4380
aagctagctt accatgaccg agtacaagcc cacggtgcgc ctcgccaccc gcgacgacgt 4440aagctagctt accatgaccg agtacaagcc cacggtgcgc ctcgccaccc gcgacgacgt 4440
ccccagggcc gtacgcaccc tcgccgccgc gttcgccgac taccccgcca cgcgccacac 4500ccccagggcc gtacgcaccc tcgccgccgc gttcgccgac taccccgcca cgcgccacac 4500
cgtcgatccg gaccgccaca tcgagcgggt caccgagctg caagaactct tcctcacgcg 4560cgtcgatccg gaccgccaca tcgagcgggt caccgagctg caagaactct tcctcacgcg 4560
cgtcgggctc gacatcggca aggtgtgggt cgcggacgac ggcgccgcgg tggcggtctg 4620cgtcgggctc gacatcggca aggtgtgggt cgcggacgac ggcgccgcgg tggcggtctg 4620
gaccacgccg gagagcgtcg aagcgggggc ggtgttcgcc gagatcggcc cgcgcatggc 4680gaccacgccg gagagcgtcg aagcgggggc ggtgttcgcc gagatcggcc cgcgcatggc 4680
cgagttgagc ggttcccggc tggccgcgca gcaacagatg gaaggcctcc tggcgccgca 4740cgagttgagc ggttcccggc tggccgcgca gcaacagatg gaaggcctcc tggcgccgca 4740
ccggcccaag gagcccgcgt ggttcctggc caccgtcggc gtctcgcccg accaccaggg 4800ccggcccaag gagcccgcgt ggttcctggc caccgtcggc gtctcgcccg accaccaggg 4800
caagggtctg ggcagcgccg tcgtgctccc cggagtggag gcggccgagc gcgccggggt 4860caagggtctg ggcagcgccg tcgtgctccc cggagtggag gcggccgagc gcgccggggt 4860
gcccgccttc ctggagacct ccgcgccccg caacctcccc ttctacgagc ggctcggctt 4920gcccgccttc ctggagacct ccgcgccccg caacctcccc ttctacgagc ggctcggctt 4920
caccgtcacc gccgacgtcg aggtgcccga aggaccgcgc acctggtgca tgacccgcaa 4980caccgtcacc gccgacgtcg aggtgcccga aggaccgcgc acctggtgca tgacccgcaa 4980
gcccggtgcc tgaggtacct ctcatgctgg agttcttcgc ccaccccaac ttgtttattg 5040gcccggtgcc tgaggtacct ctcatgctgg agttcttcgc ccaccccaac ttgtttattg 5040
cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt 5100cagcttataa tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt 5100
tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catcgatgaa 5160tttcactgca ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catcgatgaa 5160
gttcctattc tctagaaagt ataggaactt ctaacctccc gggtgacaga taacttcgta 5220gttcctattc tctagaaagt ataggaactt ctaacctccc gggtgacaga taacttcgta 5220
taatgtatgc tatacgaagt tatgtcgacg cggcgcgccg aattataata ctaacattac 5280taatgtatgc tatacgaagt tatgtcgacg cggcgcgccg aattataata ctaacattac 5280
tgaagaaaat gcccaaaaga tggtaagttc ttgaggctac ccagggggtt attgattgct 5340tgaagaaaat gcccaaaaga tggtaagttc ttgaggctac ccagggggtt attgattgct 5340
tcttaaagat cagaattact gcctataaaa ctggataagg aaatcataga gatctctcaa 5400tcttaaagat cagaattact gcctataaaa ctggataagg aaatcataga gatctctcaa 5400
gtgtgaggat gagtgactgc ctctgtagct ctgatcctag tctcccagat ggctaaattc 5460gtgtgaggat gagtgactgc ctctgtagct ctgatcctag tctcccagat ggctaaattc 5460
aattgacctt agagttcatc tggaaaattg ttatgaatga attatttgcc cagattccaa 5520aattgacctt agagttcatc tggaaaattg ttatgaatga attatttgcc cagattccaa 5520
agatgagtga aaatgtttaa taaagttgcc atcactattc tcattatatt tggtatgtaa 5580agatgagtga aaatgtttaa taaagttgcc atcactattc tcattatatt tggtatgtaa 5580
agcattcatg gaaatgttct aagtcgttat tgagccaata attttcttta gcttataatg 5640agcattcatg gaaatgttct aagtcgttat tgagccaata attttcttta gcttataatg 5640
ccaacaggtc tatccgagaa ctacaaatga catattaact gaaaaatgca actggggttt 5700ccaacaggtc tatccgagaa ctacaaatga catattaact gaaaaatgca actggggttt 5700
actgaaggca gcagcttagt aattaaggta accatggctt aggtgaaact ggacctggga 5760actgaaggca gcagcttagt aattaaggta accatggctt aggtgaaact ggacctggga 5760
attccttctt tcattgacac agagctctga ggaatttcca aaggtcacag aagaaaagct 5820attccttctt tcattgacac agagctctga ggaatttcca aaggtcacag aagaaaagct 5820
ataattaaac tagtcccaaa aaatctcagc ctactctggg aaagcagcat attttgtttg 5880ataattaaac tagtcccaaa aaatctcagc ctactctggg aaagcagcat attttgtttg 5880
acaagtgcaa ggacttagaa cttttttttt tctcactgat cctgaagtgc cttttaagta 5940acaagtgcaa ggacttagaa cttttttttt tctcactgat cctgaagtgc cttttaagta 5940
tagttaagtg gtggaaaatt gagcaactat ttaagaaaag actctttttt ttcttcttcc 6000tagttaagtg gtggaaaatt gagcaactat ttaagaaaag actctttttt ttcttcttcc 6000
agcaatgctt tccttcaaaa cggtagcttc aaaacttcct gtcttttaaa tgatcagggg 6060agcaatgctt tccttcaaaa cggtagcttc aaaacttcct gtcttttaaa tgatcagggg 6060
gctgtgtgtt taaattattg ccattcatag aacagagtgg gtctgaggat gcctgtttcc 6120gctgtgtgtt taaattattg ccattcatag aacagagtgg gtctgaggat gcctgtttcc 6120
tttgaaattc tatgccccct cccagttttc taaaatttaa gaaaccacag agactttgac 6180tttgaaattc tatgccccct cccagttttc taaaatttaa gaaaccacag agactttgac 6180
aatgtagttg ccaaatgagt tgcttttaac tgctctaata gtttggtctt aagcttggcg 6240aatgtagttg ccaaatgagt tgcttttaac tgctctaata gtttggtctt aagcttggcg 6240
taatcatggt catagctgtt tcctgtgtga aattgttatc cgctcacaat tccacacaac 6300taatcatggt catagctgtt tcctgtgtga aattgttatc cgctcacaat tccacacaac 6300
atacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca 6360atacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca 6360
ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat 6420ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat 6420
taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc 6480taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc 6480
tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 6540tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 6540
aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 6600aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 6600
aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 6660aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 6660
ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 6720ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 6720
acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 6780acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 6780
ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 6840ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 6840
tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 6900tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc 6900
tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 6960tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt 6960
gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 7020gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt 7020
agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 7080agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc 7080
tacactagaa gaacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 7140tacactagaa gaacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa 7140
agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 7200agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt 7200
tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 7260tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 7260
acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 7320acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta 7320
tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 7380tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa 7380
agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc 7440agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc 7440
tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact 7500tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact 7500
acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc 7560acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc 7560
tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt 7620tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt 7620
ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta 7680ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta 7680
agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg 7740agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg 7740
tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt 7800tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt 7800
acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc 7860acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc 7860
agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt 7920agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt 7920
actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc 7980actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc 7980
tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc 8040tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc 8040
gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa 8100gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa 8100
ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac 8160ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac 8160
tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa 8220tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa 8220
aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt 8280aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt 8280
tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa 8340tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa 8340
tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct 8400tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct 8400
gacgtctaag aaaccattat tatcatgaca ttaacctata aaaataggcg tatcacgagg 8460gacgtctaag aaaccattat tatcatgaca ttaacctata aaaataggcg tatcacgagg 8460
ccctttcgtc 8470ccctttcgtc 8470
<210> 28<210> 28
<211> 19<211> 19
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> Puro-F<223> Puro-F
<400> 28<400> 28
aacctcccct tctacgagc 19aacctccccttctacgagc 19
<210> 29<210> 29
<211> 24<211> 24
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> 3arm-outR<223> 3arm-outR
<400> 29<400> 29
tacagccagg atctggatgt cagc 24tacagccagg atctggatgt cagc 24
<210> 30<210> 30
<211> 5922<211> 5922
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 鼠胚胎干细胞Ace2位点基因组替换序列<223> Genomic replacement sequence of Ace2 locus in mouse embryonic stem cells
<400> 30<400> 30
ccctatggag tggagaagag tcttataatt ttttaaatgg gcagagaaat gaatttattt 60ccctatggag tggagaagag tcttataatt ttttaaatgg gcagagaaat gaatttattt 60
ttaattttta gagacagggt ttctttgtat agctctagct gtctttgatt ggtagacaaa 120ttaattttta gagacagggt ttctttgtat agctctagct gtctttgatt ggtagacaaa 120
gctgtcctca aactcagaga tcttccttcc tttgtctcct gagtgctggg attaaaggca 180gctgtcctca aactcagaga tcttccttcc tttgtctcct gagtgctggg attaaaggca 180
tggaccacca ctgccctgcc ccattctctc cattaatttt aagtgaatgc ttgcaaaagc 240tggaccacca ctgccctgcc ccattctctc cattaatttt aagtgaatgc ttgcaaaagc 240
tcacttcttt ggtgaacagc ttcctttaca aataagtacc tttgccttcg tttttatagg 300tcacttcttt ggtgaacagc ttcctttaca aataagtacc tttgccttcg tttttatagg 300
attcttaaaa agaaaaaaaa gattcagcca ggtggttgtg gtgcacacct ttaatcccag 360attcttaaaa agaaaaaaaa gattcagcca ggtggttgtg gtgcacacct ttaatcccag 360
cagtcaggag gcagaggaaa gcagatctct tgagtttgag gctagcctag tctacagagg 420cagtcaggag gcagaggaaa gcagatctct tgagtttgag gctagcctag tctacagagg 420
gagttccagg acagccaagg ctacagagag gaactgtcta aaaacaccaa gaaagagaga 480gagttccagg acagccaagg ctacagagag gaactgtcta aaaacaccaa gaaagagaga 480
aaggagagag ggagaggatg gatagcttat tgatagaatt gtcagaaaag gctataagtt 540aaggagagag ggagaggatg gatagctttat tgatagaatt gtcagaaaag gctataagtt 540
ccaatatgtg tcccatgatt tctaagtcta gccctttctg ttatagtaaa atcatagtac 600ccaatatgtg tcccatgatt tctaagtcta gccctttctg ttatagtaaa atcatagtac 600
accctcctcc tccagtgtat ctttaacagc ttttaaggaa catattaact aaatgtccag 660accctcctcc tccagtgtat ctttaacagc ttttaaggaa catattaact aaatgtccag 660
gttttgattt ggccataaaa tgttagcaaa gctaaggttt tctaggatta atgaataaca 720gttttgattt ggccataaaa tgttagcaaa gctaaggttt tctaggatta atgaataaca 720
tgtctttatt tagtttactt aaaaaaatca ttctaaaata tctgtttaca tatctgtcct 780tgtctttatt tagtttactt aaaaaaatca ttctaaaata tctgtttaca tatctgtcct 780
ctccaggatt aacttcatat tggtccagca gcttgtttac tgttctcttc tgtttcttct 840ctccaggatt aacttcatat tggtccagca gcttgtttac tgttctcttc tgtttcttct 840
tctgcttttt ttttcttctc ttctcagtgc ccaacccaag ttcaaaggct gatgagagag 900tctgcttttt ttttcttctc ttctcagtgc ccaacccaag ttcaaaggct gatgagagag 900
aaaaactcat gaagagattt tactctaggg aaagttgctc agtggatggg atcttggcgc 960aaaaactcat gaagagattt tactctaggg aaagttgctc agtggatggg atcttggcgc 960
acggggaaag atgtcaagct cttcctggct ccttctcagc cttgttgctg taactgctgc 1020acggggaaag atgtcaagct cttcctggct ccttctcagc cttgttgctg taactgctgc 1020
tcagtccacc attgaggaac aggccaagac atttttggac aagtttaacc acgaagccga 1080tcagtccacc attgaggaac aggccaagac atttttggac aagtttaacc acgaagccga 1080
agacctgttc tatcaaagtt cacttgcttc ttggaattat aacaccaata ttactgaaga 1140agacctgttc tatcaaagtt cacttgcttc ttggaattat aacaccaata ttactgaaga 1140
gaatgtccaa aacatgaata atgctgggga caaatggtct gcctttttaa aggaacagtc 1200gaatgtccaa aacatgaata atgctgggga caaatggtct gcctttttaa aggaacagtc 1200
cacacttgcc caaatgtatc cactacaaga aattcagaat ctcacagtca agcttcagct 1260cacacttgcc caaatgtatc cactacaaga aattcagaat ctcacagtca agcttcagct 1260
gcaggctctt cagcaaaatg ggtcttcagt gctctcagaa gacaagagca aacggttgaa 1320gcaggctctt cagcaaaatg ggtcttcagt gctctcagaa gacaagagca aacggttgaa 1320
cacaattcta aatacaatga gcaccatcta cagtactgga aaagtttgta acccagataa 1380cacaattcta aatacaatga gcaccatcta cagtactgga aaagtttgta acccagataa 1380
tccacaagaa tgcttattac ttgaaccagg tttgaatgaa ataatggcaa acagtttaga 1440tccacaagaa tgcttattac ttgaaccagg tttgaatgaa ataatggcaa acagtttaga 1440
ctacaatgag aggctctggg cttgggaaag ctggagatct gaggtcggca agcagctgag 1500ctacaatgag aggctctggg cttgggaaag ctggagatct gaggtcggca agcagctgag 1500
gccattatat gaagagtatg tggtcttgaa aaatgagatg gcaagagcaa atcattatga 1560gccattat gaagagtatg tggtcttgaa aaatgagatg gcaagagcaa atcattatga 1560
ggactatggg gattattgga gaggagacta tgaagtaaat ggggtagatg gctatgacta 1620ggactatggg gattattgga gaggagacta tgaagtaaat ggggtagatg gctatgacta 1620
cagccgcggc cagttgattg aagatgtgga acataccttt gaagagatta aaccattata 1680cagccgcggc cagttgattg aagatgtgga acataccttt gaagagatta aaccattata 1680
tgaacatctt catgcctatg tgagggcaaa gttgatgaat gcctatcctt cctatatcag 1740tgaacatctt catgcctatg tgagggcaaa gttgatgaat gcctatcctt cctatatcag 1740
tccaattgga tgcctccctg ctcatttgct tggtgatatg tggggtagat tttggacaaa 1800tccaattgga tgcctccctg ctcatttgct tggtgatatg tggggtagat tttggacaaa 1800
tctgtactct ttgacagttc cctttggaca gaaaccaaac atagatgtta ctgatgcaat 1860tctgtactct ttgacagttc cctttggaca gaaaccaaac atagatgtta ctgatgcaat 1860
ggtggaccag gcctgggatg cacagagaat attcaaggag gccgagaagt tctttgtatc 1920ggtggaccag gcctggggatg cacagagaat attcaaggag gccgagaagt tctttgtatc 1920
tgttggtctt cctaatatga ctcaaggatt ctgggaaaat tccatgctaa cggacccagg 1980tgttggtctt cctaatatga ctcaaggatt ctgggaaaat tccatgctaa cggacccagg 1980
aaatgttcag aaagcagtct gccatcccac agcttgggac ctggggaagg gcgacttcag 2040aaatgttcag aaagcagtct gccatcccac agcttgggac ctggggaagg gcgacttcag 2040
gatccttatg tgcacaaagg tgacaatgga cgacttcctg acagctcatc atgagatggg 2100gatccttatg tgcacaaagg tgacaatgga cgacttcctg acagctcatc atgagatggg 2100
gcatatccag tatgatatgg catatgctgc acaacctttt ctgctaagaa atggagctaa 2160gcatatccag tatgatatgg catatgctgc acaacctttt ctgctaagaa atggagctaa 2160
tgaaggattc catgaagctg ttggggaaat catgtcactt tctgcagcca cacctaagca 2220tgaaggattc catgaagctg ttggggaaat catgtcactt tctgcagcca cacctaagca 2220
tttaaaatcc attggtcttc tgtcacccga ttttcaagaa gacaatgaaa cagaaataaa 2280tttaaaatcc attggtcttc tgtcacccga ttttcaagaa gacaatgaaa cagaaataaa 2280
cttcctgctc aaacaagcac tcacgattgt tgggactctg ccatttactt acatgttaga 2340cttcctgctc aaacaagcac tcacgattgt tgggactctg ccatttactt acatgttaga 2340
gaagtggagg tggatggtct ttaaagggga aattcccaaa gaccagtgga tgaaaaagtg 2400gaagtggagg tggatggtct ttaaagggga aattcccaaa gaccagtgga tgaaaaagtg 2400
gtgggagatg aagcgagaga tagttggggt ggtggaacct gtgccccatg atgaaacata 2460gtgggagatg aagcgagaga tagttggggt ggtggaacct gtgccccatg atgaaacata 2460
ctgtgacccc gcatctctgt tccatgtttc taatgattac tcattcattc gatattacac 2520ctgtgacccc gcatctctgt tccatgtttc taatgattac tcattcattc gatattacac 2520
aaggaccctt taccaattcc agtttcaaga agcactttgt caagcagcta aacatgaagg 2580aaggaccctt taccaattcc agtttcaaga agcactttgt caagcagcta aacatgaagg 2580
ccctctgcac aaatgtgaca tctcaaactc tacagaagct ggacagaaac tgttcaatat 2640ccctctgcac aaatgtgaca tctcaaactc tacagaagct ggacagaaac tgttcaatat 2640
gctgaggctt ggaaaatcag aaccctggac cctagcattg gaaaatgttg taggagcaaa 2700gctgaggctt ggaaaatcag aaccctggac cctagcattg gaaaatgttg taggagcaaa 2700
gaacatgaat gtaaggccac tgctcaacta ctttgagccc ttatttacct ggctgaaaga 2760gaacatgaat gtaaggccac tgctcaacta ctttgagccc ttatttacct ggctgaaaga 2760
ccagaacaag aattcttttg tgggatggag taccgactgg agtccatatg cagaccaaag 2820ccagaacaag aattcttttg tgggatggag taccgactgg agtccatatg cagaccaaag 2820
catcaaagtg aggataagcc taaaatcagc tcttggagat aaagcatatg aatggaacga 2880catcaaagtg aggataagcc taaaatcagc tcttggagat aaagcatatg aatggaacga 2880
caatgaaatg tacctgttcc gatcatctgt tgcatatgct atgaggcagt actttttaaa 2940caatgaaatg tacctgttcc gatcatctgt tgcatatgct atgaggcagt actttttaaa 2940
agtaaaaaat cagatgattc tttttgggga ggaggatgtg cgagtggcta atttgaaacc 3000agtaaaaaat cagatgattc tttttgggga ggaggatgtg cgagtggcta atttgaaacc 3000
aagaatctcc tttaatttct ttgtcactgc acctaaaaat gtgtctgata tcattcctag 3060aagaatctcc tttaatttct ttgtcactgc acctaaaaat gtgtctgata tcattcctag 3060
aactgaagtt gaaaaggcca tcaggatgtc ccggagccgt atcaatgatg ctttccgtct 3120aactgaagtt gaaaaggcca tcaggatgtc ccggagccgt atcaatgatg ctttccgtct 3120
gaatgacaac agcctagagt ttctggggat acagccaaca cttggacctc ctaaccagcc 3180gaatgacaac agcctagagt ttctggggat acagccaaca cttggacctc ctaaccagcc 3180
ccctgtttcc atatggctga ttgtttttgg agttgtgatg ggagtgatag tggttggcat 3240ccctgtttcc atatggctga ttgtttttgg agttgtgatg ggagtgatag tggttggcat 3240
tgtcatcctg atcttcactg ggatcagaga tcggaagaag aaaaataaag caagaagtgg 3300tgtcatcctg atcttcactg ggatcagaga tcggaagaag aaaaataaag caagaagtgg 3300
agaaaatcct tatgcctcca tcgatattag caaaggagaa aataatccag gattccaaaa 3360agaaaatcct tatgcctcca tcgatattag caaaggagaa aataatccag gattccaaaa 3360
cactgatgat gttcagacct ccttttagaa cttgtttatt gcagcttata atggttacaa 3420cactgatgat gttcagacct ccttttagaa cttgtttatt gcagcttata atggttacaa 3420
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3480ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3480
tggtttgtcc aaactcatca atgtatctta acgcgtgcga agttcctatt ctctagaaag 3540tggtttgtcc aaactcatca atgtatctta acgcgtgcga agttcctatt ctctagaaag 3540
tataggaact tcatcgatac cgggtagggg aggcgctttt cccaaggcag tctggagcat 3600tataggaact tcatcgatac cgggtagggg aggcgctttt cccaaggcag tctggagcat 3600
gcgctttagc agccccgctg ggcacttggc gctacacaag tggcctctgg cctcgcacac 3660gcgctttagc agccccgctg ggcacttggc gctacacaag tggcctctgg cctcgcacac 3660
attccacatc ccccggtagg cgccaaccgg ctccgttctt tggtggcccc ttcgcgccac 3720attccacatc ccccggtagg cgccaaccgg ctccgttctt tggtggcccc ttcgcgccac 3720
cttctactcc tcccctagtc aggaagttcc cccccgcccc gcagctcgcg tcgtgcagga 3780cttctactcc tcccctagtc aggaagttcc cccccgcccc gcagctcgcg tcgtgcagga 3780
cgtgacaaat ggaagtagca cgtctcacta gtctcgtgca gatggacagc accgctgagc 3840cgtgacaaat ggaagtagca cgtctcacta gtctcgtgca gatggacagc accgctgagc 3840
aatggaagcg ggtaggcctt tggggcagcg gccaatagca gctttgctcc ttcgctttct 3900aatggaagcg ggtaggcctt tggggcagcg gccaatagca gctttgctcc ttcgctttct 3900
gggctcagag gctgggaagg ggtgggtccg ggggcgggct caggggcggg ctcaggggcg 3960gggctcagag gctgggaagg ggtgggtccg ggggcgggct caggggcggg ctcaggggcg 3960
gggcgggcgc ccgaaggtcc tccggaggcc cggcattctg cacgcttcaa aagcgcacgt 4020gggcgggcgc ccgaaggtcc tccggaggcc cggcattctg cacgcttcaa aagcgcacgt 4020
ctgccgcgct gttctcctct tcctcatctc cgggcctttc gacctgcagc ccaagctagc 4080ctgccgcgct gttctcctct tcctcatctc cgggcctttc gacctgcagc ccaagctagc 4080
ttaccatgac cgagtacaag cccacggtgc gcctcgccac ccgcgacgac gtccccaggg 4140ttaccatgac cgagtacaag cccacggtgc gcctcgccac ccgcgacgac gtccccaggg 4140
ccgtacgcac cctcgccgcc gcgttcgccg actaccccgc cacgcgccac accgtcgatc 4200ccgtacgcac cctcgccgcc gcgttcgccg actaccccgc cacgcgccac accgtcgatc 4200
cggaccgcca catcgagcgg gtcaccgagc tgcaagaact cttcctcacg cgcgtcgggc 4260cggaccgcca catcgagcgg gtcaccgagc tgcaagaact cttcctcacg cgcgtcgggc 4260
tcgacatcgg caaggtgtgg gtcgcggacg acggcgccgc ggtggcggtc tggaccacgc 4320tcgacatcgg caaggtgtgg gtcgcggacg acggcgccgc ggtggcggtc tggaccacgc 4320
cggagagcgt cgaagcgggg gcggtgttcg ccgagatcgg cccgcgcatg gccgagttga 4380cggagagcgt cgaagcgggg gcggtgttcg ccgagatcgg cccgcgcatg gccgagttga 4380
gcggttcccg gctggccgcg cagcaacaga tggaaggcct cctggcgccg caccggccca 4440gcggttcccg gctggccgcg cagcaacaga tggaaggcct cctggcgccg caccggccca 4440
aggagcccgc gtggttcctg gccaccgtcg gcgtctcgcc cgaccaccag ggcaagggtc 4500aggagcccgc gtggttcctg gccaccgtcg gcgtctcgcc cgaccaccag ggcaagggtc 4500
tgggcagcgc cgtcgtgctc cccggagtgg aggcggccga gcgcgccggg gtgcccgcct 4560tgggcagcgc cgtcgtgctc cccggagtgg aggcggccga gcgcgccggg gtgcccgcct 4560
tcctggagac ctccgcgccc cgcaacctcc ccttctacga gcggctcggc ttcaccgtca 4620tcctggagac ctccgcgccc cgcaacctcc ccttctacga gcggctcggc ttcaccgtca 4620
ccgccgacgt cgaggtgccc gaaggaccgc gcacctggtg catgacccgc aagcccggtg 4680ccgccgacgt cgaggtgccc gaaggaccgc gcacctggtg catgacccgc aagcccggtg 4680
cctgaggtac ctctcatgct ggagttcttc gcccacccca acttgtttat tgcagcttat 4740cctgaggtac ctctcatgct ggagttcttc gcccaccccca acttgtttat tgcagctttat 4740
aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatt tttttcactg 4800aatggttaca aataaagcaa tagcatcaca aatttcacaa ataaagcatttttttcactg 4800
cattctagtt gtggtttgtc caaactcatc aatgtatctt atcatcgatg aagttcctat 4860cattctagtt gtggtttgtc caaactcatc aatgtatctt atcatcgatg aagttcctat 4860
tctctagaaa gtataggaac ttctaacctc ccgggtgaca gataacttcg tataatgtat 4920tctctagaaa gtataggaac ttctaacctc ccgggtgaca gataacttcg tataatgtat 4920
gctatacgaa gttatgtcga cgcggcgcgc cgaattataa tactaacatt actgaagaaa 4980gctatacgaa gttatgtcga cgcggcgcgc cgaattataa tactaacatt actgaagaaa 4980
atgcccaaaa gatggtaagt tcttgaggct acccaggggg ttattgattg cttcttaaag 5040atgcccaaaa gatggtaagt tcttgaggct acccaggggg ttattgattg cttcttaaag 5040
atcagaatta ctgcctataa aactggataa ggaaatcata gagatctctc aagtgtgagg 5100atcagaatta ctgcctataa aactggataa ggaaatcata gagatctctc aagtgtgagg 5100
atgagtgact gcctctgtag ctctgatcct agtctcccag atggctaaat tcaattgacc 5160atgagtgact gcctctgtag ctctgatcct agtctcccag atggctaaat tcaattgacc 5160
ttagagttca tctggaaaat tgttatgaat gaattatttg cccagattcc aaagatgagt 5220ttagagttca tctggaaaat tgttatgaat gaattatttg cccagattcc aaagatgagt 5220
gaaaatgttt aataaagttg ccatcactat tctcattata tttggtatgt aaagcattca 5280gaaaatgttt aataaagttg ccatcactat tctcattata tttggtatgt aaagcattca 5280
tggaaatgtt ctaagtcgtt attgagccaa taattttctt tagcttataa tgccaacagg 5340tggaaatgtt ctaagtcgtt attgagccaa taattttctt tagcttataa tgccaacagg 5340
tctatccgag aactacaaat gacatattaa ctgaaaaatg caactggggt ttactgaagg 5400tctatccgag aactacaaat gacatattaa ctgaaaaatg caactggggt ttactgaagg 5400
cagcagctta gtaattaagg taaccatggc ttaggtgaaa ctggacctgg gaattccttc 5460cagcagctta gtaattaagg taaccatggc ttaggtgaaa ctggacctgg gaattccttc 5460
tttcattgac acagagctct gaggaatttc caaaggtcac agaagaaaag ctataattaa 5520tttcattgac acagagctct gaggaatttc caaaggtcac agaagaaaag ctataattaa 5520
actagtccca aaaaatctca gcctactctg ggaaagcagc atattttgtt tgacaagtgc 5580actagtccca aaaaatctca gcctactctg ggaaagcagc atattttgtt tgacaagtgc 5580
aaggacttag aacttttttt tttctcactg atcctgaagt gccttttaag tatagttaag 5640aaggacttag aacttttttt tttctcactg atcctgaagt gccttttaag tatagttaag 5640
tggtggaaaa ttgagcaact atttaagaaa agactctttt ttttcttctt ccagcaatgc 5700tggtggaaaa ttgagcaact atttaagaaa agactctttt ttttcttctt ccagcaatgc 5700
tttccttcaa aacggtagct tcaaaacttc ctgtctttta aatgatcagg gggctgtgtg 5760tttccttcaa aacggtagct tcaaaacttc ctgtctttta aatgatcagg gggctgtgtg 5760
tttaaattat tgccattcat agaacagagt gggtctgagg atgcctgttt cctttgaaat 5820tttaaattat tgccattcat agaacagagt gggtctgagg atgcctgttt cctttgaaat 5820
tctatgcccc ctcccagttt tctaaaattt aagaaaccac agagactttg acaatgtagt 5880tctatgcccc ctcccagttt tctaaaattt aagaaaccac agagactttg acaatgtagt 5880
tgccaaatga gttgctttta actgctctaa tagtttggtc tt 5922tgccaaatga gttgctttta actgctctaa tagtttggtc tt 5922
<210> 31<210> 31
<211> 19<211> 19
<212> DNA<212> DNA
<213> Natural sequence<213> Natural sequence
<220><220>
<223> hACE2-F<223> hACE2-F
<400> 31<400> 31
tgatagtggt tggcattgt 19tgatagtggt tggcattgt 19
<210> 32<210> 32
<211> 4591<211> 4591
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 最终删除PGK-puro后小鼠胚胎干细胞Ace2位点基因组替换序列<223> Genomic replacement sequence of the Ace2 locus in mouse embryonic stem cells after final deletion of PGK-puro
<400> 32<400> 32
ccctatggag tggagaagag tcttataatt ttttaaatgg gcagagaaat gaatttattt 60ccctatggag tggagaagag tcttataatt ttttaaatgg gcagagaaat gaatttattt 60
ttaattttta gagacagggt ttctttgtat agctctagct gtctttgatt ggtagacaaa 120ttaattttta gagacagggt ttctttgtat agctctagct gtctttgatt ggtagacaaa 120
gctgtcctca aactcagaga tcttccttcc tttgtctcct gagtgctggg attaaaggca 180gctgtcctca aactcagaga tcttccttcc tttgtctcct gagtgctggg attaaaggca 180
tggaccacca ctgccctgcc ccattctctc cattaatttt aagtgaatgc ttgcaaaagc 240tggaccacca ctgccctgcc ccattctctc cattaatttt aagtgaatgc ttgcaaaagc 240
tcacttcttt ggtgaacagc ttcctttaca aataagtacc tttgccttcg tttttatagg 300tcacttcttt ggtgaacagc ttcctttaca aataagtacc tttgccttcg tttttatagg 300
attcttaaaa agaaaaaaaa gattcagcca ggtggttgtg gtgcacacct ttaatcccag 360attcttaaaa agaaaaaaaa gattcagcca ggtggttgtg gtgcacacct ttaatcccag 360
cagtcaggag gcagaggaaa gcagatctct tgagtttgag gctagcctag tctacagagg 420cagtcaggag gcagaggaaa gcagatctct tgagtttgag gctagcctag tctacagagg 420
gagttccagg acagccaagg ctacagagag gaactgtcta aaaacaccaa gaaagagaga 480gagttccagg acagccaagg ctacagagag gaactgtcta aaaacaccaa gaaagagaga 480
aaggagagag ggagaggatg gatagcttat tgatagaatt gtcagaaaag gctataagtt 540aaggagagag ggagaggatg gatagctttat tgatagaatt gtcagaaaag gctataagtt 540
ccaatatgtg tcccatgatt tctaagtcta gccctttctg ttatagtaaa atcatagtac 600ccaatatgtg tcccatgatt tctaagtcta gccctttctg ttatagtaaa atcatagtac 600
accctcctcc tccagtgtat ctttaacagc ttttaaggaa catattaact aaatgtccag 660accctcctcc tccagtgtat ctttaacagc ttttaaggaa catattaact aaatgtccag 660
gttttgattt ggccataaaa tgttagcaaa gctaaggttt tctaggatta atgaataaca 720gttttgattt ggccataaaa tgttagcaaa gctaaggttt tctaggatta atgaataaca 720
tgtctttatt tagtttactt aaaaaaatca ttctaaaata tctgtttaca tatctgtcct 780tgtctttatt tagtttactt aaaaaaatca ttctaaaata tctgtttaca tatctgtcct 780
ctccaggatt aacttcatat tggtccagca gcttgtttac tgttctcttc tgtttcttct 840ctccaggatt aacttcatat tggtccagca gcttgtttac tgttctcttc tgtttcttct 840
tctgcttttt ttttcttctc ttctcagtgc ccaacccaag ttcaaaggct gatgagagag 900tctgcttttt ttttcttctc ttctcagtgc ccaacccaag ttcaaaggct gatgagagag 900
aaaaactcat gaagagattt tactctaggg aaagttgctc agtggatggg atcttggcgc 960aaaaactcat gaagagattt tactctaggg aaagttgctc agtggatggg atcttggcgc 960
acggggaaag atgtcaagct cttcctggct ccttctcagc cttgttgctg taactgctgc 1020acggggaaag atgtcaagct cttcctggct ccttctcagc cttgttgctg taactgctgc 1020
tcagtccacc attgaggaac aggccaagac atttttggac aagtttaacc acgaagccga 1080tcagtccacc attgaggaac aggccaagac atttttggac aagtttaacc acgaagccga 1080
agacctgttc tatcaaagtt cacttgcttc ttggaattat aacaccaata ttactgaaga 1140agacctgttc tatcaaagtt cacttgcttc ttggaattat aacaccaata ttactgaaga 1140
gaatgtccaa aacatgaata atgctgggga caaatggtct gcctttttaa aggaacagtc 1200gaatgtccaa aacatgaata atgctgggga caaatggtct gcctttttaa aggaacagtc 1200
cacacttgcc caaatgtatc cactacaaga aattcagaat ctcacagtca agcttcagct 1260cacacttgcc caaatgtatc cactacaaga aattcagaat ctcacagtca agcttcagct 1260
gcaggctctt cagcaaaatg ggtcttcagt gctctcagaa gacaagagca aacggttgaa 1320gcaggctctt cagcaaaatg ggtcttcagt gctctcagaa gacaagagca aacggttgaa 1320
cacaattcta aatacaatga gcaccatcta cagtactgga aaagtttgta acccagataa 1380cacaattcta aatacaatga gcaccatcta cagtactgga aaagtttgta acccagataa 1380
tccacaagaa tgcttattac ttgaaccagg tttgaatgaa ataatggcaa acagtttaga 1440tccacaagaa tgcttattac ttgaaccagg tttgaatgaa ataatggcaa acagtttaga 1440
ctacaatgag aggctctggg cttgggaaag ctggagatct gaggtcggca agcagctgag 1500ctacaatgag aggctctggg cttgggaaag ctggagatct gaggtcggca agcagctgag 1500
gccattatat gaagagtatg tggtcttgaa aaatgagatg gcaagagcaa atcattatga 1560gccattat gaagagtatg tggtcttgaa aaatgagatg gcaagagcaa atcattatga 1560
ggactatggg gattattgga gaggagacta tgaagtaaat ggggtagatg gctatgacta 1620ggactatggg gattattgga gaggagacta tgaagtaaat ggggtagatg gctatgacta 1620
cagccgcggc cagttgattg aagatgtgga acataccttt gaagagatta aaccattata 1680cagccgcggc cagttgattg aagatgtgga acataccttt gaagagatta aaccattata 1680
tgaacatctt catgcctatg tgagggcaaa gttgatgaat gcctatcctt cctatatcag 1740tgaacatctt catgcctatg tgagggcaaa gttgatgaat gcctatcctt cctatatcag 1740
tccaattgga tgcctccctg ctcatttgct tggtgatatg tggggtagat tttggacaaa 1800tccaattgga tgcctccctg ctcatttgct tggtgatatg tggggtagat tttggacaaa 1800
tctgtactct ttgacagttc cctttggaca gaaaccaaac atagatgtta ctgatgcaat 1860tctgtactct ttgacagttc cctttggaca gaaaccaaac atagatgtta ctgatgcaat 1860
ggtggaccag gcctgggatg cacagagaat attcaaggag gccgagaagt tctttgtatc 1920ggtggaccag gcctggggatg cacagagaat attcaaggag gccgagaagt tctttgtatc 1920
tgttggtctt cctaatatga ctcaaggatt ctgggaaaat tccatgctaa cggacccagg 1980tgttggtctt cctaatatga ctcaaggatt ctgggaaaat tccatgctaa cggacccagg 1980
aaatgttcag aaagcagtct gccatcccac agcttgggac ctggggaagg gcgacttcag 2040aaatgttcag aaagcagtct gccatcccac agcttgggac ctggggaagg gcgacttcag 2040
gatccttatg tgcacaaagg tgacaatgga cgacttcctg acagctcatc atgagatggg 2100gatccttatg tgcacaaagg tgacaatgga cgacttcctg acagctcatc atgagatggg 2100
gcatatccag tatgatatgg catatgctgc acaacctttt ctgctaagaa atggagctaa 2160gcatatccag tatgatatgg catatgctgc acaacctttt ctgctaagaa atggagctaa 2160
tgaaggattc catgaagctg ttggggaaat catgtcactt tctgcagcca cacctaagca 2220tgaaggattc catgaagctg ttggggaaat catgtcactt tctgcagcca cacctaagca 2220
tttaaaatcc attggtcttc tgtcacccga ttttcaagaa gacaatgaaa cagaaataaa 2280tttaaaatcc attggtcttc tgtcacccga ttttcaagaa gacaatgaaa cagaaataaa 2280
cttcctgctc aaacaagcac tcacgattgt tgggactctg ccatttactt acatgttaga 2340cttcctgctc aaacaagcac tcacgattgt tgggactctg ccatttactt acatgttaga 2340
gaagtggagg tggatggtct ttaaagggga aattcccaaa gaccagtgga tgaaaaagtg 2400gaagtggagg tggatggtct ttaaagggga aattcccaaa gaccagtgga tgaaaaagtg 2400
gtgggagatg aagcgagaga tagttggggt ggtggaacct gtgccccatg atgaaacata 2460gtgggagatg aagcgagaga tagttggggt ggtggaacct gtgccccatg atgaaacata 2460
ctgtgacccc gcatctctgt tccatgtttc taatgattac tcattcattc gatattacac 2520ctgtgacccc gcatctctgt tccatgtttc taatgattac tcattcattc gatattacac 2520
aaggaccctt taccaattcc agtttcaaga agcactttgt caagcagcta aacatgaagg 2580aaggaccctt taccaattcc agtttcaaga agcactttgt caagcagcta aacatgaagg 2580
ccctctgcac aaatgtgaca tctcaaactc tacagaagct ggacagaaac tgttcaatat 2640ccctctgcac aaatgtgaca tctcaaactc tacagaagct ggacagaaac tgttcaatat 2640
gctgaggctt ggaaaatcag aaccctggac cctagcattg gaaaatgttg taggagcaaa 2700gctgaggctt ggaaaatcag aaccctggac cctagcattg gaaaatgttg taggagcaaa 2700
gaacatgaat gtaaggccac tgctcaacta ctttgagccc ttatttacct ggctgaaaga 2760gaacatgaat gtaaggccac tgctcaacta ctttgagccc ttatttacct ggctgaaaga 2760
ccagaacaag aattcttttg tgggatggag taccgactgg agtccatatg cagaccaaag 2820ccagaacaag aattcttttg tgggatggag taccgactgg agtccatatg cagaccaaag 2820
catcaaagtg aggataagcc taaaatcagc tcttggagat aaagcatatg aatggaacga 2880catcaaagtg aggataagcc taaaatcagc tcttggagat aaagcatatg aatggaacga 2880
caatgaaatg tacctgttcc gatcatctgt tgcatatgct atgaggcagt actttttaaa 2940caatgaaatg tacctgttcc gatcatctgt tgcatatgct atgaggcagt actttttaaa 2940
agtaaaaaat cagatgattc tttttgggga ggaggatgtg cgagtggcta atttgaaacc 3000agtaaaaaat cagatgattc tttttgggga ggaggatgtg cgagtggcta atttgaaacc 3000
aagaatctcc tttaatttct ttgtcactgc acctaaaaat gtgtctgata tcattcctag 3060aagaatctcc tttaatttct ttgtcactgc acctaaaaat gtgtctgata tcattcctag 3060
aactgaagtt gaaaaggcca tcaggatgtc ccggagccgt atcaatgatg ctttccgtct 3120aactgaagtt gaaaaggcca tcaggatgtc ccggagccgt atcaatgatg ctttccgtct 3120
gaatgacaac agcctagagt ttctggggat acagccaaca cttggacctc ctaaccagcc 3180gaatgacaac agcctagagt ttctggggat acagccaaca cttggacctc ctaaccagcc 3180
ccctgtttcc atatggctga ttgtttttgg agttgtgatg ggagtgatag tggttggcat 3240ccctgtttcc atatggctga ttgtttttgg agttgtgatg ggagtgatag tggttggcat 3240
tgtcatcctg atcttcactg ggatcagaga tcggaagaag aaaaataaag caagaagtgg 3300tgtcatcctg atcttcactg ggatcagaga tcggaagaag aaaaataaag caagaagtgg 3300
agaaaatcct tatgcctcca tcgatattag caaaggagaa aataatccag gattccaaaa 3360agaaaatcct tatgcctcca tcgatattag caaaggagaa aataatccag gattccaaaa 3360
cactgatgat gttcagacct ccttttagaa cttgtttatt gcagcttata atggttacaa 3420cactgatgat gttcagacct ccttttagaa cttgtttatt gcagcttata atggttacaa 3420
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3480ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 3480
tggtttgtcc aaactcatca atgtatctta acgcgtgcga agttcctatt ctctagaaag 3540tggtttgtcc aaactcatca atgtatctta acgcgtgcga agttcctatt ctctagaaag 3540
tataggaact tctaacctcc cgggtgacag ataacttcgt ataatgtatg ctatacgaag 3600tataggaact tctaacctcc cgggtgacag ataacttcgt ataatgtatg ctatacgaag 3600
ttatgtcgac gcggcgcgcc gaattataat actaacatta ctgaagaaaa tgcccaaaag 3660ttatgtcgac gcggcgcgcc gaattataat actaacatta ctgaagaaaa tgcccaaaag 3660
atggtaagtt cttgaggcta cccagggggt tattgattgc ttcttaaaga tcagaattac 3720atggtaagtt cttgaggcta cccagggggt tattgattgc ttcttaaaga tcagaattac 3720
tgcctataaa actggataag gaaatcatag agatctctca agtgtgagga tgagtgactg 3780tgcctataaa actggataag gaaatcatag agatctctca agtgtgagga tgagtgactg 3780
cctctgtagc tctgatccta gtctcccaga tggctaaatt caattgacct tagagttcat 3840cctctgtagc tctgatccta gtctcccaga tggctaaatt caattgacct tagagttcat 3840
ctggaaaatt gttatgaatg aattatttgc ccagattcca aagatgagtg aaaatgttta 3900ctggaaaatt gttatgaatg aattatttgc ccagattcca aagatgagtg aaaatgttta 3900
ataaagttgc catcactatt ctcattatat ttggtatgta aagcattcat ggaaatgttc 3960ataaagttgc catcactatt ctcattatat ttggtatgta aagcattcat ggaaatgttc 3960
taagtcgtta ttgagccaat aattttcttt agcttataat gccaacaggt ctatccgaga 4020taagtcgtta ttgagccaat aattttcttt agcttataat gccaacaggt ctatccgaga 4020
actacaaatg acatattaac tgaaaaatgc aactggggtt tactgaaggc agcagcttag 4080actacaaatg acatattaac tgaaaaatgc aactggggtt tactgaaggc agcagcttag 4080
taattaaggt aaccatggct taggtgaaac tggacctggg aattccttct ttcattgaca 4140taattaaggt aaccatggct taggtgaaac tggacctggg aattccttct ttcattgaca 4140
cagagctctg aggaatttcc aaaggtcaca gaagaaaagc tataattaaa ctagtcccaa 4200cagagctctg aggaatttcc aaaggtcaca gaagaaaagc tataattaaa ctagtcccaa 4200
aaaatctcag cctactctgg gaaagcagca tattttgttt gacaagtgca aggacttaga 4260aaaatctcag cctactctgg gaaagcagca tattttgttt gacaagtgca aggacttaga 4260
actttttttt ttctcactga tcctgaagtg ccttttaagt atagttaagt ggtggaaaat 4320actttttttt ttctcactga tcctgaagtg ccttttaagt atagttaagt ggtggaaaat 4320
tgagcaacta tttaagaaaa gactcttttt tttcttcttc cagcaatgct ttccttcaaa 4380tgagcaacta tttaagaaaa gactcttttt tttcttcttc cagcaatgct ttccttcaaa 4380
acggtagctt caaaacttcc tgtcttttaa atgatcaggg ggctgtgtgt ttaaattatt 4440acggtagctt caaaacttcc tgtcttttaa atgatcaggg ggctgtgtgt ttaaattatt 4440
gccattcata gaacagagtg ggtctgagga tgcctgtttc ctttgaaatt ctatgccccc 4500gccattcata gaacagagtg ggtctgagga tgcctgtttc ctttgaaatt ctatgccccc 4500
tcccagtttt ctaaaattta agaaaccaca gagactttga caatgtagtt gccaaatgag 4560tcccagtttt ctaaaattta agaaaccaca gagactttga caatgtagtt gccaaatgag 4560
ttgcttttaa ctgctctaat agtttggtct t 4591ttgcttttaa ctgctctaat agtttggtct t 4591
<210> 33<210> 33
<211> 18<211> 18
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> hACE2-qF<223> hACE2-qF
<400> 33<400> 33
ggtcttcagt gctctcag 18ggtcttcagt gctctcag 18
<210> 34<210> 34
<211> 21<211> 21
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> hACE2-qR<223> hACE2-qR
<400> 34<400> 34
gcattcttgt ggattatctg g 21gcattcttgt ggattatctg g 21
<210> 35<210> 35
<211> 22<211> 22
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 正向引物<223> Forward primer
<400> 35<400> 35
ggggaacttc tcctgctaga at 22ggggaacttc tcctgctaga at 22
<210> 36<210> 36
<211> 22<211> 22
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> 反向引物<223> Reverse primer
<400> 36<400> 36
cagacatttt gctctcaagc tg 22cagacatttt gctctcaagc tg 22
<210> 37<210> 37
<211> 20<211> 20
<212> DNA<212> DNA
<213> 人工序列(Artificial Sequence)<213> Artificial Sequence
<220><220>
<223> TaqMan 探针<223> TaqMan Probes
<400> 37<400> 37
ttgctgctgc ttgacagatt 20ttgctgctgc ttgacagatt 20
Claims (32)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010152182X | 2020-03-06 | ||
CN202010152182 | 2020-03-06 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113355325A CN113355325A (en) | 2021-09-07 |
CN113355325B true CN113355325B (en) | 2023-07-14 |
Family
ID=77524937
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110245653.6A Active CN113355325B (en) | 2020-03-06 | 2021-03-05 | Preparation method and application of humanized ACE2 gene modified mouse embryonic stem cell model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113355325B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113897363B (en) * | 2021-09-07 | 2024-03-29 | 干细胞转化研究中心有限公司 | Embryonic stem cell sublines, construction methods and applications of ACE2 gene knockout |
CN115074385A (en) * | 2022-06-24 | 2022-09-20 | 湖南丰晖生物科技有限公司 | Epigenetic vector for reducing expression level of new coronavirus receptor ACE2 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1884303A (en) * | 2005-06-20 | 2006-12-27 | 中国医学科学院基础医学研究所 | SARS-CoV virus structural protein infusion protein and its high yield expression and purification and uses |
LT2986729T (en) * | 2013-04-16 | 2018-10-25 | Regeneron Pharmaceuticals, Inc. | Targeted modification of rat genome |
BR112016026245A2 (en) * | 2014-05-30 | 2018-02-20 | Regeneron Pharma | "rodent, and methods for making a humanized rodent, for making a humanized dpp4 mouse, for making a humanized transgenic rodent, and for determining the in vivo therapeutic efficacy of a human-specific dpp4 antagonist in the rodent?" |
CA3172866A1 (en) * | 2014-06-26 | 2015-12-30 | Regeneron Pharamaceuticals, Inc. | Methods and compositions for targeted genetic modifications and methods of use |
US10912287B2 (en) * | 2016-06-28 | 2021-02-09 | Biocytogen Pharmaceuticals (Beijing) Co., Ltd | Genetically modified mice expressing humanized PD-1 |
-
2021
- 2021-03-05 CN CN202110245653.6A patent/CN113355325B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN113355325A (en) | 2021-09-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102386029B1 (en) | genome editing immune effector cells | |
US20020025561A1 (en) | Vectors for gene-self-assembly | |
CN106755092A (en) | GLCCI1 gene Cre-LoxP conditional gene knockout mouse model-based construction kit and construction method | |
CN113355325B (en) | Preparation method and application of humanized ACE2 gene modified mouse embryonic stem cell model | |
CN110042066A (en) | The FC- bait antibody display system of surface anchoring | |
CN113355323B (en) | Preparation method and application of humanized ACE2 gene modified mouse model | |
CN114933970B (en) | Toxoplasma gondii knockout strain lacking 6-phosphogluconate dehydrogenase 1 gene | |
KR20240021218A (en) | Novel type V RNA programmable endonuclease system | |
KR20150013503A (en) | Surface anchored light chain bait antibody display system | |
CN104651402B (en) | Universal gene targeting vector | |
CN104017827A (en) | Lentiviral plasmid expression vector as well as construction method and application of lentiviral plasmid expression vector | |
CN112626116B (en) | Method for site-specific integration of large-fragment exogenous DNA | |
US9850497B2 (en) | Gene targeting methods and tools | |
KR102624831B1 (en) | Production of transgenic dogs overexpressing muscle-specific PCK1 | |
KR101666692B1 (en) | Cells, nucleic acid constructs, cells comprising said constructs and methods utilizing said cells in the treatment of diseases | |
CN112322706A (en) | Specific human gene fragment, primer probe and application thereof | |
KR20230142740A (en) | OMNI-103 CRISPR Nuclease | |
CN116887854A (en) | OMNI-103 CRISPR nuclease | |
KR102468650B1 (en) | Recombinant vector inducing expression of T7 RNA polymerase and mRNA capping enzyme and uses thereof | |
CN113073097B (en) | CHO cell endogenous temperature-sensitive promoter and application thereof | |
CN113444722B (en) | Application of single base editing mediated splicing repair in preparation of medicines for treating spinal muscular atrophy | |
CN115216492B (en) | Preparation method and application of a mouse primary glioma model | |
KR20240021211A (en) | Nucleic acid coding for KLK2-GPI fusion protein, recombinant cells and uses thereof | |
JP3688118B2 (en) | Gene trap vector and gene trap method using this vector | |
CN112980800A (en) | CAR-T cell, construction method and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |