CN103992395B - 重组hpv-58型l1的vlp疫苗及其制备方法 - Google Patents
重组hpv-58型l1的vlp疫苗及其制备方法 Download PDFInfo
- Publication number
- CN103992395B CN103992395B CN201410054216.6A CN201410054216A CN103992395B CN 103992395 B CN103992395 B CN 103992395B CN 201410054216 A CN201410054216 A CN 201410054216A CN 103992395 B CN103992395 B CN 103992395B
- Authority
- CN
- China
- Prior art keywords
- leu
- gly
- val
- ser
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 229960005486 vaccine Drugs 0.000 title claims abstract description 54
- 238000002360 preparation method Methods 0.000 title claims description 7
- 101000911390 Homo sapiens Coagulation factor VIII Proteins 0.000 title description 5
- 102000057593 human F8 Human genes 0.000 title description 5
- 229940047431 recombinate Drugs 0.000 title description 5
- 230000006798 recombination Effects 0.000 claims abstract description 30
- 238000005215 recombination Methods 0.000 claims abstract description 29
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 11
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 11
- 239000002157 polynucleotide Substances 0.000 claims abstract description 11
- 208000015181 infectious disease Diseases 0.000 claims abstract description 9
- 101100102667 Human papillomavirus 58 L1 gene Proteins 0.000 claims description 48
- 108090000623 proteins and genes Proteins 0.000 claims description 37
- 230000014509 gene expression Effects 0.000 claims description 34
- 238000000034 method Methods 0.000 claims description 31
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 21
- 241000588724 Escherichia coli Species 0.000 claims description 12
- 229960002566 papillomavirus vaccine Drugs 0.000 claims description 10
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 9
- 239000013604 expression vector Substances 0.000 claims description 9
- 201000010099 disease Diseases 0.000 claims description 8
- 239000002671 adjuvant Substances 0.000 claims description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims description 6
- 239000003814 drug Substances 0.000 claims description 5
- 229940079593 drug Drugs 0.000 claims description 4
- 239000003795 chemical substances by application Substances 0.000 claims description 3
- 239000000178 monomer Substances 0.000 claims description 3
- 101000936049 Vibrio cholerae serotype O1 (strain ATCC 39315 / El Tor Inaba N16961) Outer membrane lipoprotein Blc Proteins 0.000 claims 2
- 239000002131 composite material Substances 0.000 claims 1
- 239000012634 fragment Substances 0.000 abstract description 21
- 230000002068 genetic effect Effects 0.000 abstract description 21
- 238000013519 translation Methods 0.000 abstract description 3
- 208000022361 Human papillomavirus infectious disease Diseases 0.000 description 30
- 108091008146 restriction endonucleases Proteins 0.000 description 30
- 235000001014 amino acid Nutrition 0.000 description 22
- 150000001413 amino acids Chemical class 0.000 description 21
- 238000006243 chemical reaction Methods 0.000 description 21
- 102000004169 proteins and genes Human genes 0.000 description 21
- 235000018102 proteins Nutrition 0.000 description 19
- 210000004027 cell Anatomy 0.000 description 16
- 239000000203 mixture Substances 0.000 description 16
- 239000013612 plasmid Substances 0.000 description 14
- 241000894006 Bacteria Species 0.000 description 13
- 239000002245 particle Substances 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 12
- 210000004899 c-terminal region Anatomy 0.000 description 12
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 12
- 230000029087 digestion Effects 0.000 description 11
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 10
- 241000700605 Viruses Species 0.000 description 10
- 239000002773 nucleotide Substances 0.000 description 10
- 125000003729 nucleotide group Chemical group 0.000 description 10
- 239000000047 product Substances 0.000 description 10
- 208000019065 cervical carcinoma Diseases 0.000 description 9
- 238000006467 substitution reaction Methods 0.000 description 9
- 108010065920 Insulin Lispro Proteins 0.000 description 8
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 8
- 230000003321 amplification Effects 0.000 description 8
- 108010050848 glycylleucine Proteins 0.000 description 8
- 108010064235 lysylglycine Proteins 0.000 description 8
- 230000003472 neutralizing effect Effects 0.000 description 8
- 238000003199 nucleic acid amplification method Methods 0.000 description 8
- 238000000746 purification Methods 0.000 description 8
- 239000000243 solution Substances 0.000 description 8
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 7
- 239000007853 buffer solution Substances 0.000 description 7
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 6
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 6
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 6
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 6
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 6
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 6
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 6
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 6
- 108010075254 C-Peptide Proteins 0.000 description 6
- YXPNKXFOBHRUBL-BJDJZHNGSA-N Cys-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N YXPNKXFOBHRUBL-BJDJZHNGSA-N 0.000 description 6
- 108010090461 DFG peptide Proteins 0.000 description 6
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 6
- LVSYIKGMLRHKME-IUCAKERBSA-N Gln-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N LVSYIKGMLRHKME-IUCAKERBSA-N 0.000 description 6
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 6
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 6
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 6
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 6
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 6
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 6
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 6
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 6
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 6
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 6
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 6
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 6
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 6
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 6
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 6
- 206010028980 Neoplasm Diseases 0.000 description 6
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 6
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 6
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 6
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 6
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 6
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 6
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 6
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 6
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 6
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 6
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 6
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 6
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 6
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 6
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 6
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 6
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 6
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 6
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 6
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 6
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 6
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 6
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 6
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 6
- 108010041407 alanylaspartic acid Proteins 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 239000002585 base Substances 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 6
- 108010010147 glycylglutamine Proteins 0.000 description 6
- 230000005847 immunogenicity Effects 0.000 description 6
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 6
- 108010038320 lysylphenylalanine Proteins 0.000 description 6
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 239000011347 resin Substances 0.000 description 6
- 229920005989 resin Polymers 0.000 description 6
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 6
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 5
- CMBDUPIBCOEWNE-BJDJZHNGSA-N Asp-Leu-Asp-Gln Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CMBDUPIBCOEWNE-BJDJZHNGSA-N 0.000 description 5
- 102000012410 DNA Ligases Human genes 0.000 description 5
- 108010061982 DNA Ligases Proteins 0.000 description 5
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 5
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 5
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 5
- 102000003960 Ligases Human genes 0.000 description 5
- 108090000364 Ligases Proteins 0.000 description 5
- 241000699670 Mus sp. Species 0.000 description 5
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 5
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 5
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 5
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 5
- 238000004587 chromatography analysis Methods 0.000 description 5
- 239000013613 expression plasmid Substances 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 239000002777 nucleoside Substances 0.000 description 5
- 150000003833 nucleoside derivatives Chemical class 0.000 description 5
- 230000002265 prevention Effects 0.000 description 5
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 5
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 4
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 4
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 4
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 4
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 4
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 4
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 4
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 4
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 4
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 4
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 4
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 4
- LLRJPYJQNBMOOO-QEJZJMRPSA-N Asp-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N LLRJPYJQNBMOOO-QEJZJMRPSA-N 0.000 description 4
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 4
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 4
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 4
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 4
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 4
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 4
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 4
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 4
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 4
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 4
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 4
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 4
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 4
- 241000238631 Hexapoda Species 0.000 description 4
- 241000341655 Human papillomavirus type 16 Species 0.000 description 4
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 4
- ZIPOVLBRVPXWJQ-SPOWBLRKSA-N Ile-Cys-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N ZIPOVLBRVPXWJQ-SPOWBLRKSA-N 0.000 description 4
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 4
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 4
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 4
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 4
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 4
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 4
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 4
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 4
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 4
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 4
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 4
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 4
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- 208000009608 Papillomavirus Infections Diseases 0.000 description 4
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 4
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 4
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 4
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 4
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 4
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 4
- QMABBZHZMDXHKU-FKBYEOEOSA-N Pro-Tyr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QMABBZHZMDXHKU-FKBYEOEOSA-N 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 4
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 4
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 4
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 4
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 4
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 4
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 4
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 4
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 4
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 4
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 4
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 4
- 238000001042 affinity chromatography Methods 0.000 description 4
- 108010011559 alanylphenylalanine Proteins 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 201000011510 cancer Diseases 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 4
- 108010020688 glycylhistidine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 238000009776 industrial production Methods 0.000 description 4
- 238000004255 ion exchange chromatography Methods 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 239000000546 pharmaceutical excipient Substances 0.000 description 4
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 4
- 239000008363 phosphate buffer Substances 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 238000011282 treatment Methods 0.000 description 4
- 108010080629 tryptophan-leucine Proteins 0.000 description 4
- 108010038745 tryptophylglycine Proteins 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- FRXSZNDVFUDTIR-UHFFFAOYSA-N 6-methoxy-1,2,3,4-tetrahydroquinoline Chemical compound N1CCCC2=CC(OC)=CC=C21 FRXSZNDVFUDTIR-UHFFFAOYSA-N 0.000 description 3
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 3
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 3
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 3
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 3
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 3
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 3
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- 241000701806 Human papillomavirus Species 0.000 description 3
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 3
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 3
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 3
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 3
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- YTGGLKWSVIRECD-JBACZVJFSA-N Phe-Trp-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 YTGGLKWSVIRECD-JBACZVJFSA-N 0.000 description 3
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 3
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 3
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 3
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 3
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 3
- XIHGJKFSIDTDKV-LYARXQMPSA-N Thr-Phe-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIHGJKFSIDTDKV-LYARXQMPSA-N 0.000 description 3
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 3
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 3
- 108010047495 alanylglycine Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 238000002296 dynamic light scattering Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 3
- 238000011017 operating method Methods 0.000 description 3
- 239000008194 pharmaceutical composition Substances 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 230000009465 prokaryotic expression Effects 0.000 description 3
- 239000012264 purified product Substances 0.000 description 3
- 239000004094 surface-active agent Substances 0.000 description 3
- 230000014616 translation Effects 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 2
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 2
- 206010059313 Anogenital warts Diseases 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 2
- NYZGVTGOMPHSJW-CIUDSAMLSA-N Arg-Glu-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N NYZGVTGOMPHSJW-CIUDSAMLSA-N 0.000 description 2
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 2
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 2
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 2
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 2
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 2
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 2
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 2
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 2
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 2
- 108010083946 Asp-Tyr-Leu-Lys Proteins 0.000 description 2
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 2
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 2
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 2
- 108090000565 Capsid Proteins Proteins 0.000 description 2
- 102100023321 Ceruloplasmin Human genes 0.000 description 2
- 206010008342 Cervix carcinoma Diseases 0.000 description 2
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 2
- BUAUGQJXGNRTQE-AAEUAGOBSA-N Cys-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N BUAUGQJXGNRTQE-AAEUAGOBSA-N 0.000 description 2
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 241000588722 Escherichia Species 0.000 description 2
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 2
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 2
- DFRYZTUPVZNRLG-KKUMJFAQSA-N Gln-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DFRYZTUPVZNRLG-KKUMJFAQSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- 108010024636 Glutathione Proteins 0.000 description 2
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 description 2
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 2
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 2
- AHEBIAHEZWQVHB-QTKMDUPCSA-N His-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O AHEBIAHEZWQVHB-QTKMDUPCSA-N 0.000 description 2
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 2
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 2
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 2
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- JLVVSXFLKOJNIY-UHFFFAOYSA-N Magnesium ion Chemical compound [Mg+2] JLVVSXFLKOJNIY-UHFFFAOYSA-N 0.000 description 2
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 2
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 2
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 2
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 2
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 206010037660 Pyrexia Diseases 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 2
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 2
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 2
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 2
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 2
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 2
- VIWQOOBRKCGSDK-RYQLBKOJSA-N Trp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VIWQOOBRKCGSDK-RYQLBKOJSA-N 0.000 description 2
- WPSYJHFHZYJXMW-JSGCOSHPSA-N Trp-Gln-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O WPSYJHFHZYJXMW-JSGCOSHPSA-N 0.000 description 2
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 2
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- BXJQKVDPRMLGKN-PMVMPFDFSA-N Tyr-Trp-Leu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 BXJQKVDPRMLGKN-PMVMPFDFSA-N 0.000 description 2
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 2
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 2
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 2
- 229910021502 aluminium hydroxide Inorganic materials 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 235000011148 calcium chloride Nutrition 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 201000010881 cervical cancer Diseases 0.000 description 2
- 239000007795 chemical reaction product Substances 0.000 description 2
- 238000005336 cracking Methods 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 229940088598 enzyme Drugs 0.000 description 2
- 229960003180 glutathione Drugs 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 230000003053 immunization Effects 0.000 description 2
- 238000002649 immunization Methods 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 210000002429 large intestine Anatomy 0.000 description 2
- 238000011031 large-scale manufacturing process Methods 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 229910001425 magnesium ion Inorganic materials 0.000 description 2
- 239000002808 molecular sieve Substances 0.000 description 2
- 238000007747 plating Methods 0.000 description 2
- 239000002574 poison Substances 0.000 description 2
- 231100000614 poison Toxicity 0.000 description 2
- 238000004321 preservation Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000011321 prophylaxis Methods 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 239000012744 reinforcing agent Substances 0.000 description 2
- 238000004153 renaturation Methods 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 229910021642 ultra pure water Inorganic materials 0.000 description 2
- 239000012498 ultrapure water Substances 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 2
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-M Bicarbonate Chemical compound OC([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-M 0.000 description 1
- BTBUEUYNUDRHOZ-UHFFFAOYSA-N Borate Chemical compound [O-]B([O-])[O-] BTBUEUYNUDRHOZ-UHFFFAOYSA-N 0.000 description 1
- 208000009458 Carcinoma in Situ Diseases 0.000 description 1
- 229940124957 Cervarix Drugs 0.000 description 1
- 208000000907 Condylomata Acuminata Diseases 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 229940124897 Gardasil Drugs 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 101710121996 Hexon protein p72 Proteins 0.000 description 1
- 108700005307 Human papillomavirus HPV L1 Proteins 0.000 description 1
- 241000722343 Human papillomavirus types Species 0.000 description 1
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- 239000007993 MOPS buffer Substances 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 101710135729 Major capsid protein L1 Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241001631646 Papillomaviridae Species 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- 108700011066 PreScission Protease Proteins 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- UTCFSBBXPWKLTG-XKBZYTNZSA-N Thr-Cys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O UTCFSBBXPWKLTG-XKBZYTNZSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- 208000000260 Warts Diseases 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 229910052783 alkali metal Inorganic materials 0.000 description 1
- -1 alkali metal salt Chemical class 0.000 description 1
- 230000000735 allogeneic effect Effects 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- 239000004411 aluminium Substances 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- ILRRQNADMUWWFW-UHFFFAOYSA-K aluminium phosphate Chemical compound O1[Al]2OP1(=O)O2 ILRRQNADMUWWFW-UHFFFAOYSA-K 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 150000003863 ammonium salts Chemical class 0.000 description 1
- 238000005349 anion exchange Methods 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000000337 buffer salt Substances 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000000711 cancerogenic effect Effects 0.000 description 1
- 231100000315 carcinogenic Toxicity 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000010307 cell transformation Effects 0.000 description 1
- 210000003679 cervix uteri Anatomy 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000001125 extrusion Methods 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 238000000227 grinding Methods 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 208000021145 human papilloma virus infection Diseases 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000012872 hydroxylapatite chromatography Methods 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 208000020082 intraepithelial neoplasia Diseases 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 239000010977 jade Substances 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000005374 membrane filtration Methods 0.000 description 1
- 229910000403 monosodium phosphate Inorganic materials 0.000 description 1
- 235000019799 monosodium phosphate Nutrition 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 210000000214 mouth Anatomy 0.000 description 1
- 210000004400 mucous membrane Anatomy 0.000 description 1
- 210000003928 nasal cavity Anatomy 0.000 description 1
- 230000009826 neoplastic cell growth Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000010899 nucleation Methods 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 239000003002 pH adjusting agent Substances 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010065320 prolyl-lysyl-glutamyl-lysine Proteins 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000001814 protein method Methods 0.000 description 1
- 229940023143 protein vaccine Drugs 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011218 seed culture Methods 0.000 description 1
- 201000010153 skin papilloma Diseases 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 206010041823 squamous cell carcinoma Diseases 0.000 description 1
- 239000010414 supernatant solution Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 231100000033 toxigenic Toxicity 0.000 description 1
- 230000001551 toxigenic effect Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 238000002525 ultrasonication Methods 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 210000001215 vagina Anatomy 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 230000004735 virus-associated carcinogenesis Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20023—Virus like particles [VLP]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2710/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA dsDNA viruses
- C12N2710/00011—Details
- C12N2710/20011—Papillomaviridae
- C12N2710/20034—Use of virus or viral component as vaccine, e.g. live-attenuated or inactivated virus, VLP, viral protein
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Virology (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
本发明提供一种新的编码重组的HPV58 L1蛋白的多核苷酸基因片段、包含该基因片段的载体、包括载体的宿主细胞,以及由该基因片段翻译表达的HPV58 L1蛋白五聚体或VLP,和由其组成的抗HPV58型感染的疫苗。
Description
技术领域
本发明涉及人类乳头状瘤病毒感染的预防和/或治疗。更具体而言,本发明涉及一种重组的人乳头瘤病毒58型L1蛋白,及由其组成的五聚体和VLP,含该蛋白的疫苗及其在预防HPV58型病毒感染,特别是在预防HPV58型病毒感染引起的宫颈癌疾病中的用途。
背景技术
人类乳头状瘤病毒(Human Papillomavirus,简称HPV)是通过密切接触而传播的DNA病毒。在人体组织中,HPV主要感染皮肤和黏膜组织。HPV DNA按病毒致癌能力的大小分为三类:(1)低危型HPV,包括HPV6,11,40,42,43,44,54,61,70.72.51,主要引起良性外生性疣,宫颈上皮内瘤变(cervical intraepithelial neoplasm,CIN)(2)高危型HPV,包括HPV16,18,31,33,35,39,43,51,52,56,58,59,68,73,82,主要引起宫颈上皮内中、高度瘤变(CINII、CTNIII)和宫颈浸润型鳞癌.最常见的是HPV 16、18。(3)可能致癌型,包括HPV 26,53,66(Munoz N,Bosch Fx,Sanjose S,et a1.Epidemiologie classification of humanpapilloma virus types associated with cervical cancer[J].N Engl J Med,2003,348:8)。宫颈癌是女性第二大恶性肿瘤,每年全球的发病大概在54万(2013年),约有24万例死亡,幸运的是,宫颈癌是唯一研制出疫苗的癌症。2006年6月8日,美国食品与药品管理局(FDA)正式批准美国Merck公司(即默沙东公司)生产的Gardasil HPV预防性疫苗上市;它是由酿酒酵母表达并纯化的HPV16/18/6/11L1VLP四价宫颈癌预防性疫苗,被批准用于预防6~26岁女孩和妇女HPV16、18、6、11型感染所引起的宫颈癌、癌前病变和生殖器疣,这是FDA通过的世界上第一个肿瘤疫苗(Villa,Costa et al.2005,Villa,Ault et al.2006,Bryan2007,Olsson,Villa et al.2007,Goldstone and Vuocolo 2012)。随后英国葛兰素史克(GSK)公司生产的商品名为Cervarix的HPV预防性疫苗也成功上市,它是由来源于昆虫表达系统的HPV16/18L1VLP二价宫颈癌预防性疫苗。但这两种预防性疫苗价格昂贵,极大限制了在发展中国家和落后地区的使用,因此开发一种低成本的高效价HPV疫苗就显得尤为重要(Jansen and Shaw 2004,Buonaguro,Tornesello et al.2009,Campo and Roden 2010,Frazer,Leggatt et al.2011,Hariri,Dunne et al.2011,Lehtinen and Dillner 2013,Shaw 2013)。
HPV属乳头多瘤空泡病毒科(Papovaviridae)乳头瘤病毒属,为无包膜DNA病毒。病毒基因组为双链闭环DNA,大小约为7.2~8kb,具有8个开放框。基因组按功能的不同可以分为三个区域:早期区(E),约4.5kb,编码E1、E2、E4~E7共6个与病毒复制,转录及转化有关的非结构蛋白;晚期区(L),约2.5kb,编码主要衣壳蛋白L1和次要衣壳蛋白L2;长调控区(LCR),位于L区末端与E区起始端之间,长约800~900bp,不编码任何蛋白,含DNA复制、表达调控元件。
HPV病毒颗粒直径为55~60nm,核衣壳呈20面体对称,由72个主要衣壳蛋白L1的五聚体及次要衣壳蛋白L2组成。大量研究证实,HPV L1蛋白是HPV疫苗的主要靶蛋白。在多种表达系统中表达的HPV L1蛋白无需L2蛋白辅助即可形成在形态结构与天然病毒颗粒相似的类病毒颗粒(Virus-L1keParticle,VLP)。重组HPV L1-VLP疫苗已经成功上市并用于预防HPV感染及由此导致的宫颈癌、尖锐湿疣等疾病,并充分证明了L1-VLP具有与野生同型病毒相同的抗原性和免疫原性。从组成VLP的三级结构看,其抗原决定簇均分布于组成VLP的基本结构单元五聚体的表面(Xiaojiang S.Chen,Robert L.Garcea,Ilya Goldberg,GregoryCasini and Stephen C.(2000).
HarrisonStructure of Small Virus-like Particles Assembled from theL1Protein of Human Papillomavirus 16.Molecular Cell,Vol.5,557–567.BrookeBishop,Jhimli Dasgupta,Michael Klein,Robert L.Garcea,Neil D.Christensen,RuiZhaoand Xiaojiang S.Chen.(2007).Crystal Structures of Four Types of HumanPapillomavirus L1Capsid Proteins.THE JOURNAL OF BIOLOGICAL CHEMISTRY VOL.282,NO.43,pp.31803–31811),说明HPV L1-VLP的抗原性和免疫原性来源于或取决于L1组成的五聚体。因此,重组L1蛋白五聚体与VLP一样具备完整的抗原表位,两者均可以作为抗原用来制备疫苗。
HPV疫苗研制的关键是能够大量高效制备HPV L1蛋白。目前较为常用的表达系统可以分为真核表达系统及原核表达系统。常用的真核表达系统有痘病毒表达系统、昆虫杆状病毒表达系统、酵母表达系统。在真核表达系统中所表达的HPV L1蛋白天然构象破坏少,能自发的形成VLP,往往只需进行简单的纯化即可获得VLP。但是由于真核表达系统的表达量低,培养成本高,给大规模工业化生产带来了极大困难。原核表达系统中利用大肠杆菌表达系统表达HPVL1蛋白已有报道。但是由于大肠杆菌所表达的HPV L1蛋白大多失去其天然构象,不能产生针对HPV的保护抗体。或者上述蛋白虽然通过包含体纯化,复性等步骤也可得到HPV的VLP,但是在复性过程中蛋白损失量大,得率低,难以在大规模生产上应用。HPVL1全长序列蛋白虽然也可以在大肠杆菌中以正确构象可溶性地表达,溶解于菌体的裂解上清中,但是表达量较低,而且上清中杂蛋白种类多且量大,要从中纯化出目的蛋白难度相当大,依然无法应用于大规模生产。
因此,本领域仍然需要成本低、纯度高、产量高、效果好的HPV L1蛋白生产技术和大规模工业化生产预防宫颈癌疫苗的新方法。
发明内容
本发明的目的是提供一种新的HPV58L1蛋白,及由其组成的五聚体蛋白颗粒及含该五聚体蛋白颗粒的疫苗。
本发明涉及提供一种新的编码重组的HPV58L1蛋白的多核苷酸基因片段、包含该基因片段的载体、包括载体的宿主细胞,以及由该基因片段翻译表达的HPV58L1蛋白五聚体和由该五聚体组成的抗HPV58型感染的疫苗。
本发明公开了一种重组的HPV58L1蛋白的氨基酸序列,所述蛋白的氨基酸序列在N端8-15个氨基酸全部或任意部分被2-10个氨基酸序列所取代,其氨基酸序列由G、S、A三者的任一组合方式;H4结构域被2-10个氨基酸序列取代,其氨基酸序列由G、S、A三者的任一组合方式。
本发明涉及的HPV58L1蛋白,进一步是其氨基酸序列C端截短0个、1个、2个至21个氨基酸。
本发明所述的HPV58L1蛋白,优选的被其氨基酸序列N端前8-15个氨基酸被GSGGG、ASASG或GSGAG取代,H4结构域优选的被GGGSG或GAGAS取代。
如实施例所述的HPV58L1蛋白,优选其氨基酸序列在N端前8、10、12或15个氨基酸优选被GSGGG或ASASG氨基酸序列取代。C端优选截短10-21个氨基酸,更优选截短10个、21个氨基酸。
如实施例所述的HPV58L1蛋白,其序列包含序列SEQ ID NO:2、SEQ ID NO:4、SEQID NO:6、SEQ ID NO:8或SEQ ID NO:10。
本发明公开一种编码蛋白的多核苷酸序列。本发明公开一种包含多核苷酸序列的基因的表达载体。本发明公开一种包含表达载体的细胞。
本发明公开一种HPV58L1蛋白五聚体,该蛋白五聚体由五个HPV58L1蛋白单体形成。
本发明公开一种HPV58L1蛋白病毒样颗粒(VLP),该病毒样颗粒为二十面体立体对称结构,由72个L1蛋白的五聚体组成。
本发明公开一种HPV疫苗,该疫苗包括HPV58L1蛋白五聚体或VLP和药用佐剂。
本发明公开一种HPV疫苗的制备方法,该方法为:
A.克隆或合成重组HPV58L1蛋白的基因片段;
B.在大肠杆菌或酵母表达系统中表达重组的HPV58L1蛋白;
C.纯化由HPV58L1蛋白组成的五聚体或进一步组装成VLP;
D.加入药用佐剂制成疫苗。
上述方法中步骤C的纯化优选利用亲和层析色谱纯化HPV58L1融合标签蛋白。
本发明公开HPV疫苗在制备预防和/或治疗包括HPV58感染及导致疾病的药物中的应用。
本发明第一方面提供一种编码重组HPV58L1蛋白的多核苷酸基因片段。
本发明第二方面提供了一种构建的表达载体,其包含本发明第一方面的编码重组HPV58L1蛋白的多核苷酸基因片段。所述载体适合驱动异源DNA在细菌、昆虫或哺乳动物细胞中翻译表达HPV58L1蛋白。在一个实施方案中,所述表达载体优选pGEX-6p-1或pGEX-4T-2。
本发明的第三方面提供了一种构建的工程菌细胞,该细胞包含本发明第一方面的多核苷酸基因片段,或第二方面的表达载体。所述的工程菌宿主细胞可以是细菌细胞,例如大肠杆菌,可以是真核细胞,例如酵母细胞,或者是昆虫细胞。
本发明第四方面提供了一种药用组合物,其包含采用本发明技术生产的表达产物HPV58L1蛋白、赋形剂或载体。
本发明同时提供了制备上述编码重组HPV58L1蛋白的多核苷酸序列、表达载体构建、工程菌细胞转化和药用组合物的方法。
本发明获得HPV58L1蛋白的方法,其包括在表达系统中表达重组的HPV58L1蛋白及其五聚体,然后将含有该重组蛋白的裂解上清进行纯化处理。具体获得HPV58L1蛋白五聚体的方法包括:
1.从临床样本中克隆或人工合成编码HPV58L1全长蛋白基因或截短蛋白基因的片段。
2.在大肠杆菌或酵母表达系统中表达重组的L1蛋白。
3.纯化HPV58L1重组蛋白。
在一个实施方案中,获得重组HPV58L1蛋白的优选方法包括:
1.N端1-15个氨基酸全部或任意部分被GSGGG取代、H4结构域被GGGSG序列取代的HPV58L1重组蛋白。
2.在大肠杆菌或酵母表达系统中表达重组的L1蛋白。
3.纯化HPV58L1重组蛋白。
在优选的实施例1方案中N端截短8个氨基酸被GSGGG取代,H4结构域被GGGSG序列取代,同时C端截短21个氨基酸,SEQ ID NO:2。
在一个优选的实施例2方案中N端截短8个氨基酸被GSGGG取代,H4结构域被GGGSG序列取代,同时C端截短10个氨基酸,SEQ ID NO:4。
在一个优选的实施例3方案中N端截短15个氨基酸被GSGAG取代,H4结构域被GAGSG序列取代,同时C端截短10个氨基酸,SEQ ID NO:6。
在一个优选的实施例4方案中N端截短15个氨基酸被ASASG取代,H4结构域被GGGSG序列取代,同时C端截短21个氨基酸,SEQ ID NO:8。
在一个优选的实施例5方案中N端截短8个氨基酸被GSGGG取代,H4结构域被GGGSG序列取代,同时C端保留不截短,SEQ ID NO:10。
在对比的实施例6方案中N端前8个氨基酸被GSGGG取代,H4结构域不取代,C端保留不截短。
本发明另提供了本发明药用组合物在制备预防或治疗HPV58型感染及导致疾病药物中的应用。
本发明还涉及一种预防宫颈癌或HPV感染的疫苗,其包含本发明重组的HPV58L1五聚体,或者由五聚体组成的VLP或多聚体,包括1、2、3、4、5……200个五聚体。优选该疫苗还包含至少一种选自HPV6L1五聚体,HPV11L1五聚体,HPV16L1五聚体,HPV18L1五聚体,HPV31L1五聚体,HPV35L1五聚体,HPV45L1五聚体,HPV52L1五聚体,HPV58L1五聚体,以及由上述五聚体组成的二价、三价、四价、五价、六价、七价、八价、九价或十价疫苗。该疫苗通常还包含疫苗用赋形剂或载体。
优选地,所述疫苗每剂含有本发明重组的HPV58L1蛋白的量为1μg-200μg,优选5μg-50μg。所述疫苗含有本发明重组的HPV58L1与HPV6L1按照0.5-2:1比例组成的疫苗,重组的HPV58L1与HPV11L1按照0.5-2:1比例组成的疫苗,重组的HPV58L1与HPV16L1按照0.5-2:1比例组成的疫苗,重组的HPV58L1与HPV18L1按照0.5-2:1比例组成的疫苗,重组的HPV58L1与HPV31L1按照0.5-2:1比例组成的疫苗,重组的HPV58L1与HPV33L1按照0.5-2:1比例组成的疫苗,重组的HPV58L1与HPV45L1按照0.5-2:1比例组成的疫苗,重组的HPV58L1与HPV52L1按照0.5-2:1比例组成的疫苗,以及由重组的HPV58L1五聚体与上述各型HPV L1五聚体组成的二价、三价、四价、五价、六价、七价疫苗或在此基础上进一步组合形成的九价或十价疫苗。
本发明还涉及一种制备用于预防宫颈癌或HPV感染疫苗的方法,其包含本发明重组的HPV58L1五聚体,或者由五聚体组成的多聚体与任选的一种或多种选自HPV58,6,11,16,18,31,33,45和52的HPV型别的五聚体及疫苗用载体或者赋形剂混合。
本发明进一步涉及包含本发明重组的HPV58L1五聚体,VLP或者由五聚体组成的多聚体在制备用于预防宫颈癌或HPV58感染疫苗中的用途。
本发明中相关术语的说明及解释
根据本发明,术语“大肠杆菌表达系统”是指由大肠杆菌(菌株)与表达载体组成,其中大肠杆菌(菌株)来源于市场上可得到的,在此举例但不限于:GI698,ER2566,BL21(DE3),B834(DE3),BLR(DE3)等。
根据本发明,术语“载体”一词指的是,可将某编码蛋白的多核苷酸插入其中并使蛋白获得表达的一种核酸运载工具。载体可以通过转化,转导或者转染宿主细胞,使其携带的遗传物质元件在宿主细胞中获得表达。举例来说,载体包括:质粒,噬菌体,柯斯质粒等。
根据本发明,术语“疫苗用赋形剂或载体”是指选自一种或多种,包括但不限于:pH调节剂,表面活性剂,佐剂,离子强度增强剂。例如,pH调节剂举例但不限于磷酸盐缓冲液,表面活性剂包括阳离子,阴离子或者非离子型表面活性剂。举例但不限于:Tween-80。佐剂举例但不限于氢氧化铝,磷酸铝、无定型羟基磷酸硫酸铝,氟氏完全佐剂。离子强度增强剂举例但不限于氯化钠。
根据本发明,术语“色谱层析”包括但不限于:离子交换色谱、疏水相互作用色谱、吸附层析法(例如羟基磷灰石色谱)、凝胶过滤(凝胶排阻)层析、亲和层析法。
根据本发明,术语“HPV58L1H4结构域”是指HPV58L1 DNA序列中“FG LTPPPSASLQDTYRFVTSQA ITCQKTAPPK E”,在Genebank登录号为AEI61781的L1蛋白序列中第429位氨基酸至461位氨基酸,或其它HPV58L1序列中与之对应的氨基酸位置。
根据本发明,在本发明获得的重组HPV58L1蛋白的方法中,缓冲液是指可在一定范围内维持pH值稳定的溶液,包括但不限于,Tris缓冲液,磷酸盐缓冲液,HEPES缓冲液,MOPS缓冲液等等。
根据本发明,所述原核宿主细胞破碎包括但不限于通过匀浆器破碎、均质机破碎、超声波处理、研磨、高压挤压、溶菌酶处理中的一项或者多项方法来实现;
根据本发明,在本发明获得的重组HPV58L1蛋白的方法中,所用的盐包括但不限于是中性盐,特别是碱金属盐、铵盐、盐酸盐、硫酸盐,硫酸盐,碳酸氢盐,磷酸盐或磷酸氢盐,特别是NaCI、KCI、NH4CI、(NH4)2S04中的一种或几种。优选NaCI。所用的还原剂包括但不限于DTT,2-巯基乙醇。所用量包括但不限于lOmM-lOOmM。
根据本发明,所述的疫苗可采用患者可接受的形式,包括但不限于注射或鼻腔或口腔吸入或者阴道给药,优选注射剂。
根据本发明,术语“价”是指组成疫苗的组分所包含的基因型的数量。举例来说HPV16和18型抗原组成的疫苗称为“二价”疫苗。
本发明人经研究发现,经过对HPV58L1蛋白N端、C端和H4区域的基因重组,再利用大肠杆菌表达系统进行表达即可获得大量的重组GST-HPV58L1五聚体融合蛋白,该GST-HPV58L1五聚体蛋白经亲和层析纯化后可得到高产率的HPV58L1五聚体蛋白,纯度至少85%以上。进一步纯化后的HPV58L1五聚体蛋白可达到98%以上的纯度并可诱导针对HPV58保护性抗体。本发明基于以上发明现已完成,为大规模工业化生产预防宫颈癌的疫苗提供了一种新方法。
本发明中在N端增加GSGGG或GSGAG,并且与GST蛋白相融合,可大大提高L1蛋白的可溶性,以及提高酶切效率,降低纯化成本,同时利用蛋白酶酶解方法切除GST蛋白,去除了外源杂质的引入;用GSGGG或GAGAS取代H4结构域,可以阻止L1蛋白聚合,从而得到均一、稳定的五聚体或VLP蛋白,例如通过实施例12的实验,发现不同氨基酸序列组成的蛋白产物稳定性不同。其中序列H4结构域经过取代的蛋白产物与H4结构域未经取代的蛋白产物相比更稳定;C端截短氨基酸能够有效避免C端降解、减少由于蛋白降解而导致的产物不纯,从而影响蛋白疫苗的稳定性。
本发明序列对H4结构域改造所得的HPV58L1蛋白形成五聚体,VLP或者由五聚体组成的多聚体,具有良好的免疫原性,可以诱导高滴度的针对HPV58的中和抗体,预防HPV58对人体的感染,是一种良好的疫苗形式。此外,本发明中采用的重组HPV58L1蛋白在保留全长HPV58L1蛋白的抗原性及五聚体颗粒组装能力的同时,易于在真核表达系统和原核表达系统中表达,增加蛋白的溶解性、提高产量、降低生产成本,可应用于大规模工业化生产。
在参考下列详述和附图后,本发明的这些和其它方面的有益效果将是显然的。此处公开的所有参考文献在此均完整引用作为参考。
附图说明
图1实施例制备的HPV58L1五聚体透射电镜观察(100,000倍)结果;结果显示,视野中可见直径为13nm左右的五聚体,颗粒大小与理论大小相符,均匀一致。
图2-A按照实施例1制备的五聚体的动态光散射观测结果,结果显示五聚体的粒径与粒度分布图。
图2-B按照实施例6制备的HPV58L1五聚体的动态光散射观测结果,结果显示五聚体的粒径与粒度分布图。
图3实施例1HPV58L1五聚体蛋白的高压液相分子筛色谱图,图中显示经高度纯化的五聚体蛋白纯度。
图4实施例制备的HPV58L1五聚体疫苗接种小鼠后,在第二次加强免疫3周后,检测中和抗体的平均滴度水平。
下面结合实施例对本发明进一步举例描述。这些实施例是非限制性的。
实施例l:具有HPV58L1序列2的工程菌的构建
1、HPV58L1基因全长由金唯智生物科技有限公司(GENEWIZ)合成,其核苷酸序列SEQ ID NO:1,其源自GeneBank,序列号为GenBank:M14119.1。
2、含有SEQ ID NO:1基因片段做PCR反应的模板。以正向引物序列:5’-CGCGGATCCGGAAAGGAAGATC CATTAAATAA A-3’;在H4结构的5’端引入限制性内切酶AccIII位点,AccIII位点序列为TCCGGA。下游包含XhoI内切酶位点,反向引物序列:
5’-GCTCTCCTCGAGTTA GGGCTTTGCT TTAAGGCCTG A-3’,其5’端引入限制性内切酶XhoI位点,XhoI位点序列为CTCGAG。经PCR反应扩增得到HPV58B。
3、含有SEQ ID NO:1基因片段做PCR反应的模板。以含有引入的限制性内切酶BamHI位点,BamH I位点序列为GGATCC,正向引物序列:5’-CGCGGAGGATCC GGA GGAGGACTAGTTATTTTATTTTGCGTC-3’;在H4结构的3’端引入限制性内切酶AccIII位点,AccIII位点序列为TCCGGA,反向引物序列:GCTCTCTCCGGA TCC TCC TCCTTGCCAGTCC TCCAAAATAT C;经PCR反应扩增扩增得到HPV58A。
4、HPV58A片段与HPV58B共同用限制性内切酶AccIII酶切,形成特异粘性末端,使用T4DNA连接酶连接HPV58A和HPV58B片段,使之形成一个删除了H4结构域的、原H4结构域被编码连接多肽GAGSG的核苷碱基序列取代的HPV58C。
5、表达质粒pGEX-4T-2用BamH I和XhoI酶切消化,再用T4连接酶连接HPV58c和表达质粒。BamH I/XhoI酶切鉴定得到插入L1蛋白基因片段的阳性表达克隆pGEX-4T-2-HPV58C。利用M13(+)/(-)引物,测得质粒中插入的目的核苷酸序列正确,其编码的氨基酸序列为SEQ ID NO:2。
6、连接产物经电转化或CaCl2法将重组质粒转化至大肠杆菌中,优选转化大肠杆菌BL21宿主细胞。将转化的BL21细胞涂于含有氨苄青霉素的LB平板培养基上,经37℃培养,挑取单克隆菌落接种于LB液体培养基中37℃培养12小时,从中取1ml菌液制备甘油管菌种于-70℃保存。
上述各步骤中PCR反应体系包含20μg DNA模板、lx PCR缓冲液、正向和反向特异引物(浓度均为0.2μΜ)、1.5mM镁离子、1.0单位的Taq DNA聚合酶;反应条件为:摄氏95℃变性5分钟,经36个PCR循环放大,每一循环为94℃30秒,55℃30秒,72℃2分钟,反应产物在72℃下温育10分钟,然后停止反应。
实施例2:具有HPV58L1序列4的工程菌的构建
1、HPV58L1基因全长的目标基因片断购自北京安贞医院妇科门诊含有野生型HPV58病毒的临床细胞样本废弃物,其核苷酸序列为SEQ ID NO:3(GenBank:FN870689.1)。
2、含有SEQ ID NO:3基因片段做PCR反应的模板。以正向引物序列:5’-CGCGGATCCGGAAAGGAAGATC CATTAAATAA A-3’;在H4结构的5’端引入限制性内切酶AccIII位点,AccIII位点序列为TCCGGA。下游包含XhoI内切酶位点,反向引物序列:
5’-GCTCTCCTCGAGTTATGCACGGGTA GTAGGGGCTG A-3’,其5’端引入限制性内切酶XhoI位点,XhoI位点序列为CTCGAG。经PCR反应,扩增得到HPV58B。
3、含有SEQ ID NO:3基因片段做PCR反应的模板。以含有引入的限制性内切酶BamHI位点正向引物序列:5’-CGCGGAGGATCC GGA GGA GGA CTAGTTATTTTATTTTGCGTC-3’;在H4结构的3’端引入限制性内切酶AccIII位点,AccIII位点序列为TCCGGA,反向引物序列:
GCTCTCTCCGGA TCC TCC TCC TTGCCAGTCC TCCAAAATAT C;经PCR反应,扩增得到HPV58A。
4、HPV58A片段与HPV58B共同用限制性内切酶AccIII酶切,形成特异粘性末端,使用T4DNA连接酶连接HPV58A和HPV58B片段,使之形成一个删除了H4结构域的、原H4结构域被编码连接多肽GGGSG的核苷碱基序列取代的HPV58C。
5、表达质粒pGEX-4T-2用BamH I和XhoI酶切消化,再用T4连接酶连接HPV58c和表达质粒。BamH I/XhoI酶切鉴定得到插入L1蛋白基因片段的阳性表达克隆pGEX-4T-2-HPV58C。利用M13(+)/(-)引物,测得质粒中插入的目的核苷酸序列正确,其编码的氨基酸序列为SEQ ID NO:4。
6、连接产物经电转化或CaCl2法将重组质粒转化至大肠杆菌中,优选转化大肠杆菌BL21宿主细胞。将转化的BL21细胞涂于含有氨苄青霉素的LB平板培养基上,经37℃培养,挑取单克隆菌落接种于LB液体培养基中37℃培养12小时,从中取1ml菌液制备甘油管菌种于-70℃保存。
上述各步骤中PCR反应体系包含20μg DNA模板、lx PCR缓冲液、正向和反向特异引物(浓度均为0.2μΜ)、1.5mM镁离子、1.0单位的Taq DNA聚合酶;反应条件为:摄氏95℃变性5分钟,经36个PCR循环放大,每一循环为94℃30秒,55℃30秒,72℃2分钟,反应产物在72℃下温育10分钟,然后停止反应。
实施例3:具有HPV58L1序列6的工程菌的构建
1、HPV58L1基因全长由金唯智生物科技有限公司(GENEWIZ)合成,其核苷酸序列SEQ ID NO:5,其源自GeneBank,序列号为GenBank:FN870694.1。
2、含有SEQ ID NO:3基因片段做PCR反应的模板。以正向引物序列:5’-CGCGGATCCGGAAAGGAAGATC CATTAAATAAA-3’;在H4结构的5’端引入限制性内切酶AccIII位点,AccIII位点序列为TCCGGA。下游包含XhoI内切酶位点,反向引物序列:
5’-GCTCTCCTCGAGTTATGCACGGGTA GTAGGGGCCG A-3’,其5’端引入限制性内切酶XhoI位点,XhoI位点序列为CTCGAG。PCR反应,扩增得到HPV58B。
3、含有SEQ ID NO:3基因片段做PCR反应的模板。以含有引入的限制性内切酶BamHI位点正向引物序列:5’-CGCGGAGGATCC GGA GCC GGA GCAGACGTAAACGTTTTCCAT-3’;在H4结构的3’端引入限制性内切酶AccIII位点,AccIII位点序列为TCCGGA,反向引物序列:
5’-GCTCTCTCCGGA TCC GGC TCC TTGCCAGTCC TCCAAAATAT C-3’;经PCR反应,扩增得到HPV58A。
4、HPV58A片段与HPV58B共同用限制性内切酶AccIII酶切,形成特异粘性末端,使用T4DNA连接酶连接HPV58A和HPV58B片段,使之形成一个删除了H4结构域的、原来的H4结构域被编码连接多肽GAGSG的核苷碱基序列取代的HPV58C。
5、表达质粒pGEX-4T-2用BamH I和XhoI酶切消化,再用T4连接酶连接HPV58c和表达质粒。BamH I/XhoI酶切鉴定得到插入L1蛋白基因片段的阳性表达克隆pGEX-4T-2-HPV58C。利用M13(+)/(-)引物,测得质粒中插入的目的核苷酸序列正确,其编码的氨基酸序列为SEQ ID NO:6。
其余操作步骤参照实施例1的方法。
实施例4:具有HPV58L1序列8的工程菌的构建
1、HPV58L1基因全长由金唯智生物科技有限公司(GENEWIZ)合成,其核苷酸序列SEQ ID NO:7,其源自GeneBank,序列号为GenBank:HE574702.1。
2、含有SEQ ID NO:7基因片段做PCR反应的模板。以正向引物序列:5’-CGCGGATCCGGAAAGGAAGATC CATTAAATAA A-3’;在H4结构的5’端引入限制性内切酶AccIII位点,AccIII位点序列为TCCGGA。下游包含XhoI内切酶位点,反向引物序列:
5’-GCTCTCCTCGAGTTAGGGCTTTGCT TTAAGGCCTG A-3’,其5’端引入限制性内切酶XhoI位点,XhoI位点序列为CTCGAG。经PCR反应,扩增得到HPV58B。
3、含有SEQ ID NO:7基因片段做PCR反应的模板。以含有引入的限制性内切酶NheI位点正向引物序列:5’-CGCGGAGCTAGCGCC TCC GGAGCAGACGTAAACGTTTTCCAT-3’;在H4结构的3’端引入限制性内切酶AccIII位点,AccIII位点序列为TCCGGA,反向引物序列:
5’-GCTCTCTCCGGA TCC TCC TCC TTGCCAGTCC TCCAAAATAT T-3’;经PCR反应,扩增得到HPV58A。
4、HPV58A片段与HPV58B共同用限制性内切酶AccIII酶切,形成特异粘性末端,使用T4DNA连接酶连接HPV58A和HPV58B片段,使之形成一个删除了H4结构域的、原来的H4结构域被编码连接多肽GGGSG的核苷碱基序列取代的HPV58C。
5、表达质粒pGEX-4T-2用BamH I和XhoI酶切消化,再用T4连接酶连接HPV58c和表达质粒。BamH I/XhoI酶切鉴定得到插入L1蛋白基因片段的阳性表达克隆pGEX-4T-2-HPV58C。利用M13(+)/(-)引物,测得质粒中插入的目的核苷酸序列正确,其编码的氨基酸序列为SEQ ID NO:8。
其余操作步骤参照实施例1的方法。
实施例5:具有HPV58L1序列10的工程菌的构建
1、HPV58L1基因全长由金唯智生物科技有限公司(GENEWIZ)合成,其核苷酸序列SEQ ID NO:9,其源自GeneBank,序列号为GenBank:AF335603.1。
2、含有SEQ ID NO:9基因片段做PCR反应的模板。以正向引物序列:5’-CGCGGATCCGGAAAGGAAGATC CATTAAATAA A-3’;在H4结构的5’端引入限制性内切酶AccIII位点,AccIII位点序列为TCCGGA。下游包含XhoI内切酶位点,反向引物序列:
5’-GCTCTCCTCGAGTTATTTTTTAACC TTTTTGCGTT T-3’,其5’端引入限制性内切酶XhoI位点,XhoI位点序列为CTCGAG。经PCR反应,扩增得到HPV58B。
3、含有SEQ ID NO:9基因片段做PCR反应的模板。以含有引入的限制性内切酶BamHI位点正向引物序列:5’-CGCGGAGGATCCGGA GGA GGACTAGCTATTTTATTTTGCGTC-3’;在H4结构的3’端引入限制性内切酶AccIII位点,AccIII位点序列为TCCGGA,反向引物序列:
5’-GCTCTCTCCGGA TCC TCC TCCTTGCCAGTCC TCCAAAATAT T-3’;经PCR反应,扩增得到HPV58A。
4、HPV58A片段与HPV58B共同用限制性内切酶AccIII酶切,形成特异粘性末端,使用T4DNA连接酶连接HPV58A和HPV58B片段,使之形成一个删除了H4结构域的、原来的H4结构域被编码连接多肽GGGSG的核苷碱基序列取代的HPV58C。
5、表达质粒pGEX-4T-2用BamH I和XhoI酶切消化,再用T4连接酶连接HPV58c和表达质粒。BamH I/XhoI酶切鉴定得到插入L1蛋白基因片段的阳性表达克隆pGEX-4T-2-HPV58C。利用M13(+)/(-)引物,测得质粒中插入的目的核苷酸序列正确,其编码的氨基酸序列为SEQ ID NO:10。
其余操作步骤参照实施例1的方法。
实施例6:具有HPV58L1序列11的工程菌的构建
以合成的含有SEQ ID NO:1基因片段做PCR反应的模板,N端前8个氨基酸被GSGGG取代,C端不截短氨基酸,按照上述实施例1的方法PCR扩增,得到其目的氨基酸序列为SEQID NO:11。
实施例7:重组HPV58L1蛋白的大量表达与纯化
蛋白表达:在-70℃中分别取出实施例1-6的冻存菌种,平板活化,37℃培养14-20h,挑菌苔于80mL种子培养基中,37℃培养10-12h;然后接种于50L种子罐在37℃下培养10-12h;之后接种于500L发酵罐中,发酵培养、诱导表达;诱导表达结束后离心收集菌体。用pH7.4磷酸盐缓冲液重悬菌体,之后破碎,破碎方法可用但不限于:高压匀浆、超声波破碎或溶菌酶溶解等化学或物理手段。离心,获得上清液。可采用Lowry法检测总蛋白量,用Elisa法检测L1含量。
蛋白纯化:上清液可用但不限于盐析、等电点沉淀、离子交换层析、亲和层析、分子筛等分离纯化方法,得到纯度98%的HPV58L1的五聚体。用电镜观察纯化产物,直径均为10nm左右的五聚体蛋白。
纯化方法具体步骤为将上清溶液中的L1蛋白经亲和色谱纯化:预装谷胱甘肽-琼脂糖树脂(GE公司生产的Glutathione Sepharose 4B)色谱柱。取浓度为50%的Glutathione Sepharose 4B树脂匀浆放入色谱柱中(每200ml蛋白清液需要5-10ml树脂匀浆)。用5-10倍的柱床体积的缓冲液A(组分为:50mmol/L的Tric-HCl,200mmol/的NaCl,1mmol/L的EDTA,pH8.0)洗涤树脂,然后将蛋白清液加入色谱柱中,与树脂混合均匀并在室温下作用20分钟后放出滤过液,用10倍柱床体积的缓冲液A洗涤树脂柱。将精确蛋白酶(Prescission Protease,简称PP酶)用缓冲液A稀释,上样并柱上循环酶切120min。放出酶切液,用适量的缓冲液A洗脱并收集目的蛋白。
离子交换色谱纯化:将上述收集的目的蛋白用Source Q或Mono Q(GE公司)阴离子交换柱进行离子交换层析,收集目的蛋白。
分子筛色谱纯化:将离子交换色谱收集的目的蛋白用分子量在10-600kDa的凝胶过滤介质进行分子筛层析,最终获得纯度大于98%的高纯度HPV58L1五聚体蛋白。
实施例8:HPV58L1五聚体的形态学检测
将实施例1-6构建的工程菌株送上海生工公司测序,测得质粒中插入的目的DNA序列,其编码的氨基酸序列为SEQID NO:2、SEQID NO:4、SEQID NO:6、SEQID NO:8、SEQID NO:10、SEQID NO:11所示,其中实施例1-5结果表明原来全长序列中H4结构域不复存在,取而代之的是编码连接多肽GGGSG或GAGSG序列的“GGA GGA GGA TCC GGA”或“GGA GCC GGA TCCGGA”核苷酸序列。
将实施例1-6方法获得的工程菌株采用实施例7的方法表达纯化得到的产物HPV58L1五聚体蛋白,透射电镜观察(100,000倍),结果显示,视野中可见直径为13nm左右的五聚体,颗粒大小与理论大小相符,均匀一致。其中实施1序列所得样品的电镜照片见附图1。
进行颗粒粒径测定。仪器为马尔文Zetasizer NanoZS的动态光散射粒径仪,使用算法为Regulation算法。样品经0.22u m滤膜过滤后进行测量,结果见表1。结果表明,六种五聚体平均粒径基本一致12.89-14.48nm,但分散性指数PdI(表明蛋白的均一性)存在显著差异,其中实施例1-5样品的分散性指数较小,说明样品很均一。实施例6样品的分散性指数较大,说明样品不均一。
表1 五聚体平均粒径和分散性指数
其中实施例1所制备的HPV58L1五聚体颗粒的水化分子动力学平均粒径为12.53nm,分散性指数PdI为0.034(具体见附图2-A);实施例6所制备的HPV58L1五聚体颗粒的水化分子动力学平均粒径为14.19nm,分散性指数PdI为0.168(具体见附图2-B)。
实施例9:蛋白原液纯度检测
色谱柱为TSK-GEL G3000SWxl,或相同填料,分离范围10KDa-500KDa的色谱柱;以pH6.8的0.1mol/l磷酸盐缓冲液(称取磷酸氢二钠25.8g,磷酸二氢钠4.37g,加超纯水使溶解,用磷酸调pH至6.8,超纯水定容成1000ml)为流动相;流速为1ml/min;检测波长280nm;柱温25℃,上样量不得小于20ug,样品主峰理论塔板数不低于1000,拖尾因子小于2.0,连续进样5针,峰面积的相对标准偏差不得大于3%。
取实施例7所得的HPV蛋白原液,分别稀释浓度为1mg/ml,上样量20ul注入高压液相色谱仪,按照上述方法检测,按面积百分比计算纯度,结果见下表2及附图3(实施1所制备的HPV58L1五聚体的分子筛层析色谱图),所有处理蛋白纯度均大于98%。
表2 纯化后蛋白原液的纯度
实施例10:含有HPV58L1五聚体蛋白疫苗的制备
分别将实施例1-6制备后经实施例7步骤纯化的HPV58L1五聚体蛋白原液用pH7.2的硼酸缓冲盐稀释成100μg/ml的蛋白液,取1ml稀释后的蛋白液加入50μg/ml氢氧化铝佐剂,充分混合吸附2小时,即获得40μg/ml的HPV58L1五聚体蛋白疫苗,于4℃避光保存。
实施例11:HPV58L1五聚体疫苗的免疫原性测定
小鼠的免疫原性:将实施例10制备的HPV58L1五聚体蛋白疫苗用疫苗稀释剂分别稀释成表1所示剂量,以每只0.5mL腹腔注射BALB/C小鼠,每个处理组10只。每3周免疫一次,共免疫2次。每次免疫后三周分别采取每只小鼠血清,采用假病毒细胞中和实验法分别测定每次免疫后的小鼠血清中针对HPV58的中和抗体滴度。结果如表3所示,在第二次加强免疫3周后,检测中和抗体的平均滴度水平如附图4所示。
表3假病毒细胞中和实验法检测HPV58L1五聚体蛋白中和抗体水平
结果表明,实施例制备的HPV58L1五聚体蛋白接种小鼠,第一次免疫3周后即可产生中和抗体;二次免疫后的中和抗体即可达到很高的水平,分别如表所示。实验结果证明,各个实施例所得样品制备的HPV L1五聚体蛋白疫苗可以在动物体内产生中和抗体。说明本发明制备的HPV L1五聚体蛋白疫苗在人体临床试验中具有免疫原性,即能预防或治疗HPV58病毒引起的疾病。
实施例12:HPV58L1蛋白产率比较
将实施1-6方法构建的工程菌参照实施例7的方法,制备HPV58L1蛋白,计算表达量(用Lowry法检测总蛋白量,用Elisa法检测L1含量),之后放置6-20小时比较蛋白的稳定性,结果如表4所示。
表4 蛋白含量及稳定性实验
结果表明,不同氨基酸序列组成的蛋白产物稳定性不同。其中SEQ ID NO:2、SEQID NO:4、SEQ ID NO:6、SEQ ID NO:8和SEQ ID NO:10,比H4结构域未改造的SEQ ID NO:11制备HPV58L1蛋白纯化产物,放置6-15小时比较蛋白的稳定性,含有SEQ ID NO:11序列的蛋白纯化产物在6小时即出现沉淀,含有SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6SEQ ID NO:8和SEQ ID NO:10序列的蛋白纯化产物在15小时观察未出现沉淀,依然保持稳定。
SEQUENCE LISTING
<110> 北京康乐卫士生物技术股份有限公司
<120> 重组HPV-58型L1的VLP疫苗及其制备方法
<130> 2014
<160> 11
<170> PatentIn version 3.3
<210> 1
<211> 1575
<212> DNA
<213> 人工序列
<400> 1
atggtgctga ttttatgttg caccctagtt attttatttt gcgtcgcaga cgtaaacgtt 60
ttccatattt ttttgcagat gtccgtgtgg cggcctagtg aggccactgt gtacctgcct 120
cctgtgcctg tgtctaaggt tgtaagcact gatgaatatg tgtcacgcac aagcatttat 180
tattatgctg gcagttccag acttttggct gttggcaatc catatttttc catcaaaagt 240
cccaataaca ataaaaaagt attagttccc aaggtatcag gcttacagta tagggtgttt 300
agggtgcgtt tacctgatcc caataaattt ggttttcctg atacatcttt ttataaccct 360
gatacacaac gtttagtctg ggcatgtgta ggccttgaaa taggtagggg acagccattg 420
ggtgttggca taagtggtca tccttattta aataaatttg atgacactga aactggtaac 480
agatataccg cacagccagg gtctgataac agggaatgct tatctatgga ttataaacaa 540
acacaattat gtttaattgg ctgtaaacct cccactggtg agcattgggg taaaggtgtt 600
gcctgtaaca ataatgcagc tgctactgat tgtcctccat tggaactttt taattctatt 660
attgaggatg gtgacatggt agatacaggg tttggatgca tggactttgg tacattgcag 720
gctaataaaa gtgatgtgcc tattgatatt tgtaacagta catgcaaata tccagattat 780
ttaaaaatgg ccagtgaacc ttatggggat agtttgttct tttttcttag acgtgagcaa 840
atgtttgtta gacacttttt taatagggct ggaacccttg gcgagcctgt cccgaatgac 900
ctttatatta aagggtccgg taatactgca ggtatccaaa gtagtgcatt ttttccaact 960
cctagtggct ccatagttac ctcagaatca caactattta ataagcctta ttggctacag1020
cgtgcacaag gtcataacaa tgacatttgc tggggcaatc agttatttgt taccgtggtt1080
gataccactc gtagcactaa tatgacatta tgcactgaag taactaaaga agatacatat1140
aaaaataata attttaagga atatgtacgt catgttgaag aatatgactt acagtttgtt1200
tttcagcttt gcaaaattac actaactgca gaggtaatga catatataca tactatgaat1260
tcagatattt tggaggactg gcaatttggt ttaacacctc ctccgtctgc cagtttacag1320
gacacatata gatttgttac ctcccaggct attacttgcc aaaaaacagc accccctaaa1380
gaaaaggaag atccattaaa taaatatact ttttgggagg ttaacttaaa ggaaaagttt1440
tctgcagatc tggatcagtt tcctttggga cgaaagtttt tattacaatc aggccttaaa1500
gcaaagccca gactaaaacg ttcagcccct actacccgtg caccatccac caaacgcaaa1560
aaggttaaaa aataa 1575
<210> 2
<211> 472
<212> PRT
<213> 人工序列
<400> 2
Gly Ser Gly Gly Gly Leu Val Ile Leu Phe Cys Val Ala Asp Val Asn
1 5 10 15
Val Phe His Ile Phe Leu Gln Met Ser Val Trp Arg Pro Ser Glu Ala
20 25 30
Thr Val Tyr Leu Pro Pro Val Pro Val Ser Lys Val Val Ser Thr Asp
35 40 45
Glu Tyr Val Ser Arg Thr Ser Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg
50 55 60
Leu Leu Ala Val Gly Asn Pro Tyr Phe Ser Ile Lys Ser Pro Asn Asn
65 70 75 80
Asn Lys Lys Val Leu Val Pro Lys Val Ser Gly Leu Gln Tyr Arg Val
85 90 95
Phe Arg Val Arg Leu Pro Asp Pro Asn Lys Phe Gly Phe Pro Asp Thr
100 105 110
Ser Phe Tyr Asn Pro Asp Thr Gln Arg Leu Val Trp Ala Cys Val Gly
115 120 125
Leu Glu Ile Gly Arg Gly Gln Pro Leu Gly Val Gly Ile Ser Gly His
130 135 140
Pro Tyr Leu Asn Lys Phe Asp Asp Thr Glu Thr Gly Asn Arg Tyr Thr
145 150 155 160
Ala Gln Pro Gly Ser Asp Asn Arg Glu Cys Leu Ser Met Asp Tyr Lys
165 170 175
Gln Thr Gln Leu Cys Leu Ile Gly Cys Lys Pro Pro Thr Gly Glu His
180 185 190
Trp Gly Lys Gly Val Ala Cys Asn Asn Asn Ala Ala Ala Thr Asp Cys
195 200 205
Pro Pro Leu Glu Leu Phe Asn Ser Ile Ile Glu Asp Gly Asp Met Val
210 215 220
Asp Thr Gly Phe Gly Cys Met Asp Phe Gly Thr Leu Gln Ala Asn Lys
225 230 235 240
Ser Asp Val Pro Ile Asp Ile Cys Asn Ser Thr Cys Lys Tyr Pro Asp
245 250 255
Tyr Leu Lys Met Ala Ser Glu Pro Tyr Gly Asp Ser Leu Phe Phe Phe
260 265 270
Leu Arg Arg Glu Gln Met Phe Val Arg His Phe Phe Asn Arg Ala Gly
275 280 285
Thr Leu Gly Glu Pro Val Pro Asn Asp Leu Tyr Ile Lys Gly Ser Gly
290 295 300
Asn Thr Ala Gly Ile Gln Ser Ser Ala Phe Phe Pro Thr Pro Ser Gly
305 310 315 320
Ser Ile Val Thr Ser Glu Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu
325 330 335
Gln Arg Ala Gln Gly His Asn Asn Asp Ile Cys Trp Gly Asn Gln Leu
340 345 350
Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr Asn Met Thr Leu Cys
355 360 365
Thr Glu Val Thr Lys Glu Asp Thr Tyr Lys Asn Asn Asn Phe Lys Glu
370 375 380
Tyr Val Arg His Val Glu Glu Tyr Asp Leu Gln Phe Val Phe Gln Leu
385 390 395 400
Cys Lys Ile Thr Leu Thr Ala Glu Val Met Thr Tyr Ile His Thr Met
405 410 415
Asn Ser Asp Ile Leu Glu Asp Trp Gln Gly Gly Gly Ser Gly Lys Glu
420 425 430
Asp Pro Leu Asn Lys Tyr Thr Phe Trp Glu Val Asn Leu Lys Glu Lys
435 440 445
Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly Arg Lys Phe Leu Leu
450 455 460
Gln Ser Gly Leu Lys Ala Lys Pro
465 470
<210> 3
<211> 1575
<212> DNA
<213> 人工序列
<400> 3
atggtgctga ttttatgttg caccctagtt attttatttt gcgtcgcaga cgtaaacgtt 60
ttccatattt ttttgcagat gtccgtgtgg cggcctagtg aggccactgt gtacctgcct 120
cctgtgcctg tgtctaaggt tgtaagcact gatgaatatg tgtcacgcac aagcatttat 180
tattatgctg gcagttccag acttttggct gttggcaatc catatttttc catcaaaagt 240
cccaataaca ataaaaaagt attagttccc aaggtatcag gcttacagta tagggtgttt 300
agggtgcgtt tacctgatcc caataaattt ggttttcctg atacatcttt ttataaccct 360
gatacacaac gtttagtctg ggcatgtgta ggccttgaaa taggtagggg acagccattg 420
ggtgttggca taagtggtca tccttattta aataaatttg atgacactga aactggtaac 480
agatataccg cacagccagg gtctgataac agggaatgct tatctatgga ttataaacaa 540
acacaattat gtttaattgg ctgtaaacct cccactggtg agcattgggg taaaggtgtt 600
gcctgtaaca ataatgcagc tgctactgat tgtcctccat tggaactttt taattctatt 660
attgaggatg gtgacatggt agatacaggg tttggatgca tggactttgg tacattgcag 720
gctaataaaa gtgatgtgcc tattgatatt tgtaacagta catgcaaata tccagattat 780
ttaaaaatgg ccagtgaacc ttatggggat agtttgttct tttttcttag acgtgagcaa 840
atgtttgtta gacacttttt taatagggct ggaacccttg gcgagcctgt cccgaatgac 900
ctttatatta aagggtccgg taatactgca ggtatccaaa gtagtgcatt ttttccaact 960
cctagtggct ctatagttac ctcagaatca caattattta ataagcctta ttggctacag1020
cgtgcacaag gtcataacaa tggcatttgc tggggcaatc agttatttgt taccgtggtt1080
gataccactc gtagcactaa tatgacatta tgcactgaag taactaaaga agatacatat1140
aaaaataata attttaagga atatgtacgt catgttgaag aatatgactt acagtttgtt1200
tttcagcttt gcaaaattac actaactgca gaggtaatga catatataca tactatgaat1260
tcagatattt tggaggactg gcaatttggt ttaacacctc ctccgtctgc cagtttacag1320
gacacatata gatttgttac ctcccaggct attacttgcc aaaaaacagc accccctaaa1380
gaaaaggaag atccattaaa taaatatact ttttgggagg ttaacttaaa ggaaaagttt1440
tctgcagatc tggatcagtt tcctttggga cgaaagtttt tattacaatc aggccttaaa1500
gcaaagccca gactaaaacg ttcagcccct actacccgtg caccatccac caaacgcaaa1560
aaggttaaaa aataa 1575
<210> 4
<211> 483
<212> PRT
<213> 人工序列
<400> 4
Gly Ser Gly Gly Gly Leu Val Ile Leu Phe Cys Val Ala Asp Val Asn
1 5 10 15
Val Phe His Ile Phe Leu Gln Met Ser Val Trp Arg Pro Ser Glu Ala
20 25 30
Thr Val Tyr Leu Pro Pro Val Pro Val Ser Lys Val Val Ser Thr Asp
35 40 45
Glu Tyr Val Ser Arg Thr Ser Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg
50 55 60
Leu Leu Ala Val Gly Asn Pro Tyr Phe Ser Ile Lys Ser Pro Asn Asn
65 70 75 80
Asn Lys Lys Val Leu Val Pro Lys Val Ser Gly Leu Gln Tyr Arg Val
85 90 95
Phe Arg Val Arg Leu Pro Asp Pro Asn Lys Phe Gly Phe Pro Asp Thr
100 105 110
Ser Phe Tyr Asn Pro Asp Thr Gln Arg Leu Val Trp Ala Cys Val Gly
115 120 125
Leu Glu Ile Gly Arg Gly Gln Pro Leu Gly Val Gly Ile Ser Gly His
130 135 140
Pro Tyr Leu Asn Lys Phe Asp Asp Thr Glu Thr Gly Asn Arg Tyr Thr
145 150 155 160
Ala Gln Pro Gly Ser Asp Asn Arg Glu Cys Leu Ser Met Asp Tyr Lys
165 170 175
Gln Thr Gln Leu Cys Leu Ile Gly Cys Lys Pro Pro Thr Gly Glu His
180 185 190
Trp Gly Lys Gly Val Ala Cys Asn Asn Asn Ala Ala Ala Thr Asp Cys
195 200 205
Pro Pro Leu Glu Leu Phe Asn Ser Ile Ile Glu Asp Gly Asp Met Val
210 215 220
Asp Thr Gly Phe Gly Cys Met Asp Phe Gly Thr Leu Gln Ala Asn Lys
225 230 235 240
Ser Asp Val Pro Ile Asp Ile Cys Asn Ser Thr Cys Lys Tyr Pro Asp
245 250 255
Tyr Leu Lys Met Ala Ser Glu Pro Tyr Gly Asp Ser Leu Phe Phe Phe
260 265 270
Leu Arg Arg Glu Gln Met Phe Val Arg His Phe Phe Asn Arg Ala Gly
275 280 285
Thr Leu Gly Glu Pro Val Pro Asn Asp Leu Tyr Ile Lys Gly Ser Gly
290 295 300
Asn Thr Ala Gly Ile Gln Ser Ser Ala Phe Phe Pro Thr Pro Ser Gly
305 310 315 320
Ser Ile Val Thr Ser Glu Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu
325 330 335
Gln Arg Ala Gln Gly His Asn Asn Gly Ile Cys Trp Gly Asn Gln Leu
340 345 350
Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr Asn Met Thr Leu Cys
355 360 365
Thr Glu Val Thr Lys Glu Asp Thr Tyr Lys Asn Asn Asn Phe Lys Glu
370 375 380
Tyr Val Arg His Val Glu Glu Tyr Asp Leu Gln Phe Val Phe Gln Leu
385 390 395 400
Cys Lys Ile Thr Leu Thr Ala Glu Val Met Thr Tyr Ile His Thr Met
405 410 415
Asn Ser Asp Ile Leu Glu Asp Trp Gln Gly Gly Gly Ser Gly Lys Glu
420 425 430
Asp Pro Leu Asn Lys Tyr Thr Phe Trp Glu Val Asn Leu Lys Glu Lys
435 440 445
Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly Arg Lys Phe Leu Leu
450 455 460
Gln Ser Gly Leu Lys Ala Lys Pro Arg Leu Lys Arg Ser Ala Pro Thr
465 470 475 480
Thr Arg Ala
<210> 5
<211> 1575
<212> DNA
<213> 人工序列
<400> 5
atggtgctga ttttatgttg caccctagtt attttatttt gcgtcgcaga cgtaaacgtt 60
ttccatattt ttttgcagat gtccgtgtgg cggcctagtg aggccactgt gtacctgcct 120
cctgtgcctg tgtctaaggt tgtaagcact gatgaatatg tgtcacgcac aagcatttat 180
tattatgctg gcagttccag acttttggct gttggcaatc catatttttc catcaaaagt 240
cccaataaca ataagaaagt attagttccc aaggtatcag gcttacagta tagggtcttt 300
agggtgcgtt tacctgatcc caataaattt ggttttcctg atacatcttt ttataaccct 360
gatacacaac gtttggtctg ggcatgtgta ggccttgaaa taggtagggg acagccattg 420
ggtgtaggca taagtggtca tccttattta aataaatttg atgacactga aaccagtaac 480
agatatcccg cacagccagg gtctgataac agggaatgct tatctatgga ttataaacaa 540
acacaattat gtttaattgg ctgtaaacct cccactggtg agcattgggg taaaggtgtt 600
gcctgtaaca ataatgcagc tgctactgat tgtcctccat tggaactttt taattctatt 660
attgaggatg gtgacatggt agatacaggg tttggatgca tggactttgg tacattgcag 720
gctaataaaa gtgatgtgcc tattgatatt tgtaacagta catgcaaata tccagattat 780
ttaaaaatgg ccagtgaacc ttatggggat agtttgttct tttttcttag acgtgagcaa 840
atgtttgtta gacacttttt taatagggct ggaaaacttg gcgaggctgt cccagatgac 900
ctttatatta aagggtccgg taatactgca gttatccaaa gtagtgcatt ttttccaact 960
cctagtggct ctatagttac ctcagaatca caattattta ataagcctta ttggctacag1020
cgtgcacaag gtcataacaa tggcatttgc tggggcaatc agttatttgt taccgtggtt1080
gataccactc gtagcactaa tatgacatta tgcactgaag taaataagga aggtacatat1140
aaaaatgata attttaagga atatgtacgt catgttgaag aatatgactt acagtttgtt1200
tttcagcttt gcaaaattac actaactgca gaggtaatga catatataca tactatgaat1260
tcagatattt tggaggactg gcaatttggt ttaacacctc ctccgtctgc cagtttacag1320
gacacatata gatttgttac ctcccaggct attacttgcc aaaaaacagc accccctaaa1380
gaaaaggaag atccattaaa taaatatact ttttgggagg ttaacttaaa ggaaaagttt1440
tctgcggatc tggatcagtt tcctttggga cgaaagtttt tattacaatc aggccttaaa1500
gcaaagccca gactcaaacg ttcggcccct actacccgtg caccatccac caaacgcaaa1560
aaggttaaaa aataa 1575
<210> 6
<211> 476
<212> PRT
<213> 人工序列
<400> 6
Gly Ser Gly Ala Gly Ala Asp Val Asn Val Phe His Ile Phe Leu Gln
1 5 10 15
Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val
20 25 30
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser
35 40 45
Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Ala Val Gly Asn Pro
50 55 60
Tyr Phe Ser Ile Lys Ser Pro Asn Asn Asn Lys Lys Val Leu Val Pro
65 70 75 80
Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp
85 90 95
Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr
100 105 110
Gln Arg Leu Val Trp Ala Cys Val Gly Leu Glu Ile Gly Arg Gly Gln
115 120 125
Pro Leu Gly Val Gly Ile Ser Gly His Pro Tyr Leu Asn Lys Phe Asp
130 135 140
Asp Thr Glu Thr Ser Asn Arg Tyr Pro Ala Gln Pro Gly Ser Asp Asn
145 150 155 160
Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile
165 170 175
Gly Cys Lys Pro Pro Thr Gly Glu His Trp Gly Lys Gly Val Ala Cys
180 185 190
Asn Asn Asn Ala Ala Ala Thr Asp Cys Pro Pro Leu Glu Leu Phe Asn
195 200 205
Ser Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Phe Gly Cys Met
210 215 220
Asp Phe Gly Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Ile Asp Ile
225 230 235 240
Cys Asn Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ala Ser Glu
245 250 255
Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu Gln Met Phe
260 265 270
Val Arg His Phe Phe Asn Arg Ala Gly Lys Leu Gly Glu Ala Val Pro
275 280 285
Asp Asp Leu Tyr Ile Lys Gly Ser Gly Asn Thr Ala Val Ile Gln Ser
290 295 300
Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Ile Val Thr Ser Glu Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn
325 330 335
Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Met Thr Leu Cys Thr Glu Val Asn Lys Glu Gly
355 360 365
Thr Tyr Lys Asn Asp Asn Phe Lys Glu Tyr Val Arg His Val Glu Glu
370 375 380
Tyr Asp Leu Gln Phe Val Phe Gln Leu Cys Lys Ile Thr Leu Thr Ala
385 390 395 400
Glu Val Met Thr Tyr Ile His Thr Met Asn Ser Asp Ile Leu Glu Asp
405 410 415
Trp Gln Gly Ala Gly Ser Gly Lys Glu Asp Pro Leu Asn Lys Tyr Thr
420 425 430
Phe Trp Glu Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln
435 440 445
Phe Pro Leu Gly Arg Lys Phe Leu Leu Gln Ser Gly Leu Lys Ala Lys
450 455 460
Pro Arg Leu Lys Arg Ser Ala Pro Thr Thr Arg Ala
465 470 475
<210> 7
<211> 1575
<212> DNA
<213> 人工序列
<400> 7
atggtgctga ttttatgttg caccctagct attttatttt gcgtcgcaga cgtaaacgtt 60
ttccatattt ttttgcagat gtccgtgtgg cggcctagtg aggccactgt gtacctgcct 120
cctgtgcctg tgtctaaggt tgtaagcact gatgaatatg tgtcacgcac aagcatttat 180
tattatgctg gcagttccag acttttggct gttggcaatc catatttttc catcaaaagt 240
cccaataaca ataaaaaagt attagttccc aaggtatcag gcttacagta tagggtcttt 300
agggtgcgtt tacctgatcc caataaattt ggttttcctg atacatcttt ttataaccct 360
gatacacaac gtttggtctg ggcatgtgta ggccttaaaa taggtagggg acagccattg 420
ggtgttggcg taagtggtca tccttattta aataaatttg atgacactga aaccagtaac 480
agatatcccg cacagccagg gtctgataac agggaatgct tatctatgga ttataaacaa 540
acacaattat gtttaattgg ctgtaaacct cccactggtg agcattgggg taaaggtgtt 600
gcctgtaaca ataatgcagc tgctactgat tgtcctccat tggaactttt taattctatt 660
attgaggatg gtgacatggt agatacaggg tttggatgca tggactttgg tacattgcag 720
gctaataaaa gtgatgtgcc tattgatatt tgtaacagta catgcaaata tccagattat 780
ttaaaaatgg ccagtgaacc ttatggggat agtttgttct tttttcttag acgtgagcag 840
atgtttgtta gacacttttt taatagggct ggaaaacttg gcgaggctgt cccggatgac 900
ctttatatta aagggtccgg taatactgca gttatccaaa gtagtgcatt ttttccaact 960
cctagtggct ctatagttac ctcagaatca caattattta ataagcctta ttggctacag1020
cgtgcacaag gtcataacaa tggcatttgc tggggcaatc agttatttgt taccgtggtt1080
gataccactc gtagcactaa tatgacatta tgcactgaag taactaagga aggtacatat1140
aaaaatgata attttaagga atatgtacgt catgttgaag aatatgactt acagtttgtt1200
tttcagcttt gcaaaattac actaactgca gagataatga catatataca tactatggat1260
tccaatattt tggaggactg gcaatttggt ttaacacctc ctccgtctgc cagtttacag1320
gacacatata gatttgttac ctcccaggct attacttgcc aaaaaacagc accccctaaa1380
gaaaaggaag atccattaaa taaatatact ttttgggagg ttaacttaaa ggaaaagttt1440
tctgcagatc tagatcagtt tcctttggga ccaaagtttt tattacaatc aggccttaaa1500
gcaaagccca gactaaaacg ttcggcccct actacccgtg caccatccac caaacgcaaa1560
aaggttaaaa aataa 1575
<210> 8
<211> 465
<212> PRT
<213> 人工序列
<400> 8
Ala Ser Ala Ser Gly Ala Asp Val Asn Val Phe His Ile Phe Leu Gln
1 5 10 15
Met Ser Val Trp Arg Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val
20 25 30
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ser Arg Thr Ser
35 40 45
Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg Leu Leu Ala Val Gly Asn Pro
50 55 60
Tyr Phe Ser Ile Lys Ser Pro Asn Asn Asn Lys Lys Val Leu Val Pro
65 70 75 80
Lys Val Ser Gly Leu Gln Tyr Arg Val Phe Arg Val Arg Leu Pro Asp
85 90 95
Pro Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr
100 105 110
Gln Arg Leu Val Trp Ala Cys Val Gly Leu Lys Ile Gly Arg Gly Gln
115 120 125
Pro Leu Gly Val Gly Val Ser Gly His Pro Tyr Leu Asn Lys Phe Asp
130 135 140
Asp Thr Glu Thr Ser Asn Arg Tyr Pro Ala Gln Pro Gly Ser Asp Asn
145 150 155 160
Arg Glu Cys Leu Ser Met Asp Tyr Lys Gln Thr Gln Leu Cys Leu Ile
165 170 175
Gly Cys Lys Pro Pro Thr Gly Glu His Trp Gly Lys Gly Val Ala Cys
180 185 190
Asn Asn Asn Ala Ala Ala Thr Asp Cys Pro Pro Leu Glu Leu Phe Asn
195 200 205
Ser Ile Ile Glu Asp Gly Asp Met Val Asp Thr Gly Phe Gly Cys Met
210 215 220
Asp Phe Gly Thr Leu Gln Ala Asn Lys Ser Asp Val Pro Ile Asp Ile
225 230 235 240
Cys Asn Ser Thr Cys Lys Tyr Pro Asp Tyr Leu Lys Met Ala Ser Glu
245 250 255
Pro Tyr Gly Asp Ser Leu Phe Phe Phe Leu Arg Arg Glu Gln Met Phe
260 265 270
Val Arg His Phe Phe Asn Arg Ala Gly Lys Leu Gly Glu Ala Val Pro
275 280 285
Asp Asp Leu Tyr Ile Lys Gly Ser Gly Asn Thr Ala Val Ile Gln Ser
290 295 300
Ser Ala Phe Phe Pro Thr Pro Ser Gly Ser Ile Val Thr Ser Glu Ser
305 310 315 320
Gln Leu Phe Asn Lys Pro Tyr Trp Leu Gln Arg Ala Gln Gly His Asn
325 330 335
Asn Gly Ile Cys Trp Gly Asn Gln Leu Phe Val Thr Val Val Asp Thr
340 345 350
Thr Arg Ser Thr Asn Met Thr Leu Cys Thr Glu Val Thr Lys Glu Gly
355 360 365
Thr Tyr Lys Asn Asp Asn Phe Lys Glu Tyr Val Arg His Val Glu Glu
370 375 380
Tyr Asp Leu Gln Phe Val Phe Gln Leu Cys Lys Ile Thr Leu Thr Ala
385 390 395 400
Glu Ile Met Thr Tyr Ile His Thr Met Asp Ser Asn Ile Leu Glu Asp
405 410 415
Trp Gln Gly Gly Gly Ser Gly Lys Glu Asp Pro Leu Asn Lys Tyr Thr
420 425 430
Phe Trp Glu Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gln
435 440 445
Phe Pro Leu Gly Pro Lys Phe Leu Leu Gln Ser Gly Leu Lys Ala Lys
450 455 460
Pro
465
<210> 9
<211> 1575
<212> DNA
<213> 人工序列
<400> 9
atggtgctga ttttatgttg caccctagct attttatttt gcgtcgcaga cgtaaacgtt 60
ttccatattt ttttgcagat gtccgtgtgg cggcctagtg aggccactgt gtacctgcct 120
cctgtgcctg tgtctaaggt tgtaagcact gatgaatatg tgtcacgcac aagcatttat 180
tattatgctg gcagttccag acttttggct gttggcaatc catatttttc catcaaaagt 240
cccaataaca ataaaaaagt attagttccc aaggtatcag gcttacagta tagggtcttt 300
agggtgcgtt tacctgatcc caataaattt ggttttcctg atacatcttt ttataaccct 360
gatacacaac gtttggtctg ggcatgtgta ggccttgaaa taggtagggg acagccattg 420
ggtgttggcg taagtggtca tccttgttta aataaatttg atgacactga aaccagtaac 480
agatatcccg cacagccagg gtctgataac agggaatgct tatctatgga ttataaacaa 540
acacaattat gtttaattgg ctgtaaacct cccactggtg agcattgggg taaaggtgtt 600
gcctgtaaca ataatgcagc tgctactgat tgtcctccat tggaactttt taattctatt 660
attgaggatg gtgacatggt agatacaggg tttggatgca tggactttgg tacattgcag 720
gctaataaaa gtgatgtgcc tattgatatt tgtaacagta catgcaaata tccagattat 780
ttaaaaatgg ccagtgaacc ttatggggat agtttgttct tttttcttag acgtgagcag 840
atgtttgtta gacacttttt taatagggct ggaaaacttg gcgaggctgt cccggatgac 900
ctttatatta aagggtccgg taatactgca gttatccaaa gtagtgcatt ttttccaact 960
cctagtggct ctatagttac ctcagaatca caattattta ataagcctta ttggctacag1020
cgtgcacaag gtcataacaa tggcatttgc tggggcaatc agttatttgt taccgtggtt1080
gataccactc gtagcactaa tatgacatta tgcactgaag taactaagga aggtacatat1140
aaaaatgata attttaagga atatgtacgt catgttgaag agtatgactt acagtttgtt1200
tttcagcttt gcaaaattac actaactgca gagataatga catatataca tactatggat1260
tccaatattt tggaggactg gcaatttggt ttaacacctc ctccgtctgc cagtttacag1320
gacacatata gatttgttac ctcccaggct attacttgcc aaaaaacagc accccctaaa1380
gaaaaggaag atccattaaa taaatatact ttttgggagg ttaacttaaa ggaaaagttt1440
tctgcagatc tagatcagtt tcctttggga cgaaagtttt tattacaatc aggccttaaa1500
gcaaagccca gactaaaacg ttcggcccct actacccgtg caccatccac caaacgcaaa1560
aaggttaaaa aataa 1575
<210> 10
<211> 493
<212> PRT
<213> 人工序列
<400> 10
Gly Ser Gly Gly Gly Leu Ala Ile Leu Phe Cys Val Ala Asp Val Asn
1 5 10 15
Val Phe His Ile Phe Leu Gln Met Ser Val Trp Arg Pro Ser Glu Ala
20 25 30
Thr Val Tyr Leu Pro Pro Val Pro Val Ser Lys Val Val Ser Thr Asp
35 40 45
Glu Tyr Val Ser Arg Thr Ser Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg
50 55 60
Leu Leu Ala Val Gly Asn Pro Tyr Phe Ser Ile Lys Ser Pro Asn Asn
65 70 75 80
Asn Lys Lys Val Leu Val Pro Lys Val Ser Gly Leu Gln Tyr Arg Val
85 90 95
Phe Arg Val Arg Leu Pro Asp Pro Asn Lys Phe Gly Phe Pro Asp Thr
100 105 110
Ser Phe Tyr Asn Pro Asp Thr Gln Arg Leu Val Trp Ala Cys Val Gly
115 120 125
Leu Glu Ile Gly Arg Gly Gln Pro Leu Gly Val Gly Val Ser Gly His
130 135 140
Pro Cys Leu Asn Lys Phe Asp Asp Thr Glu Thr Ser Asn Arg Tyr Pro
145 150 155 160
Ala Gln Pro Gly Ser Asp Asn Arg Glu Cys Leu Ser Met Asp Tyr Lys
165 170 175
Gln Thr Gln Leu Cys Leu Ile Gly Cys Lys Pro Pro Thr Gly Glu His
180 185 190
Trp Gly Lys Gly Val Ala Cys Asn Asn Asn Ala Ala Ala Thr Asp Cys
195 200 205
Pro Pro Leu Glu Leu Phe Asn Ser Ile Ile Glu Asp Gly Asp Met Val
210 215 220
Asp Thr Gly Phe Gly Cys Met Asp Phe Gly Thr Leu Gln Ala Asn Lys
225 230 235 240
Ser Asp Val Pro Ile Asp Ile Cys Asn Ser Thr Cys Lys Tyr Pro Asp
245 250 255
Tyr Leu Lys Met Ala Ser Glu Pro Tyr Gly Asp Ser Leu Phe Phe Phe
260 265 270
Leu Arg Arg Glu Gln Met Phe Val Arg His Phe Phe Asn Arg Ala Gly
275 280 285
Lys Leu Gly Glu Ala Val Pro Asp Asp Leu Tyr Ile Lys Gly Ser Gly
290 295 300
Asn Thr Ala Val Ile Gln Ser Ser Ala Phe Phe Pro Thr Pro Ser Gly
305 310 315 320
Ser Ile Val Thr Ser Glu Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu
325 330 335
Gln Arg Ala Gln Gly His Asn Asn Gly Ile Cys Trp Gly Asn Gln Leu
340 345 350
Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr Asn Met Thr Leu Cys
355 360 365
Thr Glu Val Thr Lys Glu Gly Thr Tyr Lys Asn Asp Asn Phe Lys Glu
370 375 380
Tyr Val Arg His Val Glu Glu Tyr Asp Leu Gln Phe Val Phe Gln Leu
385 390 395 400
Cys Lys Ile Thr Leu Thr Ala Glu Ile Met Thr Tyr Ile His Thr Met
405 410 415
Asp Ser Asn Ile Leu Glu Asp Trp Gln Gly Gly Gly Ser Gly Lys Glu
420 425 430
Asp Pro Leu Asn Lys Tyr Thr Phe Trp Glu Val Asn Leu Lys Glu Lys
435 440 445
Phe Ser Ala Asp Leu Asp Gln Phe Pro Leu Gly Arg Lys Phe Leu Leu
450 455 460
Gln Ser Gly Leu Lys Ala Lys Pro Arg Leu Lys Arg Ser Ala Pro Thr
465 470 475 480
Thr Arg Ala Pro Ser Thr Lys Arg Lys Lys Val Lys Lys
485 490
<210> 11
<211> 521
<212> PRT
<213> 人工序列
<400> 11
Gly Ser Gly Gly Gly Leu Val Ile Leu Phe Cys Val Ala Asp Val Asn
1 5 10 15
Val Phe His Ile Phe Leu Gln Met Ser Val Trp Arg Pro Ser Glu Ala
20 25 30
Thr Val Tyr Leu Pro Pro Val Pro Val Ser Lys Val Val Ser Thr Asp
35 40 45
Glu Tyr Val Ser Arg Thr Ser Ile Tyr Tyr Tyr Ala Gly Ser Ser Arg
50 55 60
Leu Leu Ala Val Gly Asn Pro Tyr Phe Ser Ile Lys Ser Pro Asn Asn
65 70 75 80
Asn Lys Lys Val Leu Val Pro Lys Val Ser Gly Leu Gln Tyr Arg Val
85 90 95
Phe Arg Val Arg Leu Pro Asp Pro Asn Lys Phe Gly Phe Pro Asp Thr
100 105 110
Ser Phe Tyr Asn Pro Asp Thr Gln Arg Leu Val Trp Ala Cys Val Gly
115 120 125
Leu Glu Ile Gly Arg Gly Gln Pro Leu Gly Val Gly Ile Ser Gly His
130 135 140
Pro Tyr Leu Asn Lys Phe Asp Asp Thr Glu Thr Gly Asn Arg Tyr Thr
145 150 155 160
Ala Gln Pro Gly Ser Asp Asn Arg Glu Cys Leu Ser Met Asp Tyr Lys
165 170 175
Gln Thr Gln Leu Cys Leu Ile Gly Cys Lys Pro Pro Thr Gly Glu His
180 185 190
Trp Gly Lys Gly Val Ala Cys Asn Asn Asn Ala Ala Ala Thr Asp Cys
195 200 205
Pro Pro Leu Glu Leu Phe Asn Ser Ile Ile Glu Asp Gly Asp Met Val
210 215 220
Asp Thr Gly Phe Gly Cys Met Asp Phe Gly Thr Leu Gln Ala Asn Lys
225 230 235 240
Ser Asp Val Pro Ile Asp Ile Cys Asn Ser Thr Cys Lys Tyr Pro Asp
245 250 255
Tyr Leu Lys Met Ala Ser Glu Pro Tyr Gly Asp Ser Leu Phe Phe Phe
260 265 270
Leu Arg Arg Glu Gln Met Phe Val Arg His Phe Phe Asn Arg Ala Gly
275 280 285
Thr Leu Gly Glu Pro Val Pro Asn Asp Leu Tyr Ile Lys Gly Ser Gly
290 295 300
Asn Thr Ala Gly Ile Gln Ser Ser Ala Phe Phe Pro Thr Pro Ser Gly
305 310 315 320
Ser Ile Val Thr Ser Glu Ser Gln Leu Phe Asn Lys Pro Tyr Trp Leu
325 330 335
Gln Arg Ala Gln Gly His Asn Asn Asp Ile Cys Trp Gly Asn Gln Leu
340 345 350
Phe Val Thr Val Val Asp Thr Thr Arg Ser Thr Asn Met Thr Leu Cys
355 360 365
Thr Glu Val Thr Lys Glu Asp Thr Tyr Lys Asn Asn Asn Phe Lys Glu
370 375 380
Tyr Val Arg His Val Glu Glu Tyr Asp Leu Gln Phe Val Phe Gln Leu
385 390 395 400
Cys Lys Ile Thr Leu Thr Ala Glu Val Met Thr Tyr Ile His Thr Met
405 410 415
Asn Ser Asp Ile Leu Glu Asp Trp Gln Phe Gly Leu Thr Pro Pro Pro
420 425 430
Ser Ala Ser Leu Gln Asp Thr Tyr Arg Phe Val Thr Ser Gln Ala Ile
435 440 445
Thr Cys Gln Lys Thr Ala Pro Pro Lys Glu Lys Glu Asp Pro Leu Asn
450 455 460
Lys Tyr Thr Phe Trp Glu Val Asn Leu Lys Glu Lys Phe Ser Ala Asp
465 470 475 480
Leu Asp Gln Phe Pro Leu Gly Arg Lys Phe Leu Leu Gln Ser Gly Leu
485 490 495
Lys Ala Lys Pro Arg Leu Lys Arg Ser Ala Pro Thr Thr Arg Ala Pro
500 505 510
Ser Thr Lys Arg Lys Lys Val Lys Lys
515 520
Claims (11)
1.一种重组的HPV58L1蛋白,其特征在于所述蛋白的氨基酸序列如SEQ ID NO:2、SEQID NO:4、SEQ ID NO:6或SEQ ID NO:8所示。
2.编码权利要求1的蛋白的多核苷酸。
3.包含权利要求2的多核苷酸的表达载体。
4.包含权利要求3的表达载体的细胞。
5.一种HPV58L1蛋白五聚体,其特征在于该蛋白五聚体由五个相同的HPV58L1蛋白单体形成,所述HPV58L1蛋白单体的序列如SEQ ID NO:2、SEQ ID NO:4、SEQ ID NO:6或SEQ IDNO:8所示。
6.一种HPV58L1蛋白VLP,其特征在于该蛋白VLP由权利要求5所述的HPV58L1蛋白五聚体形成。
7.一种HPV疫苗,其特征在于该疫苗包括权利要求5所述的HPV58L1蛋白五聚体和药用佐剂。
8.一种HPV疫苗,其特征在于该疫苗包括权利要求6所述的HPV58L1蛋白VLP和药用佐剂。
9.如权利要求7所述的HPV疫苗的制备方法,其特征在于该方法为:
a)克隆或合成编码权利要求1所述的重组的HPV58L1蛋白的基因;
b)在大肠杆菌或酵母表达系统中表达重组的HPV58L1蛋白;
c)纯化由HPV58L1蛋白组成的五聚体;
d)HPV58L1蛋白五聚体加入药用佐剂制成疫苗。
10.如权利要求5所述的HPV58L1蛋白五聚体在制备预防HPV58感染及其导致的疾病的药物中的应用。
11.如权利要求7或8所述的HPV疫苗在制备预防HPV58感染及其导致的疾病的药物中的应用。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410054216.6A CN103992395B (zh) | 2013-02-18 | 2014-02-18 | 重组hpv-58型l1的vlp疫苗及其制备方法 |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013100524727 | 2013-02-18 | ||
CN201310052472 | 2013-02-18 | ||
CN201310052472.7 | 2013-02-18 | ||
CN201410054216.6A CN103992395B (zh) | 2013-02-18 | 2014-02-18 | 重组hpv-58型l1的vlp疫苗及其制备方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103992395A CN103992395A (zh) | 2014-08-20 |
CN103992395B true CN103992395B (zh) | 2018-07-27 |
Family
ID=51306758
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410054216.6A Active CN103992395B (zh) | 2013-02-18 | 2014-02-18 | 重组hpv-58型l1的vlp疫苗及其制备方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103992395B (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105713087B (zh) * | 2014-11-12 | 2020-05-08 | 北京康乐卫士生物技术股份有限公司 | 人乳头瘤病毒58型单克隆抗体及其应用 |
KR102351259B1 (ko) | 2015-12-04 | 2022-01-14 | 시아먼 유니버시티 | 58형 인유두종 바이러스 엘1 단백질의 돌연변이체 |
CN114716562B (zh) * | 2021-01-04 | 2023-11-10 | 中国医学科学院基础医学研究所 | 一种人乳头瘤病毒58型嵌合蛋白及其用途 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
MY139500A (en) * | 2003-11-12 | 2009-10-30 | Merck Sharp & Dohme | Optimized expression of hpv 58 l1 in yeast |
CN101116745B (zh) * | 2006-08-04 | 2011-05-11 | 长春百克生物科技股份公司 | 人乳头瘤病毒的病毒样颗粒疫苗 |
CN101245099A (zh) * | 2007-02-14 | 2008-08-20 | 马润林 | 重组人乳头瘤病毒l1衣壳蛋白的氨基酸序列及其应用 |
CN101481408A (zh) * | 2008-01-07 | 2009-07-15 | 马润林 | 防止高度聚合的重组人乳头瘤病毒l1蛋白的修饰序列 |
-
2014
- 2014-02-18 CN CN201410054216.6A patent/CN103992395B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
CN103992395A (zh) | 2014-08-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101293918B (zh) | 截短的人乳头瘤病毒16型l1蛋白 | |
CN103936840B (zh) | 重组的人乳头瘤病毒33型l1蛋白及其用途 | |
CN105039359B (zh) | 16型重组人乳头瘤病毒病毒样颗粒及其制备方法 | |
CN102229660B (zh) | 截短的人乳头瘤病毒33型l1蛋白 | |
WO2008145020A1 (en) | A truncated l1 protein of human papillomavirus 11 | |
EP2589604B1 (en) | Truncated l1 protein of human papillomavirus type 52 | |
WO2008145021A1 (en) | A truncated l1 protein of human papillomavirus 6 | |
CN103992395B (zh) | 重组hpv-58型l1的vlp疫苗及其制备方法 | |
US9738691B2 (en) | Truncated L1 protein of human papillomavirus type 58 | |
KR20210018351A (ko) | 타입 39 인유두종 바이러스 l1 단백질의 돌연변이체 | |
CN104045696B (zh) | 重组的人乳头瘤病毒16型l1蛋白及其用途 | |
WO2008134935A1 (fr) | Protéines 18 l1 de type papillomavirus humain tronqué | |
CN103819543B (zh) | 重组的人乳头瘤病毒6型l1蛋白及其用途 | |
WO2021013060A1 (zh) | 嵌合的人乳头瘤病毒51型l1蛋白 | |
CN106399329B (zh) | 33型重组人乳头瘤病毒病毒样颗粒及其制备方法 | |
CN104045695B (zh) | 重组的人乳头瘤病毒18型l1蛋白及其用途 | |
WO2021013079A1 (zh) | 嵌合的人乳头瘤病毒56型l1蛋白 | |
WO2021013078A1 (zh) | 嵌合的人乳头瘤病毒52型l1蛋白 | |
WO2021013070A1 (zh) | 嵌合的人乳头瘤病毒35型l1蛋白 | |
CN104211782B (zh) | 截短的人乳头瘤病毒45型l1蛋白 | |
CN106701797A (zh) | 31型重组人乳头瘤病毒病毒样颗粒及其制备方法 | |
WO2004062584A2 (en) | Therapeutic and prophylactic vaccine for the treatment and prevention of papillomavirus infection | |
WO2021013063A1 (zh) | 嵌合的人乳头瘤病毒16型l1蛋白 | |
CN106701796A (zh) | 52型重组人乳头瘤病毒病毒样颗粒及其制备方法 | |
WO2021013077A1 (zh) | 嵌合的人乳头瘤病毒58型l1蛋白 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |