CN102199613A - 感染性丙型肝炎病毒高生产hcv突变体及其应用 - Google Patents
感染性丙型肝炎病毒高生产hcv突变体及其应用 Download PDFInfo
- Publication number
- CN102199613A CN102199613A CN201010139886XA CN201010139886A CN102199613A CN 102199613 A CN102199613 A CN 102199613A CN 201010139886X A CN201010139886X A CN 201010139886XA CN 201010139886 A CN201010139886 A CN 201010139886A CN 102199613 A CN102199613 A CN 102199613A
- Authority
- CN
- China
- Prior art keywords
- ala
- leu
- gly
- val
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 208000015181 infectious disease Diseases 0.000 title description 98
- 230000002458 infectious effect Effects 0.000 title description 46
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 98
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 96
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 96
- 108010076039 Polyproteins Proteins 0.000 claims abstract description 91
- 239000002243 precursor Substances 0.000 claims abstract description 90
- 239000004475 Arginine Substances 0.000 claims abstract description 43
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims abstract description 43
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims abstract description 29
- 241001091624 Hepatitis C virus JFH-1 Species 0.000 claims abstract description 16
- 241000711549 Hepacivirus C Species 0.000 claims description 218
- 239000002245 particle Substances 0.000 claims description 86
- 235000001014 amino acid Nutrition 0.000 claims description 67
- 150000001413 amino acids Chemical class 0.000 claims description 49
- 229940024606 amino acid Drugs 0.000 claims description 41
- 238000004519 manufacturing process Methods 0.000 claims description 40
- 229960005486 vaccine Drugs 0.000 claims description 34
- 239000002773 nucleotide Substances 0.000 claims description 31
- 125000003729 nucleotide group Chemical group 0.000 claims description 31
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 30
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 29
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 24
- 239000004473 Threonine Substances 0.000 claims description 24
- 239000004471 Glycine Substances 0.000 claims description 19
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 12
- 101800001014 Non-structural protein 5A Proteins 0.000 claims description 10
- 238000012258 culturing Methods 0.000 claims description 7
- 229930182817 methionine Natural products 0.000 claims description 6
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 4
- 235000003704 aspartic acid Nutrition 0.000 claims description 4
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 4
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 4
- 210000002845 virion Anatomy 0.000 claims description 4
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 2
- XUYPXLNMDZIRQH-LURJTMIESA-N N-acetyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC(C)=O XUYPXLNMDZIRQH-LURJTMIESA-N 0.000 claims 5
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims 1
- 230000009849 deactivation Effects 0.000 claims 1
- 230000004927 fusion Effects 0.000 claims 1
- 241000700605 Viruses Species 0.000 abstract description 108
- 125000003275 alpha amino acid group Chemical group 0.000 abstract description 66
- 238000006467 substitution reaction Methods 0.000 abstract description 48
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 abstract description 27
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 abstract description 6
- 238000004113 cell culture Methods 0.000 abstract description 5
- 210000004027 cell Anatomy 0.000 description 121
- 108090000623 proteins and genes Proteins 0.000 description 91
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 73
- 235000018102 proteins Nutrition 0.000 description 61
- 102000004169 proteins and genes Human genes 0.000 description 61
- 230000035772 mutation Effects 0.000 description 59
- 108091026890 Coding region Proteins 0.000 description 47
- 238000000034 method Methods 0.000 description 43
- 101710132601 Capsid protein Proteins 0.000 description 35
- 230000000694 effects Effects 0.000 description 33
- 108020004414 DNA Proteins 0.000 description 32
- 239000012228 culture supernatant Substances 0.000 description 32
- 235000004279 alanine Nutrition 0.000 description 30
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical group NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 25
- 230000010076 replication Effects 0.000 description 25
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 22
- 239000000126 substance Substances 0.000 description 22
- 239000004474 valine Substances 0.000 description 22
- 108060001084 Luciferase Proteins 0.000 description 21
- 108700008625 Reporter Genes Proteins 0.000 description 20
- 238000012216 screening Methods 0.000 description 20
- 230000003044 adaptive effect Effects 0.000 description 19
- 125000000539 amino acid group Chemical group 0.000 description 19
- 239000005089 Luciferase Substances 0.000 description 18
- 108010061238 threonyl-glycine Proteins 0.000 description 18
- 125000002987 valine group Chemical group [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 18
- 241000282326 Felis catus Species 0.000 description 16
- 101000600434 Homo sapiens Putative uncharacterized protein encoded by MIR7-3HG Proteins 0.000 description 16
- 102000014150 Interferons Human genes 0.000 description 16
- 108010050904 Interferons Proteins 0.000 description 16
- 102100037401 Putative uncharacterized protein encoded by MIR7-3HG Human genes 0.000 description 16
- 108010050848 glycylleucine Proteins 0.000 description 16
- 229940079322 interferon Drugs 0.000 description 16
- 210000004748 cultured cell Anatomy 0.000 description 15
- 239000003550 marker Substances 0.000 description 15
- 239000013612 plasmid Substances 0.000 description 15
- 238000012360 testing method Methods 0.000 description 15
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 14
- 108010052090 Renilla Luciferases Proteins 0.000 description 14
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 14
- 101710144111 Non-structural protein 3 Proteins 0.000 description 13
- 239000002671 adjuvant Substances 0.000 description 13
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 13
- 239000000427 antigen Substances 0.000 description 13
- 108091007433 antigens Proteins 0.000 description 13
- 102000036639 antigens Human genes 0.000 description 13
- 239000012634 fragment Substances 0.000 description 13
- 102220053976 rs727503323 Human genes 0.000 description 13
- 239000013598 vector Substances 0.000 description 13
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical group C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 12
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 12
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 11
- 239000004472 Lysine Substances 0.000 description 11
- 230000009385 viral infection Effects 0.000 description 11
- 238000003556 assay Methods 0.000 description 10
- 208000005176 Hepatitis C Diseases 0.000 description 9
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 9
- 101800001554 RNA-directed RNA polymerase Proteins 0.000 description 9
- 108091027544 Subgenomic mRNA Proteins 0.000 description 9
- 108091036066 Three prime untranslated region Proteins 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 9
- 238000006243 chemical reaction Methods 0.000 description 9
- 229940079593 drug Drugs 0.000 description 9
- 239000003814 drug Substances 0.000 description 9
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 9
- 101710118188 DNA-binding protein HU-alpha Proteins 0.000 description 8
- 101710144128 Non-structural protein 2 Proteins 0.000 description 8
- 101710199667 Nuclear export protein Proteins 0.000 description 8
- 108010047495 alanylglycine Proteins 0.000 description 8
- 108010093581 aspartyl-proline Proteins 0.000 description 8
- 108010016616 cysteinylglycine Proteins 0.000 description 8
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 8
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 8
- 239000002609 medium Substances 0.000 description 8
- 230000035755 proliferation Effects 0.000 description 8
- 108010053725 prolylvaline Proteins 0.000 description 8
- 241000710188 Encephalomyocarditis virus Species 0.000 description 7
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- 101800001020 Non-structural protein 4A Proteins 0.000 description 7
- 101800001019 Non-structural protein 4B Proteins 0.000 description 7
- 101710172711 Structural protein Proteins 0.000 description 7
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- 239000001963 growth medium Substances 0.000 description 7
- 108010057821 leucylproline Proteins 0.000 description 7
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- 108010065920 Insulin Lispro Proteins 0.000 description 6
- 102000006992 Interferon-alpha Human genes 0.000 description 6
- 108010047761 Interferon-alpha Proteins 0.000 description 6
- 108010079364 N-glycylalanine Proteins 0.000 description 6
- 208000037581 Persistent Infection Diseases 0.000 description 6
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 6
- 239000002253 acid Substances 0.000 description 6
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 6
- 238000010586 diagram Methods 0.000 description 6
- 150000002308 glutamine derivatives Chemical group 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 108010077112 prolyl-proline Proteins 0.000 description 6
- 102220038185 rs144829356 Human genes 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 238000001890 transfection Methods 0.000 description 6
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 5
- 101710125507 Integrase/recombinase Proteins 0.000 description 5
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 5
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 5
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 5
- 101710185720 Putative ethidium bromide resistance protein Proteins 0.000 description 5
- 108010067390 Viral Proteins Proteins 0.000 description 5
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 5
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 5
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 238000004520 electroporation Methods 0.000 description 5
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 5
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 5
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 5
- 208000006454 hepatitis Diseases 0.000 description 5
- 230000005764 inhibitory process Effects 0.000 description 5
- 238000003670 luciferase enzyme activity assay Methods 0.000 description 5
- 108010051242 phenylalanylserine Proteins 0.000 description 5
- 108010090894 prolylleucine Proteins 0.000 description 5
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 5
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 5
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- 230000003612 virological effect Effects 0.000 description 5
- GVJHHUAWPYXKBD-UHFFFAOYSA-N (±)-α-Tocopherol Chemical compound OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 description 4
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 4
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 4
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 4
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 4
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 4
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 4
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 4
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 4
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 4
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 238000011156 evaluation Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 4
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 231100000283 hepatitis Toxicity 0.000 description 4
- 238000007918 intramuscular administration Methods 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- OHDXDNUPVVYWOV-UHFFFAOYSA-N n-methyl-1-(2-naphthalen-1-ylsulfanylphenyl)methanamine Chemical compound CNCC1=CC=CC=C1SC1=CC=CC2=CC=CC=C12 OHDXDNUPVVYWOV-UHFFFAOYSA-N 0.000 description 4
- 239000000546 pharmaceutical excipient Substances 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 108010029020 prolylglycine Proteins 0.000 description 4
- 108010015796 prolylisoleucine Proteins 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 239000000725 suspension Substances 0.000 description 4
- 150000003679 valine derivatives Chemical group 0.000 description 4
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 3
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 3
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 3
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 3
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 3
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 3
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 3
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Chemical group OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 3
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 3
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 3
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 3
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 3
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 3
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 3
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 3
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 3
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 3
- ISQOVWDWRUONJH-YESZJQIVSA-N His-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ISQOVWDWRUONJH-YESZJQIVSA-N 0.000 description 3
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical group OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 3
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical group OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 3
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 3
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 3
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 3
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 3
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 3
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 3
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 3
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 3
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 3
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 3
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 3
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 3
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 3
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 3
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 3
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 3
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 3
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 3
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 3
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 3
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 3
- 239000004480 active ingredient Substances 0.000 description 3
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 3
- 108010060035 arginylproline Proteins 0.000 description 3
- 235000009582 asparagine Nutrition 0.000 description 3
- 229960001230 asparagine Drugs 0.000 description 3
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 239000000839 emulsion Substances 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010084389 glycyltryptophan Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 108010018006 histidylserine Proteins 0.000 description 3
- 230000028993 immune response Effects 0.000 description 3
- 238000012744 immunostaining Methods 0.000 description 3
- 230000000415 inactivating effect Effects 0.000 description 3
- 238000001990 intravenous administration Methods 0.000 description 3
- 108010000761 leucylarginine Proteins 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 239000013600 plasmid vector Substances 0.000 description 3
- 235000021251 pulses Nutrition 0.000 description 3
- 238000010839 reverse transcription Methods 0.000 description 3
- 238000003757 reverse transcription PCR Methods 0.000 description 3
- 238000007920 subcutaneous administration Methods 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 239000000829 suppository Substances 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 230000014616 translation Effects 0.000 description 3
- 108010080629 tryptophan-leucine Proteins 0.000 description 3
- 108010029384 tryptophyl-histidine Proteins 0.000 description 3
- 108010038745 tryptophylglycine Proteins 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 2
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 2
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 2
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 2
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 2
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- 235000002198 Annona diversifolia Nutrition 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 2
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 2
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 2
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 2
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- 241000282828 Camelus bactrianus Species 0.000 description 2
- 241000282836 Camelus dromedarius Species 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 2
- 102000009016 Cholera Toxin Human genes 0.000 description 2
- 108010049048 Cholera Toxin Proteins 0.000 description 2
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 2
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 2
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 2
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 2
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 2
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 2
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 2
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 2
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 2
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 2
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 2
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 2
- VUUOMYFPWDYETE-WDSKDSINSA-N Gly-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VUUOMYFPWDYETE-WDSKDSINSA-N 0.000 description 2
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 2
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 2
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 2
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- IMRNSEPSPFQNHF-STQMWFEESA-N Gly-Ser-Trp Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O IMRNSEPSPFQNHF-STQMWFEESA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 2
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 208000037319 Hepatitis infectious Diseases 0.000 description 2
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 2
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 2
- BZAQOPHNBFOOJS-DCAQKATOSA-N His-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O BZAQOPHNBFOOJS-DCAQKATOSA-N 0.000 description 2
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 2
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 2
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- 102000002227 Interferon Type I Human genes 0.000 description 2
- 108010014726 Interferon Type I Proteins 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- 241000282838 Lama Species 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 2
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 2
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 2
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 229930195725 Mannitol Natural products 0.000 description 2
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 2
- 108010086093 Mung Bean Nuclease Proteins 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 229930193140 Neomycin Natural products 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 2
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 2
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 2
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 2
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 2
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 2
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 2
- LCWXSALTPTZKNM-CIUDSAMLSA-N Pro-Cys-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O LCWXSALTPTZKNM-CIUDSAMLSA-N 0.000 description 2
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 2
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 2
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 2
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 2
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 2
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 2
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 2
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 2
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 2
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 2
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 2
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- 238000002123 RNA extraction Methods 0.000 description 2
- IWUCXVSUMQZMFG-AFCXAGJDSA-N Ribavirin Chemical compound N1=C(C(=O)N)N=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 IWUCXVSUMQZMFG-AFCXAGJDSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 2
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 2
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 2
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 2
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 2
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 2
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 2
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 2
- OAZLRFLMQASGNW-PMVMPFDFSA-N Trp-His-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CC4=CC=C(C=C4)O)C(=O)O)N OAZLRFLMQASGNW-PMVMPFDFSA-N 0.000 description 2
- RCMHSGRBJCMFLR-BPUTZDHNSA-N Trp-Met-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 RCMHSGRBJCMFLR-BPUTZDHNSA-N 0.000 description 2
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 2
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 2
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 2
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 2
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 2
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 2
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 2
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 208000036142 Viral infection Diseases 0.000 description 2
- 229930003427 Vitamin E Natural products 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- NWMHDZMRVUOQGL-CZEIJOLGSA-N almurtide Chemical compound OC(=O)CC[C@H](C(N)=O)NC(=O)[C@H](C)NC(=O)CO[C@@H]([C@H](O)[C@H](O)CO)[C@@H](NC(C)=O)C=O NWMHDZMRVUOQGL-CZEIJOLGSA-N 0.000 description 2
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 2
- 210000000628 antibody-producing cell Anatomy 0.000 description 2
- 239000003443 antiviral agent Substances 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 230000037429 base substitution Effects 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 239000002552 dosage form Substances 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- WIGCFUFOHFEKBI-UHFFFAOYSA-N gamma-tocopherol Natural products CC(C)CCCC(C)CCCC(C)CCCC1CCC2C(C)C(O)C(C)C(C)C2O1 WIGCFUFOHFEKBI-UHFFFAOYSA-N 0.000 description 2
- 230000014509 gene expression Effects 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 239000008187 granular material Substances 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 208000005252 hepatitis A Diseases 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 210000000265 leukocyte Anatomy 0.000 description 2
- 238000001638 lipofection Methods 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 239000000594 mannitol Substances 0.000 description 2
- 235000010355 mannitol Nutrition 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 229960004927 neomycin Drugs 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 2
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 229960000329 ribavirin Drugs 0.000 description 2
- HZCAHMRRMINHDJ-DBRKOABJSA-N ribavirin Natural products O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1N=CN=C1 HZCAHMRRMINHDJ-DBRKOABJSA-N 0.000 description 2
- 229930182490 saponin Natural products 0.000 description 2
- 150000007949 saponins Chemical class 0.000 description 2
- 235000017709 saponins Nutrition 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 239000008107 starch Substances 0.000 description 2
- 235000019698 starch Nutrition 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 229940126585 therapeutic drug Drugs 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 108010036387 trimethionine Proteins 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 235000019165 vitamin E Nutrition 0.000 description 2
- 229940046009 vitamin E Drugs 0.000 description 2
- 239000011709 vitamin E Substances 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- ZCPBEAHAVUJKAE-UHTWSYAYSA-N (2s)-2-[[(2s)-2-[[(2r)-2-[(2-aminoacetyl)amino]-3-phenylpropanoyl]amino]propanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](NC(=O)CN)CC1=CC=CC=C1 ZCPBEAHAVUJKAE-UHTWSYAYSA-N 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- 108010042708 Acetylmuramyl-Alanyl-Isoglutamine Proteins 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 108010000239 Aequorin Proteins 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- WJRXVTCKASUIFF-FXQIFTODSA-N Ala-Cys-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WJRXVTCKASUIFF-FXQIFTODSA-N 0.000 description 1
- HFBFSOAKPUZCCO-ZLUOBGJFSA-N Ala-Cys-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HFBFSOAKPUZCCO-ZLUOBGJFSA-N 0.000 description 1
- FRFDXQWNDZMREB-ACZMJKKPSA-N Ala-Cys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRFDXQWNDZMREB-ACZMJKKPSA-N 0.000 description 1
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- OILNWMNBLIHXQK-ZLUOBGJFSA-N Ala-Cys-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O OILNWMNBLIHXQK-ZLUOBGJFSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- GRIFPSOFWFIICX-GOPGUHFVSA-N Ala-His-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GRIFPSOFWFIICX-GOPGUHFVSA-N 0.000 description 1
- LBFXVAXPDOBRKU-LKTVYLICSA-N Ala-His-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LBFXVAXPDOBRKU-LKTVYLICSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- ZKEHTYWGPMMGBC-XUXIUFHCSA-N Ala-Leu-Leu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O ZKEHTYWGPMMGBC-XUXIUFHCSA-N 0.000 description 1
- MLNSNVLOEIYJIU-ZUDIRPEPSA-N Ala-Leu-Thr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLNSNVLOEIYJIU-ZUDIRPEPSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- XIWKVCDQMCNKOZ-UVBJJODRSA-N Ala-Met-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XIWKVCDQMCNKOZ-UVBJJODRSA-N 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- FSXDWQGEWZQBPJ-HERUPUMHSA-N Ala-Trp-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FSXDWQGEWZQBPJ-HERUPUMHSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- BHFOJPDOQPWJRN-XDTLVQLUSA-N Ala-Tyr-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CCC(N)=O)C(O)=O BHFOJPDOQPWJRN-XDTLVQLUSA-N 0.000 description 1
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 102100024321 Alkaline phosphatase, placental type Human genes 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 108010039224 Amidophosphoribosyltransferase Proteins 0.000 description 1
- 241000272814 Anser sp. Species 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 1
- YWENWUYXQUWRHQ-LPEHRKFASA-N Arg-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O YWENWUYXQUWRHQ-LPEHRKFASA-N 0.000 description 1
- YHSNASXGBPAHRL-BPUTZDHNSA-N Arg-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N YHSNASXGBPAHRL-BPUTZDHNSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 1
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 1
- YLVGUOGAFAJMKP-JYJNAYRXSA-N Arg-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YLVGUOGAFAJMKP-JYJNAYRXSA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- XOZYYXMHMIEJET-XIRDDKMYSA-N Arg-Trp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XOZYYXMHMIEJET-XIRDDKMYSA-N 0.000 description 1
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- UTSMXMABBPFVJP-SZMVWBNQSA-N Arg-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UTSMXMABBPFVJP-SZMVWBNQSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 1
- GMCOADLDNLGOFE-ZLUOBGJFSA-N Asn-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N GMCOADLDNLGOFE-ZLUOBGJFSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- HLTLEIXYIJDFOY-ZLUOBGJFSA-N Asn-Cys-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O HLTLEIXYIJDFOY-ZLUOBGJFSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- UYXXMIZGHYKYAT-NHCYSSNCSA-N Asn-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N UYXXMIZGHYKYAT-NHCYSSNCSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- XHTUGJCAEYOZOR-UBHSHLNASA-N Asn-Ser-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XHTUGJCAEYOZOR-UBHSHLNASA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 description 1
- UPAGTDJAORYMEC-VHWLVUOQSA-N Asn-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N UPAGTDJAORYMEC-VHWLVUOQSA-N 0.000 description 1
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 1
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- ZRAOLTNMSCSCLN-ZLUOBGJFSA-N Asp-Cys-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)O ZRAOLTNMSCSCLN-ZLUOBGJFSA-N 0.000 description 1
- WEDGJJRCJNHYSF-SRVKXCTJSA-N Asp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WEDGJJRCJNHYSF-SRVKXCTJSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- XSXVLWBWIPKUSN-UHFFFAOYSA-N Asp-Leu-Glu-Asp Chemical compound OC(=O)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(O)=O)C(O)=O XSXVLWBWIPKUSN-UHFFFAOYSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- WDMNFNXKGSLIOB-GUBZILKMSA-N Asp-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N WDMNFNXKGSLIOB-GUBZILKMSA-N 0.000 description 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- GZYDPEJSZYZWEF-MXAVVETBSA-N Asp-Val-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O GZYDPEJSZYZWEF-MXAVVETBSA-N 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 101150045282 CD81 gene Proteins 0.000 description 1
- 102000011632 Caseins Human genes 0.000 description 1
- 108010076119 Caseins Proteins 0.000 description 1
- 241000700198 Cavia Species 0.000 description 1
- 108010039939 Cell Wall Skeleton Proteins 0.000 description 1
- 206010008909 Chronic Hepatitis Diseases 0.000 description 1
- 208000006154 Chronic hepatitis C Diseases 0.000 description 1
- 108090000600 Claudin-1 Proteins 0.000 description 1
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 1
- DEVDFMRWZASYOF-ZLUOBGJFSA-N Cys-Asn-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DEVDFMRWZASYOF-ZLUOBGJFSA-N 0.000 description 1
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 1
- ATPDEYTYWVMINF-ZLUOBGJFSA-N Cys-Cys-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ATPDEYTYWVMINF-ZLUOBGJFSA-N 0.000 description 1
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 1
- YZKOXEJTLWZOQL-GUBZILKMSA-N Cys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N YZKOXEJTLWZOQL-GUBZILKMSA-N 0.000 description 1
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 1
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 1
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 1
- YKKHFPGOZXQAGK-QWRGUYRKSA-N Cys-Gly-Tyr Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YKKHFPGOZXQAGK-QWRGUYRKSA-N 0.000 description 1
- XVLMKWWVBNESPX-XVYDVKMFSA-N Cys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N XVLMKWWVBNESPX-XVYDVKMFSA-N 0.000 description 1
- KCSDYJSCUWLILX-BJDJZHNGSA-N Cys-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N KCSDYJSCUWLILX-BJDJZHNGSA-N 0.000 description 1
- MRVSLWQRNWEROS-SVSWQMSJSA-N Cys-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CS)N MRVSLWQRNWEROS-SVSWQMSJSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- VDUPGIDTWNQAJD-CIUDSAMLSA-N Cys-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O VDUPGIDTWNQAJD-CIUDSAMLSA-N 0.000 description 1
- QVLKXRMFNGHDRO-FXQIFTODSA-N Cys-Met-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O QVLKXRMFNGHDRO-FXQIFTODSA-N 0.000 description 1
- PGBLJHDDKCVSTC-CIUDSAMLSA-N Cys-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O PGBLJHDDKCVSTC-CIUDSAMLSA-N 0.000 description 1
- UDDITVWSXPEAIQ-IHRRRGAJSA-N Cys-Phe-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UDDITVWSXPEAIQ-IHRRRGAJSA-N 0.000 description 1
- LHJDLVVQRJIURS-SRVKXCTJSA-N Cys-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LHJDLVVQRJIURS-SRVKXCTJSA-N 0.000 description 1
- ZOKPRHVIFAUJPV-GUBZILKMSA-N Cys-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O ZOKPRHVIFAUJPV-GUBZILKMSA-N 0.000 description 1
- RAGIABZNLPZBGS-FXQIFTODSA-N Cys-Pro-Cys Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O RAGIABZNLPZBGS-FXQIFTODSA-N 0.000 description 1
- NITLUESFANGEIW-BQBZGAKWSA-N Cys-Pro-Gly Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O NITLUESFANGEIW-BQBZGAKWSA-N 0.000 description 1
- XCDDSPYIMNXECQ-NAKRPEOUSA-N Cys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS XCDDSPYIMNXECQ-NAKRPEOUSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- SRZZZTMJARUVPI-JBDRJPRFSA-N Cys-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N SRZZZTMJARUVPI-JBDRJPRFSA-N 0.000 description 1
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 1
- FANFRJOFTYCNRG-JYBASQMISA-N Cys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N)O FANFRJOFTYCNRG-JYBASQMISA-N 0.000 description 1
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 description 1
- VXDXZGYXHIADHF-YJRXYDGGSA-N Cys-Tyr-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VXDXZGYXHIADHF-YJRXYDGGSA-N 0.000 description 1
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- 108010041986 DNA Vaccines Proteins 0.000 description 1
- 229940021995 DNA vaccine Drugs 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- 235000017274 Diospyros sandwicensis Nutrition 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 102100027723 Endogenous retrovirus group K member 6 Rec protein Human genes 0.000 description 1
- 101710091045 Envelope protein Proteins 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 1
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 1
- UVAOVENCIONMJP-GUBZILKMSA-N Gln-Cys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O UVAOVENCIONMJP-GUBZILKMSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- GQTNWYFWSUFFRA-KKUMJFAQSA-N Gln-Met-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GQTNWYFWSUFFRA-KKUMJFAQSA-N 0.000 description 1
- PDXIOFXRBVDSHD-JBACZVJFSA-N Gln-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CCC(=O)N)N PDXIOFXRBVDSHD-JBACZVJFSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- XIYWAJQIWLXXAF-XKBZYTNZSA-N Gln-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XIYWAJQIWLXXAF-XKBZYTNZSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- SAHTWBLTLJWAQA-XIRDDKMYSA-N Gln-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N SAHTWBLTLJWAQA-XIRDDKMYSA-N 0.000 description 1
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- SAEBUDRWKUXLOM-ACZMJKKPSA-N Glu-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O SAEBUDRWKUXLOM-ACZMJKKPSA-N 0.000 description 1
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 1
- PNAOVYHADQRJQU-GUBZILKMSA-N Glu-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N PNAOVYHADQRJQU-GUBZILKMSA-N 0.000 description 1
- ZXLZWUQBRYGDNS-CIUDSAMLSA-N Glu-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXLZWUQBRYGDNS-CIUDSAMLSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- ZTVGZOIBLRPQNR-KKUMJFAQSA-N Glu-Met-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZTVGZOIBLRPQNR-KKUMJFAQSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- JPAACTMBBBGAAR-HOTGVXAUSA-N Gly-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)CC(C)C)C(O)=O)=CNC2=C1 JPAACTMBBBGAAR-HOTGVXAUSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 1
- PASHZZBXZYEXFE-LSDHHAIUSA-N Gly-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN)C(=O)O PASHZZBXZYEXFE-LSDHHAIUSA-N 0.000 description 1
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- 206010019786 Hepatitis non-A non-B Diseases 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 1
- LYSVCKOXIDKEEL-SRVKXCTJSA-N His-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYSVCKOXIDKEEL-SRVKXCTJSA-N 0.000 description 1
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 1
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 1
- CTEMYIWDSVICKS-WDSOQIARSA-N His-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N CTEMYIWDSVICKS-WDSOQIARSA-N 0.000 description 1
- SGLXGEDPYJPGIQ-ACRUOGEOSA-N His-Phe-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N SGLXGEDPYJPGIQ-ACRUOGEOSA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 1
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 1
- KECFCPNPPYCGBL-PMVMPFDFSA-N His-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N KECFCPNPPYCGBL-PMVMPFDFSA-N 0.000 description 1
- BCSGDNGNHKBRRJ-ULQDDVLXSA-N His-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N BCSGDNGNHKBRRJ-ULQDDVLXSA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 1
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- RSDHVTMRXSABSV-GHCJXIJMSA-N Ile-Asn-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RSDHVTMRXSABSV-GHCJXIJMSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- XLDYDEDTGMHUCZ-GHCJXIJMSA-N Ile-Asp-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N XLDYDEDTGMHUCZ-GHCJXIJMSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- CCHSQWLCOOZREA-GMOBBJLQSA-N Ile-Asp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N CCHSQWLCOOZREA-GMOBBJLQSA-N 0.000 description 1
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- HTDRTKMNJRRYOJ-SIUGBPQLSA-N Ile-Gln-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HTDRTKMNJRRYOJ-SIUGBPQLSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- RFMDODRWJZHZCR-BJDJZHNGSA-N Ile-Lys-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O RFMDODRWJZHZCR-BJDJZHNGSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- KTTMFLSBTNBAHL-MXAVVETBSA-N Ile-Phe-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N KTTMFLSBTNBAHL-MXAVVETBSA-N 0.000 description 1
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- VISRCHQHQCLODA-NAKRPEOUSA-N Ile-Pro-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N VISRCHQHQCLODA-NAKRPEOUSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- KWHFUMYCSPJCFQ-NGTWOADLSA-N Ile-Thr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N KWHFUMYCSPJCFQ-NGTWOADLSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- BLFXHAFTNYZEQE-VKOGCVSHSA-N Ile-Trp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BLFXHAFTNYZEQE-VKOGCVSHSA-N 0.000 description 1
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 1
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- SWNRZNLXMXRCJC-VKOGCVSHSA-N Ile-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 SWNRZNLXMXRCJC-VKOGCVSHSA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 108020005350 Initiator Codon Proteins 0.000 description 1
- 102000004310 Ion Channels Human genes 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000282842 Lama glama Species 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 1
- WPIKRJDRQVFRHP-TUSQITKMSA-N Leu-Trp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O WPIKRJDRQVFRHP-TUSQITKMSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 239000012097 Lipofectamine 2000 Substances 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 1
- ZAJNRWKGHWGPDQ-SDDRHHMPSA-N Met-Arg-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N ZAJNRWKGHWGPDQ-SDDRHHMPSA-N 0.000 description 1
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 1
- QWTGQXGNNMIUCW-BPUTZDHNSA-N Met-Asn-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QWTGQXGNNMIUCW-BPUTZDHNSA-N 0.000 description 1
- YLLWCSDBVGZLOW-CIUDSAMLSA-N Met-Gln-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O YLLWCSDBVGZLOW-CIUDSAMLSA-N 0.000 description 1
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 1
- PHWSCIFNNLLUFJ-NHCYSSNCSA-N Met-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N PHWSCIFNNLLUFJ-NHCYSSNCSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- WWWGMQHQSAUXBU-BQBZGAKWSA-N Met-Gly-Asn Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O WWWGMQHQSAUXBU-BQBZGAKWSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 1
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 1
- RVYDCISQIGHAFC-ZPFDUUQYSA-N Met-Ile-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O RVYDCISQIGHAFC-ZPFDUUQYSA-N 0.000 description 1
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- VWWGEKCAPBMIFE-SRVKXCTJSA-N Met-Met-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VWWGEKCAPBMIFE-SRVKXCTJSA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- KRLKICLNEICJGV-STQMWFEESA-N Met-Phe-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 KRLKICLNEICJGV-STQMWFEESA-N 0.000 description 1
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 1
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- WYNIRYZIFZGWQD-BPUTZDHNSA-N Met-Trp-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WYNIRYZIFZGWQD-BPUTZDHNSA-N 0.000 description 1
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 1
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 125000003047 N-acetyl group Chemical group 0.000 description 1
- 108700015872 N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine Proteins 0.000 description 1
- 108700020354 N-acetylmuramyl-threonyl-isoglutamine Proteins 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 241000282579 Pan Species 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- QEPZQAPZKIPVDV-KKUMJFAQSA-N Phe-Cys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N QEPZQAPZKIPVDV-KKUMJFAQSA-N 0.000 description 1
- VJEZWOSKRCLHRP-MELADBBJSA-N Phe-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O VJEZWOSKRCLHRP-MELADBBJSA-N 0.000 description 1
- OWCLJDXHHZUNEL-IHRRRGAJSA-N Phe-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OWCLJDXHHZUNEL-IHRRRGAJSA-N 0.000 description 1
- ABQFNJAFONNUTH-FHWLQOOXSA-N Phe-Gln-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N ABQFNJAFONNUTH-FHWLQOOXSA-N 0.000 description 1
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- YTGGLKWSVIRECD-JBACZVJFSA-N Phe-Trp-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 YTGGLKWSVIRECD-JBACZVJFSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- AIZVVCMAFRREQS-GUBZILKMSA-N Pro-Cys-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AIZVVCMAFRREQS-GUBZILKMSA-N 0.000 description 1
- DIZLUAZLNDFDPR-CIUDSAMLSA-N Pro-Cys-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 DIZLUAZLNDFDPR-CIUDSAMLSA-N 0.000 description 1
- SZZBUDVXWZZPDH-BQBZGAKWSA-N Pro-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 SZZBUDVXWZZPDH-BQBZGAKWSA-N 0.000 description 1
- OLTFZQIYCNOBLI-DCAQKATOSA-N Pro-Cys-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O OLTFZQIYCNOBLI-DCAQKATOSA-N 0.000 description 1
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 1
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 1
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 1
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- YAZNFQUKPUASKB-DCAQKATOSA-N Pro-Lys-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O YAZNFQUKPUASKB-DCAQKATOSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- XZBYTHCRAVAXQQ-DCAQKATOSA-N Pro-Met-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XZBYTHCRAVAXQQ-DCAQKATOSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 description 1
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 description 1
- DYMPSOABVJIFBS-IHRRRGAJSA-N Pro-Phe-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CS)C(=O)O DYMPSOABVJIFBS-IHRRRGAJSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- BUEIYHBJHCDAMI-UFYCRDLUSA-N Pro-Phe-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BUEIYHBJHCDAMI-UFYCRDLUSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- QAAYIXYLEMRULP-SRVKXCTJSA-N Pro-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 QAAYIXYLEMRULP-SRVKXCTJSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 1
- SNSYSBUTTJBPDG-OKZBNKHCSA-N Pro-Trp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N4CCC[C@@H]4C(=O)O SNSYSBUTTJBPDG-OKZBNKHCSA-N 0.000 description 1
- BVTYXOFTHDXSNI-IHRRRGAJSA-N Pro-Tyr-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 BVTYXOFTHDXSNI-IHRRRGAJSA-N 0.000 description 1
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- DGDCSVGVWWAJRS-AVGNSLFASA-N Pro-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 DGDCSVGVWWAJRS-AVGNSLFASA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 101710188315 Protein X Proteins 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 241000242743 Renilla reniformis Species 0.000 description 1
- -1 SEQ ID NO: 2 Amino acid Chemical group 0.000 description 1
- 241000242583 Scyphozoa Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 1
- INCNPLPRPOYTJI-JBDRJPRFSA-N Ser-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N INCNPLPRPOYTJI-JBDRJPRFSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- RQXDSYQXBCRXBT-GUBZILKMSA-N Ser-Met-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RQXDSYQXBCRXBT-GUBZILKMSA-N 0.000 description 1
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 1
- FOOZNBRFRWGBNU-DCAQKATOSA-N Ser-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N FOOZNBRFRWGBNU-DCAQKATOSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- QSHKTZVJGDVFEW-GUBZILKMSA-N Ser-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N QSHKTZVJGDVFEW-GUBZILKMSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 1
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 1
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 1
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 1
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- LSHUNRICNSEEAN-BPUTZDHNSA-N Ser-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N LSHUNRICNSEEAN-BPUTZDHNSA-N 0.000 description 1
- 229920002125 Sokalan® Polymers 0.000 description 1
- 241000272534 Struthio camelus Species 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- NRBUKAHTWRCUEQ-XGEHTFHBSA-N Thr-Cys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O NRBUKAHTWRCUEQ-XGEHTFHBSA-N 0.000 description 1
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- MQUZMZBFKCHVOB-HJGDQZAQSA-N Thr-Gln-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O MQUZMZBFKCHVOB-HJGDQZAQSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- VULNJDORNLBPNG-SWRJLBSHSA-N Thr-Glu-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VULNJDORNLBPNG-SWRJLBSHSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 1
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- YDWLCDQXLCILCZ-BWAGICSOSA-N Thr-His-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YDWLCDQXLCILCZ-BWAGICSOSA-N 0.000 description 1
- LCCSEJSPBWKBNT-OSUNSFLBSA-N Thr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N LCCSEJSPBWKBNT-OSUNSFLBSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- XUGYQLFEJYZOKQ-NGTWOADLSA-N Thr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XUGYQLFEJYZOKQ-NGTWOADLSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 1
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 1
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- SOUPNXUJAJENFU-SWRJLBSHSA-N Thr-Trp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O SOUPNXUJAJENFU-SWRJLBSHSA-N 0.000 description 1
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 description 1
- JNKAYADBODLPMQ-HSHDSVGOSA-N Thr-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)=CNC2=C1 JNKAYADBODLPMQ-HSHDSVGOSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 1
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 1
- ICNFHVUVCNWUAB-SZMVWBNQSA-N Trp-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ICNFHVUVCNWUAB-SZMVWBNQSA-N 0.000 description 1
- DEZKIRSBKKXUEV-NYVOZVTQSA-N Trp-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N DEZKIRSBKKXUEV-NYVOZVTQSA-N 0.000 description 1
- WPSYJHFHZYJXMW-JSGCOSHPSA-N Trp-Gln-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O WPSYJHFHZYJXMW-JSGCOSHPSA-N 0.000 description 1
- MDDYTWOFHZFABW-SZMVWBNQSA-N Trp-Gln-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 MDDYTWOFHZFABW-SZMVWBNQSA-N 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- SNJAPSVIPKUMCK-NWLDYVSISA-N Trp-Glu-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SNJAPSVIPKUMCK-NWLDYVSISA-N 0.000 description 1
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 1
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 1
- HNIWONZFMIPCCT-SIXJUCDHSA-N Trp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N HNIWONZFMIPCCT-SIXJUCDHSA-N 0.000 description 1
- YDTKYBHPRULROG-LTHWPDAASA-N Trp-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YDTKYBHPRULROG-LTHWPDAASA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- CMXACOZDEJYZSK-XIRDDKMYSA-N Trp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CMXACOZDEJYZSK-XIRDDKMYSA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- CFMGQWYCEJDTDG-XIRDDKMYSA-N Trp-Lys-Cys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 CFMGQWYCEJDTDG-XIRDDKMYSA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- RQLNEFOBQAVGSY-WDSOQIARSA-N Trp-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQLNEFOBQAVGSY-WDSOQIARSA-N 0.000 description 1
- ACGIVBXINJFALS-HKUYNNGSSA-N Trp-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N ACGIVBXINJFALS-HKUYNNGSSA-N 0.000 description 1
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 1
- GFUOTIPYXKAPAH-BVSLBCMMSA-N Trp-Pro-Phe Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GFUOTIPYXKAPAH-BVSLBCMMSA-N 0.000 description 1
- OJKVFAWXPGCJMF-BPUTZDHNSA-N Trp-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CO)C(=O)O OJKVFAWXPGCJMF-BPUTZDHNSA-N 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 1
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 1
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 1
- CYLQUSBOSWCHTO-BPUTZDHNSA-N Trp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CYLQUSBOSWCHTO-BPUTZDHNSA-N 0.000 description 1
- PALLCTDPFINNMM-JQHSSLGASA-N Trp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N PALLCTDPFINNMM-JQHSSLGASA-N 0.000 description 1
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 1
- BVDHHLMIZFCAAU-BZSNNMDCSA-N Tyr-Cys-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BVDHHLMIZFCAAU-BZSNNMDCSA-N 0.000 description 1
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 1
- YWXMGBUGMLJMIP-IHPCNDPISA-N Tyr-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YWXMGBUGMLJMIP-IHPCNDPISA-N 0.000 description 1
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- SFSZDJHNAICYSD-PMVMPFDFSA-N Tyr-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CC4=CC=C(C=C4)O)N SFSZDJHNAICYSD-PMVMPFDFSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- KZOZXAYPVKKDIO-UFYCRDLUSA-N Tyr-Met-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KZOZXAYPVKKDIO-UFYCRDLUSA-N 0.000 description 1
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 1
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- FGVFBDZSGQTYQX-UFYCRDLUSA-N Tyr-Phe-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O FGVFBDZSGQTYQX-UFYCRDLUSA-N 0.000 description 1
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- PLVVHGFEMSDRET-IHPCNDPISA-N Tyr-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PLVVHGFEMSDRET-IHPCNDPISA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- AKKYBQGHUAWPJR-MNSWYVGCSA-N Tyr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O AKKYBQGHUAWPJR-MNSWYVGCSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- LABUITCFCAABSV-BPNCWPANSA-N Val-Ala-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-BPNCWPANSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- XPYNXORPPVTVQK-SRVKXCTJSA-N Val-Arg-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N XPYNXORPPVTVQK-SRVKXCTJSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- ICFRWCLVYFKHJV-FXQIFTODSA-N Val-Cys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N ICFRWCLVYFKHJV-FXQIFTODSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- IRLYZKKNBFPQBW-XGEHTFHBSA-N Val-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N)O IRLYZKKNBFPQBW-XGEHTFHBSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 108010084455 Zeocin Proteins 0.000 description 1
- 108010036951 achatin I Proteins 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- ILRRQNADMUWWFW-UHFFFAOYSA-K aluminium phosphate Chemical compound O1[Al]2OP1(=O)O2 ILRRQNADMUWWFW-UHFFFAOYSA-K 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 238000010420 art technique Methods 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 239000012752 auxiliary agent Substances 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- VEZXCJBBBCKRPI-UHFFFAOYSA-N beta-propiolactone Chemical compound O=C1CCO1 VEZXCJBBBCKRPI-UHFFFAOYSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 239000012888 bovine serum Substances 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 238000010805 cDNA synthesis kit Methods 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 210000004520 cell wall skeleton Anatomy 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 208000019425 cirrhosis of liver Diseases 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000012737 fresh medium Substances 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 208000010710 hepatitis C virus infection Diseases 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 230000005745 host immune response Effects 0.000 description 1
- 238000005984 hydrogenation reaction Methods 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 239000003018 immunosuppressive agent Substances 0.000 description 1
- 229940125721 immunosuppressive agent Drugs 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 229940031551 inactivated vaccine Drugs 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 230000001524 infective effect Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- FZWBNHMXJMCXLU-BLAUPYHCSA-N isomaltotriose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O)O1 FZWBNHMXJMCXLU-BLAUPYHCSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- NLYAJNPCOHFWQQ-UHFFFAOYSA-N kaolin Chemical compound O.O.O=[Al]O[Si](=O)O[Si](=O)O[Al]=O NLYAJNPCOHFWQQ-UHFFFAOYSA-N 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 229940124590 live attenuated vaccine Drugs 0.000 description 1
- 229940023012 live-attenuated vaccine Drugs 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 201000007270 liver cancer Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 238000004020 luminiscence type Methods 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- ZLNQQNXFFQJAID-UHFFFAOYSA-L magnesium carbonate Chemical compound [Mg+2].[O-]C([O-])=O ZLNQQNXFFQJAID-UHFFFAOYSA-L 0.000 description 1
- 239000001095 magnesium carbonate Substances 0.000 description 1
- 229910000021 magnesium carbonate Inorganic materials 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000000691 measurement method Methods 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- JMUHBNWAORSSBD-WKYWBUFDSA-N mifamurtide Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@@H](OC(=O)CCCCCCCCCCCCCCC)COP(O)(=O)OCCNC(=O)[C@H](C)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](C)O[C@H]1[C@H](O)[C@@H](CO)OC(O)[C@@H]1NC(C)=O JMUHBNWAORSSBD-WKYWBUFDSA-N 0.000 description 1
- 229960005225 mifamurtide Drugs 0.000 description 1
- 108700007621 mifamurtide Proteins 0.000 description 1
- 239000002480 mineral oil Substances 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000011022 operating instruction Methods 0.000 description 1
- TWNQGVIAIRXVLR-UHFFFAOYSA-N oxo(oxoalumanyloxy)alumane Chemical compound O=[Al]O[Al]=O TWNQGVIAIRXVLR-UHFFFAOYSA-N 0.000 description 1
- 239000006179 pH buffering agent Substances 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 238000002205 phenol-chloroform extraction Methods 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- CWCMIVBLVUHDHK-ZSNHEYEWSA-N phleomycin D1 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC[C@@H](N=1)C=1SC=C(N=1)C(=O)NCCCCNC(N)=N)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C CWCMIVBLVUHDHK-ZSNHEYEWSA-N 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 239000002504 physiological saline solution Substances 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 108010025488 pinealon Proteins 0.000 description 1
- 108010031345 placental alkaline phosphatase Proteins 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920001515 polyalkylene glycol Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 229920000136 polysorbate Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 239000000955 prescription drug Substances 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 229960000380 propiolactone Drugs 0.000 description 1
- 108020001775 protein parts Proteins 0.000 description 1
- 229950010131 puromycin Drugs 0.000 description 1
- FGVVTMRZYROCTH-UHFFFAOYSA-N pyridine-2-thiol N-oxide Chemical compound [O-][N+]1=CC=CC=C1S FGVVTMRZYROCTH-UHFFFAOYSA-N 0.000 description 1
- 229960002026 pyrithione Drugs 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- CVHZOJJKTDOEJC-UHFFFAOYSA-N saccharin Chemical compound C1=CC=C2C(=O)NS(=O)(=O)C2=C1 CVHZOJJKTDOEJC-UHFFFAOYSA-N 0.000 description 1
- 150000003354 serine derivatives Chemical group 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 235000020183 skimmed milk Nutrition 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 238000013268 sustained release Methods 0.000 description 1
- 239000012730 sustained-release form Substances 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 229940021747 therapeutic vaccine Drugs 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 241000712461 unidentified influenza virus Species 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
- 229960004854 viral vaccine Drugs 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N7/00—Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K39/12—Viral antigens
- A61K39/29—Hepatitis virus
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P1/00—Drugs for disorders of the alimentary tract or the digestive system
- A61P1/16—Drugs for disorders of the alimentary tract or the digestive system for liver or gallbladder disorders, e.g. hepatoprotective agents, cholagogues, litholytics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
- C07K14/08—RNA viruses
- C07K14/18—Togaviridae; Flaviviridae
- C07K14/1816—Flaviviridae, e.g. pestivirus, mucosal disease virus, bovine viral diarrhoea virus, classical swine fever virus (hog cholera virus), border disease virus
- C07K14/1833—Hepatitis C; Hepatitis NANB
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/24011—Flaviviridae
- C12N2770/24211—Hepacivirus, e.g. hepatitis C virus, hepatitis G virus
- C12N2770/24221—Viruses as such, e.g. new isolates, mutants or their genomic sequences
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/24011—Flaviviridae
- C12N2770/24211—Hepacivirus, e.g. hepatitis C virus, hepatitis G virus
- C12N2770/24222—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/24011—Flaviviridae
- C12N2770/24211—Hepacivirus, e.g. hepatitis C virus, hepatitis G virus
- C12N2770/24241—Use of virus, viral particle or viral elements as a vector
- C12N2770/24243—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2770/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses positive-sense
- C12N2770/00011—Details
- C12N2770/24011—Flaviviridae
- C12N2770/24211—Hepacivirus, e.g. hepatitis C virus, hepatitis G virus
- C12N2770/24251—Methods of production or purification of viral material
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Virology (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Gastroenterology & Hepatology (AREA)
- Molecular Biology (AREA)
- Communicable Diseases (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Mycology (AREA)
- Epidemiology (AREA)
- Oncology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
本发明的目的在于提供在细胞培养系统中显示出病毒高生产能的HCV株。本发明提供核酸,该核酸编码包含1个以上氨基酸取代的丙型肝炎病毒JFH1株的前体多聚蛋白,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,上述前体多聚蛋白中至少第862位的谷氨酰胺被取代成精氨酸。
Description
技术领域
本发明涉及感染性丙型肝炎病毒高生产HCV突变体、其基因组核酸和导入有其基因组核酸的细胞。本发明还涉及生产感染性HCV颗粒的方法、抗HCV药物的筛选方法。
背景技术
丙型肝炎病毒(Hepatitis C virus;HCV)于1989年由Choo等人发现并确定为非甲非乙型肝炎的病因病毒(非专利文献1)。在感染HCV形成慢性肝炎后,HCV仍持续感染向肝硬化、甚至肝癌转变。据报道,全球有约1亿7千万HCV感染者,日本就存在约200万HCV感染者。HCV的主要感染途径为血液感染。自从可以进行输血用血液的筛选后,新的感染者锐减,但仍然存在许多病毒携带者(virus carrier)。
目前,慢性丙型肝炎的治疗方法主要是给予PEG化干扰素、或者将PEG化干扰素与抗病毒药利巴韦林结合使用。迄今为止,人们将HCV分为6个基因型,在日本主要是基因型1b和2a的HCV感染病例。特别是基因型1b的HCV,其现实状况是:通过给予干扰素和利巴韦林并不能将病毒从体内完全去除、治疗效果并不充分(非专利文献2和3)。因此,人们希望开发以预防丙型肝炎发病或消除HCV病毒为目的的、新的抗病毒药或疫苗。
病毒疫苗的种类有:使用病毒蛋白作为抗原的成分疫苗、使用病毒颗粒作为抗原的疫苗、以及使用编码病毒蛋白的基因作为抗原的DNA疫苗。以病毒颗粒作为抗原的疫苗有:减毒活疫苗和灭活疫苗。制造以病毒颗粒作为抗原的疫苗时,必需有制造高纯度的病毒颗粒的系统,在该系统中必需有病毒颗粒的高生产培养系统。
丙型肝炎病毒(HCV)是具有约9.6kb+链的单链RNA作为基因组的病毒。HCV的单链RNA基因组编码包含10种蛋白(核心蛋白、E1、E2、p7、NS2、NS3、NS4A、NS4B、NS5A和NS5B)的单根多聚蛋白(前体多聚蛋白(polyprotein precursor))。由HCV RNA基因组翻译的前体多聚蛋白被切成各个蛋白,发挥病毒蛋白的功能。
有人开发了利用培养细胞系统自主复制HCV RNA的复制子系统,并在多种HCV研究中使用。典型的亚基因组复制子为:将HCV基因组的结构蛋白区重组到耐药基因等标记基因中、并在其下游插入有EMCV(脑心肌炎病毒)的IRES的复制子。在导入有该亚基因组复制子RNA的培养细胞内确认到HCV RNA的复制(专利文献1)。通过研究HCV亚基因组复制子的复制,表明HCV基因组的遗传突变有时会显示出提高复制子的复制效率的效果,这种遗传突变被称作适应性突变(adaptive mutation)(专利文献1)。
有资料显示:来自基因型1b的Con1株的亚基因组复制子pFK-I389neo/NS3-39/wt(Con1/wt)的突变体、即在NS3-NS5A区具有适应性突变的NK5.1株(Con1/NK5.1)与野生型Con1/wt相比具有约10倍的增殖能力(非专利文献4)。另一方面,在分析具有来自JFH1的亚基因组复制子的复制子复制细胞中所含的复制子的核苷酸序列的论文(非专利文献5)中记载着:6个克隆中,有5个克隆虽然在各自的来自HCV基因组的区中确认到若干个突变,但在这些突变中并没有确认到共同的突变,而剩下的1个克隆是没有发生氨基酸突变的碱基突变,这表明:尽管JFH1株在Huh7细胞中没有适应性突变,但仍可增殖。
关于细胞培养系统中的HCV生产,Wakita等人研究表明:将从重症肝炎患者中分离的、属于基因型2a的来自HCV株JFH1的全长基因组HCV复制子导入Huh-7细胞中,可以生产感染性HCV颗粒(专利文献2和3、以及非专利文献6)。另外,Kaul等人报道了:JFH1株的NS5A蛋白的突变会带来较野生型JFH1株高约10倍的病毒生产量(非专利文献7)。
有报道称:在细胞培养系统中HCV JFH1株病毒的病毒颗粒生产能力为4.6×104FFU/mL(非专利文献8),这与所报道的、在细胞培养系统中流感病毒的病毒颗粒生产能力为约4×109PFU/mL(非专利文献9)相比是非常低的。为了制造使用HCV颗粒作为抗原的疫苗,人们要求开发病毒颗粒生产能力更高的HCV株。
专利文献1:国际公开WO2004/104198
专利文献2:国际公开WO2005/080575
专利文献3:国际公开WO2006/22422
非专利文献1:Choo等人,Science(1989)244(4902)第359-362页
非专利文献2:Fried等人,N.Engl.J.Med.(2002)第347卷,No.13第975-982页
非专利文献3:Lusida等人,J.Clin.Microbiol.(2001)39(11)第3858-3864页
非专利文献4:Krieger等人,J.Virol.(2001)70:4614-4624
非专利文献5:Kato等人,Gastroenterology(2003)125:1808-1817
非专利文献6:Wakita等人,Nat Med.(2005)11(7)第791-796页
非专利文献7:Kaul等人,J.Virol.(2007)81(23)第13168-13179页
非专利文献8:Zhong等人,Proc.Natl.Acad.Sci.U.S.A.(2005)102(26)第9294-9299页
非专利文献9:Tree等人,Vaccine(2001)19(25-26)第3444-3450页
发明内容
发明所要解决的课题
本发明的目的在于提供:在细胞培养系统中显示出病毒高生产能的HCV株。
解决课题的方法
本发明人等为了解决上述课题反复进行了深入研究,结果发现:若干个氨基酸突变会显著提高JFH1株的病毒生产能力,从而完成了本发明。
即,本发明包含下述[1]~[14]。
[1]核酸,该核酸编码包含1个以上氨基酸取代的丙型肝炎病毒JFH1株的前体多聚蛋白,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,上述前体多聚蛋白中至少第862位的谷氨酰胺被取代成精氨酸。
[2]上述[1]所述的核酸,其中,上述前体多聚蛋白为选自下述(a)~(f)的前体多聚蛋白。
(a)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,第74位的赖氨酸被取代成苏氨酸、第297位的酪氨酸被取代成组氨酸、第330位的丙氨酸被取代成苏氨酸、第395位的丝氨酸被取代成脯氨酸、第417位的天冬酰胺被取代成丝氨酸、第483位的天冬氨酸被取代成甘氨酸、第501位的丙氨酸被取代成苏氨酸、第862位的谷氨酰胺被取代成精氨酸、第931位的谷氨酰胺被取代成精氨酸、以及第961位的丝氨酸被取代成丙氨酸;
(b)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,第31位的缬氨酸被取代成丙氨酸、第74位的赖氨酸被取代成苏氨酸、第451位的甘氨酸被取代成精氨酸、第756位的缬氨酸被取代成丙氨酸、第786位的缬氨酸被取代成丙氨酸、以及第862位的谷氨酰胺被取代成精氨酸;
(c)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,第74位的赖氨酸被取代成苏氨酸、第451位的甘氨酸被取代成精氨酸、第756位的缬氨酸被取代成丙氨酸、第786位的缬氨酸被取代成丙氨酸、以及第862位的谷氨酰胺被取代成精氨酸;
(d)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,第31位的缬氨酸被取代成丙氨酸、第74位的赖氨酸被取代成苏氨酸、第451位的甘氨酸被取代成精氨酸、第786位的缬氨酸被取代成丙氨酸、以及第862位的谷氨酰胺被取代成精氨酸;
(e)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,第31位的缬氨酸被取代成丙氨酸、第74位的赖氨酸被取代成苏氨酸、第451位的甘氨酸被取代成精氨酸、第756位的缬氨酸被取代成丙氨酸、以及第862位的谷氨酰胺被取代成精氨酸;
(f)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,仅第862位的谷氨酰胺被取代成精氨酸。
[3]上述[2]所述的核酸,该核酸包含序列表的SEQ ID NO:3、SEQID NO:4或SEQ ID NO:5所示的核苷酸序列。
[4]上述[1]或[2]所述的核酸,其中,编码报道蛋白的核酸被插入在编码上述前体多聚蛋白的NS5A蛋白的区内。
[5]上述[4]所述的核酸,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,上述报道蛋白被整合到第2394位氨基酸残基与第2395位氨基酸残基之间,作为融合蛋白被翻译。
[6]上述[5]所述的核酸,该核酸包含序列表的SEQ ID NO:6或SEQ ID NO:7所示的核苷酸序列。
[7]丙型肝炎病毒颗粒,该病毒颗粒包含上述[1]或[2]所述的核酸。
[8]培养细胞,该培养细胞生产上述[7]所述的丙型肝炎病毒颗粒。
[9]丙型肝炎病毒疫苗,该疫苗是将上述[7]所述的丙型肝炎病毒颗粒灭活而得到的。
本发明还包含以下发明。
[10]丙型肝炎病毒颗粒,该病毒颗粒包含上述[4]所述的核酸。
[11]培养细胞,该培养细胞生产上述[10]所述的丙型肝炎病毒颗粒。
[12]载体,该载体包含上述[1]~[6]中任一项所述的核酸。
[13]筛选抗丙型肝炎病毒物质的方法,该方法具备下述步骤:在受检物质的存在下,培养生产包含上述[4]或[5]所述的核酸的丙型肝炎病毒颗粒的培养细胞的步骤;以及检测所得培养物中的上述报道蛋白,当上述报道蛋白的表达量低时,判定为上述受检物质具有抗丙型肝炎病毒活性的步骤。
[14]抗丙型肝炎病毒抗体,该抗体将上述[7]所述的丙型肝炎病毒颗粒识别为抗原。
发明效果
通过本发明,能够提供感染性HCV颗粒的高产生株。通过使用该感染性HCV颗粒的高产生株,可以提供高HCV产生系统。
附图说明
图1显示为了获得JFH1适应突变体而进行的实验流程。图中,“C”表示编码核心(core)蛋白的区、“E1”表示编码E1蛋白的区、“E2”表示编码E2蛋白的区、“p7”表示编码p7蛋白的区、“2”表示编码NS2蛋白的区、“3”表示编码NS3蛋白的区、“4A”表示编码NS4A蛋白的区、“4B”表示编码NS4B蛋白的区、“5A”表示编码NS5A蛋白的区、“5B”表示编码NS5B蛋白的区,另外,与C(核心)相邻的5’末端部分表示5非翻译区,与5B(NS5B)相邻的3’末端部分表示3’非翻译区(图9、10和15中亦同)。
图2显示将JFH1病毒感染细胞继代培养2年而得到的JFH1适应突变体(JFH1a)的复制能力。
图3显示JFH1a与JFH1wt的性状比较。纵轴为与未添加IFN的对照进行比较的相对复制率(relative replication)(%)。“o”为JFH1wt的数据,“■”为JFH1a的数据。
图4显示通过解析6个克隆的序列发现的JFH1a中的氨基酸突变。
图4中,在6个克隆中的2个以上克隆中发现的氨基酸突变上注上“*”。
图5是显示用于解析复制能力和感染性的野生型JFH1wt和HCV突变体的前体多聚蛋白编码区的结构和突变导入位点的概略图。进行了突变解析的区(AgeI-SpeI片段)以灰色显示。突变导入位点以星号显示。
图6显示野生型JFH1wt与突变体的感染性比较结果。WT表示JFH1wt、A/WT表示JFH1-A/WT、B/WT表示JFH1-B/WT、Mut5表示JFH1-mut5(本申请说明书和附图的其他部分亦同)。A:转染后的细胞内核心蛋白量的比较,B:释放到培养上清中的核心蛋白量的比较,C:培养上清的感染滴度的比较,D:比活性(相对比感染滴度。比活性=(培养上清的感染滴度)/培养上清中的核心蛋白量))的比较。棒图A~C从左到右分别显示24小时后、48小时后、72小时后和96小时后的数据。
图7显示转染后的JFH1wt及其突变体的感染滴度在长期感染系统中的时间变化。*:JFH1a、△:JFH1-B/WT、×:JFH1-Mut5、□:JFH1-A/WT、◇:JFH1wt。
图8是显示JFH1wt和JFH1突变体在细胞感染的72小时后形成的转化灶(フォ一カス,focus)的大小的照片。转化灶的大小表示感染的传播能力。A:JFH1-A/WT,B:JFH1-B/WT,C:JFH1a,D:JFH1-Mut5,E:JFH1wt。
图9显示在JFH1-B/WT的6个位置氨基酸突变中仅1个位置氨基酸突变恢复成野生型氨基酸的6种突变体的前体多聚蛋白编码区的结构图。星号显示保持JFH1-B/WT的氨基酸突变的位点。
图10显示将在JFH1-B/WT的6个位置见到的氨基酸突变各1个导入野生型JFH1wt中的突变体的前体多聚蛋白编码区的结构图。星号显示导入有来自JFH1-B/WT的氨基酸突变的位点。
图11显示图9所示的突变体HCV(克隆)的感染滴度和病毒生产量。A:显示各突变体的培养上清中的感染滴度,这代表感染性的细胞外释放水平。B:显示各突变体的释放到培养上清中的细胞外核心蛋白量。31-、74-、451-、756-、786-、862-、451+、WT、B/WT分别表示31-(A31V)、74-(T74K)、451-(R451G)、756-(A756V)、786-(A786V)、862-(R862Q)、451+(G451R)、JFH1wt和JFH1-B/WT(本申请说明书和附图的其他部分亦同)。
图12显示图10所示的突变体HCV(克隆)的感染滴度和病毒生产量。A:显示各突变体的培养上清中的感染滴度,这表示感染性的细胞外释放水平。B:显示各突变体的释放到培养上清中的细胞外核心蛋白量。31+、74+、451+、756+、786+、862+、WT、B/WT分别表示31+(V31A)、74+(K74T)、451+(G451R)、756+(V756A)、786+(V786A)、862+(Q862R)、JFH1wt和JFH1-B/WT(本申请说明书和附图的其他部分亦同)。
图13显示图9所示的突变体HCV(克隆)在长期培养(长期感染)中其细胞外核心蛋白量和感染滴度随时间的变化。这还显示各克隆在长期感染中的增殖曲线。A:显示各突变体的培养上清中的细胞外核心蛋白量。B:显示各突变体的培养上清中的细胞外感染滴度。
图14显示图10所示的突变体HCV(克隆)在长期培养(长期感染)中其细胞外核心蛋白量和感染滴度随时间的变化。这还显示各克隆在长期感染中的增殖曲线。A:显示各突变体的培养上清中的细胞外核心蛋白量。B:显示各突变体的培养上清中的细胞外感染滴度。
图15显示在全长基因组HCV序列中整合有报道基因的复制子的结构图。报道基因(Rluc)被插入在复制子的前体多聚蛋白编码区(核心~NS5B)内的第2394位氨基酸与第2395位氨基酸之间。
图16显示整合有报道基因的野生型JFH1wt、突变体JFH1-A/WT-Rlu和JFH1-B/WT-Rluc的培养上清中的感染滴度。图中,WT表示JFH1wt,WT-Rluc、A/WT-Rluc和B/WT-Rluc表示分别整合有Rluc基因的JFH1wt、JFH1-A/WT和JFH1-B/WT。
图17显示使JFH-A/WT-Rluc(图17A)和JFH-B/WT-Rluc(图17B)以100FFU、50FFU、25FFU、12FFU、6FFU、3FFU和0FFU感染Huh7.5.1细胞,并于72小时后测定萤光素酶活性的结果、以及检测依赖于病毒量的萤光素酶活性。
图18显示使用JFH1-A/WT-Rluc和JFH1-B/WT-Rluc病毒的培养细胞感染增殖系统研究干扰素的抗HCV作用的结果。纵轴显示由感染滴度算出的感染抑制率(%)。IFN-α的用量(浓度)从左起依次为100U/ml(白棒)、20、4、1、0U/ml。A:通过萤光素酶分析得到的、干扰素存在下的萤光素酶活性(RLU)的抑制率。B:干扰素存在下的感染滴度(FFU/ml)的抑制率。
具体实施方式
本发明人等通过用JFH1株的HCV全长复制子复制系统进行长达2年的培养,从这些培养细胞中筛选病毒颗粒的增殖能力有所提高的适应突变体,发现了JFH1病毒高生产株,并制作了带有表达报道基因的全长HCV基因组的高感染性病毒颗粒,从而完成了本发明。
本发明涉及高产生性HCV JFH1突变体,该HCV JFH1突变体能够从包含HCV的全长基因组序列、持续复制全长基因组序列、并产生感染性病毒的Huh7细胞中分离。
本发明可以采用该领域技术范围内的分子生物学和病毒学的现有技术来实施。上述技术在文献中已作了充分说明。例如参照Sambrook等人、Molecular Cloning:A Laboratory Manual Cold SpringHarbor Laboratory(第3版,2001)和Mahy等人、Virology:a practicalapproach(1985,IRL PRESS)。
本说明书中引用的所有出版物、专利和专利申请,其全部内容均通过引用而援用在本说明书中。
(1)来自HCV JFH1基因组序列的突变体核酸
本发明涉及核酸,该核酸具有HCV JFH1突变体病毒的基因组序列,所述HCV JFH1突变体在其基因组中导入有使其病毒颗粒生产能力显著提高的适应性突变。本发明的核酸优选包含HCV全长基因组序列。
更具体而言,本发明的核酸是编码在丙型肝炎病毒JFH1株的前体多聚蛋白(优选包含SEQ ID NO:2所示的氨基酸序列的前体多聚蛋白)中导入有氨基酸突变的前体多聚蛋白的核酸;进一步具体而言,本发明的核酸是编码在前体多聚蛋白的从核心到NS2的区包含1个以上氨基酸取代的丙型肝炎病毒JFH1株的前体多聚蛋白的核酸。
本发明的核酸所编码的前体多聚蛋白包含HCV的结构蛋白和非结构蛋白。HCV的结构蛋白是指核心、E1、E2和p7,它们构成HCV的病毒颗粒部分。核心是指核心蛋白,E1和E2是包膜蛋白,p7是形成在宿主细胞膜上发挥作用的离子通道的蛋白。HCV的非结构蛋白是指NS2、NS3、NS4A、NS4B、NS5A和NS5B,它们是具有参与病毒基因组的复制或HCV蛋白加工的活性的酶蛋白。已知HCV中有各种基因型,但已知各种基因型的HCV的基因组具有同样的基因结构(例如可参照图1)。由本发明的核酸编码的前体多聚蛋白优选从N末端到C末端依次包含核心、E1、E2、p7、NS2、NS3、NS4A、NS4B、NS5A和NS5B蛋白部分。由本发明的核酸编码的前体多聚蛋白可以进一步包含选择标记蛋白或报道蛋白等异种蛋白。
本发明的核酸中所含的全长基因组序列在5’末端包含5’非翻译区、在其3’侧包含前体多聚蛋白编码区、在其3’侧以及3’末端包含3’非翻译区。全长基因组序列可以是从5’侧到3’侧依次包含5’非翻译区、核心蛋白编码序列、E1蛋白编码序列、E2蛋白编码序列、p7蛋白编码序列、NS2蛋白编码序列、NS3蛋白编码序列、NS4A蛋白编码序列、NS4B蛋白编码序列、NS5A蛋白编码序列、NS5B蛋白编码序列以及3’非翻译区的序列。
HCV的5’非翻译区(还称作5’UTR或5’NTR)是提供用于蛋白翻译的内部核糖体进入位点(IRES)和复制所必需的元件的、从全长HCV基因组的N末端起约340个核苷酸的区域。
HCV的3’非翻译区(还称作3’UTR或3’NTR)具有辅助HCV复制的功能,除包含polyU区外,还包含约100个核苷酸的附加区(追加領域)。
在本发明中,“复制子RNA”是指具有在细胞内进行自身复制(自主复制)的能力的RNA。由于被导入细胞中的复制子RNA进行自身复制、且其RNA拷贝在细胞分裂后被分配到子细胞中,所以如果使用复制子RNA则可以稳定地导入到细胞中。当本发明的核酸是包含在5’末端含有5’非翻译区、在其3’侧含有前体多聚蛋白编码区、在其3’侧及3’末端含有3’非翻译区的全长基因组序列的RNA(全长基因组RNA)时,本发明的核酸是复制子RNA。
在本发明中,“核酸”中包含RNA和DNA。在本说明书中,“蛋白编码区”、“编码蛋白的序列”是指编码预定蛋白的氨基酸序列、可以包含也可以不包含起始密码子和终止密码子的核苷酸序列。“前体多聚蛋白编码区”、“编码前体多聚蛋白的序列”也同样理解。
本说明书中,在核酸为RNA的情况下,引用序列表的SEQ ID NO来确定RNA的核苷酸序列或碱基时,该SEQ ID NO所示的核苷酸序列中的“T”(胸腺嘧啶)被读作“U”(尿嘧啶)。
本说明书中,“以序列表的SEQ ID NO:2所示的氨基酸序列为基准时第“Y”位的(氨基酸)”是指在SEQ ID NO:2所示的氨基酸序列中,以N末端的第1氨基酸(甲硫氨酸)作为第1位时,该氨基酸是位于第“Y”位的氨基酸残基。
本发明中,丙型肝炎病毒的JFH1株是指由Wakita等人从重症肝炎患者中分离的、属于基因型2a的HCV株(例如WO2005/080575)。HCV的“基因型”是指由Simmonds等人按照国际分类进行分类的基因型。丙型肝炎病毒JFH1株的前体多聚蛋白的氨基酸序列优选为由GenBank Accession No.AB047639中公开的全长基因组序列所编码的序列(SEQ ID NO:2)。JFH1株的全长基因组序列优选为GenBankAccession No.AB047639中公开的核苷酸序列(SEQ ID NO:1)。
本发明的核酸的优选方式为:编码包含1个以上氨基酸取代的丙型肝炎病毒JFH1株的前体多聚蛋白的核酸,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,上述1个以上氨基酸取代包含至少一个第862位的谷氨酰胺取代成精氨酸。即,本发明的核酸优选为:编码包含1个以上氨基酸取代的丙型肝炎病毒JFH1株的前体多聚蛋白的核酸,其特征在于:当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,上述前体多聚蛋白的第862位的谷氨酰胺被取代成精氨酸。该核酸更优选:在5’末端包含5’非翻译区、在其3’侧包含前体多聚蛋白编码区、在其3’侧及3’末端包含3’非翻译区。该前体多聚蛋白编码序列可以进一步包含编码选择标记蛋白或报道蛋白等异种蛋白的核苷酸序列。
导入到该前体多聚蛋白中的1个以上的氨基酸取代至少包含上述第862位的谷氨酰胺取代成精氨酸(Q862R),还优选进一步附加包含下述(1)-(13)中的1个以上的氨基酸取代:
(1)第31位的缬氨酸取代成丙氨酸(V31A)、
(2)第74位的赖氨酸取代成苏氨酸(K47T)、
(3)第297位的酪氨酸取代成组氨酸(Y297H)、
(4)第330位的丙氨酸取代成苏氨酸(A330T)、
(5)第395位的丝氨酸取代成脯氨酸(S395P)、
(6)第417位的天冬酰胺取代成丝氨酸(N417S)、
(7)第451位的甘氨酸取代成精氨酸(G451R)、
(8)第483位的天冬氨酸取代成甘氨酸(D483G)、
(9)第501位的丙氨酸取代成苏氨酸(A501T)、
(10)第756位的缬氨酸取代成丙氨酸(V756A)、
(11)第786位的缬氨酸取代成丙氨酸(V786A)、
(12)第931位的谷氨酰胺取代成精氨酸(Q931R)、以及
(13)第961位的丝氨酸取代成丙氨酸(S961A)。
在本说明书中,例如氨基酸突变Q862R是指第862位的氨基酸残基从Q(谷氨酰胺)取代成R(精氨酸)的突变。其他表示氨基酸突变的书写也同样理解。需要说明的是,其中,氨基酸按照生物学领域中通常使用的氨基酸的单字母符号(Sambrook等人,Molecular Cloning:ALaboratory Manual Second Edition,1989)来记载。
在本说明书中,氨基酸或氨基酸残基以生物学领域通常使用的氨基酸的单字母符号或三字母符号来记载,这也包含进行了氢氧化、糖链加成、硫酸化等翻译后修饰的氨基酸。
如果使用本发明的核酸,则可以制作能够生产病毒颗粒生产能力已显著提高的JFH1突变体病毒的复制子RNA。
本发明核酸的优选例子有:编码前体多聚蛋白的核酸,所述前体多聚蛋白在丙型肝炎病毒JFH1株的前体多聚蛋白的氨基酸序列(优选SEQ ID NO:2所示的氨基酸序列)中导入有下述取代,即以SEQ ID NO:2所示的氨基酸序列为基准时,第74位的赖氨酸取代成苏氨酸、第297位的酪氨酸取代成组氨酸、第330位的丙氨酸取代成苏氨酸、第395位的丝氨酸取代成脯氨酸、第417位的天冬酰胺取代成丝氨酸、第483位的天冬氨酸取代成甘氨酸、第501位的丙氨酸取代成苏氨酸、第862位的谷氨酰胺取代成精氨酸、第931位的谷氨酰胺取代成精氨酸、以及第961位的丝氨酸取代成丙氨酸。该核酸的优选例子见SEQID NO:3。
本发明核酸的另一优选例子有:编码前体多聚蛋白的核酸,所述前体多聚蛋白在丙型肝炎病毒JFH1株的前体多聚蛋白的氨基酸序列(优选SEQ ID NO:2所示的氨基酸序列)中导入有下述取代,即以SEQID NO:2所示的氨基酸序列为基准时,第31位的缬氨酸取代成丙氨酸、第74位的赖氨酸取代成苏氨酸、第451位的甘氨酸取代成精氨酸、第756位的缬氨酸取代成丙氨酸、第786位的缬氨酸取代成丙氨酸、以及第862位的谷氨酰胺取代成精氨酸。该核酸的优选例子见SEQID NO:4。
本发明核酸的另一优选例子有:编码前体多聚蛋白的核酸,所述前体多聚蛋白在丙型肝炎病毒JFH1株的前体多聚蛋白的氨基酸序列(优选SEQ ID NO:2所示的氨基酸序列)中导入有下述取代,即以SEQID NO:2所示的氨基酸序列为基准时,第862位的谷氨酰胺取代成精氨酸。该核酸的优选例子见SEQ ID NO:5。
本发明核酸的另一优选例子有:编码前体多聚蛋白的核酸,所述前体多聚蛋白在丙型肝炎病毒JFH1株的前体多聚蛋白的氨基酸序列(优选SEQ ID NO:2所示的氨基酸序列)中导入有下述取代,即以SEQID NO:2所示的氨基酸序列为基准时,第74位的赖氨酸取代成苏氨酸、第451位的甘氨酸取代成精氨酸、第756位的缬氨酸取代成丙氨酸、第786位的缬氨酸取代成丙氨酸、第862位的谷氨酰胺取代成精氨酸。
本发明核酸的另一优选例子有:编码前体多聚蛋白的核酸,所述前体多聚蛋白在丙型肝炎病毒JFH1株的前体多聚蛋白的氨基酸序列(优选SEQ ID NO:2所示的氨基酸序列)中导入有下述取代,即以SEQID NO:2所示的氨基酸序列为基准时,第31位的缬氨酸取代成丙氨酸、第74位的赖氨酸取代成苏氨酸、第451位的甘氨酸取代成精氨酸、第786位的缬氨酸取代成丙氨酸、第862位的谷氨酰胺取代成精氨酸。
本发明核酸的另一优选例子有:编码前体多聚蛋白的核酸,所述前体多聚蛋白在丙型肝炎病毒JFH1株的前体多聚蛋白的氨基酸序列(优选SEQ ID NO:2所示的氨基酸序列)中导入有下述取代,即以SEQID NO:2所示的氨基酸序列为基准时,第31位的缬氨酸取代成丙氨酸、第74位的赖氨酸取代成苏氨酸、第451位的甘氨酸取代成精氨酸、第756位的缬氨酸取代成丙氨酸、第862位的谷氨酰胺取代成精氨酸。
上述作为本发明的核酸的复制子RNA、或由该核酸制作的复制子RNA、特别是全长基因组复制子RNA(全长基因组HCV RNA),与野生型JFH1株的复制子RNA相比具有显著提高的病毒生产能力。在本说明书中,病毒生产能力优选为培养细胞系统中的病毒颗粒生产能力(优选感染性病毒颗粒生产能力)。本发明的核酸或由该核酸制作的复制子RNA例如与野生型JFH1株的全长基因组复制子RNA相比,具有2倍以上、优选10倍以上、典型的是10倍~10,000倍、例如10倍~1,000倍的病毒生产能力。作为本发明的核酸的全长基因组复制子RNA与来自JFH1株的全长基因组复制子RNA相比,具有2倍以上、优选10倍以上的病毒生产能力,所述JFH1株编码SEQ ID NO:2所示的氨基酸序列的第2440位的缬氨酸被取代成亮氨酸的前体多聚蛋白。野生型JFH1株的全长基因组序列见SEQ ID NO:1。SEQ ID NO:2所示的序列是根据SEQ ID NO:1所示的野生型JFH1株的全长基因组序列编码的前体多聚蛋白的氨基酸序列。
病毒生产能力可以通过测定培养上清的感染滴度来确定。感染滴度的测定可以按照任意的方法进行,但在本说明书中,以通过转化灶法测定的培养上清的感染滴度为基准。具体可以按照后述实施例中记载的方法确定感染滴度。
本发明的核酸或由该核酸制作的复制子RNA显示出高病毒颗粒形成效率。该性质对用于病毒疫苗制造等所必需的病毒蛋白的大量生产也有利。该病毒颗粒形成效率还可以以算出的比活性(=(培养上清中的感染滴度)/(培养上清中的核心蛋白量);相对比感染滴度)的值为指标。具体可以按照后述实施例中所述的方法来确定比活性。
在本发明的核酸中,包含SEQ ID NO:3~5所示的核苷酸序列的核酸(全长基因组复制子RNA)在病毒生产能力方面特别优异。另外,含有包含JFH1株的5’非翻译区、编码根据SEQ ID NO:3~5所示的核苷酸序列编码的突变型前体多聚蛋白的序列和JFH1株的3’非翻译区的全长基因组序列的核酸(全长基因组复制子RNA)也具有高病毒生产能力。
本发明的核酸可以包含编码选择标记蛋白或报道蛋白等异种蛋白的核苷酸序列、例如标记基因。标记基因包含:可以赋予细胞只选择表达该基因的细胞的选择性的选择标记基因(编码选择标记蛋白的核苷酸序列)和编码成为其基因表达指标的基因产物的报道基因(编码报道蛋白的核苷酸序列)。在本发明中,对优选的选择标记基因的例子没有限定,可以列举新霉素耐性基因、胸苷激酶基因、卡那霉素耐性基因、吡啶硫胺素耐性基因、腺苷酰基转移酶基因、Zeocin耐性基因、嘌呤霉素耐性基因等。本发明中,对优选的报道基因的例子没有限定,可以列举来自转座子Tn9的氯霉素乙酰转移酶基因、来自大肠杆菌的β葡糖醛酸酶或β半乳糖苷酶基因、萤光素酶基因、绿色萤光蛋白基因、来自水母的水母素(ィクリォン,aequorin)基因、分泌型胎盘碱性磷酸酶(SEAP)基因等。
本发明的核酸在前体多聚蛋白编码区内可以包含编码选择标记蛋白或报道蛋白等异种蛋白的核苷酸序列、例如标记基因。该情况下,对插入在前体多聚蛋白内的选择标记蛋白或报道蛋白等异种蛋白没有限定,但优选报道蛋白,更优选萤光素酶,进一步优选海肾萤光素酶(ゥミシィタケルシフェラ一ゼ)。编码海肾萤光素酶的基因的核苷酸序列的例子见SEQ ID NO:9。
以该海肾萤光素酶为代表的选择标记蛋白或报道蛋白等异种蛋白(优选报道蛋白、更优选萤光素酶)被插入前体多聚蛋白内时,优选插入在以SEQ ID NO:2所示的氨基酸序列为基准时的第2394位氨基酸残基与第2395位氨基酸残基之间。具有包含编码如此操作插入有异种蛋白的前体多聚蛋白的序列的全长基因组核酸的病毒颗粒与野生型JFH1株的病毒颗粒相比,感染性(感染滴度)高达5倍以上、优选10倍或10倍以上。编码在第2394位氨基酸残基与第2395位氨基酸残基之间插入有异种蛋白的前体多聚蛋白的HCV全长基因组序列的优选例子有:SEQ ID NO:6和SEQ ID NO:7。
本发明的核酸还优选进一步包含IRES序列。本发明中的“IRES序列”是指可以使核糖体结合在RNA内部并开始翻译的内部核糖体进入位点。以下,对本发明中的IRES序列的优选例子没有限定,可以列举EMCV IRES(脑心肌炎病毒的内部核糖体进入位点)、FMDVIRES、HCV IRES等。当本发明的核酸包含IRES序列时,优选在HCV基因组序列的5’非翻译区(5’NTR)与编码核心蛋白的核酸序列之间依次插入报道基因(编码报道蛋白的核酸序列)和IRES序列。
本发明的核酸可以通过采用本领域技术人员所公知的基因工程学技术将上述引起1个以上氨基酸取代的碱基取代导入编码HCVJFH1株的前体多聚蛋白的核酸中来制作。对编码HCV JFH1株的前体多聚蛋白的核酸没有限定,例如可以是包含SEQ ID NO:1所示的核苷酸序列的DNA、或包含该DNA的重组载体(例如重组质粒载体)。
关于上述引起氨基酸取代的碱基取代,当根据在生物学领域周知的遗传密码表比较取代部位的氨基酸密码子与取代前的氨基酸密码子时可以容易地进行确定。
本发明还提供包含本发明的核酸的载体。包含本发明的核酸的载体可以是重组载体,但更优选为表达载体。本发明的核酸优选插入在载体中的转录启动子的下游。本发明的核酸通过与转录启动子进行功能性连接,在转录启动子的控制下被整合。对转录启动子没有限定,可以列举T7启动子、SP6启动子、T3启动子等,但特别优选T7启动子。对载体没有限定,可以使用pUC19(TaKaRa社)、pBR322(TaKaRa社)、pGEM-T、pGEM-T Easy、pGEM-3Z(均由Promega社制)、pSP72(Promega社)、pCRII(Invitrogen社)、pT7Blue(Novagen社)等。来自表达载体的HCV复制子RNA的合成例如可以使用MEGAscript T7试剂盒(Ambion社)等来进行。制作的HCV复制子RNA可以利用本领域技术人员所周知的RNA提取法或纯化法等进行提取、纯化。
(2)产生感染性HCV颗粒的细胞的制作
本发明还涉及使用(1)中记载的本发明的突变体核酸生产的丙型肝炎病毒颗粒。该丙型肝炎病毒颗粒优选为感染性病毒颗粒。
本发明的HCV颗粒(优选感染性HCV颗粒)可以通过将包含上述(1)的核酸的全长基因组RNA导入到细胞中进行培养来制作。即,本发明还提供包含上述(1)中记载的本发明的核酸的丙型肝炎病毒颗粒。
导入RNA的细胞只要是允许HCV颗粒形成的细胞即可,但优选为培养细胞。上述细胞的例子有:Huh7细胞、HepG2细胞、IMY-N9细胞、HeLa细胞、293细胞等或它们的派生株等培养细胞。更优选的例子有:Huh7细胞等来自肝脏的培养细胞,进一步优选Huh7细胞和Huh7细胞的派生株(例如Huh7.5细胞、Huh7.5.1细胞等)。另外,还可以列举使CD81基因和/或Claudin1基因在Huh7细胞、HepG2细胞、IMY-N9细胞、HeLa细胞或293细胞中表达的细胞,其中优选使用Huh7细胞或Huh7细胞的派生株。在本发明中,“派生株”是指由该细胞衍生的细胞株,通常是指该细胞的亚克隆株。
作为将RNA导入到细胞中的方法,可以使用公知的任意方法。上述方法的例子有:磷酸钙共沉淀法、DEAE葡聚糖法、脂质转染法、微量注射法、电穿孔法,但优选脂质转染法和电穿孔法,进一步优选电穿孔法。
细胞的病毒颗粒产生能力还可以使用抗释放到培养液中的构成HCV病毒颗粒的要素(例如核心蛋白、E1蛋白或E2蛋白)的抗体来测定。另外,通过使用了特异性引物的RT-PCR法扩增、测定培养液中的HCV病毒颗粒所含有的HCV基因组RNA,也可以间接检测HCV病毒颗粒的存在。
所制作的病毒是否具有感染能力,这可如下判断:培养通过上述方法导入有HCV RNA的细胞,将所得上清添加在HCV容许性细胞(例如Huh7)中,例如在48小时后将细胞用抗核心抗体进行免疫染色,数出感染细胞数,或者使细胞提取物在SDS-聚丙烯酰胺凝胶中进行电泳,利用蛋白质印迹检测核心(core)蛋白,即可判断是否具有感染能力。需要说明的是,此处也将由导入有JFH1株的基因组RNA的细胞生成的感染性HCV颗粒称作JFH1病毒。
通过将如上制作的、导入有全长基因组RNA的细胞进行定期继代培养,可以得到持续产生感染性HCV颗粒的细胞。这种细胞株可以长期培养。
本发明还涉及生产如上制作的JFH1突变体的丙型肝炎病毒颗粒的细胞、优选培养细胞。
(3)适应性突变的解析
通过将由上述(2)制作的、持续生产HCV颗粒的细胞株继续进行继代培养,在HCV基因组中引起适应性突变,期待HCV颗粒生产显著提高。继代培养通常要进行十几次(1~2个月),但在本发明中,由于适应性突变的导入,继续继代培养1年、进一步优选2年。
需要说明的是,分析表明:利用适应性突变的组合,RNA复制效率达到200倍以上,反之,被抑制在1/5以下,只增加适应性突变的数目未必就好,呈现出复杂的情况(Lohmann V等人.J Virol 77:3007-3019,2003.)。另外,若HCV株不同,则适应性突变的效果也不同,所以适应性突变通过何种理由影响HCV基因组的复制效率,其细节尚不清楚。(1)的本发明核酸可以是通过导入该适应性突变而得到的适应突变体。
(4)HCV颗粒的应用
(2)中得到的HCV颗粒适合用作疫苗、用作用于制作抗HCV抗体的抗原。
具体而言,还可以将HCV颗粒直接用作疫苗,但也可以利用该领域已知的方法将其毒性减弱或灭活后使用。病毒的灭活可以通过将福尔马林、β-丙内酯、戊二醛等灭活剂例如添加混合在病毒悬浮液中,使其与病毒反应而达成(Appaiahgari,M.B. & Vrati,S.,Vaccine,22:3669-3675,2004)。因此,本发明还涉及将(2)中得到的HCV颗粒灭活而得到的丙型肝炎病毒疫苗。
本发明的疫苗通常可以制成溶液或悬浮液的任一种形式进行给药。本发明的疫苗还可以制成适合溶解或悬浮于液体中的固体的形态。制备物以乳浊液的形式进行制备、或者可以在脂质体中胶囊化。HCV颗粒等活性免疫原性成分经常与药学上可接受的、适合于活性成分的赋形剂混合。在适当的赋形剂中,例如有水、生理盐水、葡萄糖、甘油、乙醇等以及它们的混合物。并且,根据需要,疫苗可以含有少量的助剂(例如加湿剂或乳化剂)、pH缓冲剂和/或提高疫苗效能的佐剂。对有效的佐剂的例子没有限定,包含下述佐剂:氢氧化铝、N-乙酰基-胞壁酰-L-苏氨酰-D-异谷氨酰胺(thr-MDP)、N-乙酰基-正-胞壁酰-L-丙氨酰-D-异谷氨酰胺(CGP11637、称作nor-MDP)、N-乙酰基-胞壁酰-L-丙氨酰-D-异谷氨酰胺酰-L-丙氨酸-2-(1’-2’-二棕榈酰-sn-甘油基-3-羟基磷酰氧基)-乙胺(CGP19835A、称作MTP-PE)和RIBI。RIBI是在2%角鲨烯/Tween(注册商标)80乳剂中含有从细菌中提取的3种成分、即单磷脂A、海藻糖二霉菌酸和细胞壁骨架(HPL+TDM+CWS)的佐剂。佐剂的效能通过测定通过给予由HCV颗粒制成的疫苗而产生的抗体量即可确定。
本发明的疫苗通常是通过胃肠外给药、例如通过皮下注射或肌内注射等注射进行给药。适合于其他给药方式的其他剂型有:栓剂和根据情况进行口服给药的处方药。
根据需要,可以在HCV疫苗中加入具有佐剂活性的一种以上的化合物。佐剂是该免疫系统的非特异性刺激因子。佐剂增强宿主对HCV疫苗的免疫应答。该技术领域中公知的佐剂的具体例子有:弗氏完全和不完全佐剂、维生素E、非离子嵌段聚合物、胞壁酰二肽、皂苷、矿物油、植物油和卡波普(Carbopol)。特别适合粘膜使用的佐剂例如有:大肠杆菌(E.coli)易热性毒素(LT)或霍乱(Cholera)毒素(CT)。其他适当的佐剂例如有:氢氧化铝、磷酸铝或氧化铝、油性乳剂(例如Bayol(注册商标)或Marcol 52(注册商标))、皂苷或维生素E增溶剂。因此,在优选方式中本发明的疫苗包含佐剂。
例如在皮下、皮内、肌肉内、静脉内给药的注射剂中,本发明的HCV疫苗和医药上可接受的载体或稀释剂的其他具体例子中,稳定化剂、碳水化合物(例如山梨醇、甘露醇、淀粉、蔗糖、葡萄糖、葡聚糖)、白蛋白或酪蛋白等蛋白、牛血清或脱脂乳等含蛋白的物质、以及缓冲液(例如磷酸缓冲液)等可以一起给药。
栓剂中使用的现有粘合剂和载体中,例如可以包含聚亚烷基二醇或三甘油。这种栓剂可以由含有0.5%~50%范围、优选1%~20%范围的活性成分的混合物形成。口服处方药含有通常使用的赋形剂。该赋形剂例如有:药用级的甘露醇、乳糖、淀粉、硬脂酸镁、糖精钠、纤维素、碳酸镁等。
本发明的疫苗制成溶液、悬浮液、片剂、丸剂、胶囊剂、缓释处方剂或粉末剂的形态,含有10%~95%、优选25%~70%的活性成分(病毒颗粒或其一部分)。
本发明的疫苗以适于给药剂型的方法、以及具有预防和/或治疗效果的量进行给药。适合的给药量通常为每次给予0.01μg~100,000μg范围的抗原,这取决于所处置的患者、该患者免疫系统中的抗体合成能力和期望的防御程度,还取决于口服、皮下、皮内、肌肉内、静脉内给药途径等给药途径。
本发明的疫苗可以按照单独给药时间表进行给药、或者优选按照联合给药时间表进行给药。按照联合给药时间表进行给药时,接种开始时分别给予1~10种物质,接着按照维持和/或强化免疫应答所必需的时间间隔进行给药,例如第2次给药可在1~4个月后进行分别的给药。根据需要,几个月后可以继续给药。给药的要旨也至少部分性地根据个体的必要性来决定,取决于医生的判断。
并且,本发明的含有HCV颗粒的疫苗可以和其他免疫控制剂(例如免疫球蛋白、)同时给药。
本发明的疫苗还有以下使用方法:将疫苗给予健康人,在健康人体内诱导对HCV的免疫应答,对于新的HCV感染则预防性使用。并且,还有下述使用方法:将疫苗给予感染了HCV的患者,在机体内诱导对HCV的强免疫反应,从而用作消除HCV的治疗性疫苗。
本发明的HCV颗粒可用作用于制作抗体的抗原。通过将本发明的HCV颗粒给予哺乳类或鸟类,可以制作抗体。哺乳类动物有:小鼠、大鼠、兔、山羊、绵羊、马、牛、豚鼠、单峰驼、双峰驼、美洲驼等。单峰驼、双峰驼和大羊驼(Lama)适合制作仅包含H链的抗体。鸟类动物的例子有:鸡、鹅、鸵鸟等。采取给予了本发明HCV颗粒的动物的血清,按照已知方法可以获得抗体。
使用经本发明的HCV颗粒免疫的动物的细胞,可以制作产生单克隆抗体产生细胞的杂交瘤。杂交瘤的制造方法是周知的,可以采用Antibodies:A Laboratory Manual(Cold Spring Harbor Laboratory,1988)中记载的方法。
单克隆抗体产生细胞可以通过细胞融合而生成,还可以利用通过导入癌基因DNA或感染Epstein-Barr病毒使B淋巴细胞不死化等其他方法而生成。
通过上述方法得到的单克隆抗体或多克隆抗体对HCV的诊断或治疗、预防有用。因此,将本发明的HCV颗粒识别为抗原的抗丙型肝炎病毒抗体也包含在本发明范围内。
使用本发明的已附加表位的HCV颗粒制作的抗体可与医药上可接受的溶解剂、添加剂、稳定剂、缓冲液等同时给药。给药途径可以是任一种的给药途径,但优选为皮下、皮内、肌肉内给药,更优选静脉内给药。
(5)在筛选抗HCV药物中的应用
在进行HCV治疗药的开发时,除黑猩猩外,不存在反映病毒感染的有效动物、也不存在高效的体外病毒培养系统,所以无法充分进行药物评价成为障害。但近年来有人开发了可以评价HCV-RNA的复制的亚基因组HCV复制子系统(Lohmann,V.等人,Science,285:110-113,1999),作为与抑制病毒复制有关的HCV抑制剂的筛选系统取得了重要进步。
但是,上述亚基因组HCV复制子系统中存在无法评价HCV结构蛋白的功能的问题。实际上,已知作为HCV结构蛋白之一的核心蛋白会影响宿主的转录因子。因此,评价HCV感染的细胞中发出的现象时,仅凭亚基因组HCV复制子系统是不够的。在使用亚基因组HCV复制子系统进行筛选而选择的药物中,预测有时也无法充分抑制HCV的复制。
因此,为了解决上述亚基因组HCV复制子系统的问题,人们使用HCV N株(基因型1b)、HCV Con-1(基因型1b)、HCV H77株(基因型1a)开发了全长基因组HCV复制子系统(Ikeda,M等人,J.Virol.,76:2997-3006,2002;Pietschmann,T等人,J.Virol.76:4008-4021,2002;Blight,KJ等人,J.Virol.77:3181-3190,2003)。但是,即使将包含上述HCV株的结构蛋白的全长RNA导入细胞中,也没有确认到病毒颗粒释放到培养液中(Blight,KJ等人,J.Virol.77:3181-3190,2003)。因此,在该全长基因组HCV复制子系统中,出现了无法筛选在病毒的释放、感染过程中起作用的治疗药的问题。
使用HCV复制子来筛选抗HCV药物时,在受检物质的存在下培养感染性HCV颗粒和HCV感染容许细胞、例如Huh7细胞,测定HCV的复制和/或颗粒产生,从而评价受检物质的抗HCV效果。为了监测HCV的复制和颗粒产生,必需采用PCR或RNA印迹法测定HCV基因组量、或者通过EIA法或细胞免疫染色来检出测定核心蛋白或非结构蛋白(例如NS3蛋白)(Aoyagi,K.等人,J.Clin.Microbiol.,37:1802-1808,1999)。但是,由于上述测定方法的操作繁杂、难以高流通量化、成本增加,所以在抗HCV药物的筛选中人们希望开发简便且廉价的评价方法。因此,有人想出了下述方法:制作将报道基因整合到全长基因组HCV中的复制子,该复制子进行自身复制,监测由其基因组中的报道基因翻译的报道蛋白。例如,制作在JFH-1、J6CF/JFH1(Jc-1)和Con1/JFH1的5’NTR与编码核心蛋白的基因之间插入有萤光素酶基因和EMCV IRES作为报道基因的载体、Luc-JFH1、Luc-Jc1和Luc-Con1,研究其功能(Koutsoudakis,G.等人,J.Virol.80:5308-5320,2006)。制作带有上述报道选择性全长基因组HCV复制子的病毒,使其感染Huh7细胞时,则在感染细胞中作为报道基因的萤光素酶基因表达,合成萤光素酶。因此,通过测定萤光素酶活性,可以测定感染的效果,所以省去了测定HCV的基因组量或蛋白的时间,非常方便。
但是,报道基因等外来基因的插入使基因组长度增加,所以复制效率容易大幅降低。实际上,Luc-JFH1与JFH1的相比结果是,其复制能力低5倍,而且感染滴度也低3~10倍(Koutsoudakis,G.等人.J.Virol.80:5308-5320,2006),为了在筛选中使用带有表达报道基因的全长HCV基因组的病毒颗粒,必需开发感染滴度更高的HCV病毒。
相对于此,在本发明中成功制作了来自JFH1突变体的全长基因组复制子,所述JFH1突变体中虽然导入有报道基因但仍保持高复制能力。通过使用本发明的该全长基因组复制子,可以提供高效率的筛选方法。上述筛选方法也包含在本发明范围内。
在该筛选中,可以有效使用上述的、具有全长基因组序列的HCVRNA(全长基因组复制子RNA),所述全长基因组序列在前体多聚蛋白编码序列内、特别是以SEQ ID NO:2所示的氨基酸序列为基准时相当于第2394位氨基酸残基与第2395位氨基酸残基之间的位置插入有标记基因。作为标记基因,优选报道蛋白。
适合在本发明的该筛选中使用的、来自整合有报道蛋白编码序列的JFH1突变体的全长基因组复制子的结构可以是:从5’侧到3’侧依次包含本发明的JFH1适应突变体的5’非翻译区、报道蛋白编码序列、EMCV(脑心肌炎病毒,Encephalomyocarditis virus)的IRES序列、JFH1适应突变体的核心蛋白编码序列、E1蛋白编码序列、E2蛋白编码序列、p7蛋白编码序列、NS2蛋白编码序列、NS3蛋白编码序列、NS4A蛋白编码序列、NS4B蛋白编码序列、NS5A蛋白编码序列、NS5B蛋白编码序列以及3’非翻译区的核酸。
上述复制子的更优选的结构可以是:从5’侧到3’侧依次包含本发明的JFH1适应突变体的5’非翻译区、核心蛋白编码序列、E1蛋白编码序列、E2蛋白编码序列、p7蛋白编码序列、NS2蛋白编码序列、NS3蛋白编码序列、NS4A蛋白编码序列、NS4B蛋白编码序列、编码在NS5A蛋白中功能性(符合读框的)插入有报道蛋白的蛋白的序列、NS5B蛋白编码序列以及3’非翻译区的核酸。
作为本发明的JFH1适应突变体,可以适当使用上述(1)所述的本发明的核酸。
特别优选的上述复制子的结构可以是:编码在从HCV前体多聚蛋白的N末端起第2394位氨基酸残基与2395位氨基酸残基之间功能性(符合读框的)插入有报道蛋白的蛋白的核酸。
报道蛋白的例子有:萤光素酶、分泌型碱性磷酸酶、绿色萤光蛋白(GFP)、β-内酰胺酶、氯霉素乙酰转移酶、新霉素磷酰转移酶与萤光素酶的融合蛋白等。更优选萤光素酶,进一步优选海肾萤光素酶。编码海肾萤光素酶的基因的核苷酸序列的一个例子见SEQ ID NO:9。
在上述全长基因组HCV中整合有报道基因的复制子中,特别优选的序列为包含SEQ ID NO:6或SEQ ID NO:7所示的核苷酸序列的核酸,其中,当该核酸为RNA时,核苷酸序列中的碱基符号“T”读作“U”。在本发明的感染性HCV颗粒的制造中可以使用HCV基因组RNA或HCV基因组DNA。通过使用上述全长基因组复制子HCVRNA,可以提供以萤光素酶活性为指标的、高灵敏度的HCV感染测定系统。
使用在本发明的全长基因组HCV RNA中整合有编码报道蛋白的序列的复制子进行的筛选方法例如可以是筛选抗丙型肝炎病毒物质的方法,该方法包括:如上操作将该复制子导入到培养细胞中,制作生产丙型肝炎病毒颗粒的培养细胞,之后在受检物质的存在下培养(i)所得的生产丙型肝炎病毒颗粒的培养细胞或(ii)从该细胞释放到培养上清中的丙型肝炎病毒颗粒与丙型肝炎病毒感受性细胞(HCV感染容许细胞)的组合,测定所得培养物中的报道蛋白。上述筛选方法还可用作药物评价系统。
上述药物评价系统的具体例子有:筛选具有抗HCV作用的物质的方法,该方法是通过下述(1)~(3)来评价受检物质的抗HCV效果:(1)在受检物质的存在下培养感染性HCV颗粒和HCV感染容许细胞、例如Huh7细胞,所述感染性HCV颗粒包含如上操作将报道基因整合在全长基因组HCV中的复制子作为基因组;(2)测定随着HCV的复制、颗粒产生而产生的报道蛋白;(3)比较所产生的报道蛋白的水平和未加入受检物质的对照中的报道蛋白的检测水平。
本发明的筛选方法的另一例子为:评价受检物质的抗HCV效果的方法,该方法是通过下述(1)~(3)进行评价:(1)在受检物质的存在下培养感染性HCV颗粒产生细胞,该细胞包含如上操作将报道基因整合到全长基因组HCV中的复制子作为基因组;(2)测定随着HCV的复制、颗粒产生而产生的报道蛋白;(3)比较所产生的报道蛋白的水平和未加入受检物质的对照中的报道蛋白的水平。
更具体而言,上述筛选方法可以是筛选抗丙型肝炎病毒物质的方法,该方法包括下述步骤:在受检物质的存在下培养生产丙型肝炎病毒颗粒的培养细胞的步骤,所述丙型肝炎病毒颗粒包含插入有编码报道蛋白的核酸的JFH1突变体的全长基因组HCV RNA、即本发明的核酸;以及测定所得培养物中的该报道蛋白,当上述报道蛋白的表达量低时判定为上述受检物质具有抗丙型肝炎病毒活性的步骤。
(6)SEQ ID NO的概要
SEQ ID NO:1是野生型JFH1(JFH1wt)的全长基因组序列
SEQ ID NO:2是根据野生型JFH1(JFH1wt)的全长基因组序列编码的、前体多聚蛋白的氨基酸序列
SEQ ID NO:3是突变体JFH1-A/WT的全长基因组序列。第341位~9442位核苷酸是前体多聚蛋白编码序列。
SEQ ID NO:4是突变体JFH1-B/WT的全长基因组序列。第341位~9442位核苷酸是前体多聚蛋白编码序列。
SEQ ID NO:5是突变体JFH1-Q862R的全长基因组序列。第341位~9442位核苷酸是前体多聚蛋白编码序列。
SEQ ID NO:6是突变体JFH1-A/WT-Rluc的全长基因组序列。第341位~9442位核苷酸是前体多聚蛋白编码序列。
SEQ ID NO:7是突变体JFH1-B/WT-Rluc的全长基因组序列
SEQ ID NO:8是突变体JFH1wt-Rluc的全长基因组序列
SEQ ID NO:9是海肾萤光素酶基因的全长序列
SEQ ID NO:10~18为PCR引物
实施例
以下,使用实施例进一步具体地说明本发明。但本发明的技术范围并不受这些实施例的限定。
实施例1:用于高生产JFH-1病毒颗粒的JFH1适应突变体的获得
使用作为质粒DNA的pJFH-1(Wakita,T.等人,Nat.Med.,11(2005)第791-796页和国际公开WO2004/104198)作为DNA源,所述质粒DNA是将从重症肝炎的日本人患者中分离的基因型2a的丙型肝炎病毒(HCV)JFH1株(JP2002-171978A)的基因组RNA全区的cDNA(全基因组cDNA(full-genome cDNA);SEQ ID NO:1)克隆到插入有T7启动子的pUC19质粒载体中的T7启动子序列下游的EcoRI-XbaI位点而得到的。用XbaI切断pJFH-1,之后加入绿豆核酸酶20U(反应液总量为50μl),在30℃下温育30分钟,从而使Xba I切断末端变得平滑。接着,进行苯酚氯仿萃取、乙醇沉淀,得到除去了切断末端的CTGA4个碱基的XbaI切断片段。以该DNA片段为模板,使用MEGAscriptT7试剂盒(Ambion社)来合成RNA。如下操作将如此合成的JFH1株的全长基因组HCV RNA(full-length genomic HCV RNA)导入细胞中。
前1天,将1×106个Huh7细胞接种在10cm的培养皿中,用不含抗生素的培养基进行培养。在30μl脂质转染胺2000(Invitrogen社)和OPTI-MEM(Invitrogen社)的混合液中加入悬浮在1ml OPTI-MEM中的6μg JFH1 RNA,使之在室温下反应20分钟,形成RNA-脂质转染胺复合体。向前1天准备的Huh7细胞中添加该RNA-脂质转染胺复合体。24小时后将上清更换成新的培养基。之后继续培养2年。该继代培养期间远远长于用于得到适应突变体(culture-adapted variant)的通常的培养期间、即1~2个月(十几次继代)。由该继代培养结束后的细胞生产的病毒株命名为JFH-1a。另一方面,由野生型JFH1株合成全长基因组JFH1 RNA,与上述同样将其导入Huh-7.5.1细胞中。由培养刚刚开始后的野生型JFH1 RNA导入细胞生产的病毒株命名为JFH1wt。图1显示在本实施例中进行的实验流程。
实施例2:作为JFH1适应突变体(adapted JFH1 variant)的JFH1a的性
状解析(characterization)
在病毒感染24小时前,将Huh7.5.1细胞按2×104个/孔(well)接种在24孔平板中。接下来,在37℃下用实施例1中制作的JFH1wt和JFH1a病毒颗粒按M.O.I(感染复数)0.006感染Huh7.5.1细胞2小时。除去病毒液,加入新的培养基,在37℃下继续培养7天。经时回收细胞,提取总RNA(total RNA)。提取总RNA时使用市售的RNA提取试剂ISOGEN(ニッポンジ一ン社),向cDNA转换时使用ReverTra AceqPCR RT试剂盒(TOYOBO社),按照附带的操作指南进行提取。PCR反应按照通过SYBR GreenI检测进行的两步骤RT-PCR来进行。通过使用Light Cycler(Roche社)分析所得的PCR产物,确定细胞内的HCVRNA量。用于测定JFH1a基因组的引物序列为按照扩增HCV的NS3区的方式设计的引物,即5’-CTTTGACTCCGTGATCGACC-3’(SEQ IDNO:10)和5’-CCCTGTCTTCCTCTACCTG-3’(SEQ ID NO:11)。使用5’-TGGCACCCAGCACAATGAA-3’(SEQ ID NO:12)和5’-CTAAGTCATAGTCCGCCTAGAAGCA-3’(SEQ ID NO:13)作为扩增标准化用的肌动蛋白基因的引物,同样通过两步骤RT-PCR来定量,由所得数据算出每100ng总RNA的HCV RNA拷贝数(图2)。其结果,在培养的第6天,JFH1a显示出JFH1wt的约1,000倍高的复制能力。
接下来,分析JFH1wt和JFH1a的干扰素感受性。在病毒感染24小时前,在24孔平板的每孔中接种3×104个Huh7.5.1细胞。第二天,使JFH1wt和JFH1a以0.006的M.O.I.感染细胞2小时。感染后用PBS(-)清洗细胞3次,用含有图3中记载的浓度(0、0.16、0.8、4、20、100IU/ml)的干扰素α(IFN-α)(Universal Type I干扰素,PBL Interferon Source,NJ)的培养基培养72小时。通过定量的PCR来定量用图3中记载的IFN-α浓度处理的细胞内的HCV RNA量。由所得数据算出相对于未添加干扰素(IFN)的对照的相对复制率(%)。由其结果判定:JFH1a和野生型的JFH1wt显示出同样的IFN感受性(图3)。
实施例3:JFH1a突变的解析
在本实施例中,为了根据JFH1a的高病毒生产能力研究重要的适应性突变,首先分析JFH1a的基因组序列。使用ISOGEN-LS(NIPPONGENE社),从实施例2中得到的JFH1a病毒感染细胞中提取总RNA,通过逆转录反应合成cDNA。该cDNA合成的逆转录反应使用特异性引物A9482:5’-GGAACAGTTAGCTATGGAGTGTACC-3’(SEQ IDNO:16),使用Transcriptor first Strand cDNA合成试剂盒(Roche社)来进行。逆转录的程序按照附录的操作指南来进行。以所得的cDNA为模板,通过PCR反应扩增编码从核心蛋白到NS3蛋白的序列。使用S58:5’-TGTCTTCACGCAGAAAGCGCCTAG-3’(SEQ ID NO:17)和AS4639:5’-CTGAGCTGGTATTATGGAGACGTCC-3’(SEQ ID NO:18)作为PCR用引物。将通过PCR反应得到的DNA片段连接在pGEM-T Easy载体(Promega社)中,转化成大肠杆菌DH5α,之后在含有氨苄青霉素的LB琼脂培养基上培养,筛选转化大肠杆菌。取出6个集落,在LB培养基中培养一夜,使用Wizard Plus SV Miniprep DNA纯化系统(Promega)提取纯化质粒,确认了PCR扩增的DNA片段的核苷酸序列。
其结果,在JFH1a的前体多聚蛋白的从核心蛋白到NS3蛋白的区(前体多聚蛋白的N末端侧的一半),与JFH1的前体多聚蛋白序列(SEQID NO:2)相比发现多个氨基酸取代(图4)。还显示出6个克隆中的2个以上克隆存在共同的氨基酸突变(在图4中,用“*”显示)。
实施例4:突变体质粒的构建
构建具有为了使实施例3所示的JFH1a病毒具有高产生量所必需的适应性突变的质粒。如图4所示,由在6个克隆的核苷酸序列中共同发现的突变氨基酸的图式可以判定:JFH1a至少由2个突变株构成。此处将上述2个突变株分别称作组A、组B。选择Clone5-2作为组A、选择Clone5-4作为组B,制作两种的突变体嵌合。将Clone5-2和Clone5-4用限制酶AgeI和SpeI消化,得到包含经PCR扩增的5’侧的突变的区的DNA片段。通过连接反应使上述DNA片段与同样用AgeI和SpeI进行酶处理而得到的pJFH1载体片段结合,从而分别得到pJFH1-A/WT、pJFH1-B/WT。
图5是显示制作的突变体质粒中的突变导入位点的概略图。由突变体质粒pJFH1-A/WT表达的HCV突变体JFH1-A/WT具有全长基因组序列(SEQ ID NO:3),所述全长基因组序列编码在野生型JFH1(也称作JFH1wt)病毒的前体多聚蛋白的氨基酸序列(SEQ ID NO:2)的N末端侧的一半(从核心到NS3的一部分)导入有K74T、Y297H、A330T、S395P、N417S、D483G、A501T、Q862R、Q931R和S961A这10个氨基酸取代的蛋白。由突变体质粒pJFH1-B/WT表达的HCV突变体JFH1-B/WT具有全长基因组序列(SEQ ID NO:4),所述全长基因组序列编码在野生型JFH1(也称作JFH1wt)病毒的前体多聚蛋白的氨基酸序列(SEQ ID NO:2)的N末端侧的一半(从核心到NS3)导入有V31A、K74T、G451R、V756A、V786A和Q862R这6个氨基酸取代的蛋白。
需要说明的是,使用将在JFH1wt的前体多聚蛋白的氨基酸序列中导入有V2440L氨基酸取代的HCV病毒突变体JFH1-mut5的全长基因组序列在T7RNA启动子的控制下克隆化的质粒作为对照。据报道,该病毒JFH1-mut5与JFH1wt相比,病毒生产能力高10倍(Kaul等人,J.Virol.(2007)81:13168-13179)。
实施例5:HCV适应突变体的HCV生产能力的解析
比较野生型JFH1wt及其3种适应突变体JFH1-A/WT、JFH1-B/WT和JFH1-mut5的病毒颗粒生产能力。
首先,以pJFH-1和实施例4中制作的突变体质粒为模板,利用实施例1所示的方法合成上述4种病毒JFH1wt、JFH1-A/WT、JFH1-B/WT和JFH1-mut5的全长基因组HCV RNA。接下来,将各4μg合成的4种HCV RNA与100μl以5×106细胞/ml的密度悬浮在Microporation试剂盒(Degital Bio社)中所含的缓冲液R中的Huh7.5.1细胞混合,使用MicroPorator(Digital Bio社),在以1350V(脉冲电压)、30ms(脉冲宽度)进行1次脉冲的条件下进行电穿孔。将细胞悬浮于10ml培养基中,在6孔平板的每孔中接种2ml(2×105细胞)。电穿孔后于4、24、48、72、96小时回收细胞和培养上清,使用Ortho(ォ一ソ)HCV抗原IRMA试验(Aoyagi等人,J.Clin.Microbiol.,37(1999)第1802-1808页)来定量新产生的细胞内核心蛋白量(图6A)。同样操作,测定各时间点的培养上清中的核心蛋白量(图6B)。转染效率的校正使用4小时后的细胞内核心蛋白量来进行。
另外,利用病毒效价测定法(转化灶形成实验,focus forming assay)测定JFH1wt、JFH1-A/WT、JFH1-B/WT和JFH1-mut5的感染滴度。通过病毒效价测定法(转化灶形成实验)测定培养上清中的病毒感染效价。更具体而言,在96孔平板的每孔中接种6×103个Huh7.5.1细胞,第二天,用经培养基逐步稀释的病毒溶液感染细胞,在37℃下培养72小时。病毒感染细胞的检测通过抗原抗体反应的免疫染色法来进行。将感染72小时后的细胞在室温下、在10%福尔马林-PBS(-)溶液中固定20分钟,之后在室温下用0.5%Triton X-PBS(-)处理10分钟。之后,加入用5%脱脂乳-PBS(-)稀释的抗HCV-核心(克隆CP14)单克隆抗体(300倍稀释物)作为1次抗体,使之在室温下反应1小时。再用PBS(-)清洗3次,之后加入HRP标记的山羊抗小鼠抗体(300倍稀释物),使之在室温下反应1小时。用PBS(-)清洗3次,之后加入KonicaImmunostain HRP-1000(コニカミノルタ社),在显微镜下测定染成蓝色的病毒抗原阳性细胞集团(免疫转化灶)的数目(图6C)。
根据所得的核心蛋白量和感染滴度,通过下式算出比活性(相对比感染滴度(relative specific infectivity))。比活性=(培养上清中的感染滴度)/(培养上清中的核心蛋白量)。其结果见图6D。
在Huh7.5.1细胞中,JFH1-A/WT和JFH1-B/WT与野生型JFH1wt相比显示出高达100倍以上的感染滴度、与JFH1-mut5相比显示出高达10倍以上的感染滴度(图6C)。显示感染滴度高和病毒蛋白向细胞外的释放量高的这些结果意思是指这些病毒将大量的感染性病毒颗粒释放到培养上清中。即,显示JFH1-A/WT和JFH1-B/WT具有非常高的病毒生产能力(图6B、6C)。
另外,如图6D所示,还显示出JFH1-B/WT的比活性明显高。该结果表明:JFH1-B/WT的感染力强、或者可以非常高效地形成病毒颗粒。如上所述,高效率的病毒颗粒形成能力是可以在以疫苗制造等为目的的HCV病毒颗粒生产中有效利用的、非常优异的性质。
实施例6:适应突变体病毒的感染传播(infection transmission)的解析
接下来,分析JFH1wt、JFH1a、JFH1-A/WT、JFH1-B/WT和JFH1-mut5这5种HCV病毒的感染传播力。在进行病毒感染20~24小时前,在6孔平板的每孔中接种1×105个Huh7.5.1细胞。第二天,将这5种病毒用0.001的M.O.I.(100FFU/ml,1ml)在37℃下感染2小时。2小时后除去病毒液,加入2ml新的培养基,将细胞在37℃下继续培养23天。每隔3~4天取20%左右的该细胞进行继代培养,每次回收上清并保存在-80℃下。利用已述的病毒效价测定法(转化灶形成实验)测定回收的培养上清中的病毒感染滴度。其结果,转染后JFH1a和JFH1-B/WT的病毒感染滴度急速上升,感染传播迅速,这表明上述2种病毒具有高感染传播力(图7)。
为了确认JFH1-B/WT的感染传播力高,用5种病毒(各50FFU)感染Huh7.5.1细胞(6×103细胞),比较在感染72小时后形成的转化灶的大小。将转化灶按照病毒效价测定法(转化灶形成实验)的程序进行染色、观察。其结果,如图8所示,显示出JFH1a、JFH1-B/WT的转化灶大小特别大、感染传播力格外高。
实施例7:适应突变体病毒JFH1-R/WT的解析
详细分析具有病毒高生产能、且感染传播力高的JFH1适应突变体病毒JFH1-B/WT所具有的6个位置氨基酸突变(氨基酸取代)。在基因的点突变导入中通常采用位点定向诱变法。突变体的制作是按照附带的操作指南,使用QuickChange II XL位点定向诱变试剂盒(Stratagene社),以将JFH1-B/WT或JFH1wt的全长基因组序列克隆化的质粒为模板,使用点突变导入用引物来进行。如此操作导入到HCV基因组序列中的点突变通过使用DNA序列分析仪进行序列测定即可确认。
将制作的6个位置氨基酸突变(V31A、K74T、G451R、V756A、V786A和Q862R)中的任一个恢复成野生型氨基酸的突变体和将该6个位置氨基酸突变中的任一个导入JFH1wt(野生型)中的突变体分别见图9和图10。
将上述每1个位置氨基酸突变恢复成野生型氨基酸的碱基突变导入JFH1-B/WT全长基因组序列中得到的6种HCV突变体命名为31-(A31V)、74-(T74K)、451-(R451G)、756-(A756V)、786-(A786V)和862-(R862Q)(图9)。31-(A31V)是将氨基酸取代A31V导入JFH1-B/WT中得到的突变体,74-(T74K)是将氨基酸取代T74K导入JFH1-B/WT中得到的突变体,451-(R451G)是将氨基酸取代R451G导入JFH1-B/WT中得到的突变体,756-(A756V)是将氨基酸取代A756V导入JFH1-B/WT中得到的突变体,786-(A786V)是将氨基酸取代A786V导入JFH1-B/WT中得到的突变体,862-(R862Q)是将氨基酸取代R862Q导入JFH1-B/WT中得到的突变体。进行与实施例4相同的操作,制作上述突变体的全长基因组序列被克隆化的突变体质粒。
另外,将引起上述每1个位置氨基酸突变的碱基突变导入野生型JFH1wt全长基因组序列中得到的6种HCV突变体命名为31+(V31A)、74+(K74T)、451+(G451R)、756+(V756A)、786+(V786A)和862+(Q862R)(图10)。31+(V31A)是将氨基酸取代V31A导入JFH1wt中得到的突变体,74+(K74T)是将氨基酸取代K74T导入JFH1wt中得到的突变体,451+(G451R)是将氨基酸取代G451R导入JFH1wt中得到的突变体,756+(V756A)是将氨基酸取代V756A导入JFH1wt中得到的突变体,786+(V786A)是将氨基酸取代V786A导入JFH1wt中得到的突变体,862+(Q862R)是将氨基酸取代Q862R导入JFH1wt中得到的突变体。进行与实施例4相同的操作,制作上述突变体的全长基因组序列被克隆化的突变体质粒。再以制作的突变体质粒为模板,通过实施例1所示的方法合成全长基因组HCV RNA。
接着,通过电穿孔将各4μg的图9所示的6种突变体病毒31-(A31V)、74-(T74K)、451-(R451G)、756-(A756V)、786-(A786V)和862-(R862Q)的全长基因组HCV RNA、以及图10所示的突变体病毒451+(G451R)的全长基因组HCV RNA、JFH1wt和JFH1-B/WT的全长基因组HCV RNA分别转染到Huh7.5.1细胞(1×106)中。将转染的细胞悬浮于10ml培养基中,在6孔平板上各接种2ml(2×105细胞)。测定转染后24、48、72、96小时的培养上清中的病毒感染滴度(FFU/ml)和核心蛋白量(μg/ml)(图11)。如图11A和11B所示,将第451位的氨基酸残基恢复成野生型的G(甘氨酸)时、或者将第862位的氨基酸残基恢复成野生型的R(精氨酸)时,比活性显著降低。由此表明:G451R的突变和R862Q的突变对提高病毒生产能力至关重要。
同样,通过电穿孔将各4μg的图10所示的6种突变体病毒31+(V31A)、74+(K74T)、451+(G451R)、756+(V756A)、786+(V786A)和862+(Q862R)的全长基因组HCV RNA、以及JFH1wt和JFH1-B/WT的全长基因组HCV RNA分别转染到Huh7.5.1细胞(1×106)中。将转染的细胞悬浮于10ml培养基中,在6孔平板上各接种2ml(2×105细胞)。测定转染后24、48、72、96小时的培养上清中的病毒感染滴度(FFU/ml)、核心蛋白量(μg/ml)(图12)。培养上清中的感染滴度显示:通过将K74T、G451R、Q862R的氨基酸突变分别导入JFH1wt中,病毒生产能力有所提高(图12A)。另外,通过导入Q862R突变,细胞外核心蛋白量增加至JFH1wt的10倍(图12B)。
上述测定结果表明:通过导入G451R突变,与JFH1wt相比病毒感染力和病毒生产能力有所提高。另外还显示:K74T突变和Q862R突变也提高了病毒生产能力。但是,仅凭这些突变还不能超过JFH1-B/WT。
并且,为了研究在长期感染中病毒感染传播力随时间的变化,将由各突变体质粒合成的全长基因组HCV RNA转染到Huh7.5.1细胞中,用产生的各感染性病毒颗粒感染Huh7.5.1细胞(M.O.I.0.001),进行长期培养,经时测定病毒生产量和感染滴度。31-(A31V)、74-(T74K)、451-(R451G)、756-(A756V)、786-(A786V)、862-(R862Q)、451+(G451R)、JFH1wt和JFH1-B/WT的结果汇总在图13中,而31+(V31A)、74+(K74T)、451+(G451R)、756+(V756A)、786+(V786A)、862+(Q862R)、JFH1wt和JFH1-B/WT的结果汇总在图14中。
其结果表明:在将第451位的氨基酸残基恢复成野生型的G(甘氨酸)的突变体(451-(R451G))中,培养上清中的核心蛋白量的增加时间延迟(图13A),所以G451R突变与感染传播力有关。
如图14所示,培养上清中核心蛋白量和感染滴度的增加的图形显示出:K74T、G451R、Q862R的突变有助于提高感染传播力(图14A和14B)。特别是G451R在核心蛋白量和感染滴度两个方面均显示出显著的增加,表明病毒颗粒产生能力大幅提高。
根据以上解析可知:K74T、G451R、Q862R突变使HCV病毒生产能力增强。需要说明的是,突变体862+(Q862R)的全长基因组序列见SEQ ID NO:5。
实施例8:将报道基因整合在全长基因组序列中的突变体的制作
为了容易检测HCV的感染和增殖,制作包含整合有萤光素酶基因作为报道基因的全长HCV基因组序列的突变体。制作的突变体的结构图见图15。
具体而言,如下制作将编码HCV前体多聚蛋白的DNA片段功能性连接在T7启动子下游的质粒载体,所述HCV前体多聚蛋白在从丙型肝炎病毒JFH1wt(野生型)、JFH1-A/WT和JFH1-B/WT(适应突变体)的前体多聚蛋白的N末端的第1氨基酸起计数的、第2394位氨基酸残基(氨基酸2394位)与第2395位氨基酸残基(氨基酸2395位)之间整合有由311个氨基酸残基形成的海肾(Renilla reniformis)萤光素酶。
首先,以插入在质粒pGL4.27(Promega社)中的海肾萤光素酶基因(SEQ ID NO:9)为模板,使用末端具有XhoI位点(ctcgag)的2个引物5’-ctcgagATGGCTTCCAAGGTGTACGACCCC-3’(SEQ ID NO:14)和5’-ctcgaGCTGCTCGTTCTTCAGCACGCGCTC-3’(SEQ ID NO:15),扩增海肾萤光素酶基因片段。用XhoI消化已扩增的基因片段。
另一方面,将JFH1wt、JFH1-A/WT和JFH1-B/WT的全长基因组序列被克隆化的质粒用识别核苷酸序列5’-CCTCGAGG-3’的限制酶AbsI消化,在该限制位点插入上述得到的海肾萤光素酶基因扩增产物的XhoI消化片段,进行克隆,筛选具有功能性连接有海肾萤光素酶的载体的克隆。将如此操作得到的海肾萤光素酶(Rluc)基因导入突变体分别命名为JFH1wt-Rluc、JFH1-A/WT-Rluc、JFH1-B/WT-RLuc。被克隆化在载体中的JFH1-A/WT-Rluc的全长基因组序列(SEQ ID NO:5)、JFH1-B/WT-RLuc的全长基因组序列(SEQ ID NO:6)和JFH1wt-Rluc的全长基因组序列(SEQ ID NO:7)通过核苷酸序列测定来确认。
接下来,将上述的序列被克隆化的重组载体用XbaI消化,切出插入片段,用绿豆核酸酶处理,之后使用MEGAscript T7试剂盒(Ambion社)合成全长基因组序列HCV RNA。将由JFH1wt、JFH1wt-Rluc、JFH1-A/WT-Rluc和JFH1-B/WT-Rluc合成的HCV RNA转染到HuH7.5.1细胞中,72小时后测定培养上清中的感染滴度。感染滴度的测定通过使用抗HCV-核心(CP14)单克隆抗体进行细胞染色、计测转化灶数目来进行。
其结果,将Rluc基因整合在野生型JFH1wt中时,病毒生产能力与野生型JFH1wt相比低约10倍(图16)。另一方面,将Rluc基因整合在突变体JFH1-A/WT或JFH1-B/WT中时,与JFH1wt-Rluc相比感染滴度高约100倍(图16)。
进一步分析由整合有Rluc基因的全长基因组序列生产的HCV颗粒的量与萤光素酶活性的相关。在48孔平板上按1.0×104个/孔接种Huh7.5.1细胞,24小时后用JFH-A/WT-Rluc和JFH-B/WT-Rluc以100、50、25、12、6、3和0FFU(转化灶形成单元)感染2小时。感染后,将细胞用PBS清洗2次,每孔加入200μL新鲜培养基。病毒感染72小时后从平板中回收细胞,测定萤光素酶的活性。萤光素酶的活性通过使用海肾萤光素酶分析系统(Promega社),按照附带的操作指南进行测定。具体如下:除去培养上清,用200μL PBS清洗2次,加入200μL试剂盒中所含的溶解缓冲液(裂解缓冲液)(海肾萤光素酶分析系统,Promega社),在室温下搅拌15分钟,使细胞溶解。将20μL溶解液移至萤光素酶分析平板中,加入100μl基质,用Glomax发光计(Promega社)测定发光。其结果,检测出依赖于病毒量的萤光素酶活性。(图17)。
实施例9:干扰素对HCV感染和增殖的抑制效果
关于干扰素对HCV感染和增殖的抑制效果,以公知的干扰素作为受试药物进行实验,确认使用了在全长HCV基因组序列中整合有报道基因的JFH1突变体(实施例8)的抗HCV物质的筛选系统的有效性。
在病毒感染的24小时前,在两块(2セット)48孔平板的每孔中接种1.2×104个Huh7.5.1细胞。第二天,向细胞中加入100FFU的JFH-A/WT-Rluc或JFH-B/WT-Rluc病毒,感染2小时。感染后将细胞用PBS(-)清洗2次,用含有图18中记载的浓度(0、1、4、20、100U/ml)的IFN-α(Universal Type I干扰素,PBL Interferon Source,NJ)的培养基培养72小时。通过病毒效价测定法(转化灶形成实验)测定上述两块病毒感染平板中的一块平板的病毒感染滴度,测定另一块平板的萤光素酶活性。其结果见图18。
干扰素α剂量依赖性地抑制HCV的感染(图18B)。另外,在萤光素酶分析中显示:所示的萤光素酶活性与感染滴度之间高度相关(图18A)。该结果表明:如果使用整合有Rluc基因的JFH1wt及其突变体,则可以以萤光素酶活性为指标,通过测定感染抑制率,高效率筛选干扰素等的抗HCV物质。
【序列表的自由文本】
SEQ ID NO:3~8为JFH1突变体。
SEQ ID NO:10~18为引物。
CPCH1060918 序列表
<110>国立大学法人东京大学
学校法人日本大学
中国科学院微生物研究所
国立感染症研究所长
财团法人东京都医学研究机构
东丽株式会社
<120>感染性丙型肝炎病毒高生产HCV突变体及其应用
<130>PH-4259CN
<160>18
<170>PatentIn version 3.4
<210>1
<211>9678
<212>DNA
<213>丙型肝炎病毒JFH1株
<220>
<221>CDS
<222>(341)..(9442)
<400>1
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180
aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240
caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300
cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atg agc aca aat cct 355
Met Ser Thr Asn Pro
1 5
aaa cct caa aga aaa acc aaa aga aac acc aac cgt cgc cca gaa gac 403
Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Glu Asp
10 15 20
gtt aag ttc ccg ggc ggc ggc cag atc gtt ggc gga gta tac ttg ttg 451
Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly Gly Val Tyr Leu Leu
25 30 35
ccg cgc agg ggc ccc agg ttg ggt gtg cgc acg aca agg aaa act tcg 499
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr Thr Arg Lys Thr Ser
40 45 50
gag cgg tcc cag cca cgt ggg aga cgc cag ccc atc ccc aaa gat cgg 547
Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro Ile Pro Lys Asp Arg
55 60 65
cgc tcc act ggc aag gcc tgg gga aaa cca ggt cgc ccc tgg ccc cta 595
Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly Arg Pro Trp Pro Leu
70 75 80 85
tat ggg aat gag gga ctc ggc tgg gca gga tgg ctc ctg tcc ccc cga 643
Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg
90 95 100
ggc tct cgc ccc tcc tgg ggc ccc act gac ccc cgg cat agg tcg cgc 691
Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg His Arg Ser Arg
105 110 115
aac gtg ggt aaa gtc atc gac acc cta acg tgt ggc ttt gcc gac ctc 739
Asn Val Gly Lys Val Ile Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu
120 125 130
atg ggg tac atc ccc gtc gta ggc gcc ccg ctt agt ggc gcc gcc aga 787
Met Gly Tyr Ile Pro Val Val Gly Ala Pro Leu Ser Gly Ala Ala Arg
135 140 145
gct gtc gcg cac ggc gtg aga gtc ctg gag gac ggg gtt aat tat gca 835
Ala Val Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala
150 155 160 165
aca ggg aac cta ccc ggt ttc ccc ttt tct atc ttc ttg ctg gcc ctg 883
Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser Ile Phe Leu Leu Ala Leu
170 175 180
ttg tcc tgc atc acc gtt ccg gtc tct gct gcc cag gtg aag aat acc 931
Leu Ser Cys Ile Thr Val Pro Val Ser Ala Ala Gln Val Lys Asn Thr
185 190 195
agt agc agc tac atg gtg acc aat gac tgc tcc aat gac agc atc act 979
Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser Asn Asp Ser Ile Thr
200 205 210
tgg cag ctc gag gct gcg gtt ctc cac gtc ccc ggg tgc gtc ccg tgc 1027
Trp Gln Leu Glu Ala Ala Val Leu His Val Pro Gly Cys Val Pro Cys
215 220 225
gag aga gtg ggg aat acg tca cgg tgt tgg gtg cca gtc tcg cca aac 1075
Glu Arg Val Gly Asn Thr Ser Arg Cys Trp Val Pro Val Ser Pro Asn
230 235 240 245
atg gct gtg cgg cag ccc ggt gcc ctc acg cag ggt ctg cgg acg cac 1123
Met Ala Val Arg Gln Pro Gly Ala Leu Thr Gln Gly Leu Arg Thr His
250 255 260
atc gat atg gtt gtg atg tcc gcc acc ttc tgc tct gct ctc tac gtg 1171
Ile Asp Met Val Val Met Ser Ala Thr Phe Cys Ser Ala Leu Tyr Val
265 270 275
ggg gac ctc tgt ggc ggg gtg atg ctc gcg gcc cag gtg ttc atc gtc 1219
Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala Gln Val Phe Ile Val
280 285 290
tcg ccg cag tac cac tgg ttt gtg caa gaa tgc aat tgc tcc atc tac 1267
Ser Pro Gln Tyr His Trp Phe Val Gln Glu Cys Asn Cys Ser Ile Tyr
295 300 305
cct ggc acc atc act gga cac cgc atg gca tgg gac atg atg atg aac 1315
Pro Gly Thr Ile Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn
310 315 320 325
tgg tcg ccc acg gcc acc atg atc ctg gcg tac gtg atg cgc gtc ccc 1363
Trp Ser Pro Thr Ala Thr Met Ile Leu Ala Tyr Val Met Arg Val Pro
330 335 340
gag gtc atc ata gac atc gtt agc ggg gct cac tgg ggc gtc atg ttc 1411
Glu Val Ile Ile Asp Ile Val Ser Gly Ala His Trp Gly Val Met Phe
345 350 355
ggc ttg gcc tac ttc tct atg cag gga gcg tgg gcg aag gtc att gtc 1459
Gly Leu Ala Tyr Phe Ser Met Gln Gly Ala Trp Ala Lys Val Ile Val
360 365 370
atc ctt ctg ctg gcc gct ggg gtg gac gcg ggc acc acc acc gtt gga 1507
Ile Leu Leu Leu Ala Ala Gly Val Asp Ala Gly Thr Thr Thr Val Gly
375 380 385
ggc gct gtt gca cgt tcc acc aac gtg att gcc ggc gtg ttc agc cat 1555
Gly Ala Val Ala Arg Ser Thr Asn Val Ile Ala Gly Val Phe Ser His
390 395 400 405
ggc cct cag cag aac att cag ctc att aac acc aac ggc agt tgg cac 1603
Gly Pro Gln Gln Asn Ile Gln Leu Ile Asn Thr Asn Gly Ser Trp His
410 415 420
atc aac cgt act gcc ttg aat tgc aat gac tcc ttg aac acc ggc ttt 1651
Ile Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Asn Thr Gly Phe
425 430 435
ctc gcg gcc ttg ttc tac acc aac cgc ttt aac tcg tca ggg tgt cca l699
Leu Ala Ala Leu Phe Tyr Thr Asn Arg Phe Asn Ser Ser Gly Cys Pro
440 445 450
ggg cgc ctg tcc gcc tgc cgc aac atc gag gct ttc cgg ata ggg tgg 1747
Gly Arg Leu Ser Ala Cys Arg Asn Ile Glu Ala Phe Arg Ile Gly Trp
455 460 465
ggc acc cta cag tac gag gat aat gtc acc aat cca gag gat atg agg 1795
Gly Thr Leu Gln Tyr Glu Asp Asn Val Thr Asn Pro Glu Asp Met Arg
470 475 480 485
ccg tac tgc tgg cac tac ccc cca aag ccg tgt ggc gta gtc ccc gcg 1843
Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys Gly Val Val Pro Ala
490 495 500
agg tct gtg tgt ggc cca gtg tac tgt ttc acc ccc agc ccg gta gta 1891
Arg Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val
505 510 515
gtg ggc acg acc gac aga cgt gga gtg ccc acc tac aca tgg gga gag 1939
Val Gly Thr Thr Asp Arg Arg Gly Val Pro Thr Tyr Thr Trp Gly Glu
520 525 530
aat gag aca gat gtc ttc cta ctg aac agc acc cga ccg ccg cag ggc 1987
Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr Arg Pro Pro Gln Gly
535 540 545
tca tgg ttc ggc tgc acg tgg atg aac tcc act ggt ttc acc aag act 2035
Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys Thr
550 555 560 565
tgt ggc gcg cca cct tgc cgc acc aga gct gac ttc aac gcc agc acg 2083
Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp Phe Asn Ala Ser Thr
570 575 580
gac ttg ttg tgc cct acg gat tgt ttt agg aag cat cct gat gcc act 2131
Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys His Pro Asp Ala Thr
585 590 595
tat att aag tgt ggt tct ggg ccc tgg ctc aca cca aag tgc ctg gtc 2179
Tyr Ile Lys Cys Gly Ser Gly Pro Trp Leu Thr Pro Lys Cys Leu Val
600 605 610
cac tac cct tac aga ctc tgg cat tac ccc tgc aca gtc aat ttt acc 2227
His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr
615 620 625
atc ttc aag ata aga atg tat gta ggg ggg gtt gag cac agg ctc acg 2275
Ile Phe Lys Ile Arg Met Tyr Val Gly Gly Val Glu His Arg Leu Thr
630 635 640 645
gcc gca tgc aac ttc act cgt ggg gat cgc tgc gac ttg gag gac agg 2323
Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys Asp Leu Glu Asp Arg
650 655 660
gac agg agt cag ctg tct cct ctg ttg cac tct acc acg gaa tgg gcc 2371
Asp Arg Ser Gln Leu Ser Pro Leu Leu His Ser Thr Thr Glu Trp Ala
665 670 675
atc ctg ccc tgc acc tac tca gac tta ccc gct ttg tca act ggt ctt 2419
Ile Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala Leu Ser Thr Gly Leu
680 685 690
ctc cac ctt cac cag aac atc gtg gac gta caa tac atg tat ggc ctc 2467
Leu His Leu His Gln Asn Ile Val Asp Val Gln Tyr Met Tyr Gly Leu
695 700 705
tca cct gct atc aca aaa tac gtc gtt cga tgg gag tgg gtg gta ctc 2515
Ser Pro Ala Ile Thr Lys Tyr Val Val Arg Trp Glu Trp Val Val Leu
710 715 720 725
tta ttc ctg ctc tta gcg gac gcc aga gtc tgc gcc tgc ttg tgg atg 2563
Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys Ala Cys Leu Trp Met
730 735 740
ctc atc ttg ttg ggc cag gcc gaa gca gca ttg gag aag ttg gtc gtc 261l
Leu Ile Leu Leu Gly Gln Ala Glu Ala Ala Leu Glu Lys Leu Val Val
745 750 755
ttg cac gct gcg agt gcg gct aac tgc cat ggc ctc cta tat ttt gcc 2659
Leu His Ala Ala Ser Ala Ala Asn Cys His Gly Leu Leu Tyr Phe Ala
760 765 770
atc ttc ttc gtg gca gct tgg cac atc agg ggt cgg gtg gtc ccc ttg 2707
Ile Phe Phe Val Ala Ala Trp His Ile Arg Gly Arg Val Val Pro Leu
775 780 785
acc acc tat tgc ctc act ggc cta tgg ccc ttc tgc cta ctg ctc atg 2755
Thr Thr Tyr Cys Leu Thr Gly Leu Trp Pro Phe Cys Leu Leu Leu Met
790 795 800 805
gca ctg ccc cgg cag gct tat gcc tat gac gca cct gtg cac gga cag 2803
Ala Leu Pro Arg Gln Ala Tyr Ala Tyr Asp Ala Pro Val His Gly Gln
810 815 820
ata ggc gtg ggt ttg ttg ata ttg atc acc ctc ttc aca ctc acc ccg 2851
Ile Gly Val Gly Leu Leu Ile Leu Ile Thr Leu Phe Thr Leu Thr Pro
825 830 835
ggg tat aag acc ctc ctc ggc cag tgt ctg tgg tgg ttg tgc tat ctc 2899
Gly Tyr Lys Thr Leu Leu Gly Gln Cys Leu Trp Trp Leu Cys Tyr Leu
840 845 850
ctg acc ctg ggg gaa gcc atg att cag gag tgg gta cca ccc atg cag 2947
Leu Thr Leu Gly Glu Ala Met Ile Gln Glu Trp Val Pro Pro Met Gln
855 860 865
gtg cgc ggc ggc cgc gat ggc atc gcg tgg gcc gtc act ata ttc tgc 2995
Val Arg Gly Gly Arg Asp Gly Ile Ala Trp Ala Val Thr Ile Phe Cys
870 875 880 885
ccg ggt gtg gtg ttt gac att acc aaa tgg ctt ttg gcg ttg ctt ggg 3043
Pro Gly Val Val Phe Asp Ile Thr Lys Trp Leu Leu Ala Leu Leu Gly
890 895 900
cct gct tac ctc tta agg gcc gct ttg aca cat gtg ccg tac ttc gtc 3091
Pro Ala Tyr Leu Leu Arg Ala Ala Leu Thr His Val Pro Tyr Phe Val
905 910 915
aga gct cac gct ctg ata agg gta tgc gct ttg gtg aag cag ctc gcg 3139
Arg Ala His Ala Leu Ile Arg Val Cys Ala Leu Val Lys Gln Leu Ala
920 925 930
ggg ggt agg tat gtt cag gtg gcg cta ttg gcc ctt ggc agg tgg act 3187
Gly Gly Arg Tyr Val Gln Val Ala Leu Leu Ala Leu Gly Arg Trp Thr
935 940 945
ggc acc tac atc tat gac cac ctc aca cct atg tcg gac tgg gcc gct 3235
Gly Thr Tyr Ile Tyr Asp His Leu Thr Pro Met Ser Asp Trp Ala Ala
950 955 960 965
agc ggc ctg cgc gac tta gcg gtc gcc gtg gaa ccc atc atc ttc agt 3283
Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu Pro Ile Ile Phe Ser
970 975 980
ccg atg gag aag aag gtc atc gtc tgg gga gcg gag acg gct gca tgt 3331
Pro Met Glu Lys Lys Val Ile Val Trp Gly Ala Glu Thr Ala Ala Cys
985 990 995
ggg gac att cta cat gga ctt ccc gtg tcc gcc cga ctc ggc cag 3376
Gly Asp Ile Leu His Gly Leu Pro Val Ser Ala Arg Leu Gly Gln
1000 1005 1010
gag atc ctc ctc ggc cca gct gat ggc tac acc tcc aag ggg tgg 3421
Glu Ile Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser Lys Gly Trp
1015 1020 1025
aag ctc ctt gct ccc atc act gct tat gcc cag caa aca cga ggc 3466
Lys Leu Leu Ala Pro Ile Thr Ala Tyr Ala Gln Gln Thr Arg Gly
1030 1035 1040
ctc ctg ggc gcc ata gtg gtg agt atg acg ggg cgt gac agg aca 3511
Leu Leu Gly Ala Ile Val Val Ser Met Thr Gly Arg Asp Arg Thr
1045 1050 1055
gaa cag gcc ggg gaa gtc caa atc ctg tcc aca gtc tct cag tcc 3556
Glu Gln Ala Gly Glu Val Gln Ile Leu Ser Thr Val Ser Gln Ser
1060 1065 1070
ttc ctc gga aca acc atc tcg ggg gtt ttg tgg act gtt tac cac 3601
Phe Leu Gly Thr Thr Ile Ser Gly Val Leu Trp Thr Val Tyr His
1075 1080 1085
gga gct ggc aac aag act cta gcc ggc tta cgg ggt ccg gtc acg 3646
Gly Ala Gly Asn Lys Thr Leu Ala Gly Leu Arg Gly Pro Val Thr
1090 1095 1100
cag atg tac tcg agt gct gag ggg gac ttg gta ggc tgg ccc agc 3691
Gln Met Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser
1105 1110 1115
ccc cct ggg acc aag tct ttg gag ccg tgc aag tgt gga gcc gtc 3736
Pro Pro Gly Thr Lys Ser Leu Glu Pro Cys Lys Cys Gly Ala Val
1120 1125 1130
gac cta tat ctg gtc acg cgg aac gct gat gtc atc ccg gct cgg 3781
Asp Leu Tyr Leu Val Thr Arg Asn Ala Asp Val Ile Pro Ala Arg
1135 1140 1145
aga cgc ggg gac aag cgg gga gca ttg ctc tcc ccg aga ccc att 3826
Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu Ser Pro Arg Pro Ile
1150 1155 1160
tcg acc ttg aag ggg tcc tcg ggg ggg ccg gtg ctc tgc cct agg 3871
Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val Leu Cys Pro Arg
1165 1170 1175
ggc cac gtc gtt ggg ctc ttc cga gca gct gtg tgc tct cgg ggc 3916
Gly His Val Val Gly Leu Phe Arg Ala Ala Val Cys Ser Arg Gly
1180 1185 1190
gtg gcc aaa tcc atc gat ttc atc ccc gtt gag aca ctc gac gtt 3961
Val Ala Lys Ser Ile Asp Phe Ile Pro Val Glu Thr Leu Asp Val
1195 1200 1205
gtt aca agg tct ccc act ttc agt gac aac agc acg cca ccg gct 4006
Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser Thr Pro Pro Ala
1210 1215 1220
gtg ccc cag acc tat cag gtc ggg tac ttg cat gct cca act ggc 4051
Val Pro Gln Thr Tyr Gln Val Gly Tyr Leu His Ala Pro Thr Gly
1225 1230 1235
agt gga aag agc acc aag gtc cct gtc gcg tat gcc gcc cag ggg 4096
Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr Ala Ala Gln Gly
1240 1245 1250
tac aaa gta cta gtg ctt aac ccc tcg gta gct gcc acc ctg ggg 4141
Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly
1255 1260 1265
ttt ggg gcg tac cta tcc aag gca cat ggc atc aat ccc aac att 4186
Phe Gly Ala Tyr Leu Ser Lys Ala His Gly Ile Asn Pro Asn Ile
1270 1275 1280
agg act gga gtc agg acc gtg atg acc ggg gag gcc atc acg tac 4231
Arg Thr Gly Val Arg Thr Val Met Thr Gly Glu Ala Ile Thr Tyr
1285 1290 1295
tcc aca tat ggc aaa ttt ctc gcc gat ggg ggc tgc gct agc ggc 4276
Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Ser Gly
1300 1305 1310
gcc tat gac atc atc ata tgc gat gaa tgc cac gct gtg gat gct 4321
Ala Tyr Asp Ile Ile Ile Cys Asp Glu Cys His Ala Val Asp Ala
1315 1320 1325
acc tcc att ctc ggc atc gga acg gtc ctt gat caa gca gag aca 4366
Thr Ser Ile Leu Gly Ile Gly Thr Val Leu Asp Gln Ala Glu Thr
1330 1335 1340
gcc ggg gtc aga cta act gtg ctg gct acg gcc aca ccc ccc ggg 4411
Ala Gly Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly
1345 1350 1355
tca gtg aca acc ccc cat ccc gat ata gaa gag gta ggc ctc ggg 4456
Ser Val Thr Thr Pro His Pro Asp Ile Glu Glu Val Gly Leu Gly
1360 1365 1370
cgg gag ggt gag atc ccc ttc tat ggg agg gcg att ccc cta tcc 4501
Arg Glu Gly Glu Ile Pro Phe Tyr Gly Arg Ala Ile Pro Leu Ser
1375 1380 1385
tgc atc aag gga ggg aga cac ctg att ttc tgc cac tca aag aaa 4546
Cys Ile Lys Gly Gly Arg His Leu Ile Phe Cys His Ser Lys Lys
1390 1395 1400
aag tgt gac gag ctc gcg gcg gcc ctt cgg ggc atg ggc ttg aat 4591
Lys Cys Asp Glu Leu Ala Ala Ala Leu Arg Gly Met Gly Leu Asn
1405 1410 1415
gcc gtg gca tac tat aga ggg ttg gac gtc tcc ata ata cca gct 4636
Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Ile Ile Pro Ala
1420 1425 1430
cag gga gat gtg gtg gtc gtc gcc acc gac gcc ctc atg acg ggg 4681
Gln Gly Asp Val Val Val Val Ala Thr Asp Ala Leu Met Thr Gly
1435 1440 1445
tac act gga gac ttt gac tcc gtg atc gac tgc aat gta gcg gtc 4726
Tyr Thr Gly Asp Phe Asp Ser Val Ile Asp Cys Asn Val Ala Val
1450 1455 1460
acc caa gct gtc gac ttc agc ctg gac ccc acc ttc act ata acc 4771
Thr Gln Ala Val Asp Phe Ser Leu Asp Pro Thr Phe Thr Ile Thr
1465 1470 1475
aca cag act gtc cca caa gac gct gtc tca cgc agt cag cgc cgc 4816
Thr Gln Thr Val Pro Gln Asp Ala Val Ser Arg Ser Gln Arg Arg
1480 1485 1490
ggg cgc aca ggt aga gga aga cag ggc act tat agg tat gtt tcc 4861
Gly Arg Thr Gly Arg Gly Arg Gln Gly Thr Tyr Arg Tyr Val Ser
1495 1500 1505
act ggt gaa cga gcc tca gga atg ttt gac agt gta gtg ctt tgt 4906
Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val Leu Cys
1510 1515 1520
gag tgc tac gac gca ggg gct gcg tgg tac gat ctc aca cca gcg 4951
Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Asp Leu Thr Pro Ala
1525 1530 1535
gag acc acc gtc agg ctt aga gcg tat ttc aac acg ccc ggc cta 4996
Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu
1540 1545 1550
ccc gtg tgt caa gac cat ctt gaa ttt tgg gag gca gtt ttc acc 5041
Pro Val Cys Gln Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr
1555 1560 1565
ggc ctc aca cac ata gac gcc cac ttc ctc tcc caa aca aag caa 5086
Gly Leu Thr His Ile Asp Ala His Phe Leu Ser Gln Thr Lys Gln
1570 1575 1580
gcg ggg gag aac ttc gcg tac cta gta gcc tac caa gct acg gtg 5131
Ala Gly Glu Asn Phe Ala Tyr Leu Val Ala Tyr Gln Ala Thr Val
1585 1590 1595
tgc gcc aga gcc aag gcc cct ccc ccg tcc tgg gac gcc atg tgg 5176
Cys Ala Arg Ala Lys Ala Pro Pro Pro Ser Trp Asp Ala Met Trp
1600 1605 1610
aag tgc ctg gcc cga ctc aag cct acg ctt gcg ggc ccc aca cct 5221
Lys Cys Leu Ala Arg Leu Lys Pro Thr Leu Ala Gly Pro Thr Pro
1615 1620 1625
ctc ctg tac cgt ttg ggc cct att acc aat gag gtc acc ctc aca 5266
Leu Leu Tyr Arg Leu Gly Pro Ile Thr Asn Glu Val Thr Leu Thr
1630 1635 1640
cac cct ggg acg aag tac atc gcc aca tgc atg caa gct gac ctt 5311
His Pro Gly Thr Lys Tyr Ile Ala Thr Cys Met Gln Ala Asp Leu
1645 1650 1655
gag gtc atg acc agc acg tgg gtc cta gct gga gga gtc ctg gca 5356
Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly Gly Val Leu Ala
1660 1665 1670
gcc gtc gcc gca tat tgc ctg gcg act gga tgc gtt tcc atc atc 5401
Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys Val Ser Ile Ile
1675 1680 1685
ggc cgc ttg cac gtc aac cag cga gtc gtc gtt gcg ccg gat aag 5446
Gly Arg Leu His Val Asn Gln Arg Val Val Val Ala Pro Asp Lys
1690 1695 1700
gag gtc ctg tat gag gct ttt gat gag atg gag gaa tgc gcc tct 5491
Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu Glu Cys Ala Ser
1705 1710 1715
agg gcg gct ctc atc gaa gag ggg cag cgg ata gcc gag atg ttg 5536
Arg Ala Ala Leu Ile Glu Glu Gly Gln Arg I1e Ala Glu Met Leu
1720 1725 1730
aag tcc aag atc caa ggc ttg ctg cag cag gcc tct aag cag gcc 5581
Lys Ser Lys Ile Gln Gly Leu Leu Gln Gln Ala Ser Lys Gln Ala
1735 1740 1745
cag gac ata caa ccc gct atg cag gct tca tgg ccc aaa gtg gaa 5626
Gln Asp Ile Gln Pro Ala Met Gln Ala Ser Trp Pro Lys Val Glu
1750 1755 1760
caa ttt tgg gcc aga cac atg tgg aac ttc att agc ggc atc caa 5671
Gln Phe Trp Ala Arg His Met Trp Asn Phe Ile Ser Gly Ile Gln
1765 1770 1775
tac ctc gca gga ttg tca aca ctg cca ggg aac ccc gcg gtg gct 5716
Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala
1780 1785 1790
tcc atg atg gca ttc agt gcc gcc ctc acc agt ccg ttg tcg acc 5761
Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr
1795 1800 1805
agt acc acc atc ctt ctc aac atc atg gga ggc tgg tta gcg tcc 5806
Ser Thr Thr Ile Leu Leu Asn Ile Met Gly Gly Trp Leu Ala Ser
1810 1815 1820
cag atc gca cca ccc gcg ggg gcc acc ggc ttt gtc gtc agt ggc 5851
Gln Ile Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly
1825 1830 1835
ctg gtg ggg gct gcc gtg ggc agc ata ggc ctg ggt aag gtg ctg 5896
Leu Val Gly Ala Ala Val Gly Ser Ile Gly Leu Gly Lys Val Leu
1840 1845 1850
gtg gac atc ctg gca gga tat ggt gcg ggc att tcg ggg gcc ctc 5941
Val Asp Ile Leu Ala Gly Tyr Gly Ala Gly Ile Ser Gly Ala Leu
1855 1860 1865
gtc gca ttc aag atc atg tct ggc gag aag ccc tct atg gaa gat 5986
Val Ala Phe Lys Ile Met Ser Gly Glu Lys Pro Ser Met Glu Asp
1870 1875 1880
gtc atc aat cta ctg cct ggg atc ctg tct ccg gga gcc ctg gtg 6031
Val Ile Asn Leu Leu Pro Gly Ile Leu Ser Pro Gly Ala Leu Val
1885 1890 1895
gtg ggg gtc atc tgc gcg gcc att ctg cgc cgc cac gtg gga ccg 6076
Val Gly Val Ile Cys Ala Ala Ile Leu Arg Arg His Val Gly Pro
1900 1905 1910
ggg gag ggc gcg gtc caa tgg atg aac agg ctt att gcc ttt gct 6121
Gly Glu Gly Ala Val Gln Trp Met Asn Arg Leu Ile Ala Phe Ala
1915 1920 1925
tcc aga gga aac cac gtc gcc cct act cac tac gtg acg gag tcg 6166
Ser Arg Gly Asn His Val Ala Pro Thr His Tyr Val Thr Glu Ser
1930 1935 1940
gat gcg tcg cag cgt gtg acc caa cta ctt ggc tct ctt act ata 6211
Asp Ala Ser Gln Arg Val Thr Gln Leu Leu Gly Ser Leu Thr Ile
1945 1950 1955
acc agc cta ctc aga aga ctc cac aat tgg ata act gag gac tgc 6256
Thr Ser Leu Leu Arg Arg Leu His Asn Trp Ile Thr Glu Asp Cys
1960 1965 1970
ccc atc cca tgc tcc gga tcc tgg ctc cgc gac gtg tgg gac tgg 6301
Pro Ile Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp
1975 1980 1985
gtt tgc acc atc ttg aca gac ttc aaa aat tgg ctg acc tct aaa 6346
Val Cys Thr Ile Leu Thr Asp Phe Lys Asn Trp Leu Thr Ser Lys
1990 1995 2000
ttg ttc ccc aag ctg ccc ggc ctc ccc ttc atc tct tgt caa aag 6391
Leu Phe Pro Lys Leu Pro Gly Leu Pro Phe Ile Ser Cys Gln Lys
2005 2010 2015
ggg tac aag ggt gtg tgg gcc ggc act ggc atc atg acc acg cgc 6436
Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly Ile Met Thr Thr Arg
2020 2025 2030
tgc cct tgc ggc gcc aac atc tct ggc aat gtc cgc ctg ggc tct 6481
Cys Pro Cys Gly Ala Asn Ile Ser Gly Asn Val Arg Leu Gly Ser
2035 2040 2045
atg agg atc aca ggg cct aaa acc tgc atg aac acc tgg cag ggg 6526
Met Arg Ile Thr Gly Pro Lys Thr Cys Met Asn Thr Trp Gln Gly
2050 2055 2060
acc ttt cct atc aat tgc tac acg gag ggc cag tgc gcg ccg aaa 6571
Thr Phe Pro Ile Asn Cys Tyr Thr Glu Gly Gln Cys Ala Pro Lys
2065 2070 2075
ccc ccc acg aac tac aag acc gcc atc tgg agg gtg gcg gcc tcg 6616
Pro Pro Thr Asn Tyr Lys Thr Ala Ile Trp Arg Val Ala Ala Ser
2080 2085 2090
gag tac gcg gag gtg acg cag cat ggg tcg tac tcc tat gta aca 6661
Glu Tyr Ala Glu Val Thr Gln His Gly Ser Tyr Ser Tyr Val Thr
2095 2100 2105
gga ctg acc act gac aat ctg aaa att cct tgc caa cta cct tct 6706
Gly Leu Thr Thr Asp Asn Leu Lys Ile Pro Cys Gln Leu Pro Ser
2110 2115 2120
cca gag ttt ttc tcc tgg gtg gac ggt gtg cag atc cat agg ttt 6751
Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gln Ile His Arg Phe
2125 2130 2135
gca ccc aca cca aag ccg ttt ttc cgg gat gag gtc tcg ttc tgc 6796
Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu Val Ser Phe Cys
2140 2145 2150
gtt ggg ctt aat tcc tat gct gtc ggg tcc cag ctt ccc tgt gaa 6841
Val Gly Leu Asn Ser Tyr Ala Val Gly Ser Gln Leu Pro Cys Glu
2155 2160 2165
cct gag ccc gac gca gac gta ttg agg tcc atg cta aca gat ccg 6886
Pro Glu Pro Asp Ala Asp Val Leu Arg Ser Met Leu Thr Asp Pro
2170 2175 2180
ccc cac atc acg gcg gag act gcg gcg cgg cgc ttg gca cgg gga 6931
Pro His Ile Thr Ala Glu Thr Ala Ala Arg Arg Leu Ala Arg Gly
2185 2190 2195
tca cct cca tct gag gcg agc tcc tca gtg agc cag cta tca gca 6976
Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser Gln Leu Ser Ala
2200 2205 2210
ccg tcg ctg cgg gcc acc tgc acc acc cac agc aac acc tat gac 7021
Pro Ser Leu Arg Ala Thr Cys Thr Thr His Ser Asn Thr Tyr Asp
2215 2220 2225
gtg gac atg gtc gat gcc aac ctg ctc atg gag ggc ggt gtg gct 7066
Val Asp Met Val Asp Ala Asn Leu Leu Met Glu Gly Gly Val Ala
2230 2235 2240
cag aca gag cct gag tcc agg gtg ccc gtt ctg gac ttt ctc gag 7111
Gln Thr Glu Pro Glu Ser Arg Val Pro Val Leu Asp Phe Leu Glu
2245 2250 2255
cca atg gcc gag gaa gag agc gac ctt gag ccc tca ata cca tcg 7156
Pro Met Ala Glu Glu Glu Ser Asp Leu Glu Pro Ser Ile Pro Ser
2260 2265 2270
gag tgc atg ctc ccc agg agc ggg ttt cca cgg gcc tta ccg gct 7201
Glu Cys Met Leu Pro Arg Ser Gly Phe Pro Arg Ala Leu Pro Ala
2275 2280 2285
tgg gca cgg cct gac tac aac ccg ccg ctc gtg gaa tcg tgg agg 7246
Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Arg
2290 2295 2300
agg cca gat tac caa ccg ccc acc gtt gct ggt tgt gct ctc ccc 7291
Arg Pro Asp Tyr Gln Pro Pro Thr Val Ala Gly Cys Ala Leu Pro
2305 2310 2315
ccc ccc aag aag gcc ccg acg cct ccc cca agg aga cgc cgg aca 7336
Pro Pro Lys Lys Ala Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr
2320 2325 2330
gtg ggt ctg agc gag agc acc ata tca gaa gcc ctc cag caa ctg 7381
Val Gly Leu Ser Glu Ser Thr Ile Ser Glu Ala Leu Gln Gln Leu
2335 2340 2345
gcc atc aag acc ttt ggc cag ccc ccc tcg agc ggt gat gca ggc 7426
Ala Ile Lys Thr Phe Gly Gln Pro Pro Ser Ser Gly Asp Ala Gly
2350 2355 2360
tcg tcc acg ggg gcg ggc gcc gcc gaa tcc ggc ggt ccg acg tcc 7471
Ser Ser Thr Gly Ala Gly Ala Ala Glu Ser Gly Gly Pro Thr Ser
2365 2370 2375
cct ggt gag ccg gcc ccc tca gag aca ggt tcc gcc tcc tct atg 7516
Pro Gly Glu Pro Ala Pro Ser Glu Thr Gly Ser Ala Ser Ser Met
2380 2385 2390
ccc ccc ctc gag ggg gag cct gga gat ccg gac ctg gag tct gat 7561
Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Glu Ser Asp
2395 2400 2405
cag gta gag ctt caa cct ccc ccc cag ggg ggg ggg gta gct ccc 7606
Gln Val Glu Leu Gln Pro Pro Pro Gln Gly Gly Gly Val Ala Pro
2410 2415 2420
ggt tcg ggc tcg ggg tct tgg tct act tgc tcc gag gag gac gat 7651
Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys Ser Glu Glu Asp Asp
2425 2430 2435
acc acc gtg tgc tgc tcc atg tca tac tcc tgg acc ggg gct cta 7696
Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu
2440 2445 2450
ata act ccc tgt agc ccc gaa gag gaa aag ttg cca atc aac cct 7741
Ile Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro Ile Asn Pro
2455 2460 2465
ttg agt aac tcg ctg ttg cga tac cat aac aag gtg tac tgt aca 7786
Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr Cys Thr
2470 2475 2480
aca tca aag agc gcc tca cag agg gct aaa aag gta act ttt gac 7831
Thr Ser Lys Ser Ala Ser Gln Arg Ala Lys Lys Val Thr Phe Asp
2485 2490 2495
agg acg caa gtg ctc gac gcc cat tat gac tca gtc tta aag gac 7876
Arg Thr Gln Val Leu Asp Ala His Tyr Asp Ser Val Leu Lys Asp
2500 2505 2510
atc aag cta gcg gct tcc aag gtc agc gca agg ctc ctc acc ttg 7921
Ile Lys Leu Ala Ala Ser Lys Val Ser Ala Arg Leu Leu Thr Leu
2515 2520 2525
gag gag gcg tgc cag ttg act cca ccc cat tct gca aga tcc aag 7966
Glu Glu Ala Cys Gln Leu Thr Pro Pro His Ser Ala Arg Ser Lys
2530 2535 2540
tat gga ttc ggg gcc aag gag gtc cgc agc ttg tcc ggg agg gcc 8011
Tyr Gly Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala
2545 2550 2555
gtt aac cac atc aag tcc gtg tgg aag gac ctc ctg gaa gac cca 8056
Val Asn His Ile Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Pro
2560 2565 2570
caa aca cca att ccc aca acc atc atg gcc aaa aat gag gtg ttc 8101
Gln Thr Pro Ile Pro Thr Thr Ile Met Ala Lys Asn Glu Val Phe
2575 2580 2585
tgc gtg gac ccc gcc aag ggg ggt aag aaa cca gct cgc ctc atc 8146
Cys Val Asp Pro Ala Lys Gly Gly Lys Lys Pro Ala Arg Leu Ile
2590 2595 2600
gtt tac cct gac ctc ggc gtc cgg gtc tgc gag aaa atg gcc ctc 8191
Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu
2605 2610 2615
tat gac att aca caa aag ctt cct cag gcg gta atg gga gct tcc 8236
Tyr Asp Ile Thr Gln Lys Leu Pro Gln Ala Val Met Gly Ala Ser
2620 2625 2630
tat ggc ttc cag tac tcc cct gcc caa cgg gtg gag tat ctc ttg 8281
Tyr Gly Phe Gln Tyr Ser Pro Ala Gln Arg Val Glu Tyr Leu Leu
2635 2640 2645
aaa gca tgg gcg gaa aag aag gac ccc atg ggt ttt tcg tat gat 8326
Lys Ala Trp Ala Glu Lys Lys Asp Pro Met Gly Phe Ser Tyr Asp
2650 2655 2660
acc cga tgc ttc gac tca acc gtc act gag aga gac atc agg acc 8371
Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg Asp Ile Arg Thr
2665 2670 2675
gag gag tcc ata tac cag gcc tgc tcc ctg ccc gag gag gcc cgc 8416
Glu Glu Ser Ile Tyr Gln Ala Cys Ser Leu Pro Glu Glu Ala Arg
2680 2685 2690
act gcc ata cac tcg ctg act gag aga ctt tac gta gga ggg ccc 8461
Thr Ala Ile His Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly Pro
2695 2700 2705
atg ttc aac agc aag ggt caa acc tgc ggt tac aga cgt tgc cgc 8506
Met Phe Asn Ser Lys Gly Gln Thr Cys Gly Tyr Arg Arg Cys Arg
2710 2715 2720
gcc agc ggg gtg cta acc act agc atg ggt aac acc atc aca tgc 8551
Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr Ile Thr Cys
2725 2730 2735
tat gtg aaa gcc cta gcg gcc tgc aag gct gcg ggg ata gtt gcg 8596
Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly Ile Val Ala
2740 2745 2750
ccc aca atg ctg gta tgc ggc gat gac cta gta gtc atc tca gaa 8641
Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val Ile Ser Glu
2755 2760 2765
agc cag ggg act gag gag gac gag cgg aac ctg aga gcc ttc acg 8686
Ser Gln Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr
2770 2775 2780
gag gcc atg acc agg tac tct gcc cct cct ggt gat ccc ccc aga 8731
Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg
2785 2790 2795
ccg gaa tat gac ctg gag cta ata aca tcc tgt tcc tca aat gtg 8776
Pro Glu Tyr Asp Leu Glu Leu Ile Thr Ser Cys Ser Ser Asn Val
2800 2805 2810
tct gtg gcg ttg ggc ccg cgg ggc cgc cgc aga tac tac ctg acc 8821
Ser Val Ala Leu Gly Pro Arg Gly Arg Arg Arg Tyr Tyr Leu Thr
2815 2820 2825
aga gac cca acc act cca ctc gcc cgg gct gcc tgg gaa aca gtt 8866
Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Val
2830 2835 2840
aga cac tcc cct atc aat tca tgg ctg gga aac atc atc cag tat 8911
Arg His Ser Pro Ile Asn Ser Trp Leu Gly Asn Ile Ile Gln Tyr
2845 2850 2855
gct cca acc ata tgg gtt cgc atg gtc cta atg aca cac ttc ttc 8956
Ala Pro Thr Ile Trp Val Arg Met Val Leu Met Thr His Phe Phe
2860 2865 2870
tcc att ctc atg gtc caa gac acc ctg gac cag aac ctc aac ttt 9001
Ser Ile Leu Met Val Gln Asp Thr Leu Asp Gln Asn Leu Asn Phe
2875 2880 2885
gag atg tat gga tca gta tac tcc gtg aat cct ttg gac ctt cca 9046
Glu Met Tyr Gly Ser Val Tyr Ser Val Asn Pro Leu Asp Leu Pro
2890 2895 2900
gcc ata att gag agg tta cac ggg ctt gac gcc ttt tct atg cac 9091
Ala Ile Ile Glu Arg Leu His Gly Leu Asp Ala Phe Ser Met His
2905 2910 2915
aca tac tct cac cac gaa ctg acg cgg gtg gct tca gcc ctc aga 9136
Thr Tyr Ser His His Glu Leu Thr Arg Val Ala Ser Ala Leu Arg
2920 2925 2930
aaa ctt ggg gcg cca ccc ctc agg gtg tgg aag agt cgg gct cgc 9181
Lys Leu Gly Ala Pro Pro Leu Arg Val Trp Lys Ser Arg Ala Arg
2935 2940 2945
gca gtc agg gcg tcc ctc atc tcc cgt gga ggg aaa gcg gcc gtt 9226
Ala Val Arg Ala Ser Leu Ile Ser Arg Gly Gly Lys Ala Ala Val
2950 2955 2960
tgc ggc cga tat ctc ttc aat tgg gcg gtg aag acc aag ctc aaa 9271
Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu Lys
2965 2970 2975
ctc act cca ttg ccg gag gcg cgc cta ctg gac tta tcc agt tgg 9316
Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp
2980 2985 2990
ttc acc gtc ggc gcc ggc ggg ggc gac att ttt cac agc gtg tcg 9361
Phe Thr Val Gly Ala Gly Gly Gly Asp Ile Phe His Ser Val Ser
2995 3000 3005
cgc gcc cga ccc cgc tca tta ctc ttc ggc cta ctc cta ctt ttc 9406
Arg Ala Arg Pro Arg Ser Leu Leu Phe Gly Leu Leu Leu Leu Phe
3010 3015 3020
gta ggg gta ggc ctc ttc cta ctc ccc gct cgg tag agcggcacac 9452
Val Gly Val Gly Leu Phe Leu Leu Pro Ala Arg
3025 3030
actaggtaca ctccatagct aactgttcct tttttttttt tttttttttt tttttttttt 9512
tttttttttt ttcttttttt tttttttccc tctttcttcc cttctcatct tattctactt 9572
tctttcttgg tggctccatc ttagccctag tcacggctag ctgtgaaagg tccgtgagcc 9632
gcatgactgc agagagtgcc gtaactggtc tctctgcaga tcatgt 9678
<210>2
<211>3033
<212>PRT
<213>丙型肝炎病毒JFH1株
<400>2
Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn
1 5 10 15
Arg Arg Pro Glu Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly
20 25 30
Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Thr
35 40 45
Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro
50 55 60
Ile Pro Lys Asp Arg Arg Ser Thr Gly Lys Ala Trp Gly Lys Pro Gly
65 70 75 80
Arg Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp
85 90 95
Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro
100 105 110
Arg His Arg Ser Arg Asn Val Gly Lys Val Ile Asp Thr Leu Thr Cys
115 120 125
Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Val Val Gly Ala Pro Leu
130 135 140
Ser Gly Ala Ala Arg Ala Val Ala His Gly Val Arg Val Leu Glu Asp
145 150 155 160
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Phe Pro Phe Ser Ile
165 170 175
Phe Leu Leu Ala Leu Leu Ser Cys Ile Thr Val Pro Val Ser Ala Ala
180 185 190
Gln Val Lys Asn Thr Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Ser
195 200 205
Asn Asp Ser Ile Thr Trp Gln Leu Glu Ala Ala Val Leu His Val Pro
210 215 220
Gly Cys Val Pro Cys Glu Arg Val Gly Asn Thr Ser Arg Cys Trp Val
225 230 235 240
Pro Val Ser Pro Asn Met Ala Val Arg Gln Pro Gly Ala Leu Thr Gln
245 250 255
Gly Leu Arg Thr His Ile Asp Met Val Val Met Ser Ala Thr Phe Cys
260 265 270
Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala
275 280 285
Gln Val Phe Ile Val Ser Pro Gln Tyr His Trp Phe Val Gln Glu Cys
290 295 300
Asn Cys Ser Ile Tyr Pro Gly Thr Ile Thr Gly His Arg Met Ala Trp
305 310 315 320
Asp Met Met Met Asn Trp Ser Pro Thr Ala Thr Met Ile Leu Ala Tyr
325 330 335
Val Met Arg Val Pro Glu Val Ile Ile Asp Ile Val Ser Gly Ala His
340 345 350
Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gln Gly Ala Trp
355 360 365
Ala Lys Val Ile Val Ile Leu Leu Leu Ala Ala Gly Val Asp Ala Gly
370 375 380
Thr Thr Thr Val Gly Gly Ala Val Ala Arg Ser Thr Asn Val Ile Ala
385 390 395 400
Gly Val Phe Ser His Gly Pro Gln Gln Asn Ile Gln Leu Ile Asn Thr
405 410 415
Asn Gly Ser Trp His Ile Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser
420 425 430
Leu Asn Thr Gly Phe Leu Ala Ala Leu Phe Tyr Thr Asn Arg Phe Asn
435 440 445
Ser Ser Gly Cys Pro Gly Arg Leu Ser Ala Cys Arg Asn Ile Glu Ala
450 455 460
Phe Arg Ile Gly Trp Gly Thr Leu Gln Tyr Glu Asp Asn Val Thr Asn
465 470 475 480
Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Lys Pro Cys
485 490 495
Gly Val Val Pro Ala Arg Ser Val Cys Gly Pro Val Tyr Cys Phe Thr
500 505 510
Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Arg Gly Val Pro Thr
515 520 525
Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr
530 535 540
Arg Pro Pro Gln Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Thr
545 550 555 560
Gly Phe Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp
565 570 575
Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys
580 585 590
His Pro Asp Ala Thr Tyr Ile Lys Cys Gly Ser Gly Pro Trp Leu Thr
595 600 605
Pro Lys Cys Leu Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys
610 615 620
Thr Val Asn Phe Thr Ile Phe Lys Ile Arg Met Tyr Val Gly Gly Val
625 630 635 640
Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys
645 650 655
Asp Leu Glu Asp Arg Asp Arg Ser Gln Leu Ser Pro Leu Leu His Ser
660 665 670
Thr Thr Glu Trp Ala Ile Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala
675 680 685
Leu Ser Thr Gly Leu Leu His Leu His Gln Asn Ile Val Asp Val Gln
690 695 700
Tyr Met Tyr Gly Leu Ser Pro Ala Ile Thr Lys Tyr Val Val Arg Trp
705 710 715 720
Glu Trp Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys
725 730 735
Ala Cys Leu Trp Met Leu Ile Leu Leu Gly Gln Ala Glu Ala Ala Leu
740 745 750
Glu Lys Leu Val Val Leu His Ala Ala Ser Ala Ala Asn Cys His Gly
755 760 765
Leu Leu Tyr Phe Ala Ile Phe Phe Val Ala Ala Trp His Ile Arg Gly
770 775 780
Arg Val Val Pro Leu Thr Thr Tyr Cys Leu Thr Gly Leu Trp Pro Phe
785 790 795 800
Cys Leu Leu Leu Met Ala Leu Pro Arg Gln Ala Tyr Ala Tyr Asp Ala
805 810 815
Pro Val His Gly Gln Ile Gly Val Gly Leu Leu Ile Leu Ile Thr Leu
820 825 830
Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Gly Gln Cys Leu Trp
835 840 845
Trp Leu Cys Tyr Leu Leu Thr Leu Gly Glu Ala Met Ile Gln Glu Trp
850 855 860
Val Pro Pro Met Gln Val Arg Gly Gly Arg Asp Gly Ile Ala Trp Ala
865 870 875 880
Val Thr Ile Phe Cys Pro Gly Val Val Phe Asp Ile Thr Lys Trp Leu
885 890 895
Leu Ala Leu Leu Gly Pro Ala Tyr Leu Leu Arg Ala Ala Leu Thr His
900 905 910
Val Pro Tyr Phe Val Arg Ala His Ala Leu Ile Arg Val Cys Ala Leu
915 920 925
Val Lys Gln Leu Ala Gly Gly Arg Tyr Val Gln Val Ala Leu Leu Ala
930 935 940
Leu Gly Arg Trp Thr Gly Thr Tyr Ile Tyr Asp His Leu Thr Pro Met
945 950 955 960
Ser Asp Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu
965 970 975
Pro Ile Ile Phe Ser Pro Met Glu Lys Lys Val Ile Val Trp Gly Ala
980 985 990
Glu Thr Ala Ala Cys Gly Asp Ile Leu His Gly Leu Pro Val Ser Ala
995 1000 1005
Arg Leu Gly Gln Glu Ile Leu Leu Gly Pro Ala Asp Gly Tyr Thr
1010 1015 1020
Ser Lys Gly Trp Lys Leu Leu Ala Pro Ile Thr Ala Tyr Ala Gln
1025 1030 1035
Gln Thr Arg Gly Leu Leu Gly Ala Ile Val Val Ser Met Thr Gly
1040 1045 1050
Arg Asp Arg Thr Glu Gln Ala Gly Glu Val Gln Ile Leu Ser Thr
1055 1060 1065
Val Ser Gln Ser Phe Leu Gly Thr Thr Ile Ser Gly Val Leu Trp
1070 1075 1080
Thr Val Tyr His Gly Ala Gly Asn Lys Thr Leu Ala Gly Leu Arg
1085 1090 1095
Gly Pro Val Thr Gln Met Tyr Ser Ser Ala Glu Gly Asp Leu Val
1100 1105 1110
Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu Pro Cys Lys
1115 1120 1125
Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn Ala Asp Val
1130 1135 1140
Ile Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Ala Leu Leu Ser
1145 1150 1155
Pro Arg Pro Ile Ser Thr Leu Lys Gly Ser Ser Gly Gly Pro Val
1160 1165 1170
Leu Cys Pro Arg Gly His Val Val Gly Leu Phe Arg Ala Ala Val
1175 1180 1185
Cys Ser Arg Gly Val Ala Lys Ser Ile Asp Phe Ile Pro Val Glu
1190 1195 1200
Thr Leu Asp Val Val Thr Arg Ser Pro Thr Phe Ser Asp Asn Ser
1205 1210 1215
Thr Pro Pro Ala Val Pro Gln Thr Tyr Gln Val Gly Tyr Leu His
1220 1225 1230
Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr
1235 1240 1245
Ala Ala Gln Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala
1250 1255 1260
Ala Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly Ile
1265 1270 1275
Asn Pro Asn Ile Arg Thr Gly Val Arg Thr Val Met Thr Gly Glu
1280 1285 1290
Ala Ile Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly
1295 1300 1305
Cys Ala Ser Gly Ala Tyr Asp Ile Ile Ile Cys Asp Glu Cys His
1310 1315 1320
Ala Val Asp Ala Thr Ser Ile Leu Gly Ile Gly Thr Val Leu Asp
1325 1330 1335
Gln Ala Glu Thr Ala Gly Val Arg Leu Thr Val Leu Ala Thr Ala
1340 1345 1350
Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asp Ile Glu Glu
1355 1360 1365
Val Gly Leu Gly Arg Glu Gly Glu Ile Pro Phe Tyr Gly Arg Ala
1370 1375 1380
Ile Pro Leu Ser Cys Ile Lys Gly Gly Arg His Leu Ile Phe Cys
1385 1390 1395
His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Ala Leu Arg Gly
1400 1405 1410
Met Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser
1415 1420 1425
Ile Ile Pro Ala Gln Gly Asp Val Val Val Val Ala Thr Asp Ala
1430 1435 1440
Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val Ile Asp Cys
1445 1450 1455
Asn Val Ala Val Thr Gln Ala Val Asp Phe Ser Leu Asp Pro Thr
1460 1465 1470
Phe Thr Ile Thr Thr Gln Thr Val Pro Gln Asp Ala Val Ser Arg
1475 1480 1485
Ser Gln Arg Arg Gly Arg Thr Gly Arg Gly Arg Gln Gly Thr Tyr
1490 1495 1500
Arg Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser
1505 1510 1515
Val Val Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Asp
1520 1525 1530
Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn
1535 1540 1545
Thr Pro Gly Leu Pro Val Cys Gln Asp His Leu Glu Phe Trp Glu
1550 1555 1560
Ala Val Phe Thr Gly Leu Thr His Ile Asp Ala His Phe Leu Ser
1565 1570 1575
Gln Thr Lys Gln Ala Gly Glu Asn Phe Ala Tyr Leu Val Ala Tyr
1580 1585 1590
Gln Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro Pro Ser Trp
1595 1600 1605
Asp Ala Met Trp Lys Cys Leu Ala Arg Leu Lys Pro Thr Leu Ala
1610 1615 1620
Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro Ile Thr Asn Glu
1625 1630 1635
Val Thr Leu Thr His Pro Gly Thr Lys Tyr Ile Ala Thr Cys Met
1640 1645 1650
Gln Ala Asp Leu Glu Val Met Thr Ser Thr Trp Val Leu Ala Gly
1655 1660 1665
Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala Thr Gly Cys
1670 1675 1680
Val Ser Ile Ile Gly Arg Leu His Val Asn Gln Arg Val Val Val
1685 1690 1695
Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met Glu
1700 1705 1710
Glu Cys Ala Ser Arg Ala Ala Leu Ile Glu Glu Gly Gln Arg Ile
1715 1720 1725
Ala Glu Met Leu Lys Ser Lys Ile Gln Gly Leu Leu Gln Gln Ala
1730 1735 1740
Ser Lys Gln Ala Gln Asp Ile Gln Pro Ala Met Gln Ala Ser Trp
1745 1750 1755
Pro Lys Val Glu Gln Phe Trp Ala Arg His Met Trp Asn Phe Ile
1760 1765 1770
Ser Gly Ile Gln Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn
1775 1780 1785
Pro Ala Val Ala Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser
1790 1795 1800
Pro Leu Ser Thr Ser Thr Thr Ile Leu Leu Asn Ile Met Gly Gly
1805 1810 1815
Trp Leu Ala Ser Gln Ile Ala Pro Pro Ala Gly Ala Thr Gly Phe
1820 1825 1830
Val Val Ser Gly Leu Val Gly Ala Ala Val Gly Ser Ile Gly Leu
1835 1840 1845
Gly Lys Val Leu Val Asp Ile Leu Ala Gly Tyr Gly Ala Gly Ile
1850 1855 1860
Ser Gly Ala Leu Val Ala Phe Lys Ile Met Ser Gly Glu Lys Pro
1865 1870 1875
Ser Met Glu Asp Val Ile Asn Leu Leu Pro Gly Ile Leu Ser Pro
1880 1885 1890
Gly Ala Leu Val Val Gly Val Ile Cys Ala Ala Ile Leu Arg Arg
1895 1900 1905
His Val Gly Pro Gly Glu Gly Ala Val Gln Trp Met Asn Arg Leu
1910 1915 1920
Ile Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr His Tyr
1925 1930 1935
Val Thr Glu Ser Asp Ala Ser Gln Arg Val Thr Gln Leu Leu Gly
1940 1945 1950
Ser Leu Thr Ile Thr Ser Leu Leu Arg Arg Leu His Asn Trp Ile
1955 1960 1965
Thr Glu Asp Cys Pro Ile Pro Cys Ser Gly Ser Trp Leu Arg Asp
1970 1975 1980
Val Trp Asp Trp Val Cys Thr Ile Leu Thr Asp Phe Lys Asn Trp
1985 1990 1995
Leu Thr Ser Lys Leu Phe Pro Lys Leu Pro Gly Leu Pro Phe Ile
2000 2005 2010
Ser Cys Gln Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly Ile
2015 2020 2025
Met Thr Thr Arg Cys Pro Cys Gly Ala Asn Ile Ser Gly Asn Val
2030 2035 2040
Arg Leu Gly Ser Met Arg Ile Thr Gly Pro Lys Thr Cys Met Asn
2045 2050 2055
Thr Trp Gln Gly Thr Phe Pro Ile Asn Cys Tyr Thr Glu Gly Gln
2060 2065 2070
Cys Ala Pro Lys Pro Pro Thr Asn Tyr Lys Thr Ala Ile Trp Arg
2075 2080 2085
Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gln His Gly Ser Tyr
2090 2095 2100
Ser Tyr Val Thr Gly Leu Thr Thr Asp Asn Leu Lys Ile Pro Cys
2105 2110 2115
Gln Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp Gly Val Gln
2120 2125 2130
Ile His Arg Phe Ala Pro Thr Pro Lys Pro Phe Phe Arg Asp Glu
2135 2140 2145
Val Ser Phe Cys Val Gly Leu Asn Ser Tyr Ala Val Gly Ser Gln
2150 2155 2160
Leu Pro Cys Glu Pro Glu Pro Asp Ala Asp Val Leu Arg Ser Met
2165 2170 2175
Leu Thr Asp Pro Pro His Ile Thr Ala Glu Thr Ala Ala Arg Arg
2180 2185 2190
Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Val Ser
2195 2200 2205
Gln Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Ser
2210 2215 2220
Asn Thr Tyr Asp Val Asp Met Val Asp Ala Asn Leu Leu Met Glu
2225 2230 2235
Gly Gly Val Ala Gln Thr Glu Pro Glu Ser Arg Val Pro Val Leu
2240 2245 2250
Asp Phe Leu Glu Pro Met Ala Glu Glu Glu Ser Asp Leu Glu Pro
2255 2260 2265
Ser Ile Pro Ser Glu Cys Met Leu Pro Arg Ser Gly Phe Pro Arg
2270 2275 2280
A1a Leu Pro Ala Trp Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val
2285 2290 2295
Glu Ser Trp Arg Arg Pro Asp Tyr Gln Pro Pro Thr Val Ala Gly
2300 2305 2310
Cys Ala Leu Pro Pro Pro Lys Lys Ala Pro Thr Pro Pro Pro Arg
2315 2320 2325
Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Thr Ile Ser Glu Ala
2330 2335 2340
Leu Gln Gln Leu Ala Ile Lys Thr Phe Gly Gln Pro Pro Ser Ser
2345 2350 2355
Gly Asp Ala Gly Ser Ser Thr Gly Ala Gly Ala Ala Glu Ser Gly
2360 2365 2370
Gly Pro Thr Ser Pro Gly Glu Pro Ala Pro Ser Glu Thr Gly Ser
2375 2380 2385
Ala Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp
2390 2395 2400
Leu Glu Ser Asp Gln Val Glu Leu Gln Pro Pro Pro Gln Gly Gly
2405 2410 2415
Gly Val Ala Pro Gly Ser Gly Ser Gly Ser Trp Ser Thr Cys Ser
2420 2425 2430
Glu Glu Asp Asp Thr Thr Val Cys Cys Ser Met Ser Tyr Ser Trp
2435 2440 2445
Thr Gly Ala Leu Ile Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu
2450 2455 2460
Pro Ile Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys
2465 2470 2475
Val Tyr Cys Thr Thr Ser Lys Ser Ala Ser Gln Arg Ala Lys Lys
2480 2485 2490
Val Thr Phe Asp Arg Thr Gln Val Leu Asp Ala His Tyr Asp Ser
2495 2500 2505
Val Leu Lys Asp Ile Lys Leu Ala Ala Ser Lys Val Ser Ala Arg
2510 2515 2520
Leu Leu Thr Leu Glu Glu Ala Cys Gln Leu Thr Pro Pro His Ser
2525 2530 2535
Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu Val Arg Ser Leu
2540 2545 2550
Ser Gly Arg Ala Val Asn His Ile Lys Ser Val Trp Lys Asp Leu
2555 2560 2565
Leu Glu Asp Pro Gln Thr Pro Ile Pro Thr Thr Ile Met Ala Lys
2570 2575 2580
Asn Glu Val Phe Cys Val Asp Pro Ala Lys Gly Gly Lys Lys Pro
2585 2590 2595
Ala Arg Leu Ile Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu
2600 2605 2610
Lys Met Ala Leu Tyr Asp Ile Thr Gln Lys Leu Pro Gln Ala Val
2615 2620 2625
Met Gly Ala Ser Tyr Gly Phe Gln Tyr Ser Pro Ala Gln Arg Val
2630 2635 2640
Glu Tyr Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro Met Gly
2645 2650 2655
Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Arg
2660 2665 2670
Asp Ile Arg Thr Glu Glu Ser Ile Tyr Gln Ala Cys Ser Leu Pro
2675 2680 2685
Glu Glu Ala Arg Thr Ala Ile His Ser Leu Thr Glu Arg Leu Tyr
2690 2695 2700
Val Gly Gly Pro Met Phe Asn Ser Lys Gly Gln Thr Cys Gly Tyr
2705 2710 2715
Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn
2720 2725 2730
Thr Ile Thr Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala
2735 2740 2745
Gly Ile Val Ala Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val
2750 2755 2760
Val Ile Ser Glu Ser Gln Gly Thr Glu Glu Asp Glu Arg Asn Leu
2765 2770 2775
Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly
2780 2785 2790
Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu Ile Thr Ser Cys
2795 2800 2805
Ser Ser Asn Val Ser Val Ala Leu Gly Pro Arg Gly Arg Arg Arg
2810 2815 2820
Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala
2825 2830 2835
Trp Glu Thr Val Arg His Ser Pro Ile Asn Ser Trp Leu Gly Asn
2840 2845 2850
Ile Ile Gln Tyr Ala Pro Thr Ile Trp Val Arg Met Val Leu Met
2855 2860 2865
Thr His Phe Phe Ser Ile Leu Met Val Gln Asp Thr Leu Asp Gln
2870 2875 2880
Asn Leu Asn Phe Glu Met Tyr Gly Ser Val Tyr Ser Val Asn Pro
2885 2890 2895
Leu Asp Leu Pro Ala Ile Ile Glu Arg Leu His Gly Leu Asp Ala
2900 2905 2910
Phe Ser Met His Thr Tyr Ser His His Glu Leu Thr Arg Val Ala
2915 2920 2925
Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Val Trp Lys
2930 2935 2940
Ser Arg Ala Arg Ala Val Arg Ala Ser Leu Ile Ser Arg Gly Gly
2945 2950 2955
Lys Ala Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys
2960 2965 2970
Thr Lys Leu Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp
2975 2980 2985
Leu Ser Ser Trp Phe Thr Val Gly Ala Gly Gly Gly Asp Ile Phe
2990 2995 3000
His Ser Val Ser Arg Ala Arg Pro Arg Ser Leu Leu Phe Gly Leu
3005 3010 3015
Leu Leu Leu Phe Val Gly Val Gly Leu Phe Leu Leu Pro Ala Arg
3020 3025 3030
<210>3
<211>9678
<212>DNA
<213>人工序列
<220>
<223>JFH1突变体
<220>
<223>JFH1-A/WT
<400>3
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtgcaccgg aattgccggg 180
aagactgggt cctttcttgg atacacccac tctatgcccg gccatttggg cgtgcccccg 240
caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300
cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 360
tcaaagaaaa accaaaagaa acaccaaccg tcgcccagaa gacgttaagt tcccgggcgg 420
cggccagatc gttggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480
cacgacaagg aaaacttcgg agcggtccca gccacgtggg agacgccagc ccatccccaa 540
agatcggcgc tccactggca cggcctgggg taaaccaggt cgcccctggc ccctatatgg 600
gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggctctc gcccctcctg 660
gggccccact gacccccggc ataggtcgcg caacgtgggt aaagtcatcg acaccctaac 720
gtgtggcttt gccgacctca tggggtacat ccccgtcgta ggcgccccgc ttagtggcgc 780
cgccagagct gtcgcgcacg gcgtgagagt cctggaggac ggggttaatt atgcaacagg 840
gaacctacct ggtttcccct tttctatctt cttgctggcc ctgttgtcct gcatcaccgt 900
tccggtctct gctgcccagg tgaagaatac cagtagcagc tacatggtga ccaatgactg 960
ctccaatgac agcatcactt ggcagctcga ggctgcggtt ctccacgtcc ccgggtgcgt 1020
cccgtgcgag agagtgggga atacgtcacg gtgttgggtg ccagtctcgc caaacatggc 1080
tgtgcggcag cccggtgccc tcacgcaggg tctgcggacg cacatcgata tggttgtgat 1140
gtccgccacc ttctgctctg ctctctacgt gggggacctc tgtggcgggg tgatgctcgc 1200
ggcccaggtg ttcatcgtct cgccgcagca ccactggttt gtgcaggaat gcaattgctc 1260
catctaccct ggcaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320
gcccacgacc accatgatcc tggcgtacgt gatgcgcgtc cccgaggtca tcatagacat 1380
cgttagcggg gctcactggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440
gtgggcgaag gtcattgtca tccttctgct ggccgctggg gtggacgcgg gcaccaccac 1500
cgttggaggc gccgttgcac gtcccaccaa cgtgattgcc ggcgtgttca gccatggccc 1560
tcagcagaac attcagctca ttaacaccag cggcagttgg cacatcaacc gtactgcctt 1620
gaattgcaat gactccttga acaccggctt tctcgcggcc ttgttctaca ccaaccgctt 1680
taactcgtca gggtgtccag ggcgcctgtc cgcctgccgc aacatcgagg ctttccggat 1740
agggtggggc accctacagt acgaggataa tgtcaccaat ccagagggta tgaggccgta 1800
ctgctggcac taccccccaa agccgtgtgg cgtagtcccc acgaggtctg tgtgtggccc 1860
agtgtactgt ttcaccccca gcccggtagt agtgggcacg accgacagac gtggagtgcc 1920
cacctacaca tggggagaga atgagacaga tgtcttccta ctgaacagca cccgaccgcc 1980
gcagggctca tggttcggct gcacgtggat gaactccact ggtttcacca agacttgtgg 2040
cgcgccacct tgccgcacca gagctgactt caacgccagc acggacttgt tgtgccctac 2100
ggattgtttt aggaagcatc ctgatgccac ttatattaag tgtggttctg ggccctggct 2160
cacaccaaag tgcctggtcc actaccctta cagactctgg cattacccct gcacagtcaa 2220
ttttaccatc ttcaagataa gaatgtatgt agggggggtt gagcacaggc tcacggccgc 2280
atgcaacttc actcgtgggg atcgctgcga cttggaggac agggacagga gtcagctgtc 2340
tcctctgttg cactctacca cggaatgggc catcctgccc tgcacctact cagacttacc 2400
cgctttgtca actggtcttc tccaccttca ccagaacatc gtggacgtac aatacatgta 2460
tggcctctca cctgctatca caaaatacgt cgttcgatgg gagtgggtgg tactcttatt 2520
cctgctctta gcggacgcca gagtctgcgc ctgcttgtgg atgctcatct tgttgggcca 2580
ggccgaagca gcattggaga agttggtcgt cttgcacgct gcgagtgcgg ctaactgcca 2640
tggcctccta tattttgcca tcttcttcgt ggcagcttgg cacatcaggg gtcgggtggt 2700
ccccttgacc acctattgcc ttactggcct atggcccttc tgcctactgc tcatggcact 2760
gccccggcag gcttatgcct atgacgcacc tgtgcacgga cagataggcg tgggtttgtt 2820
gatattgatc accctcttca cactcacccc ggggtataag accctcctcg gccagtgtct 2880
gtggtggttg tgctatctcc tgaccctggg ggaagccatg attcgggagt gggtaccacc 2940
catgcaggtg cgcggcggcc gcgatggcat cgcgtgggcc gtcactatat tctgcccggg 3000
cgtggtgttt gacattacca aatggctttt ggcgttgctt gggcctgctt acctcttaag 3060
ggccgctttg acccatgtgc cgtacttcgt cagagctcac gctctgataa gggtatgcgc 3120
tttggtgaag cggctcgcgg ggggtaggta tgttcaggtg gcgctgttgg cccttggcag 3180
gtggactggc acctacatct atgaccacct cacacctatg gcggactggg ccgctagcgg 3240
cctgcgcgac ttagcggtcg ccgtggaacc catcatcttc agtccgatgg agaagaaggt 3300
catcgtctgg ggagcggaga cggctgcatg tggggacatt ctacatggac ttcccgtgtc 3360
cgcccgactc ggccaggaga tcctcctcgg cccagctgat ggctacacct ccaaggggtg 3420
gaagctcctt gctcccatca ctgcttatgc ccagcaaaca cgaggcctcc tgggcgccat 3480
agtggtgagt atgacggggc gtgacaggac agaacaggcc ggggaagtcc aaatcctgtc 3540
cacagtctct cagtccttcc tcggaacaac catctcgggg gttttgtgga ctgtttacca 3600
cggagctggc aacaagactc tagccggctt acgaggtccg gtcacgcaga tgtactcgag 3660
tgctgagggg gacttggtag gctggcccag cccccctggg accaagtctt tggagccgtg 3720
caagtgtgga gccgtcgacc tatatctggt cacgcggaac gctgatgtca tcccggctcg 3780
gagacgcggg gataagcggg gagcattgct ctccccgaga cccatttcga ccttgaaggg 3840
gtcctcgggg gggccggtgc tctgccctag gggccacgtc gttgggctct tccgagcagc 3900
tgtgtgctct cggggcgtgg ccaaatccat cgatttcatc cccgttgaga cactcgacgt 3960
tgttacaagg tctcccactt tcagtgacaa cagcacgcca ccggctgtgc cccagaccta 4020
tcaggtcggg tacttgcatg ctccaactgg cagtggaaag agcaccaagg tccctgtcgc 4080
gtatgccgcc caggggtaca aagtactagt gcttaacccc tcggtagctg ccaccctggg 4140
gtttggggcg tacctatcca aggcacatgg catcaatccc aacattagga ctggagtcag 4200
gaccgtgatg accggggagg ccatcacgta ctccacatat ggcaaatttc tcgccgatgg 4260
gggctgcgct agcggcgcct atgacatcat catatgcgat gaatgccacg ctgtggatgc 4320
tacctccatt ctcggcatcg gaacggtcct tgatcaagca gagacagccg gggtcagact 4380
aactgtgctg gctacggcca caccccccgg gtcagtgaca accccccatc ccgatataga 4440
agaggtaggc ctcgggcggg agggtgagat ccccttctat gggagggcga ttcccctatc 4500
ctgcatcaag ggagggagac acctgatttt ctgccactca aagaaaaagt gtgacgagct 4560
cgcggcggcc cttcggggca tgggcttgaa tgccgtggca tactatagag ggttggacgt 4620
ctccataata ccagctcagg gagatgtggt ggtcgtcgcc accgacgccc tcatgacggg 4680
gtacactgga gactttgact ccgtgatcga ctgcaatgta gcggtcaccc aagctgtcga 4740
cttcagcctg gaccccacct tcactataac cacacagact gtcccacaag acgctgtctc 4800
acgcagtcag cgccgcgggc gcacaggtag aggaagacag ggcacttata ggtatgtttc 4860
cactggtgaa cgagcctcag gaatgtttga cagtgtagtg ctttgtgagt gctacgacgc 4920
aggggctgcg tggtacgatc tcacaccagc ggagaccacc gtcaggctta gagcgtattt 4980
caacacgccc ggcctacccg tgtgtcaaga ccatcttgaa ttttgggagg cagttttcac 5040
cggcctcaca cacatagacg cccacttcct ctcccaaaca aagcaagcgg gggagaactt 5100
cgcgtaccta gtagcctacc aagctacggt gtgcgccaga gccaaggccc ctcccccgtc 5160
ctgggacgcc atgtggaagt gcctggcccg actcaagcct acgcttgcgg gccccacacc 5220
tctcctgtac cgtttgggcc ctattaccaa tgaggtcacc ctcacacacc ctgggacgaa 5280
gtacatcgcc acatgcatgc aagctgacct tgaggtcatg accagcacgt gggtcctagc 5340
tggaggagtc ctggcagccg tcgccgcata ttgcctggcg actggatgcg tttccatcat 5400
cggccgcttg cacgtcaacc agcgagtcgt cgttgcgccg gataaggagg tcctgtatga 5460
ggcttttgat gagatggagg aatgcgcctc tagggcggct ctcatcgaag aggggcagcg 5520
gatagccgag atgttgaagt ccaagatcca aggcttgctg cagcaggcct ctaagcaggc 5580
ccaggacata caacccgcta tgcaggcttc atggcccaaa gtggaacaat tttgggccag 5640
acacatgtgg aacttcatta gcggcatcca atacctcgca ggattgtcaa cactgccagg 5700
gaaccccgcg gtggcttcca tgatggcatt cagtgccgcc ctcaccagtc cgttgtcgac 5760
cagtaccacc atccttctca acatcatggg aggctggtta gcgtcccaga tcgcaccacc 5820
cgcgggggcc accggctttg tcgtcagtgg cctggtgggg gctgccgtgg gcagcatagg 5880
cctgggtaag gtgctggtgg acatcctggc aggatatggt gcgggcattt cgggggccct 5940
cgtcgcattc aagatcatgt ctggcgagaa gccctctatg gaagatgtca tcaatctact 6000
gcctgggatc ctgtctccgg gagccctggt ggtgggggtc atctgcgcgg ccattctgcg 6060
ccgccacgtg ggaccggggg agggcgcggt ccaatggatg aacaggctta ttgcctttgc 6120
ttccagagga aaccacgtcg cccctactca ctacgtgacg gagtcggatg cgtcgcagcg 6180
tgtgacccaa ctacttggct ctcttactat aaccagccta ctcagaagac tccacaattg 6240
gataactgag gactgcccca tcccatgctc cggatcctgg ctccgcgacg tgtgggactg 6300
ggtttgcacc atcttgacag acttcaaaaa ttggctgacc tctaaattgt tccccaagct 6360
gcccggcctc cccttcatct cttgtcaaaa ggggtacaag ggtgtgtggg ccggcactgg 6420
catcatgacc acgcgctgcc cttgcggcgc caacatctct ggcaatgtcc gcctgggctc 6480
tatgaggatc acagggccta aaacctgcat gaacacctgg caggggacct ttcctatcaa 6540
ttgctacacg gagggccagt gcgcgccgaa accccccacg aactacaaga ccgccatctg 6600
gagggtggcg gcctcggagt acgcggaggt gacgcagcat gggtcgtact cctatgtaac 6660
aggactgacc actgacaatc tgaaaattcc ttgccaacta ccttctccag agtttttctc 6720
ctgggtggac ggtgtgcaga tccataggtt tgcacccaca ccaaagccgt ttttccggga 6780
tgaggtctcg ttctgcgttg ggcttaattc ctatgctgtc gggtcccagc ttccctgtga 6840
acctgagccc gacgcagacg tattgaggtc catgctaaca gatccgcccc acatcacggc 6900
ggagactgcg gcgcggcgct tggcacgggg atcacctcca tctgaggcga gctcctcagt 6960
gagccagcta tcagcaccgt cgctgcgggc cacctgcacc acccacagca acacctatga 7020
cgtggacatg gtcgatgcca acctgctcat ggagggcggt gtggctcaga cagagcctga 7080
gtccagggtg cccgttctgg actttctcga gccaatggcc gaggaagaga gcgaccttga 7140
gccctcaata ccatcggagt gcatgctccc caggagcggg tttccacggg ccttaccggc 7200
ttgggcacgg cctgactaca acccgccgct cgtggaatcg tggaggaggc cagattacca 7260
accgcccacc gttgctggtt gtgctctccc cccccccaag aaggccccga cgcctccccc 7320
aaggagacgc cggacagtgg gtctgagcga gagcaccata tcagaagccc tccagcaact 7380
ggccatcaag acctttggcc agcccccctc gagcggtgat gcaggctcgt ccacgggggc 7440
gggcgccgcc gaatccggcg gtccgacgtc ccctggtgag ccggccccct cagagacagg 7500
ttccgcctcc tctatgcccc ccctcgaggg ggagcctgga gatccggacc tggagtctga 7560
tcaggtagag cttcaacctc ccccccaggg ggggggggta gctcccggtt cgggctcggg 7620
gtcttggtct acttgctccg aggaggacga taccaccgtg tgctgctcca tgtcatactc 7680
ctggaccggg gctctaataa ctccctgtag ccccgaagag gaaaagttgc caatcaaccc 7740
tttgagtaac tcgctgttgc gataccataa caaggtgtac tgtacaacat caaagagcgc 7800
ctcacagagg gctaaaaagg taacttttga caggacgcaa gtgctcgacg cccattatga 7860
ctcagtctta aaggacatca agctagcggc ttccaaggtc agcgcaaggc tcctcacctt 7920
ggaggaggcg tgccagttga ctccacccca ttctgcaaga tccaagtatg gattcggggc 7980
caaggaggtc cgcagcttgt ccgggagggc cgttaaccac atcaagtccg tgtggaagga 8040
cctcctggaa gacccacaaa caccaattcc cacaaccatc atggccaaaa atgaggtgtt 8100
ctgcgtggac cccgccaagg ggggtaagaa accagctcgc ctcatcgttt accctgacct 8160
cggcgtccgg gtctgcgaga aaatggccct ctatgacatt acacaaaagc ttcctcaggc 8220
ggtaatggga gcttcctatg gcttccagta ctcccctgcc caacgggtgg agtatctctt 8280
gaaagcatgg gcggaaaaga aggaccccat gggtttttcg tatgataccc gatgcttcga 8340
ctcaaccgtc actgagagag acatcaggac cgaggagtcc atataccagg cctgctccct 8400
gcccgaggag gcccgcactg ccatacactc gctgactgag agactttacg taggagggcc 8460
catgttcaac agcaagggtc aaacctgcgg ttacagacgt tgccgcgcca gcggggtgct 8520
aaccactagc atgggtaaca ccatcacatg ctatgtgaaa gccctagcgg cctgcaaggc 8580
tgcggggata gttgcgccca caatgctggt atgcggcgat gacctagtag tcatctcaga 8640
aagccagggg actgaggagg acgagcggaa cctgagagcc ttcacggagg ccatgaccag 8700
gtactctgcc cctcctggtg atccccccag accggaatat gacctggagc taataacatc 8760
ctgttcctca aatgtgtctg tggcgttggg cccgcggggc cgccgcagat actacctgac 8820
cagagaccca accactccac tcgcccgggc tgcctgggaa acagttagac actcccctat 8880
caattcatgg ctgggaaaca tcatccagta tgctccaacc atatgggttc gcatggtcct 8940
aatgacacac ttcttctcca ttctcatggt ccaagacacc ctggaccaga acctcaactt 9000
tgagatgtat ggatcagtat actccgtgaa tcctttggac cttccagcca taattgagag 9060
gttacacggg cttgacgcct tttctatgca cacatactct caccacgaac tgacgcgggt 9120
ggcttcagcc ctcagaaaac ttggggcgcc acccctcagg gtgtggaaga gtcgggctcg 9180
cgcagtcagg gcgtccctca tctcccgtgg agggaaagcg gccgtttgcg gccgatatct 9240
cttcaattgg gcggtgaaga ccaagctcaa actcactcca ttgccggagg cgcgcctact 9300
ggacttatcc agttggttca ccgtcggcgc cggcgggggc gacatttttc acagcgtgtc 9360
gcgcgcccga ccccgctcat tactcttcgg cctactccta cttttcgtag gggtaggcct 9420
cttcctactc cccgctcggt agagcggcac acactaggta cactccatag ctaactgttc 9480
cttttttttt tttttttttt tttttttttt tttttttttt ttttcttttt tttttttttc 9540
cctctttctt cccttctcat cttattctac tttctttctt ggtggctcca tcttagccct 9600
agtcacggct agctgtgaaa ggtccgtgag ccgcatgact gcagagagtg ccgtaactgg 9660
tctctctgca gatcatgt 9678
<210>4
<211>9678
<212>DNA
<213>人工序列
<220>
<223>JFH1突变体
<220>
<223>JFH1-B/WT
<400>4
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180
aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240
caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300
cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 360
tcaaagaaaa accaaaagaa acaccaaccg tcgcccagaa gacgttaagt tcccgggcgg 420
cggccagatc gctggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480
cacgacaagg aaaacttcgg agcggtccca gccacgtggg agacgccagc ccatccccaa 540
agatcggcgc tccactggca cggcctgggg aaaaccaggt cgcccctggc ccctatatgg 600
gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggctctc gcccctcctg 660
gggccccact gacccccggc ataggtcgcg caacgtgggt aaagtcatcg acaccctaac 720
gtgtggcttt gccgacctca tggggtacat ccccgtcgta ggcgccccgc ttagtggcgc 780
cgccagagct gtcgcgcacg gcgtgagagt cctggaggac ggggttaatt atgcaacagg 840
gaacctaccc ggtttcccct tttctatctt cttgctggcc ctgttgtcct gcatcaccgt 900
tccggtctct gctgcccagg tgaagaatac cagtagcagc tacatggtga ccaatgactg 960
ctccaatgac agcatcactt ggcagctcga ggctgcagtt ctccacgtcc ccgggtgcgt 1020
cccgtgcgag agagtgggga atacgtcacg gtgttgggtg ccagtctcgc caaacatggc 1080
tgtgcggcag cccggtgccc tcacgcaggg tctgcggacg cacatcgata tggttgtgat 1140
gtccgccacc ttctgctctg ctctctacgt gggggacctc tgtggcgggg tgatgctcgc 1200
ggcccaggtg ttcatcgtct cgccgcagta ccactggttt gtgcaggaat gcaattgctc 1260
catctaccct ggcaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320
gcccacggcc accatgatcc tggcgtacgt gatgcgcgtc cccgaggtca tcatagacat 1380
cgttagcggg gctcactggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440
gtgggcgaag gtcattgtca tccttctgct ggccgctggg gtggacgcgg gcaccaccac 1500
cgttgggggc gctgttgcac gttccaccaa cgtgattgcc ggcgtgttca gccatggccc 1560
tcagcagaac attcagctca ttaacaccaa cggcagctgg cacatcaacc gtactgcctt 1620
gaattgcaat gactccttga acaccggctt tctcgcggcc ttgttctaca ccaaccgctt 1680
taactcgtca aggtgtccag ggcgcctgtc cgcctgccgc aacatcgagg ctttccggat 1740
agggtggggc accctacagt acgaggataa tgtcaccaat ccagaggata tgaggccgta 1800
ctgctggcac taccccccaa agccgtgtgg cgtagtcccc gcgaggtctg tgtgtggccc 1860
agtgtactgt ttcactccca gcccggtagt agtgggcacg accgacagac gtggagtgcc 1920
cacctacaca tggggagaga atgagacaga tgtcttccta ctgaacagca cccgaccgcc 1980
acagggctca tggttcggct gcacgtggat gaactccact ggtttcacca agacttgtgg 2040
cgcgccacct tgccgcacca gagctgactt caacgccagc acggacttgt tgtgccctac 2100
ggattgtttt aggaagcatc ctgatgccac ttatattaag tgtggttctg ggccctggct 2160
cacaccaaag tgcctggtcc actaccctta cagactctgg cattacccct gcacagtcaa 2220
ttttaccatc ttcaagataa gaatgtatgt agggggggtt gagcacaggc tcacggccgc 2280
atgcaacttc actcgtgggg atcgctgcga cttggaggac agggacagga gtcagctgtc 2340
tcctctgttg cactctacca cggaatgggc catcctgccc tgcacctact cagacttacc 2400
cgctttgtca actggtcttc tccaccttca ccagaacatc gtggacgtac aatacatgta 2460
tggcctctca cctgctatca caaaatacgt cgttcgatgg gagtgggtgg tactcttatt 2520
cctgctctta gcggacgcca gagtctgcgc ctgcctgtgg atgctcatct tgttgggcca 2580
ggccgaagca gcattggaga agttggccgt cttgcacgct gcgagtgcgg ctaactgcca 2640
tggcctccta tattttgcca tcttcttcgt ggcagcttgg cacatcaggg gtcgggcggt 2700
ccccttgacc acctattgcc tcactggcct atggcccttc tgcctactgc tcatggcact 2760
gccccggcag gcttatgcct atgacgcacc tgtgcacgga cagataggcg tgggtttgtt 2820
gatattgatc accctcttca cactcacccc ggggtataag accctcctcg gccagtgtct 2880
gtggtggttg tgctatctcc tgaccctggg ggaagccatg attcgggagt gggtaccacc 2940
catgcaggtg cgcggcggcc gcgatggcat cgcgtgggcc gtcactatat tctgcccggg 3000
tgtggtgttt gacattacca aatggctttt ggcgttgctt gggcctgctt acctcttaag 3060
ggccgctttg acacatgtgc cgtacttcgt cagagctcac gctctgataa gggtatgcgc 3120
tttggtgaag cagctcgcgg ggggtaggta tgttcaggtg gcgctattgg cccttggcag 3180
gtggactggc acctacatct atgaccacct cacacctatg tcggactggg ccgctagcgg 3240
cctgcgcgac ttagcggtcg ccgtggaacc catcatcttc agtccgatgg agaagaaggt 3300
catcgtctgg ggagcggaga cggctgcatg tggggacatt ctacatggac ttcccgtgtc 3360
cgcccgactc ggccaggaga tcctcctcgg cccagctgat ggctacacct ccaaggggtg 3420
gaagctcctt gctcccatca ctgcttatgc ccagcaaaca cgaggcctcc tgggcgccat 3480
agtggtgagt atgacggggc gtgacaggac agaacaggcc ggggaagtcc aaatcctgtc 3540
cacagtctct cagtccttcc tcggaacaac catctcgggg gttttgtgga ctgtttacca 3600
cggagctggc aacaagactc tagccggctt acggggtccg gtcacgcaga tgtactcgag 3660
tgctgagggg gacttggtag gctggcccag cccccctggg accaagtctt tggagccgtg 3720
caagtgtgga gccgtcgacc tatatctggt cacgcggaac gctgatgtta tcccggctcg 3780
gagacgcggg gacaagcggg gagcattgct ctccccgaga cccatttcga ccttgaaggg 3840
gtcctcgggg gggccggtgc tctgccctag gggccacgtc gttgggctct tccgagcagc 3900
tgtgtgctct cggggcgtgg ccaaatccat cgatttcatc cccgttgaga cactcgacgt 3960
tgttacaagg tctcccacct tcagtgacaa cagcacgcca ccggctgtgc cccagaccta 4020
tcaggtcggg tacttgcatg ctccaactgg cagtggaaag agcaccaagg tccctgtcgc 4080
gtatgccgcc caggggtaca aagtactagt gcttaacccc tcggtagctg ccaccctggg 4140
gtttggggcg tacctatcca aggcacatgg catcaatccc aacattagga ctggagtcag 4200
gaccgtgatg accggggagg ccatcacgta ctccacatat ggcaaatttc tcgccgatgg 4260
gggctgcgct agcggcgcct atgacatcat catatgcgat gaatgccacg ctgtggatgc 4320
tacctccatt ctcggcatcg gaacggtcct tgatcaagca gagacagccg gggtcagact 4380
aactgtgctg gctacggcca caccccccgg gtcagtgaca accccccatc ccgatataga 4440
agaggtaggc ctcgggcggg agggtgagat ccccttctat gggagggcga ttcccctatc 4500
ctgcatcaag ggagggagac acctgatttt ctgccactca aagaaaaagt gtgacgagct 4560
cgcggcggcc cttcggggca tgggcttgaa tgccgtggca tactatagag ggttggacgt 4620
ctccataata ccagctcagg gagatgtggt ggtcgtcgcc accgacgccc tcatgacggg 4680
gtacactgga gactttgact ccgtgatcga ctgcaatgta gcggtcaccc aagctgtcga 4740
cttcagcctg gaccccacct tcactataac cacacagact gtcccacaag acgctgtctc 4800
acgcagtcag cgccgcgggc gcacaggtag aggaagacag ggcacttata ggtatgtttc 4860
cactggtgaa cgagcctcag gaatgtttga cagtgtagtg ctttgtgagt gctacgacgc 4920
aggggctgcg tggtacgatc tcacaccagc ggagaccacc gtcaggctta gagcgtattt 4980
caacacgccc ggcctacccg tgtgtcaaga ccatcttgaa ttttgggagg cagttttcac 5040
cggcctcaca cacatagacg cccacttcct ctcccaaaca aagcaagcgg gggagaactt 5100
cgcgtaccta gtagcctacc aagctacggt gtgcgccaga gccaaggccc ctcccccgtc 5160
ctgggacgcc atgtggaagt gcctggcccg actcaagcct acgcttgcgg gccccacacc 5220
tctcctgtac cgtttgggcc ctattaccaa tgaggtcacc ctcacacacc ctgggacgaa 5280
gtacatcgcc acatgcatgc aagctgacct tgaggtcatg accagcacgt gggtcctagc 5340
tggaggagtc ctggcagccg tcgccgcata ttgcctggcg actggatgcg tttccatcat 5400
cggccgcttg cacgtcaacc agcgagtcgt cgttgcgccg gataaggagg tcctgtatga 5460
ggcttttgat gagatggagg aatgcgcctc tagggcggct ctcatcgaag aggggcagcg 5520
gatagccgag atgttgaagt ccaagatcca aggcttgctg cagcaggcct ctaagcaggc 5580
ccaggacata caacccgcta tgcaggcttc atggcccaaa gtggaacaat tttgggccag 5640
acacatgtgg aacttcatta gcggcatcca atacctcgca ggattgtcaa cactgccagg 5700
gaaccccgcg gtggcttcca tgatggcatt cagtgccgcc ctcaccagtc cgttgtcgac 5760
cagtaccacc atccttctca acatcatggg aggctggtta gcgtcccaga tcgcaccacc 5820
cgcgggggcc accggctttg tcgtcagtgg cctggtgggg gctgccgtgg gcagcatagg 5880
cctgggtaag gtgctggtgg acatcctggc aggatatggt gcgggcattt cgggggccct 5940
cgtcgcattc aagatcatgt ctggcgagaa gccctctatg gaagatgtca tcaatctact 6000
gcctgggatc ctgtctccgg gagccctggt ggtgggggtc atctgcgcgg ccattctgcg 6060
ccgccacgtg ggaccggggg agggcgcggt ccaatggatg aacaggctta ttgcctttgc 6120
ttccagagga aaccacgtcg cccctactca ctacgtgacg gagtcggatg cgtcgcagcg 6180
tgtgacccaa ctacttggct ctcttactat aaccagccta ctcagaagac tccacaattg 6240
gataactgag gactgcccca tcccatgctc cggatcctgg ctccgcgacg tgtgggactg 6300
ggtttgcacc atcttgacag acttcaaaaa ttggctgacc tctaaattgt tccccaagct 6360
gcccggcctc cccttcatct cttgtcaaaa ggggtacaag ggtgtgtggg ccggcactgg 6420
catcatgacc acgcgctgcc cttgcggcgc caacatctct ggcaatgtcc gcctgggctc 6480
tatgaggatc acagggccta aaacctgcat gaacacctgg caggggacct ttcctatcaa 6540
ttgctacacg gagggccagt gcgcgccgaa accccccacg aactacaaga ccgccatctg 6600
gagggtggcg gcctcggagt acgcggaggt gacgcagcat gggtcgtact cctatgtaac 6660
aggactgacc actgacaatc tgaaaattcc ttgccaacta ccttctccag agtttttctc 6720
ctgggtggac ggtgtgcaga tccataggtt tgcacccaca ccaaagccgt ttttccggga 6780
tgaggtctcg ttctgcgttg ggcttaattc ctatgctgtc gggtcccagc ttccctgtga 6840
acctgagccc gacgcagacg tattgaggtc catgctaaca gatccgcccc acatcacggc 6900
ggagactgcg gcgcggcgct tggcacgggg atcacctcca tctgaggcga gctcctcagt 6960
gagccagcta tcagcaccgt cgctgcgggc cacctgcacc acccacagca acacctatga 7020
cgtggacatg gtcgatgcca acctgctcat ggagggcggt gtggctcaga cagagcctga 7080
gtccagggtg cccgttctgg actttctcga gccaatggcc gaggaagaga gcgaccttga 7140
gccctcaata ccatcggagt gcatgctccc caggagcggg tttccacggg ccttaccggc 7200
ttgggcacgg cctgactaca acccgccgct cgtggaatcg tggaggaggc cagattacca 7260
accgcccacc gttgctggtt gtgctctccc cccccccaag aaggccccga cgcctccccc 7320
aaggagacgc cggacagtgg gtctgagcga gagcaccata tcagaagccc tccagcaact 7380
ggccatcaag acctttggcc agcccccctc gagcggtgat gcaggctcgt ccacgggggc 7440
gggcgccgcc gaatccggcg gtccgacgtc ccctggtgag ccggccccct cagagacagg 7500
ttccgcctcc tctatgcccc ccctcgaggg ggagcctgga gatccggacc tggagtctga 7560
tcaggtagag cttcaacctc ccccccaggg ggggggggta gctcccggtt cgggctcggg 7620
gtcttggtct acttgctccg aggaggacga taccaccgtg tgctgctcca tgtcatactc 7680
ctggaccggg gctctaataa ctccctgtag ccccgaagag gaaaagttgc caatcaaccc 7740
tttgagtaac tcgctgttgc gataccataa caaggtgtac tgtacaacat caaagagcgc 7800
ctcacagagg gctaaaaagg taacttttga caggacgcaa gtgctcgacg cccattatga 7860
ctcagtctta aaggacatca agctagcggc ttccaaggtc agcgcaaggc tcctcacctt 7920
ggaggaggcg tgccagttga ctccacccca ttctgcaaga tccaagtatg gattcggggc 7980
caaggaggtc cgcagcttgt ccgggagggc cgttaaccac atcaagtccg tgtggaagga 8040
cctcctggaa gacccacaaa caccaattcc cacaaccatc atggccaaaa atgaggtgtt 8100
ctgcgtggac cccgccaagg ggggtaagaa accagctcgc ctcatcgttt accctgacct 8160
cggcgtccgg gtctgcgaga aaatggccct ctatgacatt acacaaaagc ttcctcaggc 8220
ggtaatggga gcttcctatg gcttccagta ctcccctgcc caacgggtgg agtatctctt 8280
gaaagcatgg gcggaaaaga aggaccccat gggtttttcg tatgataccc gatgcttcga 8340
ctcaaccgtc actgagagag acatcaggac cgaggagtcc atataccagg cctgctccct 8400
gcccgaggag gcccgcactg ccatacactc gctgactgag agactttacg taggagggcc 8460
catgttcaac agcaagggtc aaacctgcgg ttacagacgt tgccgcgcca gcggggtgct 8520
aaccactagc atgggtaaca ccatcacatg ctatgtgaaa gccctagcgg cctgcaaggc 8580
tgcggggata gttgcgccca caatgctggt atgcggcgat gacctagtag tcatctcaga 8640
aagccagggg actgaggagg acgagcggaa cctgagagcc ttcacggagg ccatgaccag 8700
gtactctgcc cctcctggtg atccccccag accggaatat gacctggagc taataacatc 8760
ctgttcctca aatgtgtctg tggcgttggg cccgcggggc cgccgcagat actacctgac 8820
cagagaccca accactccac tcgcccgggc tgcctgggaa acagttagac actcccctat 8880
caattcatgg ctgggaaaca tcatccagta tgctccaacc atatgggttc gcatggtcct 8940
aatgacacac ttcttctcca ttctcatggt ccaagacacc ctggaccaga acctcaactt 9000
tgagatgtat ggatcagtat actccgtgaa tcctttggac cttccagcca taattgagag 9060
gttacacggg cttgacgcct tttctatgca cacatactct caccacgaac tgacgcgggt 9120
ggcttcagcc ctcagaaaac ttggggcgcc acccctcagg gtgtggaaga gtcgggctcg 9180
cgcagtcagg gcgtccctca tctcccgtgg agggaaagcg gccgtttgcg gccgatatct 9240
cttcaattgg gcggtgaaga ccaagctcaa actcactcca ttgccggagg cgcgcctact 9300
ggacttatcc agttggttca ccgtcggcgc cggcgggggc gacatttttc acagcgtgtc 9360
gcgcgcccga ccccgctcat tactcttcgg cctactccta cttttcgtag gggtaggcct 9420
cttcctactc cccgctcggt agagcggcac acactaggta cactccatag ctaactgttc 9480
cttttttttt tttttttttt tttttttttt tttttttttt ttttcttttt tttttttttc 9540
cctctttctt cccttctcat cttattctac tttctttctt ggtggctcca tcttagccct 9600
agtcacggct agctgtgaaa ggtccgtgag ccgcatgact gcagagagtg ccgtaactgg 9660
tctctctgca gatcatgt 9678
<210>5
<211>9678
<212>DNA
<213>人工序列
<220>
<223>JFH1突变体
<220>
<223>JFH1-Q862R
<400>5
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180
aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240
caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300
cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 360
tcaaagaaaa accaaaagaa acaccaaccg tcgcccagaa gacgttaagt tcccgggcgg 420
cggccagatc gttggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480
cacgacaagg aaaacttcgg agcggtccca gccacgtggg agacgccagc ccatccccaa 540
agatcggcgc tccactggca aggcctgggg aaaaccaggt cgcccctggc ccctatatgg 600
gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggctctc gcccctcctg 660
gggccccact gacccccggc ataggtcgcg caacgtgggt aaagtcatcg acaccctaac 720
gtgtggcttt gccgacctca tggggtacat ccccgtcgta ggcgccccgc ttagtggcgc 780
cgccagagct gtcgcgcacg gcgtgagagt cctggaggac ggggttaatt atgcaacagg 840
gaacctaccc ggtttcccct tttctatctt cttgctggcc ctgttgtcct gcatcaccgt 900
tccggtctct gctgcccagg tgaagaatac cagtagcagc tacatggtga ccaatgactg 960
ctccaatgac agcatcactt ggcagctcga ggctgcggtt ctccacgtcc ccgggtgcgt 1020
cccgtgcgag agagtgggga atacgtcacg gtgttgggtg ccagtctcgc caaacatggc 1080
tgtgcggcag cccggtgccc tcacgcaggg tctgcggacg cacatcgata tggttgtgat 1140
gtccgccacc ttctgctctg ctctctacgt gggggacctc tgtggcgggg tgatgctcgc 1200
ggcccaggtg ttcatcgtct cgccgcagta ccactggttt gtgcaagaat gcaattgctc 1260
catctaccct ggcaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320
gcccacggcc accatgatcc tggcgtacgt gatgcgcgtc cccgaggtca tcatagacat 1380
cgttagcggg gctcactggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440
gtgggcgaag gtcattgtca tccttctgct ggccgctggg gtggacgcgg gcaccaccac 1500
cgttggaggc gctgttgcac gttccaccaa cgtgattgcc ggcgtgttca gccatggccc 1560
tcagcagaac attcagctca ttaacaccaa cggcagttgg cacatcaacc gtactgcctt 1620
gaattgcaat gactccttga acaccggctt tctcgcggcc ttgttctaca ccaaccgctt 1680
taactcgtca gggtgtccag ggcgcctgtc cgcctgccgc aacatcgagg ctttccggat 1740
agggtggggc accctacagt acgaggataa tgtcaccaat ccagaggata tgaggccgta 1800
ctgctggcac taccccccaa agccgtgtgg cgtagtcccc gcgaggtctg tgtgtggccc 1860
agtgtactgt ttcaccccca gcccggtagt agtgggcacg accgacagac gtggagtgcc 1920
cacctacaca tggggagaga atgagacaga tgtcttccta ctgaacagca cccgaccgcc 1980
gcagggctca tggttcggct gcacgtggat gaactccact ggtttcacca agacttgtgg 2040
cgcgccacct tgccgcacca gagctgactt caacgccagc acggacttgt tgtgccctac 2100
ggattgtttt aggaagcatc ctgatgccac ttatattaag tgtggttctg ggccctggct 2160
cacaccaaag tgcctggtcc actaccctta cagactctgg cattacccct gcacagtcaa 2220
ttttaccatc ttcaagataa gaatgtatgt agggggggtt gagcacaggc tcacggccgc 2280
atgcaacttc actcgtgggg atcgctgcga cttggaggac agggacagga gtcagctgtc 2340
tcctctgttg cactctacca cggaatgggc catcctgccc tgcacctact cagacttacc 2400
cgctttgtca actggtcttc tccaccttca ccagaacatc gtggacgtac aatacatgta 2460
tggcctctca cctgctatca caaaatacgt cgttcgatgg gagtgggtgg tactcttatt 2520
cctgctctta gcggacgcca gagtctgcgc ctgcttgtgg atgctcatct tgttgggcca 2580
ggccgaagca gcattggaga agttggtcgt cttgcacgct gcgagtgcgg ctaactgcca 2640
tggcctccta tattttgcca tcttcttcgt ggcagcttgg cacatcaggg gtcgggtggt 2700
ccccttgacc acctattgcc tcactggcct atggcccttc tgcctactgc tcatggcact 2760
gccccggcag gcttatgcct atgacgcacc tgtgcacgga cagataggcg tgggtttgtt 2820
gatattgatc accctcttca cactcacccc ggggtataag accctcctcg gccagtgtct 2880
gtggtggttg tgctatctcc tgaccctggg ggaagccatg attcgggagt gggtaccacc 2940
catgcaggtg cgcggcggcc gcgatggcat cgcgtgggcc gtcactatat tctgcccggg 3000
tgtggtgttt gacattacca aatggctttt ggcgttgctt gggcctgctt acctcttaag 3060
ggccgctttg acacatgtgc cgtacttcgt cagagctcac gctctgataa gggtatgcgc 3120
tttggtgaag cagctcgcgg ggggtaggta tgttcaggtg gcgctattgg cccttggcag 3180
gtggactggc acctacatct atgaccacct cacacctatg tcggactggg ccgctagcgg 3240
cctgcgcgac ttagcggtcg ccgtggaacc catcatcttc agtccgatgg agaagaaggt 3300
catcgtctgg ggagcggaga cggctgcatg tggggacatt ctacatggac ttcccgtgtc 3360
cgcccgactc ggccaggaga tcctcctcgg cccagctgat ggctacacct ccaaggggtg 3420
gaagctcctt gctcccatca ctgcttatgc ccagcaaaca cgaggcctcc tgggcgccat 3480
agtggtgagt atgacggggc gtgacaggac agaacaggcc ggggaagtcc aaatcctgtc 3540
cacagtctct cagtccttcc tcggaacaac catctcgggg gttttgtgga ctgtttacca 3600
cggagctggc aacaagactc tagccggctt acggggtccg gtcacgcaga tgtactcgag 3660
tgctgagggg gacttggtag gctggcccag cccccctggg accaagtctt tggagccgtg 3720
caagtgtgga gccgtcgacc tatatctggt cacgcggaac gctgatgtca tcccggctcg 3780
gagacgcggg gacaagcggg gagcattgct ctccccgaga cccatttcga ccttgaaggg 3840
gtcctcgggg gggccggtgc tctgccctag gggccacgtc gttgggctct tccgagcagc 3900
tgtgtgctct cggggcgtgg ccaaatccat cgatttcatc cccgttgaga cactcgacgt 3960
tgttacaagg tctcccactt tcagtgacaa cagcacgcca ccggctgtgc cccagaccta 4020
tcaggtcggg tacttgcatg ctccaactgg cagtggaaag agcaccaagg tccctgtcgc 4080
gtatgccgcc caggggtaca aagtactagt gcttaacccc tcggtagctg ccaccctggg 4140
gtttggggcg tacctatcca aggcacatgg catcaatccc aacattagga ctggagtcag 4200
gaccgtgatg accggggagg ccatcacgta ctccacatat ggcaaatttc tcgccgatgg 4260
gggctgcgct agcggcgcct atgacatcat catatgcgat gaatgccacg ctgtggatgc 4320
tacctccatt ctcggcatcg gaacggtcct tgatcaagca gagacagccg gggtcagact 4380
aactgtgctg gctacggcca caccccccgg gtcagtgaca accccccatc ccgatataga 4440
agaggtaggc ctcgggcggg agggtgagat ccccttctat gggagggcga ttcccctatc 4500
ctgcatcaag ggagggagac acctgatttt ctgccactca aagaaaaagt gtgacgagct 4560
cgcggcggcc cttcggggca tgggcttgaa tgccgtggca tactatagag ggttggacgt 4620
ctccataata ccagctcagg gagatgtggt ggtcgtcgcc accgacgccc tcatgacggg 4680
gtacactgga gactttgact ccgtgatcga ctgcaatgta gcggtcaccc aagctgtcga 4740
cttcagcctg gaccccacct tcactataac cacacagact gtcccacaag acgctgtctc 4800
acgcagtcag cgccgcgggc gcacaggtag aggaagacag ggcacttata ggtatgtttc 4860
cactggtgaa cgagcctcag gaatgtttga cagtgtagtg ctttgtgagt gctacgacgc 4920
aggggctgcg tggtacgatc tcacaccagc ggagaccacc gtcaggctta gagcgtattt 4980
caacacgccc ggcctacccg tgtgtcaaga ccatcttgaa ttttgggagg cagttttcac 5040
cggcctcaca cacatagacg cccacttcct ctcccaaaca aagcaagcgg gggagaactt 5100
cgcgtaccta gtagcctacc aagctacggt gtgcgccaga gccaaggccc ctcccccgtc 5160
ctgggacgcc atgtggaagt gcctggcccg actcaagcct acgcttgcgg gccccacacc 5220
tctcctgtac cgtttgggcc ctattaccaa tgaggtcacc ctcacacacc ctgggacgaa 5280
gtacatcgcc acatgcatgc aagctgacct tgaggtcatg accagcacgt gggtcctagc 5340
tggaggagtc ctggcagccg tcgccgcata ttgcctggcg actggatgcg tttccatcat 5400
cggccgcttg cacgtcaacc agcgagtcgt cgttgcgccg gataaggagg tcctgtatga 5460
ggcttttgat gagatggagg aatgcgcctc tagggcggct ctcatcgaag aggggcagcg 5520
gatagccgag atgttgaagt ccaagatcca aggcttgctg cagcaggcct ctaagcaggc 5580
ccaggacata caacccgcta tgcaggcttc atggcccaaa gtggaacaat tttgggccag 5640
acacatgtgg aacttcatta gcggcatcca atacctcgca ggattgtcaa cactgccagg 5700
gaaccccgcg gtggcttcca tgatggcatt cagtgccgcc ctcaccagtc cgttgtcgac 5760
cagtaccacc atccttctca acatcatggg aggctggtta gcgtcccaga tcgcaccacc 5820
cgcgggggcc accggctttg tcgtcagtgg cctggtgggg gctgccgtgg gcagcatagg 5880
cctgggtaag gtgctggtgg acatcctggc aggatatggt gcgggcattt cgggggccct 5940
cgtcgcattc aagatcatgt ctggcgagaa gccctctatg gaagatgtca tcaatctact 6000
gcctgggatc ctgtctccgg gagccctggt ggtgggggtc atctgcgcgg ccattctgcg 6060
ccgccacgtg ggaccggggg agggcgcggt ccaatggatg aacaggctta ttgcctttgc 6120
ttccagagga aaccacgtcg cccctactca ctacgtgacg gagtcggatg cgtcgcagcg 6180
tgtgacccaa ctacttggct ctcttactat aaccagccta ctcagaagac tccacaattg 6240
gataactgag gactgcccca tcccatgctc cggatcctgg ctccgcgacg tgtgggactg 6300
ggtttgcacc atcttgacag acttcaaaaa ttggctgacc tctaaattgt tccccaagct 6360
gcccggcctc cccttcatct cttgtcaaaa ggggtacaag ggtgtgtggg ccggcactgg 6420
catcatgacc acgcgctgcc cttgcggcgc caacatctct ggcaatgtcc gcctgggctc 6480
tatgaggatc acagggccta aaacctgcat gaacacctgg caggggacct ttcctatcaa 6540
ttgctacacg gagggccagt gcgcgccgaa accccccacg aactacaaga ccgccatctg 6600
gagggtggcg gcctcggagt acgcggaggt gacgcagcat gggtcgtact cctatgtaac 6660
aggactgacc actgacaatc tgaaaattcc ttgccaacta ccttctccag agtttttctc 6720
ctgggtggac ggtgtgcaga tccataggtt tgcacccaca ccaaagccgt ttttccggga 6780
tgaggtctcg ttctgcgttg ggcttaattc ctatgctgtc gggtcccagc ttccctgtga 6840
acctgagccc gacgcagacg tattgaggtc catgctaaca gatccgcccc acatcacggc 6900
ggagactgcg gcgcggcgct tggcacgggg atcacctcca tctgaggcga gctcctcagt 6960
gagccagcta tcagcaccgt cgctgcgggc cacctgcacc acccacagca acacctatga 7020
cgtggacatg gtcgatgcca acctgctcat ggagggcggt gtggctcaga cagagcctga 7080
gtccagggtg cccgttctgg actttctcga gccaatggcc gaggaagaga gcgaccttga 7140
gccctcaata ccatcggagt gcatgctccc caggagcggg tttccacggg ccttaccggc 7200
ttgggcacgg cctgactaca acccgccgct cgtggaatcg tggaggaggc cagattacca 7260
accgcccacc gttgctggtt gtgctctccc cccccccaag aaggccccga cgcctccccc 7320
aaggagacgc cggacagtgg gtctgagcga gagcaccata tcagaagccc tccagcaact 7380
ggccatcaag acctttggcc agcccccctc gagcggtgat gcaggctcgt ccacgggggc 7440
gggcgccgcc gaatccggcg gtccgacgtc ccctggtgag ccggccccct cagagacagg 7500
ttccgcctcc tctatgcccc ccctcgaggg ggagcctgga gatccggacc tggagtctga 7560
tcaggtagag cttcaacctc ccccccaggg ggggggggta gctcccggtt cgggctcggg 7620
gtcttggtct acttgctccg aggaggacga taccaccgtg tgctgctcca tgtcatactc 7680
ctggaccggg gctctaataa ctccctgtag ccccgaagag gaaaagttgc caatcaaccc 7740
tttgagtaac tcgctgttgc gataccataa caaggtgtac tgtacaacat caaagagcgc 7800
ctcacagagg gctaaaaagg taacttttga caggacgcaa gtgctcgacg cccattatga 7860
ctcagtctta aaggacatca agctagcggc ttccaaggtc agcgcaaggc tcctcacctt 7920
ggaggaggcg tgccagttga ctccacccca ttctgcaaga tccaagtatg gattcggggc 7980
caaggaggtc cgcagcttgt ccgggagggc cgttaaccac atcaagtccg tgtggaagga 8040
cctcctggaa gacccacaaa caccaattcc cacaaccatc atggccaaaa atgaggtgtt 8100
ctgcgtggac cccgccaagg ggggtaagaa accagctcgc ctcatcgttt accctgacct 8160
cggcgtccgg gtctgcgaga aaatggccct ctatgacatt acacaaaagc ttcctcaggc 8220
ggtaatggga gcttcctatg gcttccagta ctcccctgcc caacgggtgg agtatctctt 8280
gaaagcatgg gcggaaaaga aggaccccat gggtttttcg tatgataccc gatgcttcga 8340
ctcaaccgtc actgagagag acatcaggac cgaggagtcc atataccagg cctgctccct 8400
gcccgaggag gcccgcactg ccatacactc gctgactgag agactttacg taggagggcc 8460
catgttcaac agcaagggtc aaacctgcgg ttacagacgt tgccgcgcca gcggggtgct 8520
aaccactagc atgggtaaca ccatcacatg ctatgtgaaa gccctagcgg cctgcaaggc 8580
tgcggggata gttgcgccca caatgctggt atgcggcgat gacctagtag tcatctcaga 8640
aagccagggg actgaggagg acgagcggaa cctgagagcc ttcacggagg ccatgaccag 8700
gtactctgcc cctcctggtg atccccccag accggaatat gacctggagc taataacatc 8760
ctgttcctca aatgtgtctg tggcgttggg cccgcggggc cgccgcagat actacctgac 8820
cagagaccca accactccac tcgcccgggc tgcctgggaa acagttagac actcccctat 8880
caattcatgg ctgggaaaca tcatccagta tgctccaacc atatgggttc gcatggtcct 8940
aatgacacac ttcttctcca ttctcatggt ccaagacacc ctggaccaga acctcaactt 9000
tgagatgtat ggatcagtat actccgtgaa tcctttggac cttccagcca taattgagag 9060
gttacacggg cttgacgcct tttctatgca cacatactct caccacgaac tgacgcgggt 9120
ggcttcagcc ctcagaaaac ttggggcgcc acccctcagg gtgtggaaga gtcgggctcg 9180
cgcagtcagg gcgtccctca tctcccgtgg agggaaagcg gccgtttgcg gccgatatct 9240
cttcaattgg gcggtgaaga ccaagctcaa actcactcca ttgccggagg cgcgcctact 9300
ggacttatcc agttggttca ccgtcggcgc cggcgggggc gacatttttc acagcgtgtc 9360
gcgcgcccga ccccgctcat tactcttcgg cctactccta cttttcgtag gggtaggcct 9420
cttcctactc cccgctcggt agagcggcac acactaggta cactccatag ctaactgttc 9480
cttttttttt tttttttttt tttttttttt tttttttttt ttttcttttt tttttttttc 9540
cctctttctt cccttctcat cttattctac tttctttctt ggtggctcca tcttagccct 9600
agtcacggct agctgtgaaa ggtccgtgag ccgcatgact gcagagagtg ccgtaactgg 9660
tctctctgca gatcatgt 9678
<210>6
<211>10617
<212>DNA
<213>人工序列
<220>
<223>JFH1突变体
<220>
<223>JFH1-A/WT-Rluc
<400>6
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtgcaccgg aattgccggg 180
aagactgggt cctttcttgg atacacccac tctatgcccg gccatttggg cgtgcccccg 240
caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300
cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 360
tcaaagaaaa accaaaagaa acaccaaccg tcgcccagaa gacgttaagt tcccgggcgg 420
cggccagatc gttggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480
cacgacaagg aaaacttcgg agcggtccca gccacgtggg agacgccagc ccatccccaa 540
agatcggcgc tccactggca cggcctgggg taaaccaggt cgcccctggc ccctatatgg 600
gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggctctc gcccctcctg 660
gggccccact gacccccggc ataggtcgcg caacgtgggt aaagtcatcg acaccctaac 720
gtgtggcttt gccgacctca tggggtacat ccccgtcgta ggcgccccgc ttagtggcgc 780
cgccagagct gtcgcgcacg gcgtgagagt cctggaggac ggggttaatt atgcaacagg 840
gaacctacct ggtttcccct tttctatctt cttgctggcc ctgttgtcct gcatcaccgt 900
tccggtctct gctgcccagg tgaagaatac cagtagcagc tacatggtga ccaatgactg 960
ctccaatgac agcatcactt ggcagctcga ggctgcggtt ctccacgtcc ccgggtgcgt 1020
cccgtgcgag agagtgggga atacgtcacg gtgttgggtg ccagtctcgc caaacatggc 1080
tgtgcggcag cccggtgccc tcacgcaggg tctgcggacg cacatcgata tggttgtgat 1140
gtccgccacc ttctgctctg ctctctacgt gggggacctc tgtggcgggg tgatgctcgc 1200
ggcccaggtg ttcatcgtct cgccgcagca ccactggttt gtgcaggaat gcaattgctc 1260
catctaccct ggcaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320
gcccacgacc accatgatcc tggcgtacgt gatgcgcgtc cccgaggtca tcatagacat 1380
cgttagcggg gctcactggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440
gtgggcgaag gtcattgtca tccttctgct ggccgctggg gtggacgcgg gcaccaccac 1500
cgttggaggc gccgttgcac gtcccaccaa cgtgattgcc ggcgtgttca gccatggccc 1560
tcagcagaac attcagctca ttaacaccag cggcagttgg cacatcaacc gtactgcctt 1620
gaattgcaat gactccttga acaccggctt tctcgcggcc ttgttctaca ccaaccgctt 1680
taactcgtca gggtgtccag ggcgcctgtc cgcctgccgc aacatcgagg ctttccggat 1740
agggtggggc accctacagt acgaggataa tgtcaccaat ccagagggta tgaggccgta 1800
ctgctggcac taccccccaa agccgtgtgg cgtagtcccc acgaggtctg tgtgtggccc 1860
agtgtactgt ttcaccccca gcccggtagt agtgggcacg accgacagac gtggagtgcc 1920
cacctacaca tggggagaga atgagacaga tgtcttccta ctgaacagca cccgaccgcc 1980
gcagggctca tggttcggct gcacgtggat gaactccact ggtttcacca agacttgtgg 2040
cgcgccacct tgccgcacca gagctgactt caacgccagc acggacttgt tgtgccctac 2100
ggattgtttt aggaagcatc ctgatgccac ttatattaag tgtggttctg ggccctggct 2160
cacaccaaag tgcctggtcc actaccctta cagactctgg cattacccct gcacagtcaa 2220
ttttaccatc ttcaagataa gaatgtatgt agggggggtt gagcacaggc tcacggccgc 2280
atgcaacttc actcgtgggg atcgctgcga cttggaggac agggacagga gtcagctgtc 2340
tcctctgttg cactctacca cggaatgggc catcctgccc tgcacctact cagacttacc 2400
cgctttgtca actggtcttc tccaccttca ccagaacatc gtggacgtac aatacatgta 2460
tggcctctca cctgctatca caaaatacgt cgttcgatgg gagtgggtgg tactcttatt 2520
cctgctctta gcggacgcca gagtctgcgc ctgcttgtgg atgctcatct tgttgggcca 2580
ggccgaagca gcattggaga agttggtcgt cttgcacgct gcgagtgcgg ctaactgcca 2640
tggcctccta tattttgcca tcttcttcgt ggcagcttgg cacatcaggg gtcgggtggt 2700
ccccttgacc acctattgcc ttactggcct atggcccttc tgcctactgc tcatggcact 2760
gccccggcag gcttatgcct atgacgcacc tgtgcacgga cagataggcg tgggtttgtt 2820
gatattgatc accctcttca cactcacccc ggggtataag accctcctcg gccagtgtct 2880
gtggtggttg tgctatctcc tgaccctggg ggaagccatg attcgggagt gggtaccacc 2940
catgcaggtg cgcggcggcc gcgatggcat cgcgtgggcc gtcactatat tctgcccggg 3000
cgtggtgttt gacattacca aatggctttt ggcgttgctt gggcctgctt acctcttaag 3060
ggccgctttg acccatgtgc cgtacttcgt cagagctcac gctctgataa gggtatgcgc 3120
tttggtgaag cggctcgcgg ggggtaggta tgttcaggtg gcgctgttgg cccttggcag 3180
gtggactggc acctacatct atgaccacct cacacctatg gcggactggg ccgctagcgg 3240
cctgcgcgac ttagcggtcg ccgtggaacc catcatcttc agtccgatgg agaagaaggt 3300
catcgtctgg ggagcggaga cggctgcatg tggggacatt ctacatggac ttcccgtgtc 3360
cgcccgactc ggccaggaga tcctcctcgg cccagctgat ggctacacct ccaaggggtg 3420
gaagctcctt gctcccatca ctgcttatgc ccagcaaaca cgaggcctcc tgggcgccat 3480
agtggtgagt atgacggggc gtgacaggac agaacaggcc ggggaagtcc aaatcctgtc 3540
cacagtctct cagtccttcc tcggaacaac catctcgggg gttttgtgga ctgtttacca 3600
cggagctggc aacaagactc tagccggctt acgaggtccg gtcacgcaga tgtactcgag 3660
tgctgagggg gacttggtag gctggcccag cccccctggg accaagtctt tggagccgtg 3720
caagtgtgga gccgtcgacc tatatctggt cacgcggaac gctgatgtca tcccggctcg 3780
gagacgcggg gataagcggg gagcattgct ctccccgaga cccatttcga ccttgaaggg 3840
gtcctcgggg gggccggtgc tctgccctag gggccacgtc gttgggctct tccgagcagc 3900
tgtgtgctct cggggcgtgg ccaaatccat cgatttcatc cccgttgaga cactcgacgt 3960
tgttacaagg tctcccactt tcagtgacaa cagcacgcca ccggctgtgc cccagaccta 4020
tcaggtcggg tacttgcatg ctccaactgg cagtggaaag agcaccaagg tccctgtcgc 4080
gtatgccgcc caggggtaca aagtactagt gcttaacccc tcggtagctg ccaccctggg 4140
gtttggggcg tacctatcca aggcacatgg catcaatccc aacattagga ctggagtcag 4200
gaccgtgatg accggggagg ccatcacgta ctccacatat ggcaaatttc tcgccgatgg 4260
gggctgcgct agcggcgcct atgacatcat catatgcgat gaatgccacg ctgtggatgc 4320
tacctccatt ctcggcatcg gaacggtcct tgatcaagca gagacagccg gggtcagact 4380
aactgtgctg gctacggcca caccccccgg gtcagtgaca accccccatc ccgatataga 4440
agaggtaggc ctcgggcggg agggtgagat ccccttctat gggagggcga ttcccctatc 4500
ctgcatcaag ggagggagac acctgatttt ctgccactca aagaaaaagt gtgacgagct 4560
cgcggcggcc cttcggggca tgggcttgaa tgccgtggca tactatagag ggttggacgt 4620
ctccataata ccagctcagg gagatgtggt ggtcgtcgcc accgacgccc tcatgacggg 4680
gtacactgga gactttgact ccgtgatcga ctgcaatgta gcggtcaccc aagctgtcga 4740
cttcagcctg gaccccacct tcactataac cacacagact gtcccacaag acgctgtctc 4800
acgcagtcag cgccgcgggc gcacaggtag aggaagacag ggcacttata ggtatgtttc 4860
cactggtgaa cgagcctcag gaatgtttga cagtgtagtg ctttgtgagt gctacgacgc 4920
aggggctgcg tggtacgatc tcacaccagc ggagaccacc gtcaggctta gagcgtattt 4980
caacacgccc ggcctacccg tgtgtcaaga ccatcttgaa ttttgggagg cagttttcac 5040
cggcctcaca cacatagacg cccacttcct ctcccaaaca aagcaagcgg gggagaactt 5100
cgcgtaccta gtagcctacc aagctacggt gtgcgccaga gccaaggccc ctcccccgtc 5160
ctgggacgcc atgtggaagt gcctggcccg actcaagcct acgcttgcgg gccccacacc 5220
tctcctgtac cgtttgggcc ctattaccaa tgaggtcacc ctcacacacc ctgggacgaa 5280
gtacatcgcc acatgcatgc aagctgacct tgaggtcatg accagcacgt gggtcctagc 5340
tggaggagtc ctggcagccg tcgccgcata ttgcctggcg actggatgcg tttccatcat 5400
cggccgcttg cacgtcaacc agcgagtcgt cgttgcgccg gataaggagg tcctgtatga 5460
ggcttttgat gagatggagg aatgcgcctc tagggcggct ctcatcgaag aggggcagcg 5520
gatagccgag atgttgaagt ccaagatcca aggcttgctg cagcaggcct ctaagcaggc 5580
ccaggacata caacccgcta tgcaggcttc atggcccaaa gtggaacaat tttgggccag 5640
acacatgtgg aacttcatta gcggcatcca atacctcgca ggattgtcaa cactgccagg 5700
gaaccccgcg gtggcttcca tgatggcatt cagtgccgcc ctcaccagtc cgttgtcgac 5760
cagtaccacc atccttctca acatcatggg aggctggtta gcgtcccaga tcgcaccacc 5820
cgcgggggcc accggctttg tcgtcagtgg cctggtgggg gctgccgtgg gcagcatagg 5880
cctgggtaag gtgctggtgg acatcctggc aggatatggt gcgggcattt cgggggccct 5940
cgtcgcattc aagatcatgt ctggcgagaa gccctctatg gaagatgtca tcaatctact 6000
gcctgggatc ctgtctccgg gagccctggt ggtgggggtc atctgcgcgg ccattctgcg 6060
ccgccacgtg ggaccggggg agggcgcggt ccaatggatg aacaggctta ttgcctttgc 6120
ttccagagga aaccacgtcg cccctactca ctacgtgacg gagtcggatg cgtcgcagcg 6180
tgtgacccaa ctacttggct ctcttactat aaccagccta ctcagaagac tccacaattg 6240
gataactgag gactgcccca tcccatgctc cggatcctgg ctccgcgacg tgtgggactg 6300
ggtttgcacc atcttgacag acttcaaaaa ttggctgacc tctaaattgt tccccaagct 6360
gcccggcctc cccttcatct cttgtcaaaa ggggtacaag ggtgtgtggg ccggcactgg 6420
catcatgacc acgcgctgcc cttgcggcgc caacatctct ggcaatgtcc gcctgggctc 6480
tatgaggatc acagggccta aaacctgcat gaacacctgg caggggacct ttcctatcaa 6540
ttgctacacg gagggccagt gcgcgccgaa accccccacg aactacaaga ccgccatctg 6600
gagggtggcg gcctcggagt acgcggaggt gacgcagcat gggtcgtact cctatgtaac 6660
aggactgacc actgacaatc tgaaaattcc ttgccaacta ccttctccag agtttttctc 6720
ctgggtggac ggtgtgcaga tccataggtt tgcacccaca ccaaagccgt ttttccggga 6780
tgaggtctcg ttctgcgttg ggcttaattc ctatgctgtc gggtcccagc ttccctgtga 6840
acctgagccc gacgcagacg tattgaggtc catgctaaca gatccgcccc acatcacggc 6900
ggagactgcg gcgcggcgct tggcacgggg atcacctcca tctgaggcga gctcctcagt 6960
gagccagcta tcagcaccgt cgctgcgggc cacctgcacc acccacagca acacctatga 7020
cgtggacatg gtcgatgcca acctgctcat ggagggcggt gtggctcaga cagagcctga 7080
gtccagggtg cccgttctgg actttctcga gccaatggcc gaggaagaga gcgaccttga 7140
gccctcaata ccatcggagt gcatgctccc caggagcggg tttccacggg ccttaccggc 7200
ttgggcacgg cctgactaca acccgccgct cgtggaatcg tggaggaggc cagattacca 7260
accgcccacc gttgctggtt gtgctctccc cccccccaag aaggccccga cgcctccccc 7320
aaggagacgc cggacagtgg gtctgagcga gagcaccata tcagaagccc tccagcaact 7380
ggccatcaag acctttggcc agcccccctc gagcggtgat gcaggctcgt ccacgggggc 7440
gggcgccgcc gaatccggcg gtccgacgtc ccctggtgag ccggccccct cagagacagg 7500
ttccgcctcc tctatgcccc ccctcgagat ggcttccaag gtgtacgacc ccgagcaacg 7560
caaacgcatg atcactgggc ctcagtggtg ggctcgctgc aagcaaatga acgtgctgga 7620
ctccttcatc aactactatg attccgagaa gcacgccgag aacgccgtga tttttctgca 7680
tggtaacgct gcctccagct acctgtggag gcacgtcgtg cctcacatcg agcccgtggc 7740
tagatgcatc atccctgatc tgatcggaat gggtaagtcc ggcaagagcg ggaatggctc 7800
atatcgcctc ctggatcact acaagtacct caccgcttgg ttcgagctgc tgaaccttcc 7860
aaagaaaatc atctttgtgg gccacgactg gggggcttgt ctggcctttc actactccta 7920
cgagcaccaa gacaagatca aggccatcgt ccatgctgag agtgtcgtgg acgtgatcga 7980
gtcctgggac gagtggcctg acatcgagga ggatatcgcc ctgatcaaga gcgaagaggg 8040
cgagaaaatg gtgcttgaga ataacttctt cgtcgagacc atgctcccaa gcaagatcat 8100
gcggaaactg gagcctgagg agttcgctgc ctacctggag ccattcaagg agaagggcga 8160
ggttagacgg cctaccctct cctggcctcg cgagatccct ctcgttaagg gaggcaagcc 8220
cgacgtcgtc cagattgtcc gcaactacaa cgcctacctt cgggccagcg acgatctgcc 8280
taagatgttc atcgagtccg accctgggtt cttttccaac gctattgtcg agggagctaa 8340
gaagttccct aacaccgagt tcgtgaaggt gaagggcctc cacttcagcc aggaggacgc 8400
tccagatgaa atgggtaagt acatcaagag cttcgtggag cgcgtgctga agaacgagca 8460
gctcgagggg gagcctggag atccggacct ggagtctgat caggtagagc ttcaacctcc 8520
cccccagggg gggggggtag ctcccggttc gggctcgggg tcttggtcta cttgctccga 8580
ggaggacgat accaccgtgt gctgctccat gtcatactcc tggaccgggg ctctaataac 8640
tccctgtagc cccgaagagg aaaagttgcc aatcaaccct ttgagtaact cgctgttgcg 8700
ataccataac aaggtgtact gtacaacatc aaagagcgcc tcacagaggg ctaaaaaggt 8760
aacttttgac aggacgcaag tgctcgacgc ccattatgac tcagtcttaa aggacatcaa 8820
gctagcggct tccaaggtca gcgcaaggct cctcaccttg gaggaggcgt gccagttgac 8880
tccaccccat tctgcaagat ccaagtatgg attcggggcc aaggaggtcc gcagcttgtc 8940
cgggagggcc gttaaccaca tcaagtccgt gtggaaggac ctcctggaag acccacaaac 9000
accaattccc acaaccatca tggccaaaaa tgaggtgttc tgcgtggacc ccgccaaggg 9060
gggtaagaaa ccagctcgcc tcatcgttta ccctgacctc ggcgtccggg tctgcgagaa 9120
aatggccctc tatgacatta cacaaaagct tcctcaggcg gtaatgggag cttcctatgg 9180
cttccagtac tcccctgccc aacgggtgga gtatctcttg aaagcatggg cggaaaagaa 9240
ggaccccatg ggtttttcgt atgatacccg atgcttcgac tcaaccgtca ctgagagaga 9300
catcaggacc gaggagtcca tataccaggc ctgctccctg cccgaggagg cccgcactgc 9360
catacactcg ctgactgaga gactttacgt aggagggccc atgttcaaca gcaagggtca 9420
aacctgcggt tacagacgtt gccgcgccag cggggtgcta accactagca tgggtaacac 9480
catcacatgc tatgtgaaag ccctagcggc ctgcaaggct gcggggatag ttgcgcccac 9540
aatgctggta tgcggcgatg acctagtagt catctcagaa agccagggga ctgaggagga 9600
cgagcggaac ctgagagcct tcacggaggc catgaccagg tactctgccc ctcctggtga 9660
tccccccaga ccggaatatg acctggagct aataacatcc tgttcctcaa atgtgtctgt 9720
ggcgttgggc ccgcggggcc gccgcagata ctacctgacc agagacccaa ccactccact 9780
cgcccgggct gcctgggaaa cagttagaca ctcccctatc aattcatggc tgggaaacat 9840
catccagtat gctccaacca tatgggttcg catggtccta atgacacact tcttctccat 9900
tctcatggtc caagacaccc tggaccagaa cctcaacttt gagatgtatg gatcagtata 9960
ctccgtgaat cctttggacc ttccagccat aattgagagg ttacacgggc ttgacgcctt 10020
ttctatgcac acatactctc accacgaact gacgcgggtg gcttcagccc tcagaaaact 10080
tggggcgcca cccctcaggg tgtggaagag tcgggctcgc gcagtcaggg cgtccctcat 10140
ctcccgtgga gggaaagcgg ccgtttgcgg ccgatatctc ttcaattggg cggtgaagac 10200
caagctcaaa ctcactccat tgccggaggc gcgcctactg gacttatcca gttggttcac 10260
cgtcggcgcc ggcgggggcg acatttttca cagcgtgtcg cgcgcccgac cccgctcatt 10320
actcttcggc ctactcctac ttttcgtagg ggtaggcctc ttcctactcc ccgctcggta 10380
gagcggcaca cactaggtac actccatagc taactgttcc tttttttttt tttttttttt 10440
tttttttttt tttttttttt tttctttttt ttttttttcc ctctttcttc ccttctcatc 10500
ttattctact ttctttcttg gtggctccat cttagcccta gtcacggcta gctgtgaaag 10560
gtccgtgagc cgcatgactg cagagagtgc cgtaactggt ctctctgcag atcatgt 10617
<210>7
<211>10617
<212>DNA
<213>人工序列
<220>
<223>JFH1突变体
<220>
<223>JFH1-B/WT-RLuc
<400>7
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180
aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240
caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300
cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 360
tcaaagaaaa accaaaagaa acaccaaccg tcgcccagaa gacgttaagt tcccgggcgg 420
cggccagatc gctggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480
cacgacaagg aaaacttcgg agcggtccca gccacgtggg agacgccagc ccatccccaa 540
agatcggcgc tccactggca cggcctgggg aaaaccaggt cgcccctggc ccctatatgg 600
gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggctctc gcccctcctg 660
gggccccact gacccccggc ataggtcgcg caacgtgggt aaagtcatcg acaccctaac 720
gtgtggcttt gccgacctca tggggtacat ccccgtcgta ggcgccccgc ttagtggcgc 780
cgccagagct gtcgcgcacg gcgtgagagt cctggaggac ggggttaatt atgcaacagg 840
gaacctaccc ggtttcccct tttctatctt cttgctggcc ctgttgtcct gcatcaccgt 900
tccggtctct gctgcccagg tgaagaatac cagtagcagc tacatggtga ccaatgactg 960
ctccaatgac agcatcactt ggcagctcga ggctgcagtt ctccacgtcc ccgggtgcgt 1020
cccgtgcgag agagtgggga atacgtcacg gtgttgggtg ccagtctcgc caaacatggc 1080
tgtgcggcag cccggtgccc tcacgcaggg tctgcggacg cacatcgata tggttgtgat 1140
gtccgccacc ttctgctctg ctctctacgt gggggacctc tgtggcgggg tgatgctcgc 1200
ggcccaggtg ttcatcgtct cgccgcagta ccactggttt gtgcaggaat gcaattgctc 1260
catctaccct ggcaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320
gcccacggcc accatgatcc tggcgtacgt gatgcgcgtc cccgaggtca tcatagacat 1380
cgttagcggg gctcactggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440
gtgggcgaag gtcattgtca tccttctgct ggccgctggg gtggacgcgg gcaccaccac 1500
cgttgggggc gctgttgcac gttccaccaa cgtgattgcc ggcgtgttca gccatggccc 1560
tcagcagaac attcagctca ttaacaccaa cggcagctgg cacatcaacc gtactgcctt 1620
gaattgcaat gactccttga acaccggctt tctcgcggcc ttgttctaca ccaaccgctt 1680
taactcgtca aggtgtccag ggcgcctgtc cgcctgccgc aacatcgagg ctttccggat 1740
agggtggggc accctacagt acgaggataa tgtcaccaat ccagaggata tgaggccgta 1800
ctgctggcac taccccccaa agccgtgtgg cgtagtcccc gcgaggtctg tgtgtggccc 1860
agtgtactgt ttcactccca gcccggtagt agtgggcacg accgacagac gtggagtgcc 1920
cacctacaca tggggagaga atgagacaga tgtcttccta ctgaacagca cccgaccgcc 1980
acagggctca tggttcggct gcacgtggat gaactccact ggtttcacca agacttgtgg 2040
cgcgccacct tgccgcacca gagctgactt caacgccagc acggacttgt tgtgccctac 2100
ggattgtttt aggaagcatc ctgatgccac ttatattaag tgtggttctg ggccctggct 2160
cacaccaaag tgcctggtcc actaccctta cagactctgg cattacccct gcacagtcaa 2220
ttttaccatc ttcaagataa gaatgtatgt agggggggtt gagcacaggc tcacggccgc 2280
atgcaacttc actcgtgggg atcgctgcga cttggaggac agggacagga gtcagctgtc 2340
tcctctgttg cactctacca cggaatgggc catcctgccc tgcacctact cagacttacc 2400
cgctttgtca actggtcttc tccaccttca ccagaacatc gtggacgtac aatacatgta 2460
tggcctctca cctgctatca caaaatacgt cgttcgatgg gagtgggtgg tactcttatt 2520
cctgctctta gcggacgcca gagtctgcgc ctgcctgtgg atgctcatct tgttgggcca 2580
ggccgaagca gcattggaga agttggccgt cttgcacgct gcgagtgcgg ctaactgcca 2640
tggcctccta tattttgcca tcttcttcgt ggcagcttgg cacatcaggg gtcgggcggt 2700
ccccttgacc acctattgcc tcactggcct atggcccttc tgcctactgc tcatggcact 2760
gccccggcag gcttatgcct atgacgcacc tgtgcacgga cagataggcg tgggtttgtt 2820
gatattgatc accctcttca cactcacccc ggggtataag accctcctcg gccagtgtct 2880
gtggtggttg tgctatctcc tgaccctggg ggaagccatg attcgggagt gggtaccacc 2940
catgcaggtg cgcggcggcc gcgatggcat cgcgtgggcc gtcactatat tctgcccggg 3000
tgtggtgttt gacattacca aatggctttt ggcgttgctt gggcctgctt acctcttaag 3060
ggccgctttg acacatgtgc cgtacttcgt cagagctcac gctctgataa gggtatgcgc 3120
tttggtgaag cagctcgcgg ggggtaggta tgttcaggtg gcgctattgg cccttggcag 3180
gtggactggc acctacatct atgaccacct cacacctatg tcggactggg ccgctagcgg 3240
cctgcgcgac ttagcggtcg ccgtggaacc catcatcttc agtccgatgg agaagaaggt 3300
catcgtctgg ggagcggaga cggctgcatg tggggacatt ctacatggac ttcccgtgtc 3360
cgcccgactc ggccaggaga tcctcctcgg cccagctgat ggctacacct ccaaggggtg 3420
gaagctcctt gctcccatca ctgcttatgc ccagcaaaca cgaggcctcc tgggcgccat 3480
agtggtgagt atgacggggc gtgacaggac agaacaggcc ggggaagtcc aaatcctgtc 3540
cacagtctct cagtccttcc tcggaacaac catctcgggg gttttgtgga ctgtttacca 3600
cggagctggc aacaagactc tagccggctt acggggtccg gtcacgcaga tgtactcgag 3660
tgctgagggg gacttggtag gctggcccag cccccctggg accaagtctt tggagccgtg 3720
caagtgtgga gccgtcgacc tatatctggt cacgcggaac gctgatgtta tcccggctcg 3780
gagacgcggg gacaagcggg gagcattgct ctccccgaga cccatttcga ccttgaaggg 3840
gtcctcgggg gggccggtgc tctgccctag gggccacgtc gttgggctct tccgagcagc 3900
tgtgtgctct cggggcgtgg ccaaatccat cgatttcatc cccgttgaga cactcgacgt 3960
tgttacaagg tctcccacct tcagtgacaa cagcacgcca ccggctgtgc cccagaccta 4020
tcaggtcggg tacttgcatg ctccaactgg cagtggaaag agcaccaagg tccctgtcgc 4080
gtatgccgcc caggggtaca aagtactagt gcttaacccc tcggtagctg ccaccctggg 4140
gtttggggcg tacctatcca aggcacatgg catcaatccc aacattagga ctggagtcag 4200
gaccgtgatg accggggagg ccatcacgta ctccacatat ggcaaatttc tcgccgatgg 4260
gggctgcgct agcggcgcct atgacatcat catatgcgat gaatgccacg ctgtggatgc 4320
tacctccatt ctcggcatcg gaacggtcct tgatcaagca gagacagccg gggtcagact 4380
aactgtgctg gctacggcca caccccccgg gtcagtgaca accccccatc ccgatataga 4440
agaggtaggc ctcgggcggg agggtgagat ccccttctat gggagggcga ttcccctatc 4500
ctgcatcaag ggagggagac acctgatttt ctgccactca aagaaaaagt gtgacgagct 4560
cgcggcggcc cttcggggca tgggcttgaa tgccgtggca tactatagag ggttggacgt 4620
ctccataata ccagctcagg gagatgtggt ggtcgtcgcc accgacgccc tcatgacggg 4680
gtacactgga gactttgact ccgtgatcga ctgcaatgta gcggtcaccc aagctgtcga 4740
cttcagcctg gaccccacct tcactataac cacacagact gtcccacaag acgctgtctc 4800
acgcagtcag cgccgcgggc gcacaggtag aggaagacag ggcacttata ggtatgtttc 4860
cactggtgaa cgagcctcag gaatgtttga cagtgtagtg ctttgtgagt gctacgacgc 4920
aggggctgcg tggtacgatc tcacaccagc ggagaccacc gtcaggctta gagcgtattt 4980
caacacgccc ggcctacccg tgtgtcaaga ccatcttgaa ttttgggagg cagttttcac 5040
cggcctcaca cacatagacg cccacttcct ctcccaaaca aagcaagcgg gggagaactt 5100
cgcgtaccta gtagcctacc aagctacggt gtgcgccaga gccaaggccc ctcccccgtc 5160
ctgggacgcc atgtggaagt gcctggcccg actcaagcct acgcttgcgg gccccacacc 5220
tctcctgtac cgtttgggcc ctattaccaa tgaggtcacc ctcacacacc ctgggacgaa 5280
gtacatcgcc acatgcatgc aagctgacct tgaggtcatg accagcacgt gggtcctagc 5340
tggaggagtc ctggcagccg tcgccgcata ttgcctggcg actggatgcg tttccatcat 5400
cggccgcttg cacgtcaacc agcgagtcgt cgttgcgccg gataaggagg tcctgtatga 5460
ggcttttgat gagatggagg aatgcgcctc tagggcggct ctcatcgaag aggggcagcg 5520
gatagccgag atgttgaagt ccaagatcca aggcttgctg cagcaggcct ctaagcaggc 5580
ccaggacata caacccgcta tgcaggcttc atggcccaaa gtggaacaat tttgggccag 5640
acacatgtgg aacttcatta gcggcatcca atacctcgca ggattgtcaa cactgccagg 5700
gaaccccgcg gtggcttcca tgatggcatt cagtgccgcc ctcaccagtc cgttgtcgac 5760
cagtaccacc atccttctca acatcatggg aggctggtta gcgtcccaga tcgcaccacc 5820
cgcgggggcc accggctttg tcgtcagtgg cctggtgggg gctgccgtgg gcagcatagg 5880
cctgggtaag gtgctggtgg acatcctggc aggatatggt gcgggcattt cgggggccct 5940
cgtcgcattc aagatcatgt ctggcgagaa gccctctatg gaagatgtca tcaatctact 6000
gcctgggatc ctgtctccgg gagccctggt ggtgggggtc atctgcgcgg ccattctgcg 6060
ccgccacgtg ggaccggggg agggcgcggt ccaatggatg aacaggctta ttgcctttgc 6120
ttccagagga aaccacgtcg cccctactca ctacgtgacg gagtcggatg cgtcgcagcg 6180
tgtgacccaa ctacttggct ctcttactat aaccagccta ctcagaagac tccacaattg 6240
gataactgag gactgcccca tcccatgctc cggatcctgg ctccgcgacg tgtgggactg 6300
ggtttgcacc atcttgacag acttcaaaaa ttggctgacc tctaaattgt tccccaagct 6360
gcccggcctc cccttcatct cttgtcaaaa ggggtacaag ggtgtgtggg ccggcactgg 6420
catcatgacc acgcgctgcc cttgcggcgc caacatctct ggcaatgtcc gcctgggctc 6480
tatgaggatc acagggccta aaacctgcat gaacacctgg caggggacct ttcctatcaa 6540
ttgctacacg gagggccagt gcgcgccgaa accccccacg aactacaaga ccgccatctg 6600
gagggtggcg gcctcggagt acgcggaggt gacgcagcat gggtcgtact cctatgtaac 6660
aggactgacc actgacaatc tgaaaattcc ttgccaacta ccttctccag agtttttctc 6720
ctgggtggac ggtgtgcaga tccataggtt tgcacccaca ccaaagccgt ttttccggga 6780
tgaggtctcg ttctgcgttg ggcttaattc ctatgctgtc gggtcccagc ttccctgtga 6840
acctgagccc gacgcagacg tattgaggtc catgctaaca gatccgcccc acatcacggc 6900
ggagactgcg gcgcggcgct tggcacgggg atcacctcca tctgaggcga gctcctcagt 6960
gagccagcta tcagcaccgt cgctgcgggc cacctgcacc acccacagca acacctatga 7020
cgtggacatg gtcgatgcca acctgctcat ggagggcggt gtggctcaga cagagcctga 7080
gtccagggtg cccgttctgg actttctcga gccaatggcc gaggaagaga gcgaccttga 7140
gccctcaata ccatcggagt gcatgctccc caggagcggg tttccacggg ccttaccggc 7200
ttgggcacgg cctgactaca acccgccgct cgtggaatcg tggaggaggc cagattacca 7260
accgcccacc gttgctggtt gtgctctccc cccccccaag aaggccccga cgcctccccc 7320
aaggagacgc cggacagtgg gtctgagcga gagcaccata tcagaagccc tccagcaact 7380
ggccatcaag acctttggcc agcccccctc gagcggtgat gcaggctcgt ccacgggggc 7440
gggcgccgcc gaatccggcg gtccgacgtc ccctggtgag ccggccccct cagagacagg 7500
ttccgcctcc tctatgcccc ccctcgagat ggcttccaag gtgtacgacc ccgagcaacg 7560
caaacgcatg atcactgggc ctcagtggtg ggctcgctgc aagcaaatga acgtgctgga 7620
ctccttcatc aactactatg attccgagaa gcacgccgag aacgccgtga tttttctgca 7680
tggtaacgct gcctccagct acctgtggag gcacgtcgtg cctcacatcg agcccgtggc 7740
tagatgcatc atccctgatc tgatcggaat gggtaagtcc ggcaagagcg ggaatggctc 7800
atatcgcctc ctggatcact acaagtacct caccgcttgg ttcgagctgc tgaaccttcc 7860
aaagaaaatc atctttgtgg gccacgactg gggggcttgt ctggcctttc actactccta 7920
cgagcaccaa gacaagatca aggccatcgt ccatgctgag agtgtcgtgg acgtgatcga 7980
gtcctgggac gagtggcctg acatcgagga ggatatcgcc ctgatcaaga gcgaagaggg 8040
cgagaaaatg gtgcttgaga ataacttctt cgtcgagacc atgctcccaa gcaagatcat 8100
gcggaaactg gagcctgagg agttcgctgc ctacctggag ccattcaagg agaagggcga 8160
ggttagacgg cctaccctct cctggcctcg cgagatccct ctcgttaagg gaggcaagcc 8220
cgacgtcgtc cagattgtcc gcaactacaa cgcctacctt cgggccagcg acgatctgcc 8280
taagatgttc atcgagtccg accctgggtt cttttccaac gctattgtcg agggagctaa 8340
gaagttccct aacaccgagt tcgtgaaggt gaagggcctc cacttcagcc aggaggacgc 8400
tccagatgaa atgggtaagt acatcaagag cttcgtggag cgcgtgctga agaacgagca 8460
gctcgagggg gagcctggag atccggacct ggagtctgat caggtagagc ttcaacctcc 8520
cccccagggg gggggggtag ctcccggttc gggctcgggg tcttggtcta cttgctccga 8580
ggaggacgat accaccgtgt gctgctccat gtcatactcc tggaccgggg ctctaataac 8640
tccctgtagc cccgaagagg aaaagttgcc aatcaaccct ttgagtaact cgctgttgcg 8700
ataccataac aaggtgtact gtacaacatc aaagagcgcc tcacagaggg ctaaaaaggt 8760
aacttttgac aggacgcaag tgctcgacgc ccattatgac tcagtcttaa aggacatcaa 8820
gctagcggct tccaaggtca gcgcaaggct cctcaccttg gaggaggcgt gccagttgac 8880
tccaccccat tctgcaagat ccaagtatgg attcggggcc aaggaggtcc gcagcttgtc 8940
cgggagggcc gttaaccaca tcaagtccgt gtggaaggac ctcctggaag acccacaaac 9000
accaattccc acaaccatca tggccaaaaa tgaggtgttc tgcgtggacc ccgccaaggg 9060
gggtaagaaa ccagctcgcc tcatcgttta ccctgacctc ggcgtccggg tctgcgagaa 9120
aatggccctc tatgacatta cacaaaagct tcctcaggcg gtaatgggag cttcctatgg 9180
cttccagtac tcccctgccc aacgggtgga gtatctcttg aaagcatggg cggaaaagaa 9240
ggaccccatg ggtttttcgt atgatacccg atgcttcgac tcaaccgtca ctgagagaga 9300
catcaggacc gaggagtcca tataccaggc ctgctccctg cccgaggagg cccgcactgc 9360
catacactcg ctgactgaga gactttacgt aggagggccc atgttcaaca gcaagggtca 9420
aacctgcggt tacagacgtt gccgcgccag cggggtgcta accactagca tgggtaacac 9480
catcacatgc tatgtgaaag ccctagcggc ctgcaaggct gcggggatag ttgcgcccac 9540
aatgctggta tgcggcgatg acctagtagt catctcagaa agccagggga ctgaggagga 9600
cgagcggaac ctgagagcct tcacggaggc catgaccagg tactctgccc ctcctggtga 9660
tccccccaga ccggaatatg acctggagct aataacatcc tgttcctcaa atgtgtctgt 9720
ggcgttgggc ccgcggggcc gccgcagata ctacctgacc agagacccaa ccactccact 9780
cgcccgggct gcctgggaaa cagttagaca ctcccctatc aattcatggc tgggaaacat 9840
catccagtat gctccaacca tatgggttcg catggtccta atgacacact tcttctccat 9900
tctcatggtc caagacaccc tggaccagaa cctcaacttt gagatgtatg gatcagtata 9960
ctccgtgaat cctttggacc ttccagccat aattgagagg ttacacgggc ttgacgcctt 10020
ttctatgcac acatactctc accacgaact gacgcgggtg gcttcagccc tcagaaaact 10080
tggggcgcca cccctcaggg tgtggaagag tcgggctcgc gcagtcaggg cgtccctcat 10140
ctcccgtgga gggaaagcgg ccgtttgcgg ccgatatctc ttcaattggg cggtgaagac 10200
caagctcaaa ctcactccat tgccggaggc gcgcctactg gacttatcca gttggttcac 10260
cgtcggcgcc ggcgggggcg acatttttca cagcgtgtcg cgcgcccgac cccgctcatt 10320
actcttcggc ctactcctac ttttcgtagg ggtaggcctc ttcctactcc ccgctcggta 10380
gagcggcaca cactaggtac actccatagc taactgttcc tttttttttt tttttttttt 10440
tttttttttt tttttttttt tttctttttt ttttttttcc ctctttcttc ccttctcatc 10500
ttattctact ttctttcttg gtggctccat cttagcccta gtcacggcta gctgtgaaag 10560
gtccgtgagc cgcatgactg cagagagtgc cgtaactggt ctctctgcag atcatgt 10617
<210>8
<211>10617
<212>DNA
<213>人工序列
<220>
<223>JFH1突变体
<220>
<223>JFH1wt-Rluc
<400>8
acctgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60
cttcacgcag aaagcgccta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 120
ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180
aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240
caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 300
cgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 360
tcaaagaaaa accaaaagaa acaccaaccg tcgcccagaa gacgttaagt tcccgggcgg 420
cggccagatc gttggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480
cacgacaagg aaaacttcgg agcggtccca gccacgtggg agacgccagc ccatccccaa 540
agatcggcgc tccactggca aggcctgggg aaaaccaggt cgcccctggc ccctatatgg 600
gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggctctc gcccctcctg 660
gggccccact gacccccggc ataggtcgcg caacgtgggt aaagtcatcg acaccctaac 720
gtgtggcttt gccgacctca tggggtacat ccccgtcgta ggcgccccgc ttagtggcgc 780
cgccagagct gtcgcgcacg gcgtgagagt cctggaggac ggggttaatt atgcaacagg 840
gaacctaccc ggtttcccct tttctatctt cttgctggcc ctgttgtcct gcatcaccgt 900
tccggtctct gctgcccagg tgaagaatac cagtagcagc tacatggtga ccaatgactg 960
ctccaatgac agcatcactt ggcagctcga ggctgcggtt ctccacgtcc ccgggtgcgt 1020
cccgtgcgag agagtgggga atacgtcacg gtgttgggtg ccagtctcgc caaacatggc 1080
tgtgcggcag cccggtgccc tcacgcaggg tctgcggacg cacatcgata tggttgtgat 1140
gtccgccacc ttctgctctg ctctctacgt gggggacctc tgtggcgggg tgatgctcgc 1200
ggcccaggtg ttcatcgtct cgccgcagta ccactggttt gtgcaagaat gcaattgctc 1260
catctaccct ggcaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320
gcccacggcc accatgatcc tggcgtacgt gatgcgcgtc cccgaggtca tcatagacat 1380
cgttagcggg gctcactggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440
gtgggcgaag gtcattgtca tccttctgct ggccgctggg gtggacgcgg gcaccaccac 1500
cgttggaggc gctgttgcac gttccaccaa cgtgattgcc ggcgtgttca gccatggccc 1560
tcagcagaac attcagctca ttaacaccaa cggcagttgg cacatcaacc gtactgcctt 1620
gaattgcaat gactccttga acaccggctt tctcgcggcc ttgttctaca ccaaccgctt 1680
taactcgtca gggtgtccag ggcgcctgtc cgcctgccgc aacatcgagg ctttccggat 1740
agggtggggc accctacagt acgaggataa tgtcaccaat ccagaggata tgaggccgta 1800
ctgctggcac taccccccaa agccgtgtgg cgtagtcccc gcgaggtctg tgtgtggccc 1860
agtgtactgt ttcaccccca gcccggtagt agtgggcacg accgacagac gtggagtgcc 1920
cacctacaca tggggagaga atgagacaga tgtcttccta ctgaacagca cccgaccgcc 1980
gcagggctca tggttcggct gcacgtggat gaactccact ggtttcacca agacttgtgg 2040
cgcgccacct tgccgcacca gagctgactt caacgccagc acggacttgt tgtgccctac 2100
ggattgtttt aggaagcatc ctgatgccac ttatattaag tgtggttctg ggccctggct 2160
cacaccaaag tgcctggtcc actaccctta cagactctgg cattacccct gcacagtcaa 2220
ttttaccatc ttcaagataa gaatgtatgt agggggggtt gagcacaggc tcacggccgc 2280
atgcaacttc actcgtgggg atcgctgcga cttggaggac agggacagga gtcagctgtc 2340
tcctctgttg cactctacca cggaatgggc catcctgccc tgcacctact cagacttacc 2400
cgctttgtca actggtcttc tccaccttca ccagaacatc gtggacgtac aatacatgta 2460
tggcctctca cctgctatca caaaatacgt cgttcgatgg gagtgggtgg tactcttatt 2520
cctgctctta gcggacgcca gagtctgcgc ctgcttgtgg atgctcatct tgttgggcca 2580
ggccgaagca gcattggaga agttggtcgt cttgcacgct gcgagtgcgg ctaactgcca 2640
tggcctccta tattttgcca tcttcttcgt ggcagcttgg cacatcaggg gtcgggtggt 2700
ccccttgacc acctattgcc tcactggcct atggcccttc tgcctactgc tcatggcact 2760
gccccggcag gcttatgcct atgacgcacc tgtgcacgga cagataggcg tgggtttgtt 2820
gatattgatc accctcttca cactcacccc ggggtataag accctcctcg gccagtgtct 2880
gtggtggttg tgctatctcc tgaccctggg ggaagccatg attcaggagt gggtaccacc 2940
catgcaggtg cgcggcggcc gcgatggcat cgcgtgggcc gtcactatat tctgcccggg 3000
tgtggtgttt gacattacca aatggctttt ggcgttgctt gggcctgctt acctcttaag 3060
ggccgctttg acacatgtgc cgtacttcgt cagagctcac gctctgataa gggtatgcgc 3120
tttggtgaag cagctcgcgg ggggtaggta tgttcaggtg gcgctattgg cccttggcag 3180
gtggactggc acctacatct atgaccacct cacacctatg tcggactggg ccgctagcgg 3240
cctgcgcgac ttagcggtcg ccgtggaacc catcatcttc agtccgatgg agaagaaggt 3300
catcgtctgg ggagcggaga cggctgcatg tggggacatt ctacatggac ttcccgtgtc 3360
cgcccgactc ggccaggaga tcctcctcgg cccagctgat ggctacacct ccaaggggtg 3420
gaagctcctt gctcccatca ctgcttatgc ccagcaaaca cgaggcctcc tgggcgccat 3480
agtggtgagt atgacggggc gtgacaggac agaacaggcc ggggaagtcc aaatcctgtc 3540
cacagtctct cagtccttcc tcggaacaac catctcgggg gttttgtgga ctgtttacca 3600
cggagctggc aacaagactc tagccggctt acggggtccg gtcacgcaga tgtactcgag 3660
tgctgagggg gacttggtag gctggcccag cccccctggg accaagtctt tggagccgtg 3720
caagtgtgga gccgtcgacc tatatctggt cacgcggaac gctgatgtca tcccggctcg 3780
gagacgcggg gacaagcggg gagcattgct ctccccgaga cccatttcga ccttgaaggg 3840
gtcctcgggg gggccggtgc tctgccctag gggccacgtc gttgggctct tccgagcagc 3900
tgtgtgctct cggggcgtgg ccaaatccat cgatttcatc cccgttgaga cactcgacgt 3960
tgttacaagg tctcccactt tcagtgacaa cagcacgcca ccggctgtgc cccagaccta 4020
tcaggtcggg tacttgcatg ctccaactgg cagtggaaag agcaccaagg tccctgtcgc 4080
gtatgccgcc caggggtaca aagtactagt gcttaacccc tcggtagctg ccaccctggg 4140
gtttggggcg tacctatcca aggcacatgg catcaatccc aacattagga ctggagtcag 4200
gaccgtgatg accggggagg ccatcacgta ctccacatat ggcaaatttc tcgccgatgg 4260
gggctgcgct agcggcgcct atgacatcat catatgcgat gaatgccacg ctgtggatgc 4320
tacctccatt ctcggcatcg gaacggtcct tgatcaagca gagacagccg gggtcagact 4380
aactgtgctg gctacggcca caccccccgg gtcagtgaca accccccatc ccgatataga 4440
agaggtaggc ctcgggcggg agggtgagat ccccttctat gggagggcga ttcccctatc 4500
ctgcatcaag ggagggagac acctgatttt ctgccactca aagaaaaagt gtgacgagct 4560
cgcggcggcc cttcggggca tgggcttgaa tgccgtggca tactatagag ggttggacgt 4620
ctccataata ccagctcagg gagatgtggt ggtcgtcgcc accgacgccc tcatgacggg 4680
gtacactgga gactttgact ccgtgatcga ctgcaatgta gcggtcaccc aagctgtcga 4740
cttcagcctg gaccccacct tcactataac cacacagact gtcccacaag acgctgtctc 4800
acgcagtcag cgccgcgggc gcacaggtag aggaagacag ggcacttata ggtatgtttc 4860
cactggtgaa cgagcctcag gaatgtttga cagtgtagtg ctttgtgagt gctacgacgc 4920
aggggctgcg tggtacgatc tcacaccagc ggagaccacc gtcaggctta gagcgtattt 4980
caacacgccc ggcctacccg tgtgtcaaga ccatcttgaa ttttgggagg cagttttcac 5040
cggcctcaca cacatagacg cccacttcct ctcccaaaca aagcaagcgg gggagaactt 5100
cgcgtaccta gtagcctacc aagctacggt gtgcgccaga gccaaggccc ctcccccgtc 5160
ctgggacgcc atgtggaagt gcctggcccg actcaagcct acgcttgcgg gccccacacc 5220
tctcctgtac cgtttgggcc ctattaccaa tgaggtcacc ctcacacacc ctgggacgaa 5280
gtacatcgcc acatgcatgc aagctgacct tgaggtcatg accagcacgt gggtcctagc 5340
tggaggagtc ctggcagccg tcgccgcata ttgcctggcg actggatgcg tttccatcat 5400
cggccgcttg cacgtcaacc agcgagtcgt cgttgcgccg gataaggagg tcctgtatga 5460
ggcttttgat gagatggagg aatgcgcctc tagggcggct ctcatcgaag aggggcagcg 5520
gatagccgag atgttgaagt ccaagatcca aggcttgctg cagcaggcct ctaagcaggc 5580
ccaggacata caacccgcta tgcaggcttc atggcccaaa gtggaacaat tttgggccag 5640
acacatgtgg aacttcatta gcggcatcca atacctcgca ggattgtcaa cactgccagg 5700
gaaccccgcg gtggcttcca tgatggcatt cagtgccgcc ctcaccagtc cgttgtcgac 5760
cagtaccacc atccttctca acatcatggg aggctggtta gcgtcccaga tcgcaccacc 5820
cgcgggggcc accggctttg tcgtcagtgg cctggtgggg gctgccgtgg gcagcatagg 5880
cctgggtaag gtgctggtgg acatcctggc aggatatggt gcgggcattt cgggggccct 5940
cgtcgcattc aagatcatgt ctggcgagaa gccctctatg gaagatgtca tcaatctact 6000
gcctgggatc ctgtctccgg gagccctggt ggtgggggtc atctgcgcgg ccattctgcg 6060
ccgccacgtg ggaccggggg agggcgcggt ccaatggatg aacaggctta ttgcctttgc 6120
ttccagagga aaccacgtcg cccctactca ctacgtgacg gagtcggatg cgtcgcagcg 6180
tgtgacccaa ctacttggct ctcttactat aaccagccta ctcagaagac tccacaattg 6240
gataactgag gactgcccca tcccatgctc cggatcctgg ctccgcgacg tgtgggactg 6300
ggtttgcacc atcttgacag acttcaaaaa ttggctgacc tctaaattgt tccccaagct 6360
gcccggcctc cccttcatct cttgtcaaaa ggggtacaag ggtgtgtggg ccggcactgg 6420
catcatgacc acgcgctgcc cttgcggcgc caacatctct ggcaatgtcc gcctgggctc 6480
tatgaggatc acagggccta aaacctgcat gaacacctgg caggggacct ttcctatcaa 6540
ttgctacacg gagggccagt gcgcgccgaa accccccacg aactacaaga ccgccatctg 6600
gagggtggcg gcctcggagt acgcggaggt gacgcagcat gggtcgtact cctatgtaac 6660
aggactgacc actgacaatc tgaaaattcc ttgccaacta ccttctccag agtttttctc 6720
ctgggtggac ggtgtgcaga tccataggtt tgcacccaca ccaaagccgt ttttccggga 6780
tgaggtctcg ttctgcgttg ggcttaattc ctatgctgtc gggtcccagc ttccctgtga 6840
acctgagccc gacgcagacg tattgaggtc catgctaaca gatccgcccc acatcacggc 6900
ggagactgcg gcgcggcgct tggcacgggg atcacctcca tctgaggcga gctcctcagt 6960
gagccagcta tcagcaccgt cgctgcgggc cacctgcacc acccacagca acacctatga 7020
cgtggacatg gtcgatgcca acctgctcat ggagggcggt gtggctcaga cagagcctga 7080
gtccagggtg cccgttctgg actttctcga gccaatggcc gaggaagaga gcgaccttga 7140
gccctcaata ccatcggagt gcatgctccc caggagcggg tttccacggg ccttaccggc 7200
ttgggcacgg cctgactaca acccgccgct cgtggaatcg tggaggaggc cagattacca 7260
accgcccacc gttgctggtt gtgctctccc cccccccaag aaggccccga cgcctccccc 7320
aaggagacgc cggacagtgg gtctgagcga gagcaccata tcagaagccc tccagcaact 7380
ggccatcaag acctttggcc agcccccctc gagcggtgat gcaggctcgt ccacgggggc 7440
gggcgccgcc gaatccggcg gtccgacgtc ccctggtgag ccggccccct cagagacagg 7500
ttccgcctcc tctatgcccc ccctcgagat ggcttccaag gtgtacgacc ccgagcaacg 7560
caaacgcatg atcactgggc ctcagtggtg ggctcgctgc aagcaaatga acgtgctgga 7620
ctccttcatc aactactatg attccgagaa gcacgccgag aacgccgtga tttttctgca 7680
tggtaacgct gcctccagct acctgtggag gcacgtcgtg cctcacatcg agcccgtggc 7740
tagatgcatc atccctgatc tgatcggaat gggtaagtcc ggcaagagcg ggaatggctc 7800
atatcgcctc ctggatcact acaagtacct caccgcttgg ttcgagctgc tgaaccttcc 7860
aaagaaaatc atctttgtgg gccacgactg gggggcttgt ctggcctttc actactccta 7920
cgagcaccaa gacaagatca aggccatcgt ccatgctgag agtgtcgtgg acgtgatcga 7980
gtcctgggac gagtggcctg acatcgagga ggatatcgcc ctgatcaaga gcgaagaggg 8040
cgagaaaatg gtgcttgaga ataacttctt cgtcgagacc atgctcccaa gcaagatcat 8100
gcggaaactg gagcctgagg agttcgctgc ctacctggag ccattcaagg agaagggcga 8160
ggttagacgg cctaccctct cctggcctcg cgagatccct ctcgttaagg gaggcaagcc 8220
cgacgtcgtc cagattgtcc gcaactacaa cgcctacctt cgggccagcg acgatctgcc 8280
taagatgttc atcgagtccg accctgggtt cttttccaac gctattgtcg agggagctaa 8340
gaagttccct aacaccgagt tcgtgaaggt gaagggcctc cacttcagcc aggaggacgc 8400
tccagatgaa atgggtaagt acatcaagag cttcgtggag cgcgtgctga agaacgagca 8460
gctcgagggg gagcctggag atccggacct ggagtctgat caggtagagc ttcaacctcc 8520
cccccagggg gggggggtag ctcccggttc gggctcgggg tcttggtcta cttgctccga 8580
ggaggacgat accaccgtgt gctgctccat gtcatactcc tggaccgggg ctctaataac 8640
tccctgtagc cccgaagagg aaaagttgcc aatcaaccct ttgagtaact cgctgttgcg 8700
ataccataac aaggtgtact gtacaacatc aaagagcgcc tcacagaggg ctaaaaaggt 8760
aacttttgac aggacgcaag tgctcgacgc ccattatgac tcagtcttaa aggacatcaa 8820
gctagcggct tccaaggtca gcgcaaggct cctcaccttg gaggaggcgt gccagttgac 8880
tccaccccat tctgcaagat ccaagtatgg attcggggcc aaggaggtcc gcagcttgtc 8940
cgggagggcc gttaaccaca tcaagtccgt gtggaaggac ctcctggaag acccacaaac 9000
accaattccc acaaccatca tggccaaaaa tgaggtgttc tgcgtggacc ccgccaaggg 9060
gggtaagaaa ccagctcgcc tcatcgttta ccctgacctc ggcgtccggg tctgcgagaa 9120
aatggccctc tatgacatta cacaaaagct tcctcaggcg gtaatgggag cttcctatgg 9180
cttccagtac tcccctgccc aacgggtgga gtatctcttg aaagcatggg cggaaaagaa 9240
ggaccccatg ggtttttcgt atgatacccg atgcttcgac tcaaccgtca ctgagagaga 9300
catcaggacc gaggagtcca tataccaggc ctgctccctg cccgaggagg cccgcactgc 9360
catacactcg ctgactgaga gactttacgt aggagggccc atgttcaaca gcaagggtca 9420
aacctgcggt tacagacgtt gccgcgccag cggggtgcta accactagca tgggtaacac 9480
catcacatgc tatgtgaaag ccctagcggc ctgcaaggct gcggggatag ttgcgcccac 9540
aatgctggta tgcggcgatg acctagtagt catctcagaa agccagggga ctgaggagga 9600
cgagcggaac ctgagagcct tcacggaggc catgaccagg tactctgccc ctcctggtga 9660
tccccccaga ccggaatatg acctggagct aataacatcc tgttcctcaa atgtgtctgt 9720
ggcgttgggc ccgcggggcc gccgcagata ctacctgacc agagacccaa ccactccact 9780
cgcccgggct gcctgggaaa cagttagaca ctcccctatc aattcatggc tgggaaacat 9840
catccagtat gctccaacca tatgggttcg catggtccta atgacacact tcttctccat 9900
tctcatggtc caagacaccc tggaccagaa cctcaacttt gagatgtatg gatcagtata 9960
ctccgtgaat cctttggacc ttccagccat aattgagagg ttacacgggc ttgacgcctt 10020
ttctatgcac acatactctc accacgaact gacgcgggtg gcttcagccc tcagaaaact 10080
tggggcgcca cccctcaggg tgtggaagag tcgggctcgc gcagtcaggg cgtccctcat 10140
ctcccgtgga gggaaagcgg ccgtttgcgg ccgatatctc ttcaattggg cggtgaagac 10200
caagctcaaa ctcactccat tgccggaggc gcgcctactg gacttatcca gttggttcac 10260
cgtcggcgcc ggcgggggcg acatttttca cagcgtgtcg cgcgcccgac cccgctcatt 10320
actcttcggc ctactcctac ttttcgtagg ggtaggcctc ttcctactcc ccgctcggta 10380
gagcggcaca cactaggtac actccatagc taactgttcc tttttttttt tttttttttt 10440
tttttttttt tttttttttt tttctttttt ttttttttcc ctctttcttc ccttctcatc 10500
ttattctact ttctttcttg gtggctccat cttagcccta gtcacggcta gctgtgaaag 10560
gtccgtgagc cgcatgactg cagagagtgc cgtaactggt ctctctgcag atcatgt 10617
<210>9
<211>933
<212>DNA
<213>海肾
<220>
<223>海肾荧光素酶
<400>9
atggcttcca aggtgtacga ccccgagcaa cgcaaacgca tgatcactgg gcctcagtgg 60
tgggctcgct gcaagcaaat gaacgtgctg gactccttca tcaactacta tgattccgag 120
aagcacgccg agaacgccgt gatttttctg catggtaacg ctgcctccag ctacctgtgg 180
aggcacgtcg tgcctcacat cgagcccgtg gctagatgca tcatccctga tctgatcgga 240
atgggtaagt ccggcaagag cgggaatggc tcatatcgcc tcctggatca ctacaagtac 300
ctcaccgctt ggttcgagct gctgaacctt ccaaagaaaa tcatctttgt gggccacgac 360
tggggggctt gtctggcctt tcactactcc tacgagcacc aagacaagat caaggccatc 420
gtccatgctg agagtgtcgt ggacgtgatc gagtcctggg acgagtggcc tgacatcgag 480
gaggatatcg ccctgatcaa gagcgaagag ggcgagaaaa tggtgcttga gaataacttc 540
ttcgtcgaga ccatgctccc aagcaagatc atgcggaaac tggagcctga ggagttcgct 600
gcctacctgg agccattcaa ggagaagggc gaggttagac ggcctaccct ctcctggcct 660
cgcgagatcc ctctcgttaa gggaggcaag cccgacgtcg tccagattgt ccgcaactac 720
aacgcctacc ttcgggccag cgacgatctg cctaagatgt tcatcgagtc cgaccctggg 780
ttcttttcca acgctattgt cgagggagct aagaagttcc ctaacaccga gttcgtgaag 840
gtgaagggcc tccacttcag ccaggaggac gctccagatg aaatgggtaa gtacatcaag 900
agcttcgtgg agcgcgtgct gaagaacgag cag 933
<210>10
<211>20
<212>DNA
<213>人工序列
<220>
<223>引物
<400>10
ctttgactcc gtgatcgacc 20
<210>11
<211>19
<212>DNA
<213>人工序列
<220>
<223>引物
<400>11
ccctgtcttc ctctacctg 19
<210>12
<211>19
<212>DNA
<213>人工序列
<220>
<223>引物
<400>12
tggcacccag cacaatgaa 19
<210>13
<211>25
<212>DNA
<213>人工序列
<220>
<223>引物
<400>13
ctaagtcata gtccgcctag aagca 25
<210>14
<211>30
<212>DNA
<213>人工序列
<220>
<223>引物
<400>14
ctcgagatgg cttccaaggt gtacgacccc 30
<210>15
<211>30
<212>DNA
<213>人工序列
<220>
<223>引物
<400>15
ctcgagctgc tcgttcttca gcacgcgctc 30
<210>16
<211>25
<212>DNA
<213>人工序列
<220>
<223>引物
<400>16
ggaacagtta gctatggagt gtacc 25
<210>17
<211>24
<212>DNA
<213>人工序列
<220>
<223>引物
<400>17
tgtcttcacg cagaaagcgc ctag 24
<210>18
<211>25
<212>DNA
<213>人工序列
<220>
<223>引物
<400>18
ctgagctggt attatggaga cgtcc 25
Claims (9)
1.核酸,该核酸编码包含1个以上氨基酸取代的丙型肝炎病毒JFH1株的前体多聚蛋白,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,上述前体多聚蛋白中至少第862位的谷氨酰胺被取代成精氨酸。
2.权利要求1所述的核酸,其中上述前体多聚蛋白为选自下述(a)~(f)的前体多聚蛋白:
(a)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,第74位的赖氨酸被取代成苏氨酸、第297位的酪氨酸被取代成组氨酸、第330位的丙氨酸被取代成苏氨酸、第395位的丝氨酸被取代成脯氨酸、第417位的天冬酰胺被取代成丝氨酸、第483位的天冬氨酸被取代成甘氨酸、第501位的丙氨酸被取代成苏氨酸、第862位的谷氨酰胺被取代成精氨酸、第931位的谷氨酰胺被取代成精氨酸、以及第961位的丝氨酸被取代成丙氨酸;
(b)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,第31位的缬氨酸被取代成丙氨酸、第74位的赖氨酸被取代成苏氨酸、第451位的甘氨酸被取代成精氨酸、第756位的缬氨酸被取代成丙氨酸、第786位的缬氨酸被取代成丙氨酸、以及第862位的谷氨酰胺被取代成精氨酸;
(c)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,第74位的赖氨酸被取代成苏氨酸、第451位的甘氨酸被取代成精氨酸、第756位的缬氨酸被取代成丙氨酸、第786位的缬氨酸被取代成丙氨酸、以及第862位的谷氨酰胺被取代成精氨酸;
(d)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,第31位的缬氨酸被取代成丙氨酸、第74位的赖氨酸被取代成苏氨酸、第451位的甘氨酸被取代成精氨酸、第786位的缬氨酸被取代成丙氨酸、以及第862位的谷氨酰胺被取代成精氨酸;
(e)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,第31位的缬氨酸被取代成丙氨酸、第74位的赖氨酸被取代成苏氨酸、第451位的甘氨酸被取代成精氨酸、第756位的缬氨酸被取代成丙氨酸、以及第862位的谷氨酰胺被取代成精氨酸;
(f)前体多聚蛋白,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,仅第862位的谷氨酰胺被取代成精氨酸。
3.权利要求2所述的核酸,该核酸包含序列表的SEQ ID NO:3、4或5所示的核苷酸序列。
4.权利要求1或2所述的核酸,其中,编码报道蛋白的核酸被插入在编码上述前体多聚蛋白的NS5A蛋白的区内。
5.权利要求4所述的核酸,其中,当以序列表的SEQ ID NO:2所示的氨基酸序列为基准时,上述报道蛋白被整合在第2394位氨基酸残基与第2395位氨基酸残基之间,作为融合蛋白被翻译。
6.权利要求5所述的核酸,该核酸包含序列表的SEQ ID NO:6或7所示的核苷酸序列。
7.丙型肝炎病毒颗粒,该病毒颗粒包含权利要求1或2所述的核酸。
8.培养细胞,该培养细胞生产权利要求7所述的丙型肝炎病毒颗粒。
9.丙型肝炎病毒疫苗,该疫苗是将权利要求7所述的丙型肝炎病毒颗粒灭活而得到的。
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201010139886XA CN102199613A (zh) | 2010-03-25 | 2010-03-25 | 感染性丙型肝炎病毒高生产hcv突变体及其应用 |
EP11759547.0A EP2551345A4 (en) | 2010-03-25 | 2011-03-25 | HCV VERSION WITH HIGH PRODUCTIVITY FOR INFECTIOUS HEPATITIS C VIRUSES AND USE THEREOF |
JP2012507076A JP5816614B2 (ja) | 2010-03-25 | 2011-03-25 | 感染性c型肝炎ウイルス高生産hcv変異体及びその利用 |
CA2794359A CA2794359A1 (en) | 2010-03-25 | 2011-03-25 | Infectious hepatitis c virus-high producing hcv variants and use thereof |
US13/636,904 US9057048B2 (en) | 2010-03-25 | 2011-03-25 | Infectious hepatitis C virus—high producing HCV variants and use thereof |
PCT/JP2011/057271 WO2011118743A1 (ja) | 2010-03-25 | 2011-03-25 | 感染性c型肝炎ウイルス高生産hcv変異体及びその利用 |
CN201180015783.5A CN103339254B (zh) | 2010-03-25 | 2011-03-25 | 感染性丙型肝炎病毒高生产hcv突变体及其应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201010139886XA CN102199613A (zh) | 2010-03-25 | 2010-03-25 | 感染性丙型肝炎病毒高生产hcv突变体及其应用 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN102199613A true CN102199613A (zh) | 2011-09-28 |
Family
ID=44660599
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201010139886XA Pending CN102199613A (zh) | 2010-03-25 | 2010-03-25 | 感染性丙型肝炎病毒高生产hcv突变体及其应用 |
CN201180015783.5A Expired - Fee Related CN103339254B (zh) | 2010-03-25 | 2011-03-25 | 感染性丙型肝炎病毒高生产hcv突变体及其应用 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201180015783.5A Expired - Fee Related CN103339254B (zh) | 2010-03-25 | 2011-03-25 | 感染性丙型肝炎病毒高生产hcv突变体及其应用 |
Country Status (6)
Country | Link |
---|---|
US (1) | US9057048B2 (zh) |
EP (1) | EP2551345A4 (zh) |
JP (1) | JP5816614B2 (zh) |
CN (2) | CN102199613A (zh) |
CA (1) | CA2794359A1 (zh) |
WO (1) | WO2011118743A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103261411A (zh) * | 2010-10-08 | 2013-08-21 | 株式会社先端生命科学研究所 | 丙型肝炎病毒基因 |
CN105164251A (zh) * | 2012-09-28 | 2015-12-16 | 国立大学法人神户大学 | C型肝炎病毒颗粒形成促进剂和c型肝炎病毒颗粒的产生方法 |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008141651A1 (en) | 2007-05-18 | 2008-11-27 | Hvidovre Hospital | Efficient cell culture system for hepatitis c virus genotype 5a |
WO2009080052A1 (en) | 2007-12-20 | 2009-07-02 | Hvidovre Hospital | Efficient cell culture system for hepatitis c virus genotype 6a |
WO2010017818A1 (en) | 2008-08-15 | 2010-02-18 | Hvidovre Hospital | Efficient cell culture system for hepatitis c virus genotype 2b |
US8506969B2 (en) | 2008-08-15 | 2013-08-13 | Hvidovre Hospital | Efficient cell culture system for hepatitis C virus genotype 7a |
EP2344647A1 (en) | 2008-10-03 | 2011-07-20 | Hvidovre Hospital | Hepatitis c virus expressing reporter tagged ns5a protein |
JP6589159B2 (ja) * | 2014-04-01 | 2019-10-16 | 共立製薬株式会社 | 新規ネコモルビリウイルス株、不活化ワクチン製剤、並びにネコモルビリウイルス感染症予防方法 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4880116B2 (ja) | 2000-12-01 | 2012-02-22 | 財団法人 東京都医学総合研究所 | 劇症c型肝炎ウイルス株の遺伝子 |
ES2347237T3 (es) | 2003-05-26 | 2010-10-27 | Toray Industries, Inc. | Constructo de acido nucleico que contiene el virus de la hepatitis c (vhc) del acido nucleico de origen genomico de genotipo 2a y celula que tiene el constructo de acido nucleico transferido a su interior. |
CN102206663B (zh) | 2004-02-20 | 2014-02-05 | 公益财团法人东京都医学综合研究所 | 含人丙型肝炎病毒全长基因组的核酸构建物及其用途 |
AU2007288129B2 (en) | 2006-08-25 | 2013-03-07 | The Macfarlane Burnet Institute For Medical Research And Public Health Limited | Recombinant HCV E2 glycoprotein |
WO2010017818A1 (en) * | 2008-08-15 | 2010-02-18 | Hvidovre Hospital | Efficient cell culture system for hepatitis c virus genotype 2b |
CN102361977B (zh) * | 2008-12-26 | 2014-09-24 | 东丽株式会社 | 衍生自丙型肝炎病毒的核酸以及通过使用该核酸各自制备的表达载体、转化细胞和丙型肝炎病毒颗粒 |
CN101748149B (zh) * | 2009-12-24 | 2012-09-05 | 中国人民解放军军事医学科学院微生物流行病研究所 | 含有丙肝病毒基因组的质粒、细胞、药物筛选系统和方法 |
-
2010
- 2010-03-25 CN CN201010139886XA patent/CN102199613A/zh active Pending
-
2011
- 2011-03-25 EP EP11759547.0A patent/EP2551345A4/en not_active Withdrawn
- 2011-03-25 WO PCT/JP2011/057271 patent/WO2011118743A1/ja active Application Filing
- 2011-03-25 CN CN201180015783.5A patent/CN103339254B/zh not_active Expired - Fee Related
- 2011-03-25 US US13/636,904 patent/US9057048B2/en not_active Expired - Fee Related
- 2011-03-25 JP JP2012507076A patent/JP5816614B2/ja not_active Expired - Fee Related
- 2011-03-25 CA CA2794359A patent/CA2794359A1/en not_active Abandoned
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103261411A (zh) * | 2010-10-08 | 2013-08-21 | 株式会社先端生命科学研究所 | 丙型肝炎病毒基因 |
CN103261411B (zh) * | 2010-10-08 | 2015-01-28 | 株式会社先端生命科学研究所 | 丙型肝炎病毒基因 |
CN105164251A (zh) * | 2012-09-28 | 2015-12-16 | 国立大学法人神户大学 | C型肝炎病毒颗粒形成促进剂和c型肝炎病毒颗粒的产生方法 |
Also Published As
Publication number | Publication date |
---|---|
CA2794359A1 (en) | 2011-09-29 |
US9057048B2 (en) | 2015-06-16 |
JPWO2011118743A1 (ja) | 2013-07-04 |
EP2551345A1 (en) | 2013-01-30 |
CN103339254B (zh) | 2016-04-27 |
CN103339254A (zh) | 2013-10-02 |
WO2011118743A1 (ja) | 2011-09-29 |
JP5816614B2 (ja) | 2015-11-18 |
EP2551345A4 (en) | 2014-01-29 |
US20130078277A1 (en) | 2013-03-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4903929B2 (ja) | C型肝炎ウイルス細胞培養系、c型肝炎ウイルス−rna−構築物、細胞培養系または構築物の使用、c型肝炎ウイルス−rna−構築物の細胞培養に適合した突然変異体を獲得する方法、c型肝炎ウイルス−全長ゲノム、c型肝炎ウイルス−部分ゲノム、または任意のc型肝炎ウイルス−構築物の突然変異体の製法、細胞培養に適合したc型肝炎ウイルス−構築物、その突然変異体、c型肝炎ウイルス−全長ゲノムの突然変異体、c型肝炎ウイルス粒子またはウイルス様粒子、およびこれで感染した細胞 | |
US8945584B2 (en) | Cell culture system of a hepatitis C genotype 3a and 2a chimera | |
CN103339254B (zh) | 感染性丙型肝炎病毒高生产hcv突变体及其应用 | |
JP4921164B2 (ja) | ヒトc型肝炎ウイルスの全長ゲノムを含む核酸構築物及び該核酸構築物を導入した組換え全長ウイルスゲノム複製細胞、並びにc型肝炎ウイルス粒子の作製方法 | |
HK1206062A1 (zh) | 衍生自丙型肝炎病毒的核酸以及通过使用该核酸各自制备的表达载体、转化细胞和丙型肝炎病毒颗粒 | |
JP6026419B2 (ja) | 遺伝子型3aのC型肝炎ウイルスゲノム由来の核酸を含む核酸構築物 | |
CN103703133B (zh) | 来自丙型肝炎病毒j6cf株基因组的突变体复制子 | |
US9234184B2 (en) | Nucleic acid construct comprising nucleic acid derived from genome of hepatitis C virus of genotype 1B, hepatitis C virus genome-replicating cells transfected with the same, and method for producing infectious hepatitis C virus particles | |
HK1156355B (zh) | 含有来自丙型肝炎病毒的嵌合基因的核酸 | |
HK1156355A1 (zh) | 含有来自丙型肝炎病毒的嵌合基因的核酸 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20110928 |