CN1414103A - Human serine/threonine protein kitase, its coding sequence, preparation method and use - Google Patents
Human serine/threonine protein kitase, its coding sequence, preparation method and use Download PDFInfo
- Publication number
- CN1414103A CN1414103A CN02136497A CN02136497A CN1414103A CN 1414103 A CN1414103 A CN 1414103A CN 02136497 A CN02136497 A CN 02136497A CN 02136497 A CN02136497 A CN 02136497A CN 1414103 A CN1414103 A CN 1414103A
- Authority
- CN
- China
- Prior art keywords
- polypeptide
- leu
- pro
- sequence
- arg
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims description 84
- 102000004169 proteins and genes Human genes 0.000 title claims description 70
- 108091026890 Coding region Proteins 0.000 title description 16
- 238000002360 preparation method Methods 0.000 title description 4
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 title 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 title 1
- 239000004473 Threonine Substances 0.000 title 1
- 229920001184 polypeptide Polymers 0.000 claims abstract description 66
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 66
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 66
- 239000002773 nucleotide Substances 0.000 claims abstract description 42
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 42
- 238000000034 method Methods 0.000 claims abstract description 17
- 235000018102 proteins Nutrition 0.000 claims description 61
- 239000012634 fragment Substances 0.000 claims description 50
- 210000004027 cell Anatomy 0.000 claims description 47
- 230000014509 gene expression Effects 0.000 claims description 16
- 101001059454 Homo sapiens Serine/threonine-protein kinase MARK2 Proteins 0.000 claims description 15
- 102100028904 Serine/threonine-protein kinase MARK2 Human genes 0.000 claims description 15
- 150000007523 nucleic acids Chemical class 0.000 claims description 13
- 238000006243 chemical reaction Methods 0.000 claims description 10
- 108020004707 nucleic acids Proteins 0.000 claims description 10
- 102000039446 nucleic acids Human genes 0.000 claims description 10
- 239000013604 expression vector Substances 0.000 claims description 8
- 239000003814 drug Substances 0.000 claims description 7
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 6
- 238000009396 hybridization Methods 0.000 claims description 6
- 230000003321 amplification Effects 0.000 claims description 5
- 230000000692 anti-sense effect Effects 0.000 claims description 5
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 5
- 102000001253 Protein Kinase Human genes 0.000 claims description 3
- 230000008859 change Effects 0.000 claims description 3
- 239000003112 inhibitor Substances 0.000 claims description 3
- 108060006633 protein kinase Proteins 0.000 claims description 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 238000000926 separation method Methods 0.000 claims 2
- 241000894006 Bacteria Species 0.000 claims 1
- 239000000556 agonist Substances 0.000 claims 1
- 238000012258 culturing Methods 0.000 claims 1
- 230000000968 intestinal effect Effects 0.000 claims 1
- 239000002299 complementary DNA Substances 0.000 abstract description 26
- 101000798314 Danio rerio Aurora kinase B Proteins 0.000 abstract description 6
- 108091033319 polynucleotide Proteins 0.000 abstract description 4
- 102000040430 polynucleotide Human genes 0.000 abstract description 4
- 239000002157 polynucleotide Substances 0.000 abstract description 4
- 230000008569 process Effects 0.000 abstract description 3
- 238000013518 transcription Methods 0.000 abstract description 3
- 230000035897 transcription Effects 0.000 abstract description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 42
- 239000013615 primer Substances 0.000 description 39
- 150000001413 amino acids Chemical group 0.000 description 31
- 235000001014 amino acid Nutrition 0.000 description 23
- 108091008146 restriction endonucleases Proteins 0.000 description 18
- 239000013598 vector Substances 0.000 description 18
- 239000000523 sample Substances 0.000 description 16
- 238000005520 cutting process Methods 0.000 description 15
- 239000000047 product Substances 0.000 description 14
- 230000006870 function Effects 0.000 description 12
- 229960000789 guanidine hydrochloride Drugs 0.000 description 12
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 12
- 230000000694 effects Effects 0.000 description 11
- 239000013612 plasmid Substances 0.000 description 10
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 9
- 108020004414 DNA Proteins 0.000 description 9
- 238000012217 deletion Methods 0.000 description 9
- 230000037430 deletion Effects 0.000 description 9
- 241000588724 Escherichia coli Species 0.000 description 8
- 108091034117 Oligonucleotide Proteins 0.000 description 8
- 239000012535 impurity Substances 0.000 description 8
- 230000036961 partial effect Effects 0.000 description 8
- 230000004952 protein activity Effects 0.000 description 8
- 108010026333 seryl-proline Proteins 0.000 description 8
- 238000007796 conventional method Methods 0.000 description 7
- 108010050848 glycylleucine Proteins 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 238000013519 translation Methods 0.000 description 7
- 230000014616 translation Effects 0.000 description 7
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 6
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 108010062796 arginyllysine Proteins 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 201000010099 disease Diseases 0.000 description 6
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 6
- 230000002441 reversible effect Effects 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 102100030011 Endoribonuclease Human genes 0.000 description 5
- 101710199605 Endoribonuclease Proteins 0.000 description 5
- 241001465754 Metazoa Species 0.000 description 5
- 108091028043 Nucleic acid sequence Proteins 0.000 description 5
- 101710113029 Serine/threonine-protein kinase Proteins 0.000 description 5
- 230000037396 body weight Effects 0.000 description 5
- 210000004899 c-terminal region Anatomy 0.000 description 5
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 5
- 238000001962 electrophoresis Methods 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 4
- OQCPATDFWYYDDX-HGNGGELXSA-N Ala-Gln-His Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OQCPATDFWYYDDX-HGNGGELXSA-N 0.000 description 4
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 4
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 4
- SMYXEYRYCLIPIL-ZLUOBGJFSA-N Cys-Cys-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O SMYXEYRYCLIPIL-ZLUOBGJFSA-N 0.000 description 4
- 102000004127 Cytokines Human genes 0.000 description 4
- 108090000695 Cytokines Proteins 0.000 description 4
- 239000003155 DNA primer Substances 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 4
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 4
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 4
- WZBLRQQCDYYRTD-SIXJUCDHSA-N His-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N WZBLRQQCDYYRTD-SIXJUCDHSA-N 0.000 description 4
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 4
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 4
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 4
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 4
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 4
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 4
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 4
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 4
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 4
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 108091081024 Start codon Proteins 0.000 description 4
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 4
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 4
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 4
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 230000003213 activating effect Effects 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 108010093581 aspartyl-proline Proteins 0.000 description 4
- 230000003115 biocidal effect Effects 0.000 description 4
- 230000003197 catalytic effect Effects 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 238000010790 dilution Methods 0.000 description 4
- 239000012895 dilution Substances 0.000 description 4
- 229940079593 drug Drugs 0.000 description 4
- 239000003480 eluent Substances 0.000 description 4
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 4
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 4
- 230000003053 immunization Effects 0.000 description 4
- 101150109249 lacI gene Proteins 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 108010005652 splenotritin Proteins 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 210000001550 testis Anatomy 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 108010044292 tryptophyltyrosine Proteins 0.000 description 4
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- 230000003612 virological effect Effects 0.000 description 4
- 108090000749 Aurora kinase B Proteins 0.000 description 3
- 102100032306 Aurora kinase B Human genes 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 description 3
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 3
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 3
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 230000010261 cell growth Effects 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000004440 column chromatography Methods 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 230000013595 glycosylation Effects 0.000 description 3
- 238000006206 glycosylation reaction Methods 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 3
- 210000004408 hybridoma Anatomy 0.000 description 3
- 230000002401 inhibitory effect Effects 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 238000002493 microarray Methods 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 230000001766 physiological effect Effects 0.000 description 3
- 102000005962 receptors Human genes 0.000 description 3
- 108020003175 receptors Proteins 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 2
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 2
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 2
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 2
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 2
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 2
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 2
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 2
- DHVMIHWNDBFTHB-FXQIFTODSA-N Asn-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N DHVMIHWNDBFTHB-FXQIFTODSA-N 0.000 description 2
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 2
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 2
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- TZBJAXGYGSIUHQ-XUXIUFHCSA-N Asp-Leu-Leu-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O TZBJAXGYGSIUHQ-XUXIUFHCSA-N 0.000 description 2
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 2
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 2
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 2
- 101100315624 Caenorhabditis elegans tyr-1 gene Proteins 0.000 description 2
- 241000699802 Cricetulus griseus Species 0.000 description 2
- POSRGGKLRWCUBE-CIUDSAMLSA-N Cys-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N POSRGGKLRWCUBE-CIUDSAMLSA-N 0.000 description 2
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 2
- 229920002271 DEAE-Sepharose Polymers 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- 241001200922 Gagata Species 0.000 description 2
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 2
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 2
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 2
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 2
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 2
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 2
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 2
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 2
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 2
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 2
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 2
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 2
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 2
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 2
- SNFUTDLOCQQRQD-ZKWXMUAHSA-N Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SNFUTDLOCQQRQD-ZKWXMUAHSA-N 0.000 description 2
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 2
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 2
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 2
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 2
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 2
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 2
- LMMPTUVWHCFTOT-GARJFASQSA-N His-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O LMMPTUVWHCFTOT-GARJFASQSA-N 0.000 description 2
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 2
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 2
- DGLAHESNTJWGDO-SRVKXCTJSA-N His-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DGLAHESNTJWGDO-SRVKXCTJSA-N 0.000 description 2
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 2
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 2
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 2
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- HYXQKVOADYPQEA-CIUDSAMLSA-N Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HYXQKVOADYPQEA-CIUDSAMLSA-N 0.000 description 2
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 2
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 2
- IXEFKXAGHRQFAF-HVTMNAMFSA-N Ile-Glu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IXEFKXAGHRQFAF-HVTMNAMFSA-N 0.000 description 2
- TVSPLSZTKTUYLV-ZPFDUUQYSA-N Ile-Glu-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O TVSPLSZTKTUYLV-ZPFDUUQYSA-N 0.000 description 2
- BCVIOZZGJNOEQS-XKNYDFJKSA-N Ile-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)[C@@H](C)CC BCVIOZZGJNOEQS-XKNYDFJKSA-N 0.000 description 2
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 2
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 2
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 2
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 2
- JCGMFFQQHJQASB-PYJNHQTQSA-N Ile-Val-His Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O JCGMFFQQHJQASB-PYJNHQTQSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 2
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 2
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 2
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 2
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 2
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 2
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 2
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 2
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 2
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 2
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 2
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 2
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 2
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 2
- CAEZLMGDJMEBKP-AVGNSLFASA-N Met-Pro-His Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC=N1 CAEZLMGDJMEBKP-AVGNSLFASA-N 0.000 description 2
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 108091005461 Nucleic proteins Proteins 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 2
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 2
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 2
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 2
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 2
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 2
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 2
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 2
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 2
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 2
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 2
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 description 2
- NGNNPLJHUFCOMZ-FXQIFTODSA-N Pro-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 NGNNPLJHUFCOMZ-FXQIFTODSA-N 0.000 description 2
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 2
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 2
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- XNJVJEHDZPDPQL-BZSNNMDCSA-N Pro-Trp-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1)C(O)=O XNJVJEHDZPDPQL-BZSNNMDCSA-N 0.000 description 2
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 2
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 102100032314 RAC-gamma serine/threonine-protein kinase Human genes 0.000 description 2
- 108020005091 Replication Origin Proteins 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 2
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 2
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 2
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 2
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 2
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 2
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 2
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- BQBCIBCLXBKYHW-CSMHCCOUSA-N Thr-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O BQBCIBCLXBKYHW-CSMHCCOUSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 2
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- IQIRAJGHFRVFEL-UBHSHLNASA-N Trp-Ser-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N IQIRAJGHFRVFEL-UBHSHLNASA-N 0.000 description 2
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 2
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 2
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 2
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 2
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 2
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 2
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 2
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 2
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 2
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 2
- MANXHLOVEUHVFD-DCAQKATOSA-N Val-His-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N MANXHLOVEUHVFD-DCAQKATOSA-N 0.000 description 2
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 2
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 2
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 2
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- HMNZFMSWFCAGGW-XPWSMXQVSA-N [3-[hydroxy(2-hydroxyethoxy)phosphoryl]oxy-2-[(e)-octadec-9-enoyl]oxypropyl] (e)-octadec-9-enoate Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OCC(COP(O)(=O)OCCO)OC(=O)CCCCCCC\C=C\CCCCCCCC HMNZFMSWFCAGGW-XPWSMXQVSA-N 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 108010084094 alanyl-alanyl-alanyl-alanine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 239000005557 antagonist Substances 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000008952 bacterial invasion Effects 0.000 description 2
- 230000008238 biochemical pathway Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 230000004663 cell proliferation Effects 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 239000013522 chelant Substances 0.000 description 2
- 238000005352 clarification Methods 0.000 description 2
- 210000001728 clone cell Anatomy 0.000 description 2
- 210000004748 cultured cell Anatomy 0.000 description 2
- 108010004073 cysteinylcysteine Proteins 0.000 description 2
- 230000009089 cytolysis Effects 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000003745 diagnosis Methods 0.000 description 2
- 238000000502 dialysis Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 210000002865 immune cell Anatomy 0.000 description 2
- 238000002649 immunization Methods 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000000415 inactivating effect Effects 0.000 description 2
- 210000003000 inclusion body Anatomy 0.000 description 2
- 230000028709 inflammatory response Effects 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 238000001638 lipofection Methods 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 230000035790 physiological processes and functions Effects 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 239000012460 protein solution Substances 0.000 description 2
- 230000005855 radiation Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 238000000527 sonication Methods 0.000 description 2
- 230000004936 stimulating effect Effects 0.000 description 2
- 239000011550 stock solution Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 230000036964 tight binding Effects 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 230000008467 tissue growth Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 2
- PEZMQPADLFXCJJ-ZETCQYMHSA-N 2-[[2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(O)=O PEZMQPADLFXCJJ-ZETCQYMHSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 102000054300 EC 2.7.11.- Human genes 0.000 description 1
- 108700035490 EC 2.7.11.- Proteins 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 1
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000009465 Growth Factor Receptors Human genes 0.000 description 1
- 108010009202 Growth Factor Receptors Proteins 0.000 description 1
- QIVPRLJQQVXCIY-HGNGGELXSA-N His-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIVPRLJQQVXCIY-HGNGGELXSA-N 0.000 description 1
- 101000798007 Homo sapiens RAC-gamma serine/threonine-protein kinase Proteins 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- BZQFBWGGLXLEPQ-UHFFFAOYSA-N O-phosphoryl-L-serine Natural products OC(=O)C(N)COP(O)(O)=O BZQFBWGGLXLEPQ-UHFFFAOYSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 1
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 1
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 1
- 101710103995 RAC-gamma serine/threonine-protein kinase Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- 206010048038 Wound infection Diseases 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 239000008365 aqueous carrier Substances 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 230000021523 carboxylation Effects 0.000 description 1
- 238000006473 carboxylation reaction Methods 0.000 description 1
- 230000005779 cell damage Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 208000037887 cell injury Diseases 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000009087 cell motility Effects 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 230000022811 deglycosylation Effects 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 229950006137 dexfosfoserine Drugs 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 239000003862 glucocorticoid Substances 0.000 description 1
- 108091005608 glycosylated proteins Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 230000003862 health status Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 230000008105 immune reaction Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000016784 immunoglobulin production Effects 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 239000007928 intraperitoneal injection Substances 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 210000004925 microvascular endothelial cell Anatomy 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 231100000707 mutagenic chemical Toxicity 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 108091005981 phosphorylated proteins Proteins 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- USRGIUJOYOXOQJ-GBXIJSLDSA-N phosphothreonine Chemical compound OP(=O)(O)O[C@H](C)[C@H](N)C(O)=O USRGIUJOYOXOQJ-GBXIJSLDSA-N 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- DCWXELXMIBXGTH-UHFFFAOYSA-N phosphotyrosine Chemical compound OC(=O)C(N)CC1=CC=C(OP(O)(O)=O)C=C1 DCWXELXMIBXGTH-UHFFFAOYSA-N 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 230000009822 protein phosphorylation Effects 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 238000002271 resection Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 229940037128 systemic glucocorticoids Drugs 0.000 description 1
- 238000011200 topical administration Methods 0.000 description 1
- 230000029663 wound healing Effects 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P20/00—Technologies relating to chemical industry
- Y02P20/50—Improvements relating to the production of bulk chemicals
- Y02P20/52—Improvements relating to the production of bulk chemicals using catalysts, e.g. selective catalysts
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Description
技术领域technical field
本发明涉及基因工程领域,具体地,本发明涉及一种人基因核苷酸序列。更具体地说,本发明涉及一种人丝氨酸/苏氨酸激酶( human serine/ threonine kinase a,hSTKa)及其转录本hSTKc的cDNA序列。本发明还涉及由上述核苷酸序列编码的多肽,这些多核苷酸和多肽的应用及生产方法The present invention relates to the field of genetic engineering, in particular, the present invention relates to a human gene nucleotide sequence. More specifically, the present invention relates to a cDNA sequence of a human serine/threonine kinase ( human serine / threonine kinase a, hSTKa) and its transcript hSTKc. The present invention also relates to polypeptides encoded by the above nucleotide sequences, applications and production methods of these polynucleotides and polypeptides
背景技术Background technique
众所周知,生物中广泛存在着复杂的信号传递途径。每个细胞通过表面的受体从外界接收信号,然后经过一系列复杂的生物化学途径,将信号从质膜传递到胞内。而蛋白质的磷酸化和去磷酸化就是其中最重要的生物化学途径之一( Bioorg Med Chem.1997 Sep;5(9):1751-1773.)。It is well known that complex signaling pathways exist widely in organisms. Each cell receives signals from the outside world through receptors on the surface, and then transmits the signals from the plasma membrane to the inside of the cell through a series of complex biochemical pathways. Protein phosphorylation and dephosphorylation is one of the most important biochemical pathways ( Bioorg Med Chem. 1997 Sep; 5(9):1751-1773.).
蛋白激酶的主要作用是催化蛋白质的磷酸化,总体上包括蛋白质的丝氨酸/苏氨酸激酶(serine/threonine kinase,简称“STK”)和蛋白质的酪氨酸激酶两大类。蛋白激酶在生物体内发挥着重要的功能,其在生物体内主要是通过蛋白的磷酸化或者通过与起调节作用的小分子结合,在翻译后水平上对蛋白发挥正常功能进行调控的。丝氨酸/苏氨酸蛋白激酶的表达受到糖皮质激素、生长因子、细胞体积及细胞损伤等各种因素的影响。The main function of protein kinases is to catalyze the phosphorylation of proteins, which generally include protein serine/threonine kinases (serine/threonine kinase, referred to as "STK") and protein tyrosine kinases. Protein kinases play an important role in living organisms. In living organisms, they mainly regulate the normal function of proteins at the post-translational level through phosphorylation of proteins or binding to small molecules that play a regulatory role. The expression of serine/threonine protein kinase is affected by various factors such as glucocorticoids, growth factors, cell volume and cell damage.
STK在进化上非常保守,其结构上尤其在其催化功能语中有许多保守区域。如催化区的N端为甘氨酸富集段,该富集段附近还有一个赖氨酸,研究显示该区与ATP的结合有关。催化区中心是保守的天冬氨酸,它与激酶的催化活性密切相关。丝氨酸/苏氨酸蛋白激酶广泛存在于生物体的各个部分中,包括各种组织、器官和血清中,参与细胞发挥正常功能所必须的众多过程,如:基因的转录,翻译,蛋白的分泌及细胞的移动和细胞器的生物合成等。STK is very conservative in evolution, and there are many conserved regions in its structure, especially in its catalytic function. For example, the N-terminus of the catalytic region is a glycine-rich segment, and there is a lysine near the rich segment. Studies have shown that this region is related to the binding of ATP. The center of the catalytic domain is the conserved aspartic acid, which is closely related to the catalytic activity of the kinase. Serine/threonine protein kinase widely exists in various parts of organisms, including various tissues, organs and serum, and participates in many processes necessary for normal cell function, such as: gene transcription, translation, protein secretion and Cell movement and organelle biosynthesis, etc.
由于丝氨酸/苏氨酸蛋白激酶在细胞生理活动中所起的基础性作用,人们对其,尤其是该家族成员在人类中的表达作用情况的研究日渐广泛和深入。1994年,Small D等人克隆了人干细胞STK-1并研究了其在正常人骨髓中的表达情况,结果表明,STK-1可能是造血干细胞的生长因子受体(Proc Natl Acad Sci 1994,91(2):459-463)。1996年,Chen,H.,等人研究了人皮肤微脉管内皮细胞STK-1基因组第九和第十个外显子的和突变情况。1998年,Li X.等人报道克隆得到了STK-2。Brodbeck,D.,Nakatani,K.,Masure,S.,等人克隆并鉴定了了RAC-gamma serine/threonine protein kinase(Akt-3)(J.Biol.Chem.274(14),9133-9136(1999);Biochem.Biophys.Res.Commun.257(3),906-910(1999);Eur.J.Biochem.265(1),353-360(1999)。2001年Stanchi,F.等人在分离与酵母序列相似的人类基因时还得到了一个由603个氨基酸残基组成的丝氨酸/苏氨酸蛋白激酶,Genbank登录号为CAA07196,进一步证明丝氨酸/苏氨酸蛋白激酶对细胞生长的必需性(Yeast,18(1),69-80)。Due to the fundamental role played by serine/threonine protein kinases in the physiological activities of cells, the researches on them, especially the expression of the members of this family in humans, are becoming more and more extensive and in-depth. In 1994, Small D and others cloned human stem cell STK-1 and studied its expression in normal human bone marrow. The results showed that STK-1 may be a growth factor receptor for hematopoietic stem cells (Proc Natl Acad Sci 1994, 91 (2): 459-463). In 1996, Chen, H., et al. studied the mutations of the ninth and tenth exons of STK-1 genome in human skin microvascular endothelial cells. In 1998, Li X. et al. reported that STK-2 was cloned. Brodbeck, D., Nakatani, K., Masure, S., et al. cloned and identified RAC-gamma serine/threonine protein kinase (Akt-3) (J.Biol.Chem.274(14), 9133-9136 (1999); Biochem.Biophys.Res.Commun.257(3), 906-910(1999); Eur.J.Biochem.265(1), 353-360(1999). 2001 Stanchi, F. et al. A serine/threonine protein kinase composed of 603 amino acid residues was also obtained when the human gene similar to the yeast sequence was isolated, and the Genbank accession number is CAA07196, which further proves that the serine/threonine protein kinase is necessary for cell growth Sex (Yeast, 18(1), 69-80).
然而,在本发明之前,尚没有任何人公开过本发明所述的人丝氨酸/苏氨酸激酶hSTKa和hSTKc。However, prior to the present invention, no one has disclosed the human serine/threonine kinases hSTKa and hSTKc described in the present invention.
发明内容Contents of the invention
本发明的目的之一是提供一种多核苷酸序列,该多核苷酸序列编码一种人丝氨酸/苏氨酸蛋白激酶a( human serine/ threonine kinase a,hSTKa)。One of the objects of the present invention is to provide a polynucleotide sequence encoding a human serine/threonine kinase a ( human serine / threonine kinase a, hSTKa).
本发明的目的之二是提供一种多核苷酸序列,该多核苷酸序列编码一种人丝氨酸/苏氨酸蛋白激酶b( human serine/ threonine kinase c,hSTKc),它是上述hSTKa的转录剪切本。The second object of the present invention is to provide a polynucleotide sequence, which encodes a human serine/threonine kinase b ( human serine / threonine kinase c, hSTKc), which is the above-mentioned Transcript splicing of hSTKa.
本发明的目的之三是提供一种蛋白,该蛋白被命名为人丝氨酸/苏氨酸蛋白激酶a( human serine/ threonine kinase a,hSTKa)。The third object of the present invention is to provide a protein named human serine/threonine kinase a ( human serine / threonine kinase a, hSTKa).
本发明的目的之四是提供一种蛋白,该蛋白被命名为人丝氨酸/苏氨酸蛋白激酶c( human serine/ threonine kinase c,hSTKc)。The fourth object of the present invention is to provide a protein named human serine/threonine kinase c ( human serine / threonine kinase c, hSTKc).
本发明的目的之五是提供一种利用重组技术生产所述的hSTKa和hSTKc蛋白的方法。The fifth object of the present invention is to provide a method for producing the hSTKa and hSTKc proteins using recombinant technology.
本发明的目的之六是提供hSTKa和hSTKc核酸序列和蛋白的应用。The sixth object of the present invention is to provide the application of hSTKa and hSTKc nucleic acid sequences and proteins.
在本发明提供了一种分离出的DNA分子,它包括:编码具有人hSTK蛋白活性的多肽的核苷酸序列,所述的核苷酸序列与SEQ ID NO.1中的核苷酸序列有至少70%的同源性;或者所述的核苷酸序列能在中度严紧条件下与SEQ ID NO.1中核苷酸序列杂交。较佳地,所述的序列编码一多肽,该多肽具有SEQ ID NO.2或SEQ ID NO.4所示的序列。更佳地,该序列具有SEQ ID NO.1或SEQ ID NO.3中所示的核苷酸序列。The present invention provides an isolated DNA molecule, which includes: a nucleotide sequence encoding a polypeptide having human hSTK protein activity, and the nucleotide sequence has the same nucleotide sequence as that in SEQ ID NO.1 At least 70% homology; or the nucleotide sequence can hybridize with the nucleotide sequence in SEQ ID NO.1 under moderately stringent conditions. Preferably, said sequence encodes a polypeptide having the sequence shown in SEQ ID NO.2 or SEQ ID NO.4. More preferably, the sequence has the nucleotide sequence shown in SEQ ID NO.1 or SEQ ID NO.3.
在本发明的另一方面,提供了一种分离的hSTK蛋白多肽,它包括:具有SEQ ID NO.2或SEQ ID NO.4氨基酸序列的多肽、或其活性片段,或其活性衍生物。较佳地,该多肽是具有SEQ ID NO.2或SEQ ID NO.4序列的多肽。In another aspect of the present invention, an isolated hSTK protein polypeptide is provided, which includes: a polypeptide having an amino acid sequence of SEQ ID NO.2 or SEQ ID NO.4, or an active fragment thereof, or an active derivative thereof. Preferably, the polypeptide is a polypeptide having the sequence of SEQ ID NO.2 or SEQ ID NO.4.
在本发明的另一方面,提供了一种载体,它含有上述分离出的DNA。In another aspect of the present invention, there is provided a vector comprising the above isolated DNA.
在本发明的另一方面,提供了一种所述载体转化的宿主细胞。In another aspect of the present invention, a host cell transformed with the vector is provided.
在本发明的另一方面,提供了一种产生具有hSTK蛋白活性的多肽的方法,该方法包括:In another aspect of the present invention, a method for producing a polypeptide having hSTK protein activity is provided, the method comprising:
(a)将编码具有hSTK蛋白活性的多肽的核苷酸序列可操作地连于表达调控序列,形成hSTK蛋白表达载体,所述的核苷酸序列与SEQ ID NO.1或SEQ ID NO.3中核苷酸序列有至少70%的同源性;(a) The nucleotide sequence encoding the polypeptide having hSTK protein activity is operably connected to the expression control sequence to form an hSTK protein expression vector, and the nucleotide sequence is the same as SEQ ID NO.1 or SEQ ID NO.3 There is at least 70% homology of nucleotide sequences among them;
(b)将步骤(a)中的表达载体转入宿主细胞,形成hSTK蛋白的重组细胞;(b) transferring the expression vector in step (a) into a host cell to form a recombinant cell of the hSTK protein;
(c)在适合表达hSTK蛋白多肽的条件下,培养步骤(b)中的重组细胞;(c) cultivating the recombinant cells in step (b) under conditions suitable for expressing the hSTK protein polypeptide;
(d)分离出具有hSTK蛋白活性的多肽。(d) isolating a polypeptide having hSTK protein activity.
在本发明中,“分离的”、“纯化的”或“基本纯的”DNA是指,该DNA或片段已从天然状态下位于其两侧的序列中分离出来,还指该DNA或片段已经与天然状态下伴随核酸的组份分开,而且已经与在细胞中伴随其的蛋白质分开。In the present invention, "isolated", "purified" or "substantially pure" DNA means that the DNA or fragment has been separated from the sequences flanking it in its natural state, and also means that the DNA or fragment has been Separated from the components that naturally accompany nucleic acids and that have been separated from the proteins that accompany them in cells.
在本发明中,术语“hSTK蛋白(或多肽)编码序列”指编码具有hSTK蛋白活性的多肽的核苷酸序列,如SEQ ID NO.1中3-2009位核苷酸序列及其简并序列。该简并序列是指,位于SEQ ID NO.1序列的编码框3-2009位核苷酸中,有一个或多个密码子被编码相同氨基酸的简并密码子所取代后而产生的序列。由于密码子的简并性,所以与SEQ ID NO.1中3-2009位核苷酸序列同源性低至约70%的简并序列也能编码出SEQ ID NO.2所述的序列。该术语还包括能在中度严紧条件下,更佳地在高度严紧条件下与SEQ ID NO.1中从核苷酸3-2009位的核苷酸序列杂交的核苷酸序列。此外,该术语还包括与SEQ ID NO.1中从核苷酸3-2009位的核苷酸序列的同源性至少70%,较佳地至少80%,更佳地至少90%的核苷酸序列。In the present invention, the term "hSTK protein (or polypeptide) coding sequence" refers to a nucleotide sequence encoding a polypeptide having hSTK protein activity, such as the 3-2009 nucleotide sequence and its degenerate sequence in SEQ ID NO.1 . The degenerate sequence refers to the sequence generated after one or more codons are replaced by degenerate codons encoding the same amino acid in the 3-2009 nucleotides of the coding frame of the sequence of SEQ ID NO.1. Due to the degeneracy of codons, a degenerate sequence with a homology as low as about 70% of the 3-2009 nucleotide sequence in SEQ ID NO.1 can also encode the sequence described in SEQ ID NO.2. The term also includes a nucleotide sequence capable of hybridizing to the nucleotide sequence from nucleotide 3-2009 in SEQ ID NO.1 under moderately stringent conditions, preferably under highly stringent conditions. In addition, the term also includes at least 70%, preferably at least 80%, more preferably at least 90% of the nucleosides of the nucleotide sequence from nucleotide 3-2009 in SEQ ID NO.1 acid sequence.
该术语还包括能编码具有与人hSTK相同功能的蛋白的、SEQ ID NO.1序列的变异形式。这些变异形式包括(但并不限于):若干个(通常为1-90个,较佳地1-60个,更佳地1-20个,最佳地1-10个)核苷酸的缺失、插入和/或取代,以及在5’和/或3’端添加数个(通常为60个以内,较佳地为30个以内,更佳地为10个以内,最佳地为5个以内)核苷酸。The term also includes variants of the sequence of SEQ ID NO. 1 that encode a protein having the same function as human hSTK. These variations include (but are not limited to): the deletion of several (usually 1-90, preferably 1-60, more preferably 1-20, and most preferably 1-10) nucleotides , insertion and/or substitution, and addition of several (usually within 60, preferably within 30, more preferably within 10, and most preferably within 5) at the 5' and/or 3' end ) nucleotides.
在本发明中,“基本纯的”蛋白质或多肽是指其至少占样品总物质的至少20%,较佳地至少50%,更佳地至少80%,最佳地至少90%(按干重或湿重计)。纯度可以用任何合适的方法进行测量,如用柱层析、PAGE或HPLC法测量多肽的纯度。基本纯的多肽基本上不含天然状态下的伴随其的组分。In the present invention, "substantially pure" protein or polypeptide means that it accounts for at least 20%, preferably at least 50%, more preferably at least 80%, and most preferably at least 90% (by dry weight) of the total substance of the sample. or wet weight). Purity can be measured by any suitable method, such as measuring the purity of the polypeptide by column chromatography, PAGE or HPLC. A substantially pure polypeptide is substantially free of components that accompany it in its native state.
在本发明中,术语“hSTK蛋白多肽”指具有hSTK蛋白活性的SEQ IDNO.2或SEQ IDNO.4序列的多肽。该术语还包括具有与人hSTK相同功能的、SEQ ID NO.2或SEQ ID NO.4序列的变异形式。这些变异形式包括(但并不限于):若干个(通常为1-50个,较佳地1-30个,更佳地1-20个,最佳地1-10个)氨基酸的缺失、插入和/或取代,以及在C末端和/或N末端添加一个或数个(通常为20个以内,较佳地为10个以内,更佳地为5个以内)氨基酸。例如,在本领域中,用性能相近或相似的氨基酸进行取代时,通常不会改变蛋白质的功能。又比如,在C末端和/或N末端添加一个或数个氨基酸通常也不会改变蛋白质的功能。该术语还包括hSTK蛋白的活性片段和活性衍生物。In the present invention, the term "hSTK protein polypeptide" refers to a polypeptide of SEQ ID NO.2 or SEQ ID NO.4 sequence having hSTK protein activity. The term also includes variant forms of the sequence of SEQ ID NO. 2 or SEQ ID NO. 4 that have the same function as human hSTK. These variations include (but are not limited to): deletions and insertions of several (usually 1-50, preferably 1-30, more preferably 1-20, and most preferably 1-10) amino acids and/or substitution, and addition of one or several (usually within 20, preferably within 10, more preferably within 5) amino acids at the C-terminal and/or N-terminal. For example, in the art, substitutions with amino acids with similar or similar properties generally do not change the function of the protein. As another example, adding one or several amino acids at the C-terminus and/or N-terminus usually does not change the function of the protein. The term also includes active fragments and active derivatives of hSTK proteins.
该多肽的变异形式包括:同源序列、等位变异体、天然突变体、诱导突变体、在高或低的严谨度条件下能与本发明的hSTK DNA杂交的DNA所编码的蛋白、以及利用抗hSTK多肽的抗血清获得的多肽或蛋白。本发明还提供了其他多肽,如包含hSTK多肽或其片段的融合蛋白。除了几乎全长的多肽外,本发明还包括了hSTK多肽的可溶性片段。通常,该片段具有hSTK多肽序列的至少约10个连续氨基酸,通常至少约30个连续氨基酸,较佳地至少约50个连续氨基酸,更佳地至少约80个连续氨基酸,最佳地至少约100个连续氨基酸。Variants of the polypeptide include: homologous sequences, allelic variants, natural mutants, induced mutants, proteins encoded by DNA that can hybridize with the hSTK DNA of the present invention under high or low stringency conditions, and using Polypeptide or protein obtained by antiserum against hSTK polypeptide. The invention also provides other polypeptides, such as fusion proteins comprising hSTK polypeptides or fragments thereof. In addition to substantially full-length polypeptides, the present invention also includes soluble fragments of hSTK polypeptides. Typically, the fragment has at least about 10 contiguous amino acids, usually at least about 30 contiguous amino acids, preferably at least about 50 contiguous amino acids, more preferably at least about 80 contiguous amino acids, and most preferably at least about 100 contiguous amino acids of the hSTK polypeptide sequence. consecutive amino acids.
本发明所指的hSTK蛋白或多肽的类似物与天然hSTK多肽的差别可以是氨基酸序列上的差异,也可以是不影响序列的修饰形式上的差异,或者兼而有之。这些多肽包括天然或诱导的遗传变异体。诱导变异体可以通过各种技术得到,如通过辐射或暴露于诱变剂而产生随机诱变,还可通过定点诱变法或其他已知分子生物学的技术。类似物还包括具有不同于天然L-氨基酸的残基(如D-氨基酸)的类似物,以及具有非天然存在的或合成的氨基酸(如β、γ-氨基酸)的类似物。应理解,本发明的多肽并不限于上述例举的代表性的多肽。The difference between the analog of hSTK protein or polypeptide referred to in the present invention and the natural hSTK polypeptide may be the difference in amino acid sequence, or the difference in modified form that does not affect the sequence, or both. These polypeptides include natural or induced genetic variants. Induced variants can be obtained by various techniques, such as random mutagenesis by radiation or exposure to mutagens, but also by site-directed mutagenesis or other techniques known in molecular biology. Analogs also include analogs with residues other than natural L-amino acids (eg, D-amino acids), and analogs with non-naturally occurring or synthetic amino acids (eg, β, γ-amino acids). It should be understood that the polypeptides of the present invention are not limited to the representative polypeptides exemplified above.
修饰(通常不改变一级结构)形式包括:体内或体外的多肽的化学衍生形式如乙酰化或羧基化。修饰还包括糖基化,如那些在多肽的合成和加工中或进一步加工步骤中进行糖基化修饰而产生的多肽。这种修饰可以通过将多肽暴露于进行糖基化的酶(如哺乳动物的糖基化酶或去糖基化酶)而完成。修饰形式还包括具有磷酸化氨基酸残基(如磷酸酪氨酸,磷酸丝氨酸,磷酸苏氨酸)的序列。还包括被修饰从而提高了其抗蛋白水解性能或优化了溶解性能的多肽。Modified (usually without altering primary structure) forms include: chemically derivatized forms of polypeptides such as acetylation or carboxylation, in vivo or in vitro. Modifications also include glycosylation, such as those resulting from polypeptides that are modified by glycosylation during synthesis and processing of the polypeptide or during further processing steps. Such modification can be accomplished by exposing the polypeptide to an enzyme that performs glycosylation, such as a mammalian glycosylase or deglycosylation enzyme. Modified forms also include sequences with phosphorylated amino acid residues (eg, phosphotyrosine, phosphoserine, phosphothreonine). Also included are polypeptides that have been modified to increase their resistance to proteolysis or to optimize solubility.
本发明还包括hSTK多肽编码序列的反义序列。这种反义序列可用于抑制细胞内hSTK的表达。The invention also includes antisense sequences of hSTK polypeptide coding sequences. Such antisense sequences can be used to inhibit the expression of hSTK in cells.
本发明还包括一种探针分子,该分子通常具有hSTK多肽编码序列的8-100个,较佳地15-50个连续核苷酸。该探针可用于检测样品中是否存在编码hSTK的核酸分子。The present invention also includes a probe molecule, which generally has 8-100, preferably 15-50 contiguous nucleotides of the hSTK polypeptide coding sequence. The probe can be used to detect whether there is a nucleic acid molecule encoding hSTK in a sample.
本发明还包括检测hSTK核苷酸序列的方法,它包括用上述的探针与样品进行杂交,然后检测探针是否发生了结合。较佳地,该样品是PCR扩增后的产物,其中PCR扩增引物对应于hSTK多肽的编码序列,并可位于该编码序列的两侧或中间。引物长度一般为20-50个核苷酸。The present invention also includes a method for detecting hSTK nucleotide sequence, which comprises using the above-mentioned probe to hybridize with a sample, and then detecting whether the probe is combined. Preferably, the sample is a product of PCR amplification, wherein the PCR amplification primers correspond to the coding sequence of the hSTK polypeptide and can be located on both sides or in the middle of the coding sequence. Primers are generally 20-50 nucleotides in length.
在本发明中,可选用本领域已知的各种载体,如市售的载体。In the present invention, various vectors known in the art, such as commercially available vectors, can be used.
在本发明中,术语“宿主细胞”包括原核细胞和真核细胞。常用的原核宿主细胞的例子包括大肠杆菌、枯草杆菌等。常用的真核宿主细胞包括酵母细胞,昆虫细胞、和哺乳动物细胞。较佳地,该宿主细胞是真核细胞,如CHO细胞、COS细胞等。In the present invention, the term "host cell" includes prokaryotic cells and eukaryotic cells. Examples of commonly used prokaryotic host cells include Escherichia coli, Bacillus subtilis, and the like. Commonly used eukaryotic host cells include yeast cells, insect cells, and mammalian cells. Preferably, the host cells are eukaryotic cells, such as CHO cells, COS cells and the like.
另一方面,本发明还包括对hSTK DNA或是其片段编码的多肽具有特异性的多克隆抗体和单克隆抗体,尤其是单克隆抗体。这里,“特异性”是指抗体能结合于hSTK基因产物或片段。较佳地,指那些能与hSTK基因产物或片段结合但不识别和结合于其它非相关抗原分子的抗体。本发明中抗体包括那些能够结合并抑制hSTK蛋白的分子,也包括那些并不影响hSTK蛋白功能的抗体。本发明还包括那些能与修饰或未经修饰形式的hSTK基因产物结合的抗体。On the other hand, the present invention also includes polyclonal antibodies and monoclonal antibodies, especially monoclonal antibodies, specific for hSTK DNA or polypeptides encoded by its fragments. Here, "specificity" means that the antibody can bind to hSTK gene product or fragment. Preferably, it refers to those antibodies that can bind to hSTK gene products or fragments but do not recognize and bind to other irrelevant antigen molecules. Antibodies in the present invention include those molecules capable of binding and inhibiting hSTK protein, as well as those antibodies that do not affect the function of hSTK protein. The invention also includes antibodies that bind to modified or unmodified forms of the hSTK gene product.
本发明不仅包括完整的单克隆或多克隆抗体,而且还包括具有免疫活性的抗体片段,如Fab′或(Fab)2片段;抗体重链;抗体轻链;遗传工程改造的单链Fv分子(Ladner等人,美国专利No.4,946,778);或嵌合抗体,如具有鼠抗体结合特异性但仍保留来自人的抗体部分的抗体。The present invention includes not only complete monoclonal or polyclonal antibodies, but also immunologically active antibody fragments, such as Fab' or (Fab) 2 fragments; antibody heavy chains; antibody light chains; genetically engineered single-chain Fv molecules ( Ladner et al., US Patent No. 4,946,778); or chimeric antibodies, such as antibodies that have the binding specificity of a murine antibody but retain portions of the antibody from humans.
本发明的抗体可以通过本领域内技术人员已知的各种技术进行制备。例如,纯化的hSTK基因产物或者其具有抗原性的片段,可被施用于动物以诱导多克隆抗体的产生。与之相似的,表达hSTK或其具有抗原性的片段的细胞可用来免疫动物来生产抗体。本发明的抗体也可以是单克隆抗体。此类单克隆抗体可以利用杂交瘤技术来制备(见Kohler等人,Nature 256;495,1975;Kohler等人, Eur.J.Immunol.6:511,1976;Kohler等人, Eur.J.Immunol.6:292,1976;Hammerling等人. In Monoclonal Antibodies and T Cell Hybridomas,Elsevier,N.Y,1981)。本发明的抗体包括能阻断hSTK功能的抗体以及不影响hSTK功能的抗体。本发明的各类抗体可以利用hSTK基因产物的片段或功能区,通过常规免疫技术获得。这些片段或功能区可以利用重组方法制备或利用多肽合成仪合成。与hSTK基因产物的未修饰形式结合的抗体可以用原核细胞(例如E.Coli)中生产的基因产物来免疫动物而产生;与翻译后修饰形式结合的抗体(如糖基化或磷酸化的蛋白或多肽),可以用真核细胞(例如酵母或昆虫细胞)中产生的基因产物来免疫动物而获得。Antibodies of the present invention can be prepared by various techniques known to those skilled in the art. For example, purified hSTK gene products, or antigenic fragments thereof, can be administered to animals to induce polyclonal antibody production. Similarly, cells expressing hSTK or antigenic fragments thereof can be used to immunize animals to produce antibodies. Antibodies of the invention may also be monoclonal antibodies. Such monoclonal antibodies can be prepared using hybridoma technology (see Kohler et al., Nature 256; 495, 1975; Kohler et al., Eur.J. Immunol. 6:511, 1976; Kohler et al., Eur.J. Immunol . 6:292, 1976; Hammerling et al. In Monoclonal Antibodies and T Cell Hybridomas , Elsevier, NY, 1981). The antibodies of the present invention include antibodies capable of blocking the function of hSTK and antibodies that do not affect the function of hSTK. Various antibodies of the present invention can be obtained by conventional immunization techniques using fragments or functional regions of hSTK gene products. These fragments or functional regions can be prepared using recombinant methods or synthesized using a polypeptide synthesizer. Antibodies that bind to unmodified forms of hSTK gene products can be produced by immunizing animals with gene products produced in prokaryotic cells (e.g., E. coli); antibodies that bind to post-translationally modified forms (such as glycosylated or phosphorylated proteins or polypeptides), which can be obtained by immunizing animals with gene products produced in eukaryotic cells (eg, yeast or insect cells).
在本发明中,人hSTKa的cDNA核苷酸序列是如此获得的,以人睾丸λgt10cDNA文库(购自Clontech公司)为模板,用两对寡核苷酸为引物:A1 CGATGACATCGACGGGGAAGGAC;A2 CAACAA GCTGCTG CAGGACCTG为正向引物;B1:CAGGTCCTCATC CTCCTGGCTCG;B2 GATGGCCTC GTGGAGGT GACATG为反向引物,进行PCR。PCR条件为93℃4分钟,随之以93℃1分钟、64℃1分钟和72℃1分钟进行35个循环,最后72℃延伸5分钟。电泳检测得到的PCR片断,大小分别为约一千和一千一百个碱基对的片段及杂质。去除杂质,提高所需核苷酸片段的浓度,分别用PshAI酶切得到的片段,并连接,最后得到目的片段。In the present invention, the cDNA nucleotide sequence of human hSTKa is obtained in this way, using the human testis λgt10 cDNA library (purchased from Clontech Company) as a template, and using two pairs of oligonucleotides as primers: A1 CGATGACATCGACGGGGAAGGAC; A2 CAACAA GCTGCTG CAGGACCTG is Forward primer; B1: CAGGTCCTCATC CTCCTGGCTCG; B2 GATGGCCTC GTGGAGGT GACATG is the reverse primer for PCR. The PCR conditions were 93°C for 4 minutes, followed by 35 cycles of 93°C for 1 minute, 64°C for 1 minute and 72°C for 1 minute, and finally 72°C for 5 minutes. The PCR fragments detected by electrophoresis are fragments and impurities of about 1,000 and 1,100 base pairs, respectively. Remove impurities, increase the concentration of the required nucleotide fragments, digest the obtained fragments with PshAI respectively, and connect them, and finally obtain the target fragments.
以人睾丸λgt10cDNA文库(购自Clontech公司)为模板,用两对寡核苷酸为引物:A1CGATGACATCG ACGGGGAAGGAC;A2 CAACAA GCTGCTG CAGGACCTG为正向引物;B1:CA GGTCCTCATC CTCCTGGCTCG;B2 GCTAGTG TGTCTAAGGCTGCTCG为反向引物,进行PCR。PCR条件为93℃4分钟,随之以93℃1分钟、64℃1分钟和72℃1分钟进行35个循环,最后72℃延伸5分钟。电泳检测得到的PCR片断,大小分别为约一千和一千三百个碱基对的片段及杂质。去除杂质,提高所需核苷酸片段的浓度,分别用PshAI酶切得到的片段,并连接,最后得到目的片段。Using the human testis λgt10 cDNA library (purchased from Clontech) as a template, two pairs of oligonucleotides were used as primers: A1CGATGACATCG ACGGGGAAGGAC; A2 CAACAA GCTGCTG CAGGACCTG as forward primers; B1: CA GGTCCTCATC CTCCTGGCTCG; B2 GCTAGTG TGTCTAAGGCTGCTCG as reverse primers, Perform PCR. The PCR conditions were 93°C for 4 minutes, followed by 35 cycles of 93°C for 1 minute, 64°C for 1 minute and 72°C for 1 minute, and finally 72°C for 5 minutes. The PCR fragments detected by electrophoresis are fragments and impurities of about 1,000 and 1,300 base pairs, respectively. Remove impurities, increase the concentration of the required nucleotide fragments, digest the obtained fragments with PshAI respectively, and connect them, and finally obtain the target fragments.
本发明的hSTKa和hSTKc是由同一DNA转录翻译而来,只是在转录过程中由于转录产物的剪切拼接方式不同,故而产生了不同的蛋白,但其在序列和制备方法上有很大的相似性。The hSTKa and hSTKc of the present invention are transcribed and translated from the same DNA, but different proteins are produced due to the different cutting and splicing methods of the transcripts during the transcription process, but they are very similar in sequence and preparation method sex.
用本发明的hSTKa和hSTKc蛋白在Non-redundant GenBank+EMBL+DDBJ+PDB数据库及Non-redundant GenBank CDS ranslations+PDB+SwissProt+Spupdate+PIR数据库中用BLAST软件进行核酸和蛋白同源检索。结果发现它与某些已鉴定的丝氨酸/苏氨酸激酶蛋白家族成员(如CAA64382、JC1446和CAA07196)都有一定同源性,并且其结构中具有丝氨酸/苏氨酸激酶蛋白家族成员的特征功能域。Use the hSTKa and hSTKc proteins of the present invention in the Non-redundant GenBank+EMBL+DDBJ+PDB database and the Non-redundant GenBank CDS translations+PDB+SwissProt+Spupdate+PIR database to perform nucleic acid and protein homology searches using BLAST software. It was found that it has certain homology with some identified serine/threonine kinase protein family members (such as CAA64382, JC1446 and CAA07196), and its structure has the characteristic functions of serine/threonine kinase protein family members area.
综上所述,本发明的hSTKa和hSTKc属于丝氨酸/苏氨酸激酶家族成员,具有该家族成员的共同特征,即可通过胞外的免疫球蛋白样结构与细胞因子接合,并通过其胞内部分的磷酸酯酶活性将细胞因子携带的信号传递至胞内,作用于靶分子,从而对细胞乃至生物体的生理活动提到调控作用。这些生理作用包括促进组织生长,调控细胞的生长和增殖,激活、抑制免疫细胞,刺激造血细胞,激活炎症反应,防止病毒、细菌入侵等。In summary, hSTKa and hSTKc of the present invention belong to serine/threonine kinase family members, and have the common characteristics of the family members, that is, they can bind cytokines through extracellular immunoglobulin-like structures and pass through their intracellular Part of the phosphatase activity transmits the signal carried by cytokines into the cell and acts on the target molecule, thereby regulating the physiological activities of cells and organisms. These physiological functions include promoting tissue growth, regulating cell growth and proliferation, activating and inhibiting immune cells, stimulating hematopoietic cells, activating inflammatory responses, and preventing virus and bacterial invasion.
将本发明的hSTKa和hSTKc核苷酸或氨基酸序列与医药上可接受的载体相连可以制成试剂、药物和试剂盒,以预防、诊断和治疗由于hSTKa或hSTKc表达不正常而引起的疾病。Linking hSTKa and hSTKc nucleotide or amino acid sequences of the present invention with pharmaceutically acceptable carriers can be used to make reagents, medicines and kits to prevent, diagnose and treat diseases caused by abnormal expression of hSTKa or hSTKc.
表1为hSTKa、hSTKac和CAA07196的氨基酸序列同源比较。其中,“*”表示三个序列中相应位置上的氨基酸相同,“:”表示三个序列中相应位置上的氨基酸虽不相同但其理化性质大致相同,“.”表示三个序列中相应位置上的氨基酸理化性质有一定相同性,“-″表示三个序列中相应位置上没有对应的氨基酸。Table 1 is the amino acid sequence homology comparison of hSTKa, hSTKac and CAA07196. Among them, " * " indicates that the amino acids at the corresponding positions in the three sequences are the same, ":" indicates that the amino acids at the corresponding positions in the three sequences are different but have roughly the same physical and chemical properties, and "." indicates the corresponding positions in the three sequences The physicochemical properties of the amino acids above have a certain identity, and "-" indicates that there is no corresponding amino acid at the corresponding position in the three sequences.
具体实施方式Detailed ways
实施例1Example 1
hSTKa的cDNA序列的克隆和测定Cloning and Determination of cDNA Sequence of hSTKa
(1).引物扩增(1).Primer amplification
以人睾丸λgt10cDNA文库(购自Clontech公司)为模板,用两对寡核苷酸为引物:A1CGATGACATCG ACGGGGAAGGAC;A2 CAACAA GCTGCTG CAGGACCTG为正向引物;B1:CA GGTCCTCATC CTCCTGGCTCG;B2 GATGGCCTC GTGGAGGTGACATG为反向引物,进行PCR。PCR条件为93℃4分钟,随之以93℃1分钟、64℃1分钟和72℃C1分钟进行35个循环,最后72℃延伸5分钟。电泳检测得到的PCR片断,大小分别为约一千和一千一百个碱基对的片段及杂质。去除杂质,提高所需核苷酸片段的浓度,分别用PshAI酶切得到的片段,并连接,最后得到目的片段。Using the human testis λgt10 cDNA library (purchased from Clontech) as a template, two pairs of oligonucleotides were used as primers: A1CGATGACATCG ACGGGGAAGGAC; A2 CAACAA GCTGCTG CAGGACCTG as forward primers; B1: CA GGTCCTCATC CTCCTGGCTCG; B2 GATGGCCTC GTGGAGGTGACATG as reverse primers, Perform PCR. The PCR condition was 93°C for 4 minutes, followed by 35 cycles of 93°C for 1 minute, 64°C for 1 minute and 72°C for 1 minute, and finally 72°C for 5 minutes. The PCR fragments detected by electrophoresis are fragments and impurities of about 1,000 and 1,100 base pairs, respectively. Remove impurities, increase the concentration of the required nucleotide fragments, digest the obtained fragments with PshAI respectively, and connect them, and finally obtain the target fragments.
(2).PCR产物的测序(2).Sequencing of PCR products
将PCR扩增产物与pGEM-TTM载体(Promega)连接,转化大肠杆菌JM103,用QIAprepPlasmid试剂盒(QIAGEN)提取质粒,用双链嵌套式缺失试剂盒(Pharmacia)对插入片段进行定向系列缺失,然后用PCR对缺失子进行快速鉴定及排序。用SequiTherm EXCELTM DNA测序试剂盒(Epicentre Technologies)对依次截短的缺失子进行测序,最后获得全长cDNA序列,共2023bp,详细序列见SEQ ID NO.1,其中开放读框位于3-2009位核苷酸。The PCR amplified product was ligated with the pGEM-T TM vector (Promega), transformed into Escherichia coli JM103, the plasmid was extracted with the QIAprepPlasmid kit (QIAGEN), and the inserted fragment was directional and serially deleted with the double-stranded nested deletion kit (Pharmacia) , and then quickly identify and sort the deletions by PCR. The sequentially truncated deletions were sequenced with the SequiTherm EXCEL TM DNA Sequencing Kit (Epicentre Technologies), and finally the full-length cDNA sequence was obtained, with a total of 2023 bp. The detailed sequence is shown in SEQ ID NO.1, where the open reading frame is located at positions 3-2009 Nucleotides.
根据得到的全长cDNA序列推导出hSTKa的氨基酸序列,共668个氨基酸残基,其氨基酸序列详见SEQ ID NO.2。The amino acid sequence of hSTKa was deduced according to the obtained full-length cDNA sequence, with a total of 668 amino acid residues, and its amino acid sequence is detailed in SEQ ID NO.2.
实施例2Example 2
hSTKa在大肠杆菌中的表达Expression of hSTKa in Escherichia coli
在该实施例中,将编码hSTKa的cDNA序列,用对应于该DNA序列的5′和3′端的PCR寡核苷酸引物进行扩增,获得hSTK a cDNA作为插入片段。In this example, the cDNA sequence encoding hSTKa was amplified with PCR oligonucleotide primers corresponding to the 5' and 3' ends of the DNA sequence to obtain hSTKa cDNA as an insert fragment.
PCR反应中使用的5′寡核苷酸引物序列为:The 5' oligonucleotide primer sequence used in the PCR reaction is:
5’-CCCCGGATCCATGACA TCGACGGGGA AGG,该引物含有BamHI限制性内切酶的酶切位点,在该酶切位点之后是由起始密码子开始的hSTKa的部分编码序列;5'-CCCCGGATCCATGACA TCGACGGGGA AGG, the primer contains the restriction endonuclease restriction endonuclease site of BamHI, after the restriction site is the partial coding sequence of hSTKa beginning with the initiation codon;
3′端引物序列为:The 3' end primer sequence is:
5’TGACAAGCTTACTTTTCG GGATAATTC,该引物含有HindIII限制性内切酶的酶切位点、翻译终止子和hSTKa的部分编码序列。5'TGACAAGCTTACTTTTCG GGATAATTC, the primer contains the restriction endonuclease restriction site of HindIII, translation terminator and partial coding sequence of hSTKa.
引物上的限制性内切酶的酶切位点对应于细菌表达载体pQE-9(Qiagen Inc.,Chatsworth,CA)上的限制性内切酶酶切位点,该质粒载体编码抗生素抗性(Ampr)、一个细菌复制起点(ori)、一个IPTG-可调启动子/操纵子(P/O)、一个核糖体结合位点(RBS)、一个6-组氨酸标记物(6-His)以及限制性内切酶克隆位点。The restriction endonuclease cutting sites on the primers correspond to the restriction endonuclease cutting sites on the bacterial expression vector pQE-9 (Qiagen Inc., Chatsworth, CA), which encodes antibiotic resistance ( Amp r ), a bacterial origin of replication (ori), an IPTG-regulated promoter/operator (P/O), a ribosome binding site (RBS), a 6-histidine tag (6-His ) and restriction enzyme cloning sites.
用BamHI和HindIII消化pQE-9载体和插入片段,随后将插入片段连接到pQE-9载体并保持开放读框在细菌RBS起始。随后用连接混合物转化购自Qiagen,商品名为M15/rep4的E.coli菌株,M15/rep4含有多拷贝的质粒pREP4,其表达lacI阻遏物并携带卡那霉素抗性(Kanr)。在含有Amp和Kan的LB培养皿上筛选转化子,抽提质粒,用PshAI酶切鉴定插入片段大小及方向,并测序验证hSTK的cDNA片段已正确插入了载体。The pQE-9 vector and insert were digested with BamHI and HindIII, and the insert was subsequently ligated into the pQE-9 vector keeping the open reading frame at the beginning of the bacterial RBS. The ligation mixture was subsequently used to transform an E. coli strain purchased from Qiagen under the tradename M15/rep4 containing multiple copies of the plasmid pREP4 expressing the lacI repressor and carrying kanamycin resistance (Kan r ). The transformants were screened on the LB culture dish containing Amp and Kan, the plasmid was extracted, the size and direction of the inserted fragment were identified by digestion with PshAI, and the cDNA fragment of hSTK was correctly inserted into the vector by sequencing.
在补加Amp(100μg/ml)和Kan(25μg/ml)的LB液体培养基中过夜培养(O/N)含所需构建物的阳性转化子克隆。过夜(O/N)培养物以1∶100-1∶250的稀释率稀释,然后接种到大体积培养基中,培养细胞生长至600光密度(OD600)为0.4-0.6时,加入IPTG(“异丙基硫代-β-D-半乳糖苷”)至终浓度为1mM。通过使lacI阻遏物失活,IPTG诱导启动P/O导致基因表达水平提高。继续培养细胞3-4小时,随后离心(6000×g,20分钟)。超声裂解包涵体,收集细胞并将细胞沉淀溶于6M的盐酸胍中。澄清后,通过在能使含6-His标记物蛋白紧密结合的条件下,用镍-螯合柱层析从溶液中纯化溶解的hSTK。用6M盐酸胍(pH5.0)从柱中洗脱hSTK。可用几种方法从盐酸胍中变性沉淀蛋白。或者使用透析步骤除去盐酸胍,或者从镍-螯合柱中分离出纯化蛋白,纯化后的蛋白可以结合到第二个柱中,该柱中具有递减的线性盐酸胍梯度。在结合到该柱时蛋白质变性,随后用盐酸胍(pH5.0)洗脱。最后,将可溶的蛋白质对含PBS进行透析,然后将蛋白质保存在终浓度为10%(w/v)甘油的贮存液中。Positive transformant clones containing the desired constructs were cultured overnight (O/N) in liquid LB medium supplemented with Amp (100 μg/ml) and Kan (25 μg/ml). The overnight (O/N) culture was diluted at a dilution rate of 1:100-1:250, and then inoculated into a large volume of culture medium. When the cultured cells grew to 600 optical density (OD 600 ) of 0.4-0.6, IPTG ( "Isopropylthio-β-D-galactoside") to a final concentration of 1 mM. By inactivating the lacI repressor, IPTG-induced initiation of P/O leads to increased gene expression levels. Cells were incubated for an additional 3-4 hours followed by centrifugation (6000 xg, 20 minutes). Inclusion bodies were lysed by sonication, cells were collected and the cell pellet was dissolved in 6M guanidine hydrochloride. After clarification, the solubilized hSTK was purified from solution by nickel-chelate column chromatography under conditions that allow tight binding of the 6-His tag-containing protein. hSTK was eluted from the column with 6M guanidine hydrochloride (pH 5.0). Several methods can be used to denature and precipitate proteins from guanidine hydrochloride. Either use a dialysis step to remove the guanidine hydrochloride, or isolate the purified protein from the nickel-chelation column, and the purified protein can be bound to a second column with a decreasing linear guanidine hydrochloride gradient. Proteins were denatured upon binding to the column and subsequently eluted with guanidine hydrochloride (pH 5.0). Finally, the soluble protein was dialyzed against PBS, and the protein was then stored in a stock solution with a final concentration of 10% (w/v) glycerol.
此外,用常规方法对表达蛋白的N端和C端各10个氨基酸长度的氨基酸进行测序,发现与SEQ ID NO.2的序列一致。In addition, the N-terminal and C-terminal 10 amino acid lengths of the expressed protein were sequenced by conventional methods, and it was found to be consistent with the sequence of SEQ ID NO.2.
实施例3Example 3
hSTKa在真核细胞(CHO细胞株)中的表达Expression of hSTKa in eukaryotic cells (CHO cell lines)
在该实施例中,将编码hSTKa的cDNA序列下列寡核苷酸引物进行扩增,获得hSTKacDNA作为插入片段。In this example, the cDNA sequence encoding hSTKa was amplified with the following oligonucleotide primers to obtain hSTKacDNA as an insert.
PCR反应中使用的5′寡核苷酸引物序列为:The 5' oligonucleotide primer sequence used in the PCR reaction is:
5’-CCCCAAGCTTATGACA TCGACGGGGA AGG5’-CCCCAAGCTTATGACA TCGACGGGGA AGG
该引物含有HindIII限制性内切酶的酶切位点,在该酶切位点之后是由起始密码子开始的hSTKa的部分编码序列;The primer contains a HindIII restriction endonuclease enzyme cutting site, followed by the partial coding sequence of hSTKa starting from the start codon after the enzyme cutting site;
3′端引物序列为:The 3' end primer sequence is:
5’-TGACGGATCCACTTTTCG GGATAATTC5’-TGACGGATCCACTTTTCG GGATAATTC
该引物含有BamHI限制性内切酶的酶切位点、一个翻译终止子和hSTKa的部分编码序列。The primer contains the cutting site of BamHI restriction endonuclease, a translation terminator and the partial coding sequence of hSTKa.
引物上的限制性内切酶的酶切位点对应于CHO细胞表达载体pcDNA3上的限制性内切酶酶切位点,该质粒载体编码抗生素抗性(Ampr和Neor)、一个噬菌体复制起点(f1 ori)、一个病毒复制起点(SV40 ori)、一个T7启动子、一个病毒启动子(P-CMV)、一个Sp6启动子、一个SV40启动子、一个SV40加尾信号和相应的polyA顺序、一个BGH加尾信号和相应的polyA顺序。The restriction endonuclease cutting sites on the primers correspond to the restriction endonuclease cutting sites on the CHO cell expression vector pcDNA3, which encodes antibiotic resistance (Amp r and Neo r ), a phage replication Origin (f1 ori), a viral origin of replication (SV40 ori), a T7 promoter, a viral promoter (P-CMV), a Sp6 promoter, an SV40 promoter, an SV40 tailing signal and the corresponding polyA sequence , a BGH tailed signal and the corresponding polyA sequence.
用HindIII及BamHI消化pcDNA3载体和插入片段,随后将插入片段连接到pcDNA3载体。随后用连接混合物转化E.coli DH5α菌株。在含有Amp的LB培养皿上筛选转化子,在补加Amp(100μg/ml)的LB液体培养基中过夜培养(O/N)含所需构建物的克隆。抽提质粒,用PshAI酶切鉴定插入片段大小及方向,并测序验证hSTKa的cDNA片段已正确插入了载体。The pcDNA3 vector and insert were digested with HindIII and BamHI, and then the insert was ligated into the pcDNA3 vector. The E. coli DH5α strain was subsequently transformed with the ligation mixture. Transformants were screened on LB dishes containing Amp, and clones containing the desired construct were cultured overnight (O/N) in LB liquid medium supplemented with Amp (100 μg/ml). The plasmid was extracted, digested with PshAI to identify the size and direction of the inserted fragment, and sequenced to verify that the cDNA fragment of hSTKa had been correctly inserted into the vector.
质粒转染CHO细胞是用脂转染法,用Lipofectin(GiBco Life)进行,转染48小时后,经2-3周的持续G418加压筛选,收集细胞及细胞上清测定表达蛋白酶活力。去G418,连续传代培养;对混合克隆细胞极限稀释,选择具有较高蛋白活性的细胞亚克隆。按常规方法大量培养上述阳性亚克隆。48小时后,开始收集细胞及上清,用超声裂解方法破碎细胞。以含0.05%Triton的50mMTris·HCl(pH7.6)溶液为平衡液及洗脱液,用经预平衡的SuperdexG-75柱收集上述蛋白的活性峰。再用50mMTris·HCl(pH8.0)平衡的DEAE-Sepharose柱,以含0-1MNaCl的50mMTris·HCl(pH8.0)溶液为洗脱液进行梯度洗脱,收集上述蛋白的活性峰。然后以PBS(pH7.4)为透析液对表达蛋白溶液进行透析。最后冻干保存。Plasmid transfection into CHO cells was performed by lipofection with Lipofectin (GiBco Life). After 48 hours of transfection, after 2-3 weeks of continuous G418 pressurized selection, the cells were collected and the supernatant of the cells was assayed for protease activity. G418 was removed and subcultured continuously; for extreme dilution of mixed clone cells, select cell subclones with higher protein activity. The above-mentioned positive subclones were cultured in large quantities according to conventional methods. After 48 hours, the cells and supernatant were collected, and the cells were disrupted by ultrasonic lysis. Using 50 mM Tris·HCl (pH 7.6) solution containing 0.05% Triton as the balance and eluent, the activity peaks of the above proteins were collected with a pre-balanced SuperdexG-75 column. A DEAE-Sepharose column equilibrated with 50mM Tris HCl (pH8.0) was used to carry out gradient elution with 50mM Tris HCl (pH8.0) solution containing 0-1M NaCl as the eluent, and the activity peak of the above protein was collected. Then, the expressed protein solution was dialyzed with PBS (pH7.4) as the dialysate. Finally freeze-dried and stored.
此外,用常规方法对表达蛋白的N端和C端各10个氨基酸长度的氨基酸进行测序,发现与SEQ ID NO.2的序列一致。In addition, the N-terminal and C-terminal 10 amino acid lengths of the expressed protein were sequenced by conventional methods, and it was found to be consistent with the sequence of SEQ ID NO.2.
实施例4Example 4
hSTKc的cDNA序列的克隆和测定Cloning and Determination of cDNA Sequence of hSTKc
1.引物扩增1. Primer amplification
以人睾丸λgt10cDNA文库(购自Clontech公司)为模板,用两对寡核苷酸为引物:A1CGATGACATCG ACGGGGAAGGAC;A2 CAACAA GCTGCTG CAGGACCTG为正向引物;B1:CA GGTCCTCATC CTCCTGGCTCG;B2 GCTAGTG TGTCTAAGGCTGCTCG为反向引物,进行PCR。PCR条件为93℃4分钟,随之以93℃1分钟、64℃1分钟和72℃1分钟进行35个循环,最后72℃延伸5分钟。电泳检测得到的PCR片断,大小分别为约一千和一千三百个碱基对的片段及杂质。去除杂质,提高所需核苷酸片段的浓度,分别用PshAI酶切得到的片段,并连接,最后得到目的片段。Using the human testis λgt10 cDNA library (purchased from Clontech) as a template, two pairs of oligonucleotides were used as primers: A1CGATGACATCG ACGGGGAAGGAC; A2 CAACAA GCTGCTG CAGGACCTG as forward primers; B1: CA GGTCCTCATC CTCCTGGCTCG; B2 GCTAGTG TGTCTAAGGCTGCTCG as reverse primers, Perform PCR. The PCR conditions were 93°C for 4 minutes, followed by 35 cycles of 93°C for 1 minute, 64°C for 1 minute and 72°C for 1 minute, and finally 72°C for 5 minutes. The PCR fragments detected by electrophoresis are fragments and impurities of about 1,000 and 1,300 base pairs, respectively. Remove impurities, increase the concentration of the required nucleotide fragments, digest the obtained fragments with PshAI respectively, and connect them, and finally obtain the target fragments.
2.PCR产物的测序2. Sequencing of PCR products
将PCR扩增产物与pGEM-TTM载体(Promega)连接,转化大肠杆菌JM103,用QIAprepPlasmid试剂盒(QIAGEN)提取质粒,用双链嵌套式缺失试剂盒(Pharmacia)对插入片段进行定向系列缺失,然后用PCR对缺失子进行快速鉴定及排序。用SequiTherm EXCELTM DNA测序试剂盒(Epicentre Technologies)对依次截短的缺失子进行测序,最后获得全长cDNA序列,共2223bp,详细序列见SEQ ID NO.3,其中开放读框位于3-2009位核苷酸。The PCR amplified product was ligated with the pGEM-T TM vector (Promega), transformed into Escherichia coli JM103, the plasmid was extracted with the QIAprepPlasmid kit (QIAGEN), and the inserted fragment was directional and serially deleted with the double-stranded nested deletion kit (Pharmacia) , and then quickly identify and sort the deletions by PCR. The sequentially truncated deletions were sequenced with SequiTherm EXCEL TM DNA Sequencing Kit (Epicentre Technologies), and finally the full-length cDNA sequence was obtained, with a total of 2223 bp. The detailed sequence is shown in SEQ ID NO.3, where the open reading frame is located at position 3-2009 Nucleotides.
根据得到的全长cDNA序列推导出hSTKc的氨基酸序列,共736个氨基酸残基,其氨基酸序列详见SEQ ID NO.4。The amino acid sequence of hSTKc was deduced according to the obtained full-length cDNA sequence, with a total of 736 amino acid residues, and its amino acid sequence is shown in SEQ ID NO.4.
实施例5Example 5
hSTKc在大肠杆菌中的表达Expression of hSTKc in Escherichia coli
在该实施例中,将编码hSTKc的cDNA序列,用对应于该DNA序列的5′和3′端的PCR寡核苷酸引物进行扩增,获得hSTKc cDNA作为插入片段。In this example, the cDNA sequence encoding hSTKc was amplified with PCR oligonucleotide primers corresponding to the 5' and 3' ends of the DNA sequence to obtain hSTKc cDNA as an insert.
PCR反应中使用的5′寡核苷酸引物序列为:The 5' oligonucleotide primer sequence used in the PCR reaction is:
5’-CCCCGGATCCATGACA TCGACGGGGA AGG,该引物含有BamHI限制性内切酶的酶切位点,在该酶切位点之后是由起始密码子开始的hSTKc的部分编码序列;5'-CCCCGGATCCATGACA TCGACGGGGA AGG, the primer contains the restriction endonuclease restriction site of BamHI, after the restriction site is the partial coding sequence of hSTKc beginning with the initiation codon;
3′端引物序列为:The 3' end primer sequence is:
5’-AGTGAAGCTTAG GCTGCTCGCG GCGGGCG,该引物含有HindIII限制性内切酶的酶切位点、翻译终止子和hSTKc的部分编码序列。5'-AGTGAAGCTTAG GCTGCTCGCG GCGGGCG, the primer contains the restriction endonuclease restriction site of HindIII, translation terminator and partial coding sequence of hSTKc.
引物上的限制性内切酶的酶切位点对应于细菌表达载体pQE-9(Qiagen Inc.,Chatsworth,CA)上的限制性内切酶酶切位点,该质粒载体编码抗生素抗性(Ampr)、一个细菌复制起点(ori)、一个IPTG-可调启动子/操纵子(P/O)、一个核糖体结合位点(RBS)、一个6-组氨酸标记物(6-His)以及限制性内切酶克隆位点。The restriction endonuclease cutting sites on the primers correspond to the restriction endonuclease cutting sites on the bacterial expression vector pQE-9 (Qiagen Inc., Chatsworth, CA), which encodes antibiotic resistance ( Amp r ), a bacterial origin of replication (ori), an IPTG-regulated promoter/operator (P/O), a ribosome binding site (RBS), a 6-histidine tag (6-His ) and restriction enzyme cloning sites.
用BamHI和HindIII消化pQE-9载体和插入片段,随后将插入片段连接到pQE-9载体并保持开放读框在细菌RBS起始。随后用连接混合物转化购自Qiagen,商品名为M15/rep4的E.coli菌株,M15/rep4含有多拷贝的质粒pREP4,其表达lacI阻遏物并携带卡那霉素抗性(Kanr)。在含有Amp和Kan的LB培养皿上筛选转化子,抽提质粒,用PshAI酶切鉴定插入片段大小及方向,并测序验证hSTKc的cDNA片段已正确插入了载体。The pQE-9 vector and insert were digested with BamHI and HindIII, and the insert was subsequently ligated into the pQE-9 vector keeping the open reading frame at the beginning of the bacterial RBS. The ligation mixture was subsequently used to transform an E. coli strain purchased from Qiagen under the tradename M15/rep4 containing multiple copies of the plasmid pREP4 expressing the lacI repressor and carrying kanamycin resistance (Kan r ). The transformants were screened on the LB culture dish containing Amp and Kan, the plasmid was extracted, the size and direction of the inserted fragment were identified by digestion with PshAI, and the cDNA fragment of hSTKc was correctly inserted into the vector by sequencing.
在补加Amp(100μg/ml)和Kan(25μg/ml)的LB液体培养基中过夜培养(O/N)含所需构建物的阳性转化子克隆。过夜(O/N)培养物以1∶100-1∶250的稀释率稀释,然后接种到大体积培养基中,培养细胞生长至600光密度(OD600)为0.4-0.6时,加入IPTG(“异丙基硫代-β-D-半乳糖苷”)至终浓度为1mM。通过使lacI阻遏物失活,IPTG诱导启动P/O导致基因表达水平提高。继续培养细胞3-4小时,随后离心(6000×g,20分钟)。超声裂解包涵体,收集细胞并将细胞沉淀溶于6M的盐酸胍中。澄清后,通过在能使含6-His标记物蛋白紧密结合的条件下,用镍-螯合柱层析从溶液中纯化溶解的hSTK。用6M盐酸胍(pH5.0)从柱中洗脱hSTK。可用几种方法从盐酸胍中变性沉淀蛋白。或者使用透析步骤除去盐酸胍,或者从镍-螯合柱中分离出纯化蛋白,纯化后的蛋白可以结合到第二个柱中,该柱中具有递减的线性盐酸胍梯度。在结合到该柱时蛋白质变性,随后用盐酸胍(pH5.0)洗脱。最后,将可溶的蛋白质对含PBS进行透析,然后将蛋白质保存在终浓度为10%(w/v)甘油的贮存液中。Positive transformant clones containing the desired constructs were cultured overnight (O/N) in liquid LB medium supplemented with Amp (100 μg/ml) and Kan (25 μg/ml). The overnight (O/N) culture was diluted at a dilution rate of 1:100-1:250, and then inoculated into a large volume of culture medium. When the cultured cells grew to 600 optical density (OD 600 ) of 0.4-0.6, IPTG ( "Isopropylthio-β-D-galactoside") to a final concentration of 1 mM. By inactivating the lacI repressor, IPTG-induced initiation of P/O leads to increased gene expression levels. Cells were incubated for an additional 3-4 hours followed by centrifugation (6000 xg, 20 minutes). Inclusion bodies were lysed by sonication, cells were collected and the cell pellet was dissolved in 6M guanidine hydrochloride. After clarification, the solubilized hSTK was purified from solution by nickel-chelate column chromatography under conditions that allow tight binding of the 6-His tag-containing protein. hSTK was eluted from the column with 6M guanidine hydrochloride (pH 5.0). Several methods can be used to denature and precipitate proteins from guanidine hydrochloride. Either use a dialysis step to remove the guanidine hydrochloride, or isolate the purified protein from the nickel-chelation column, and the purified protein can be bound to a second column with a decreasing linear guanidine hydrochloride gradient. Proteins were denatured upon binding to the column and subsequently eluted with guanidine hydrochloride (pH 5.0). Finally, the soluble protein was dialyzed against PBS, and the protein was then stored in a stock solution with a final concentration of 10% (w/v) glycerol.
此外,用常规方法对表达蛋白的N端和C端各10个氨基酸长度的氨基酸进行测序,发现与SEQ ID NO.4的序列一致。In addition, the N-terminal and C-terminal 10 amino acid lengths of the expressed protein were sequenced by conventional methods, and it was found to be consistent with the sequence of SEQ ID NO.4.
实施例6Example 6
hSTKc在真核细胞(CHO细胞株)中的表达Expression of hSTKc in eukaryotic cells (CHO cell lines)
在该实施例中,将编码hSTKc的cDNA序列下列寡核苷酸引物进行扩增,获得hSTKccDNA作为插入片段。In this example, the cDNA sequence encoding hSTKc was amplified with the following oligonucleotide primers to obtain hSTKccDNA as an insert.
PCR反应中使用的5′寡核苷酸引物序列为:The 5' oligonucleotide primer sequence used in the PCR reaction is:
5’-CCCCAAGCTTATGACA TCGACGGGGA AGG5’-CCCCAAGCTTATGACA TCGACGGGGA AGG
该引物含有HindIII限制性内切酶的酶切位点,在该酶切位点之后是由起始密码子开始的hSTKc的部分编码序列;The primer contains a HindIII restriction endonuclease enzyme cutting site, followed by a partial coding sequence of hSTKc beginning with a start codon after the enzyme cutting site;
3′端引物序列为:The 3' end primer sequence is:
5’-AG TGGGATCCAG GCTGCTCGCG GCGGGCG5’-AG TGGGATCCAG GCTGCTCGCG GCGGGCG
该引物含有BamHI限制性内切酶的酶切位点、一个翻译终止子和hSTKc的部分编码序列。The primer contains the cutting site of BamHI restriction endonuclease, a translation terminator and the partial coding sequence of hSTKc.
引物上的限制性内切酶的酶切位点对应于CHO细胞表达载体pcDNA3上的限制性内切酶酶切位点,该质粒载体编码抗生素抗性(Ampr和Neor)、一个噬菌体复制起点(f1 ori)、一个病毒复制起点(SV40ori)、一个T7启动子、一个病毒启动子(P-CMV)、一个Sp6启动子、一个SV40启动子、一个SV40加尾信号和相应的polyA顺序、一个BGH加尾信号和相应的polyA顺序。The restriction endonuclease cutting sites on the primers correspond to the restriction endonuclease cutting sites on the CHO cell expression vector pcDNA3, which encodes antibiotic resistance (Amp r and Neo r ), a phage replication origin (f1 ori), a viral origin of replication (SV40ori), a T7 promoter, a viral promoter (P-CMV), a Sp6 promoter, an SV40 promoter, an SV40 tailing signal and the corresponding polyA sequence, A BGH tailing signal and corresponding polyA sequence.
用HindIII及BamHI消化pcDNA3载体和插入片段,随后将插入片段连接到pcDNA3载体。随后用连接混合物转化E.coli DH5α菌株。在含有Amp的LB培养皿上筛选转化子,在补加Amp(100μg/ml)的LB液体培养基中过夜培养(O/N)含所需构建物的克隆。抽提质粒,用PshAI酶切鉴定插入片段大小及方向,并测序验证hSTKc的cDNA片段已正确插入了载体。The pcDNA3 vector and insert were digested with HindIII and BamHI, and then the insert was ligated into the pcDNA3 vector. The E. coli DH5α strain was subsequently transformed with the ligation mix. Transformants were screened on LB dishes containing Amp, and clones containing the desired construct were cultured overnight (O/N) in LB liquid medium supplemented with Amp (100 μg/ml). The plasmid was extracted, digested with PshAI to identify the size and direction of the inserted fragment, and sequenced to verify that the cDNA fragment of hSTKc had been correctly inserted into the vector.
质粒转染CHO细胞是用脂转染法,用Lipofectin(GiBco Life)进行,转染48小时后,经2-3周的持续G418加压筛选,收集细胞及细胞上清测定表达蛋白酶活力。去G418,连续传代培养;对混合克隆细胞极限稀释,选择具有较高蛋白活性的细胞亚克隆。按常规方法大量培养上述阳性亚克隆。48小时后,开始收集细胞及上清,用超声裂解方法破碎细胞。以含0.05%Triton的50mMTris·HCl(pH7.6)溶液为平衡液及洗脱液,用经预平衡的SuperdexG-75柱收集上述蛋白的活性峰。再用50mMTris·HCl(pH8.0)平衡的DEAE-Sepharose柱,以含0-1M NaCl的50mMTris·HCl(pH8.0)溶液为洗脱液进行梯度洗脱,收集上述蛋白的活性峰。然后以PBS(pH7.4)为透析液对表达蛋白溶液进行透析。最后冻干保存。Plasmid transfection into CHO cells was performed by lipofection with Lipofectin (GiBco Life). After 48 hours of transfection, after 2-3 weeks of continuous G418 pressurized selection, the cells were collected and the supernatant of the cells was assayed for protease activity. G418 was removed and subcultured continuously; for extreme dilution of mixed clone cells, select cell subclones with higher protein activity. The above-mentioned positive subclones were cultured in large quantities according to conventional methods. After 48 hours, the cells and supernatant were collected, and the cells were disrupted by ultrasonic lysis. Using 50 mM Tris·HCl (pH 7.6) solution containing 0.05% Triton as the balance and eluent, the activity peaks of the above proteins were collected with a pre-balanced SuperdexG-75 column. Then use a DEAE-Sepharose column equilibrated with 50mM Tris HCl (pH8.0) to carry out gradient elution with 50mM Tris HCl (pH8.0) solution containing 0-1M NaCl as the eluent, and collect the activity peak of the above protein. Then, the expressed protein solution was dialyzed with PBS (pH7.4) as the dialysate. Finally freeze-dried and stored.
此外,用常规方法对表达蛋白的N端和C端各10个氨基酸长度的氨基酸进行测序,发现与SEQ ID NO.4的序列一致。In addition, the N-terminal and C-terminal 10 amino acid lengths of the expressed protein were sequenced by conventional methods, and it was found to be consistent with the sequence of SEQ ID NO.4.
实施例7Example 7
同源比较homologous comparison
用本发明的hSTKa和hSTKc蛋白(SEQ ID NO.2和SEQ ID NO.4)在Non-redundantGenBank+EMBL+DDBJ+PDB数据库及Non-redundant GenBank CDSranslations+PDB+SwissProt+Spupdate+PIR数据库中用BLAST软件进行核酸和蛋白同源检索。结果发现它与某些已鉴定的丝氨酸/苏氨酸激酶蛋白家族成员(如CAA64382、JC1446和CAA07196)都有一定同源性,并且其结构中具有丝氨酸/苏氨酸激酶蛋白家族成员的结构特征。Use hSTKa of the present invention and hSTKc albumen (SEQ ID NO.2 and SEQ ID NO.4) in Non-redundant GenBank+EMBL+DDBJ+PDB database and Non-redundant GenBank CDStranslations+PDB+SwissProt+Spupdate+PIR database with BLAST The software performs nucleic acid and protein homology searches. It was found that it has certain homology with some identified serine/threonine kinase protein family members (such as CAA64382, JC1446 and CAA07196), and its structure has the structural characteristics of serine/threonine kinase protein family members .
丝氨酸/苏氨酸激酶蛋白家族成员的结构特征为[LIVMFYC]-x-[HY]-x-D-[LIVMFY]-K-x(2)-N-[LIVMFYCT](3).其中,[LIVMFYC]是指从LIVMFYC其中氨基酸中任选一个,x为任意氨基酸,x(2)为任意2个氨基酸。本发明的蛋白属于该结构如下:The structural features of serine/threonine kinase protein family members are [LIVMFYC]-x-[HY]-x-D-[LIVMFY]-K-x(2)-N-[LIVMFYCT](3). Among them, [LIVMFYC] means Choose one of the amino acids from LIVMFYC, x is any amino acid, and x(2) is any 2 amino acids. The proteins of the present invention belong to this structure as follows:
hSTKa:RLEKTLGKGQTGLVKLGVHCVTCQKVAIKIVNREKLSESVLMKVEREIAILKLIEHPHVLKLHDVYENKKYLYLVLEHVSGGELFDYLVKKGRLTPKEARKFFRQIISALDFCHSHSICHRDLKPENLLLDEKNNIRIADFGMASLQVGDSLLETSCGSPHYACPEVIRGEKYDGRKADVWSCGVILFALLVGALPFDDDNLRQLLEKVKRGVFHMPHFIPPDCQSLLRGMSEVDAARRLTLEHIQKHIWY(SEQ ID NO.2中19-270);hSTKa:RLEKTLGKGQTGLVKLGVHCVTCQKVAIKIVNREKLSESVLMKVEREIAILKLIEHPHVLKLHDVYENKKYLYLVLEHVSGGELFDYLVKKGRLTPKEARKFFRQIISALDFCHSHSICHRDLKPENLLLDEKNNIRIADFGMASLQVGDSLLETSCGSPHYACPEVIRGEKYDGRKADVWSCGVILFALLVGALPFDDDNLRQLLEKVKRGVFHMPHFIPPDCQSLLRGMSEVDAARRLTLEHIQKHIWY(SEQ ID NO.2中19-270);
hSTKc:RLEKTLGKGQTGLVKLGVHCVTCQKVAIKIVNREKLSESVLMKVEREIAILKLIEHPHVLKLHDVYENKKYLYLVLEHVSGGELFDYLVKKGRLTPKEARKFFRQIISALDFCHSHSICHRDLKPENLLLDEKNNIRIADFGMASLQVGDSLLETSCGSPHYACPEVIRGEKYDGRKADVWSCGVILFALLVGALPFDDDNLRQLLEKVKRGVFHMPHFIPPDCQSLLRGMSEVDAARRLTLEHIQKHIWY(SEQ ID NO.4中19-270)( htttp://smart.embl-heidelberg.de/smart/show motifs.pl)hSTKc:RLEKTLGKGQTGLVKLGVHCVTCQKVAIKIVNREKLSESVLMKVEREIAILKLIEHPHVLKLHDVYENKKYLYLVLEHVSGGELFDYLVKKGRLTPKEARKFFRQIISALDFCHSHSICHRDLKPENLLLDEKNNIRIADFGMASLQVGDSLLETSCGSPHYACPEVIRGEKYDGRKADVWSCGVILFALLVGALPFDDDNLRQLLEKVKRGVFHMPHFIPPDCQSLLRGMSEVDAARRLTLEHIQKHIWY(SEQ ID NO.4中19-270)( htttp://smart.embl-heidelberg.de/smart/show motifs.pl )
综上所述,本发明的hSTKa和hSTKc属于丝氨酸/苏氨酸激酶家族成员,具有该家族成员的共同特征,即可通过胞外的免疫球蛋白样结构与细胞因子接合,并通过其胞内部分的磷酸酯酶活性将细胞因子携带的信号传递至胞内,作用于靶分子,从而对细胞乃至生物体的生理活动提到调控作用。这些生理作用包括促进组织生长,调控细胞的生长和增殖,激活、抑制免疫细胞,刺激造血细胞,激活炎症反应,防止病毒、细菌入侵等。In summary, hSTKa and hSTKc of the present invention belong to serine/threonine kinase family members, and have the common characteristics of the family members, that is, they can bind cytokines through extracellular immunoglobulin-like structures and pass through their intracellular Part of the phosphatase activity transmits the signal carried by cytokines into the cell and acts on the target molecule, thereby regulating the physiological activities of cells and organisms. These physiological functions include promoting tissue growth, regulating cell growth and proliferation, activating and inhibiting immune cells, stimulating hematopoietic cells, activating inflammatory responses, and preventing virus and bacterial invasion.
本发明的hSTKa和hSTKc除了可作为该家族一员用于进一步的功能研究,还可用于与其他蛋白一起产生融合蛋白,比如与免疫球蛋白一起产生融合蛋白。此外,本发明hSTKa和hSTKc还可以与该家族的其他成员进行融合或交换片段,以产生新的蛋白,例如可将本发明hSTKa和hSTKc的N端与鼠STK的N端进行交换,以产生新的活性更高或具有新特性的蛋白。The hSTKa and hSTKc of the present invention can be used as members of the family for further functional research, and can also be used to produce fusion proteins with other proteins, such as immunoglobulins to produce fusion proteins. In addition, hSTKa and hSTKc of the present invention can also be fused or exchanged fragments with other members of the family to produce new proteins, for example, the N-terminal of hSTKa and hSTKc of the present invention can be exchanged with the N-terminal of mouse STK to produce new proteins Proteins with higher activity or new properties.
针对本发明的hSTKa和hSTKc的抗体,用于筛选该家族的其他成员,或者用于亲和纯化相关蛋白(如该家族的其他成员)。Antibodies against hSTKa and hSTKc of the present invention are used for screening other members of the family, or for affinity purification of related proteins (such as other members of the family).
实施例8Example 8
制备抗体Antibody preparation
将本发明的hSTKa和hSTKc重组蛋白用来免疫动物以产生抗体,具体如下。重组分子用层析法进行分离后备用。也可用SDS-PAGE凝胶电泳法进行分离,将电泳条带从凝胶中切下,并用等体积的完全Freund’s佐剂乳化。用50-100μg/0.2ml乳化过的蛋白,对小鼠进行腹膜内注射。14天后,用非完全Freund’s佐剂乳化的同样抗原,对小鼠以50-100μg/0.2ml的剂量进行腹膜内注射以加强免疫。每隔14天进行一次加强免疫,至少进行三次。获得的抗血清的特异反应活性用它在体外沉淀hSTK基因翻译产物的能力加以评估。The hSTKa and hSTKc recombinant proteins of the present invention are used to immunize animals to produce antibodies, as follows. The recombinant molecules are separated by chromatography for further use. It can also be separated by SDS-PAGE gel electrophoresis, and the electrophoresis bands are excised from the gel and emulsified with an equal volume of complete Freund's adjuvant. Mice were injected intraperitoneally with 50-100 [mu]g/0.2 ml emulsified protein. Fourteen days later, mice were boosted by intraperitoneal injection of the same antigen emulsified with incomplete Freund's adjuvant at a dose of 50-100 µg/0.2 ml. Give booster immunizations at least three times every 14 days. The specific reactivity of the obtained antiserum was assessed by its ability to precipitate the hSTK gene translation product in vitro.
实施例9Example 9
作为微矩阵的底物as a substrate for microarrays
微矩阵,又称为DNA芯片。通过使用一些著名的生物软件(如LASERGENESOFTWARE(DNASTAR),来选择将hSTK全长cDNA、EST、或基因片段作为微矩阵的底物样品。然后通过使用喷墨技术及一系列化学方法如射线、化学、热力学、机械方法等把样品固定在载体上,如玻璃上,得到芯片(Schena,M.et al.(1995)Science 270:467-470;andShalon,D.et al.(1996)Genome Res.6:639-645.),形成的元件有点状、条形等等。一块典型的芯片通常含有一定数量的元件。杂交反应后,洗去未结合的探针,然后用扫描仪来检测发生杂交反应的元件及反应的程度。探针与每一个芯片元件的互补性和结合量都可通过对扫描出的图像的分析来判断。Microarray, also known as DNA chip. By using some well-known biological software (such as LASERGENESOFTWARE (DNASTAR), select hSTK full-length cDNA, EST, or gene fragments as the substrate sample of the microarray. Then by using inkjet technology and a series of chemical methods such as radiation, chemical Fix the sample on the carrier, such as glass, to obtain a chip (Schena, M. et al. (1995) Science 270: 467-470; and Shalon, D. et al. (1996) Genome Res. 6:639-645.), the formed elements are point-like, strip-shaped, etc. A typical chip usually contains a certain number of elements. After the hybridization reaction, unbound probes are washed away, and then a scanner is used to detect hybridization The components of the reaction and the degree of the reaction. The complementarity and binding amount of the probe to each chip component can be judged by analyzing the scanned images.
杂交结果可用于检测出hSTKa和hSTKc基因变异、突变及多态现象,理解由于hSTKa和hSTKc表达不正常引起的疾病的分子基础,诊断疾病,并可改进及监测相关药剂的活性。Hybridization results can be used to detect hSTKa and hSTKc gene variation, mutation and polymorphism, understand the molecular basis of diseases caused by abnormal expression of hSTKa and hSTKc, diagnose diseases, and improve and monitor the activity of related drugs.
实施例10:Example 10:
试剂盒的制备Kit preparation
试剂盒1:Kit 1:
它含有(1)特异性扩增hSTKa和hSTKc的引物对和使用说明。该试剂盒还可含有或不含有将mRNA逆转录成cDNA的所需试剂。其中,该特异性引物的序列衍生自SEQ IDNO:1的序列。对逆转录成的cDNA进行特异性扩增,以检测样品中是否含有hSTKa和hSTKc的核酸序列。该试剂盒还可含有进行PCR反应所需的其他试剂,例如缓冲液等。It contains (1) primer pairs and instructions for specific amplification of hSTKa and hSTKc. The kit may or may not contain the reagents required to reverse transcribe mRNA into cDNA. Wherein, the sequence of the specific primer is derived from the sequence of SEQ ID NO:1. The reverse transcribed cDNA is specifically amplified to detect whether the sample contains the nucleic acid sequences of hSTKa and hSTKc. The kit can also contain other reagents required for performing PCR reactions, such as buffers and the like.
此外,较佳地,至少一个所用的引物跨越了两个外显子,因为此时对应于hSTKa和hSTKc的基因组序列将不产生扩增产物。Furthermore, it is preferred that at least one of the primers used spans two exons, since then the genomic sequences corresponding to hSTKa and hSTKc will not give rise to amplification products.
试剂盒2:Kit 2:
该试剂盒含有针对hSTKa和hSTKc的特异性的抗体(例如实施例5中制备的多克隆抗体,或者用标准的杂交瘤技术产生的抗hSTKa和hSTKc的单克隆抗体),和使用说明。该试剂盒用于直接检测样品中是否存在或缺失hSTKa和hSTKc蛋白。先通过特异性免疫反应形成免疫复合物,然后用常规技术检测免疫复合物。The kit contains specific antibodies against hSTKa and hSTKc (such as polyclonal antibodies prepared in Example 5, or monoclonal antibodies against hSTKa and hSTKc produced by standard hybridoma technology), and instructions for use. This kit is used to directly detect the presence or absence of hSTKa and hSTKc proteins in samples. Immune complexes are first formed by specific immune reactions, and then detected by conventional techniques.
试剂盒3:Kit 3:
该试剂盒含有可特异性地与hSTKa和hSTKc的mRNA杂交的探针。它还可含有或不含有杂交缓冲液。该试剂盒通过核酸分子与hSTKa和hSTKc特异性探针之间的杂交反应来检测样品中是否存在hSTKa和hSTKc的核酸分子。The kit contains probes that specifically hybridize to hSTKa and hSTKc mRNA. It may or may not also contain a hybridization buffer. The kit detects whether there are nucleic acid molecules of hSTKa and hSTKc in the sample through the hybridization reaction between the nucleic acid molecules and hSTKa and hSTKc specific probes.
试剂盒也可用于检测出基因变异、突变及多态现象,并检测出一系列与本发明的hSTK相关的基因,从而对相关疾病的诊断、治疗起辅助作用。The kit can also be used to detect gene variation, mutation and polymorphism, and detect a series of genes related to the hSTK of the present invention, thereby playing an auxiliary role in the diagnosis and treatment of related diseases.
实例11:Example 11:
制成药物made into medicine
本发明的hSTKa和hSTKc蛋白及其抗体、抑制剂、拮抗剂或受体等可作为药物,可促进烫伤烧伤、组织切除乃至器官摘除后的伤口愈合,防止伤口感染。将本发明的hSTKa和hSTKc核酸反义链或hSTKa和hSTKc抗体制成试剂盒可用于减轻或辅助治疗由于hSTKa和hSTKc表达量过高而导致的疾病,也可辅助临床上肿瘤和癌症的诊断,预防及治疗。The hSTKa and hSTKc proteins of the present invention and their antibodies, inhibitors, antagonists or receptors can be used as medicines, which can promote wound healing after burns, tissue resection and even organ removal, and prevent wound infection. The hSTKa and hSTKc nucleic acid antisense chains or hSTKa and hSTKc antibodies of the present invention are made into kits, which can be used to alleviate or assist in the treatment of diseases caused by excessive expression of hSTKa and hSTKc, and can also assist in the clinical diagnosis of tumors and cancers. Prevention and treatment.
本发明蛋白及其抗体、抑制剂、拮抗剂或受体等,当在治疗上进行施用(给药)时,可提供不同的效果。通常,可将这些物质配制于无毒的、惰性的和药学上可接受的水性载体介质中,其中pH通常为约5-8,较佳地pH约为6-8,尽管pH值可随被配制物质的性质以及待治疗的病症而有所变化。配制好的药物组合物可以通过常规途径进行给药,其中包括(但并不限于):肌内、腹膜内、皮下、皮内、或局部给药。When the protein of the present invention and its antibody, inhibitor, antagonist, or receptor are administered (administered) therapeutically, various effects can be provided. Generally, these materials can be formulated in a non-toxic, inert and pharmaceutically acceptable aqueous carrier medium, wherein the pH is usually about 5-8, preferably about 6-8, although the pH value can be changed according to the Depending on the nature of the substance formulated and the condition to be treated. The formulated pharmaceutical composition can be administered by conventional routes, including (but not limited to): intramuscular, intraperitoneal, subcutaneous, intradermal, or topical administration.
此外,本发明hSTKa和hSTKc核酸(编码序列或反义序列)可以直接引入细胞,以提高hSTKa和hSTKc的表达水平或者抑制hSTKa和hSTKc的过度表达。本发明的hSTKa和hSTKc蛋白或其活性多肽片段可以施用于病人,以治疗或减轻因hSTKa和hSTKc缺失、无功能或异常而导致的有关病症。此外,还可以用基于本发明的核酸序列或抗体进行有关的诊断或预后判断。In addition, hSTKa and hSTKc nucleic acid (coding sequence or antisense sequence) of the present invention can be directly introduced into cells to increase the expression level of hSTKa and hSTKc or inhibit the overexpression of hSTKa and hSTKc. The hSTKa and hSTKc proteins or their active polypeptide fragments of the present invention can be administered to patients to treat or alleviate related diseases caused by hSTKa and hSTKc deletion, non-function or abnormality. In addition, the nucleic acid sequence or antibody based on the present invention can also be used for relevant diagnosis or prognosis.
当本发明的hSTKa和hSTKc蛋白多肽被用作药物时,治疗有效剂量通常至少约10微克/千克体重,而且在大多数情况下不超过约8毫克/千克体重,较佳地该剂量是约10微克/千克体重-约1毫克/千克体重。当然,具体剂量还应考虑给药途径、病人健康状况等因素,这些都是熟练医师技能范围之内的。When the hSTKa and hSTKc protein polypeptides of the present invention are used as medicines, the therapeutically effective dose is usually at least about 10 micrograms/kg body weight, and in most cases no more than about 8 mg/kg body weight, preferably the dose is about 10 mg/kg body weight. µg/kg body weight - about 1 mg/kg body weight. Of course, factors such as the route of administration and the health status of the patient should also be considered for the specific dosage, which are within the skill of skilled physicians.
序列表<210>1<211>2032<212>核苷酸<213>人类<220><221>编码序列<222>(3)..(2009)<223><400>1cg atg aca tcg acg ggg aag gac ggc ggc gcg cag cac gcg cag tat 47Met Thr Ser Thr Gly Lys Asp Gly Gly Ala Gln His Ala Gln Tyr1 5 10 15gtt ggg ccc tac cgg ctg gag aag acg ctg ggc aag ggg cag aca ggt 95Val Gly Pro Tyr Arg Leu Glu Lys Thr Leu Gly Lys Gly Gln Thr GlySEQUENCE LISTING <210>1<211>2032<212>nucleotide<213>human<220><221>coding sequence<222>(3)..(2009)<223><400>1cg atg aca tcg acg ggg aag gac ggc ggc gcg cag cac gcg cag tat 47Met Thr Ser Thr Gly Lys Asp Gly Gly Ala Gln His Ala Gln Tyr1 5 10 15gtt ggg ccc tac cgg ctg gag aag acg ctg ggc aag ggg cag aca ggt 95Val Gly Pro Tyr Arg Leu Glu Lys Thr Leu Gly Lys Gly Gln Thr Gly
20 25 30ctg gtg aag ctg ggg gtt cac tgc gtc acc tgc cag aag gtg gcc atc 143Leu Val Lys Leu Gly Val His Cys Val Thr Cys Gln Lys Val Ala Ile20 25 30CTG GTG AAG CTG GGG GGG GTT CAC GTC GTC ACC TGC CAG GCC ATC 143leu Val Lys Leu Gly Val Val THR CYS Val Ala Ile
35 40 45aag atc gtc aac cgt gag aag ctc agc gag tcg gtg ctg atg aag gtg 191Lys Ile Val Asn Arg Glu Lys Leu Ser Glu Ser Val Leu Met Lys Val35 40 45AAG ATC GAC CGT GAG AAG CTC GAG GAG GAG GAG GAG GAG GAG GAG GAG GAG GAG CTG AAG GTG 191lys Ile Val Asn ARG GLU LEU Ser Val Leu Met Lys Val Val Val
50 55 60gag cgg gag atc gcg atc ctg aag ctc att gag cac ccc cac gtc cta 239Glu Arg Glu Ile Ala Ile Leu Lys Leu Ile Glu His Pro His Val Leu50 55 60gag CGG GAG ATC GCG ATC CTG CTC CTC ATT GAG CAC CAC CAC CAC CAC CTC
65 70 75aag ctg cac gac gtt tat gaa aac aaa aaa tat ttg tac ctg gtg cta 287Lys Leu His Asp Val Tyr Glu Asn Lys Lys Tyr Leu Tyr Leu Val Leu80 85 90 95gaa cac gtg tca ggt ggt gag ctc ttc gac tac ctg gtg aag aag ggg 335Glu His Val Ser Gly Gly Glu Leu Phe Asp Tyr Leu Val Lys Lys Gly65 70 75aag ctg cac gac gtt tat gaa aac aaa aaa tat ttg tac ctg gtg cta 287Lys Leu His Asp Val Tyr Glu Asn Lys Lys Tyr Leu Tyr Leu Val Leu80 85 90 95gaa cac gtg tca ggt ggt gag ctc ttc gac tac ctg gtg aag aag ggg 335Glu His Val Ser Gly Gly Glu Leu Phe Asp Tyr Leu Val Lys Lys Gly
100 105 110agg ctg acg cct aag gag gct cgg aag ttc ttc cgg cag atc atc tct 383Arg Leu Thr Pro Lys Glu Ala Arg Lys Phe Phe Arg Gln Ile Ile Ser100 105 110AGG CTG ACG CCT AAG GAG GCT CGG AAG TTC TTC CGG CAG ATC ATC ATC TCT 383ARG Leu Thr Pro Lys Glu ARG LYS PHE PHE GLN ILE ILN iLe Ile iLn iLe PLN iLE PLN iLE PLU -
115 120 125gcg ctg gac ttc tgc cac agc cac tcc ata tgc cac agg gat ctg aaa 431Ala Leu Asp Phe Cys His Ser His Ser Ile Cys His Arg Asp Leu Lys115 120 125GCG CTG GAC TTC TGC CAC CAC CAC CAC ATA TGC CAC AGG GAT CTG AAA 431ALA Leu ASP PHE CYS Serle Cys
130 135 140cct gaa aac ctc ctg ctg gac gag aag aac aac atc cgc atc gca gac 479Pro Glu Asn Leu Leu Leu Asp Glu Lys Asn Asn Ile Arg Ile Ala Asp135 135 140cct GAA AAC CTC CTG CTG GAC GAG AAC AAC AAC ATC CGC ATC GCA GAC 479Pro Glu Leu Leu ASN Asn Ile ARG Ile Ala Ala Asp
145 150 155ttt ggc atg gcg tcc ctg cag gtt ggc gac agc ctg ttg gag acc agc 527Phe Gly Met Ala Ser Leu Gln Val Gly Asp Ser Leu Leu Glu Thr Ser160 165 170 175tgt ggg tcc ccc cac tac gcc tgc ccc gag gtg atc cgg ggg gag aag 575Cys Gly Ser Pro His Tyr Ala Cys Pro Glu Val Ile Arg Gly Glu Lys145 150 155ttt ggc atg gcg tcc ctg cag gtt ggc gac agc ctg ttg gag acc agc 527Phe Gly Met Ala Ser Leu Gln Val Gly Asp Ser Leu Leu Glu Thr Ser160 165 170 175tgt ggg tcc ccc cac tac gcc tgc ccc gag gtg atc cgg ggg gag aag 575Cys Gly Ser Pro His Tyr Ala Cys Pro Glu Val Ile Arg Gly Glu Lys
180 185 190tat gac ggc cgg aag gcg gac gtg tgg agc tgc ggc gtc atc ctg ttc 623Tyr Asp Gly Arg Lys Ala Asp Val Trp Ser Cys Gly Val Ile Leu Phe180 185 190TAT GAC GGC CGG AAG GCG GAC GAC GAC GTG AGC TGC GGC GGC GGC ATC CTG TTC 623TYR ARG LYS Ala Ala Ala's TrS Ge Leu Phes
195 200 205gcc ttg ctg gtg ggg gct ctg ccc ttc gac gat gac aac ttg cga cag 671Ala Leu Leu Val Gly Ala Leu Pro Phe Asp Asp Asp Asn Leu Arg Gln195 200 205GCC TTG CTG GGG GGG GGG GCT CTG CCC GAC GAC GAC GAC GAC AAC TTG CGA CGA 671ALA Leu Val Gly Ala PHE ASP ASN Leu ARG Gln Leu ARG Gln
210 215 220ctg ctg gag aag gtg aag cgg ggc gtg ttc cac atg ccg cac ttt atc 719Leu Leu Glu Lys Val Lys Arg Gly Val Phe His Met Pro His Phe Ile210 215 220CTG CTG GAG AAG GTG AAG CGG GGC GGC GTG TTC CAC CAC CCG CAC TTT ATC 719leu Leu Lys Val Val Phe His Met Pro His PHE Ile
225 230 235ccg ccc gac tgc cag agt ctg cta cgg ggc atg agc gag gtg gac gcc 767Pro Pro Asp Cys Gln Ser Leu Leu Arg Gly Met Ser Glu Val Asp Ala240 245 250 255gca cgc cgc ctc acg cta gag cac att cag aaa cac ata tgg tat ata 815Ala Arg Arg Leu Thr Leu Glu His Ile Gln Lys His Ile Trp Tyr Ile225 230 235ccg ccc gac tgc cag agt ctg cta cgg ggc atg agc gag gtg gac gcc 767Pro Pro Asp Cys Gln Ser Leu Leu Arg Gly Met Ser Glu Val Asp Ala240 245 250 255gca cgc cgc ctc acg cta gag cac att cag aaa cac ata tgg tat ata 815Ala Arg Arg Leu Thr Leu Glu His Ile Gln Lys His Ile Trp Tyr Ile
260 265 270ggg ggc aag aat gag ccc gaa cca gag cag ccc att cct cgc aag gtg 863Gly Gly Lys Asn Glu Pro Glu Pro Glu Gln Pro Ile Pro Arg Lys Val260 265 270GGG GGC AAG AAT GAG CCC GAA CCA GAG CAG CAG CAG CCC AAG GTG 863GLY GLY LYS ASN GLU PRO GLU Gln Pro Ile Pro Arg Lys Val
275 280 285cag atc cgc tcg ctg ccc agc ctg gag gac atc gac ccc gac gtg ctg 911Gln Ile Arg Ser Leu Pro Ser Leu Glu Asp Ile Asp Pro Asp Val Leu275 280 285CAG ATC CGC TCG CTG CCC AGC CTG GAC GAC GAC GAC CCC GAC GAC GAC GTG 911GLN Ile ARG Sero Seru Glo ASP PRO ASP Val Leuu
290 295 300gac agc atg cac tca ctg ggc tgc ttc cga gac cgc aac aag ctg ctg 959Asp Ser Met His Ser Leu Gly Cys Phe Arg Asp Arg Asn Lys Leu Leu290 295 300GAC AGC ATG CAC TCA CTG GGC TGC TGC CGA GAC CGC AAC AAG CTG 959asp Serou Gly Cys PHE ARG ARG Asn Lys Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu's Lu Le Lu Lu Lu Lu Lu -Aws that Cla's that
305 310 315cag gac ctg ctg tcc gag gag gag aac cag gag aag atg att tac ttc 1007Gln Asp Leu Leu Ser Glu Glu Glu Asn Gln Glu Lys Met Ile Tyr Phe320 325 330 335ctc ctc ctg gac cgg aaa gaa agg tac ccg agc cag gag gat gag gac 1055Leu Leu Leu Asp Arg Lys Glu Arg Tyr Pro Ser Gln Glu Asp Glu Asp305 310 315cag gac ctg ctg tcc gag gag gag aac cag gag aag atg att tac ttc 1007Gln Asp Leu Leu Ser Glu Glu Glu Asn Gln Glu Lys Met Ile Tyr Phe320 325 330 335ctc ctc ctg gac cgg aaa gaa agg tac ccg agc cag gag gat gag gac 1055Leu Leu Leu Asp Arg Lys Glu Arg Tyr Pro Ser Gln Glu Asp Glu Asp
340 345 350ctg ccc ccc cgg aac gag ata gac cct ccc cgg aag cgt gtg gac tcc 1103Leu Pro Pro Arg Asn Glu Ile Asp Pro Pro Arg Lys Arg Val Asp Ser340 345 350CTG CCC CCC CCC CCC CGG AAC GAG ATA GAC CCC CCC CGG AAG CGT GAC TCC 1103leu Pro Pro ARG Asn Glu Pro Pro ARG LYS ARG Val Val Val Val Val Val Val Val Val Val Val Val Val Val Val Val ASP Ser
355 360 365ccg atg ctg aac cgg cac ggc aag cgg cgg cca gaa cgc aaa tcc atg 1151Pro Met Leu Asn Arg His Gly Lys Arg Arg Pro Glu Arg Lys Ser Met355 360 365CCG ATG CTG AAC CGG CAC GGC AAG CGG CGG CCA GAA CGC AAA TCC ATG 1151PRO MET Leu Asn ARG HS ARG Pro Glu ARG LYS MET MET
370 375 380gag gtg ctc agc gtg acg gac ggc ggc tcc ccg gtg cct gcg cgg cgg 1199Glu Val Leu Ser Val Thr Asp Gly Gly Ser Pro Val Pro Ala Arg Arg370 375 380GAG GTG CTC AGC GTG GAC GGC GGC GGC GGC GGC CCG CCG CCG CGG CGG 1199Glu Val Val THR AS PRO Val Pro Ala ARA ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG
385 390 395gcc att gag atg gcc cag cac ggc cag agg tct cgg tcc atc agc ggt 1247Ala Ile Glu Met Ala Gln His Gly Gln Arg Ser Arg Ser Ile Ser Gly400 405 410 415gcc tcc tca ggc ctt tcc acc agc cca ctc agc agc ccc cgg gtg acc 1295Ala Ser Ser Gly Leu Ser Thr Ser Pro Leu Ser Ser Pro Arg Val Thr385 390 395gcc att gag atg gcc cag cac ggc cag agg tct cgg tcc atc agc ggt 1247Ala Ile Glu Met Ala Gln His Gly Gln Arg Ser Arg Ser Ile Ser Gly400 405 410 415gcc tcc tca ggc ctt tcc acc agc cca ctc agc agc ccc cgg gtg acc 1295Ala Ser Ser Gly Leu Ser Thr Ser Pro Leu Ser Ser Pro Arg Val Thr
420 425 430cct cac ccc tca cca agg ggc agt ccc ctc ccc acc ccc aag ggg aca 1343Pro His Pro Ser Pro Arg Gly Ser Pro Leu Pro Thr Pro Lys Gly Thr420 425 430CCT CAC CCC TCA CCA AGG GGC AGT CCC CCC CCC CCC AAG GGG ACA 1343PRO HIS PRO ARG GLY GLY Serg
435 440 445cct gtc cac acg cca aag gag agc ccg gct ggc acg ccc aac ccc acg 1391Pro Val His Thr Pro Lys Glu Ser Pro Ala Gly Thr Pro Asn Pro Thr435 440 445CCT GTC CAC ACG CCA AAG GAG GAG AGC CCG GCC ACG CCC CCC CCC ACG 1391Pro Val His THR Pro LYS GLY THRY THR Pro ASN PRO Asn
450 455 460ccc ccg tcc agc ccc agc gtc gga ggg gtg ccc tgg agg gcg cgg ctc 1439Pro Pro Ser Ser Pro Ser Val Gly Gly Val Pro Trp Arg Ala Arg Leu450 455 460ccc ccg tcc agc ccc agc gtc gga ggg gtg ccc tgg agg gcg cgg ctc 1439Pro Ar Val u Pro Ser A la Ser p Tr g Pro Val Gly
465 470 475aac tcc atc aag aac agc ttt ctg ggc tca ccc cgc ttc cac cgc cga 1487Asn Ser Ile Lys Asn Ser Phe Leu Gly Ser Pro Arg Phe His Arg Arg480 485 490 495aaa ctg caa gtt ccg acg ccg gag gag atg tcc aac ctg aca cca gag 1535Lys Leu Gln Val Pro Thr Pro Glu Glu Met Ser Asn Leu Thr Pro Glu465 470 475aac tcc atc aag aac agc ttt ctg ggc tca ccc cgc ttc cac cgc cga 1487Asn Ser Ile Lys Asn Ser Phe Leu Gly Ser Pro Arg Phe His Arg Arg480 485 490 495aaa ctg caa gtt ccg acg ccg gag gag atg tcc aac ctg aca cca gag 1535Lys Leu Gln Val Pro Thr Pro Glu Glu Met Ser Asn Leu Thr Pro Glu
500 505 510tcg tcc cca gag ctg gcg aag aag tcc tgg ttt ggg aac ttc atc agc 1583Ser Ser Pro Glu Leu Ala Lys Lys Ser Trp Phe Gly Asn Phe Ile Ser500 505 510TCG TCA GAG CTG GCG AAG AAG AAG TCC TGG TGG AAC TTC AGC AGC 1583ser Seru Leu Ala Lys Sern Phe Gly Asn Phe Ile Ser
515 520 525ctg gag aag gag gag cag atc ttc gtg gtc atc aaa gac aaa cct ctg 1631Leu Glu Lys Glu Glu Gln Ile Phe Val Val Ile Lys Asp Lys Pro Leu515 520 525CTG Gag Aag Gag Gag Gag CAG ATC GTG GTC ATC AAA GAC AAA CCT CTG 1631leu LYS Glu Gln Ile Lysre Lysp Lysp Lou
530 535 540agc tcc atc aag gct gac atc gtg cac gcc ttc ctg tcg att ccc agt 1679Ser Ser Ile Lys Ala Asp Ile Val His Ala Phe Leu Ser Ile Pro Ser530 535 540AGC TCC AAG GCT GCT GAC ATC GCC GCC TTC CTG AGT CCC AGT 1679ser Serle Lysp Ile Val His Ala Phe Leu Serle Pro Ser
545 550 555ctc agc cac agc gtc atc tcc caa acg agc ttc cgg gcc gag tac aag 1727Leu Ser His Ser Val Ile Ser Gln Thr Ser Phe Arg Ala Glu Tyr Lys560 565 570 575gcc acg ggg ggg cca gcc gtg ttc cag aag ccg gtc aag ttc cag gtt 1775Ala Thr Gly Gly Pro Ala Val Phe Gln Lys Pro Val Lys Phe Gln Val545 550 555ctc agc cac agc gtc atc tcc caa acg agc ttc cgg gcc gag tac aag 1727Leu Ser His Ser Val Ile Ser Gln Thr Ser Phe Arg Ala Glu Tyr Lys560 565 570 575gcc acg ggg ggg cca gcc gtg ttc cag aag ccg gtc aag ttc cag gtt 1775Ala Thr Gly Gly Pro Ala Val Phe Gln Lys Pro Val Lys Phe Gln Val
580 585 590gat atc acc tac acg gag ggt ggg gag gcg cag aag gag aac ggc atc 1823Asp Ile Thr Tyr Thr Glu Gly Gly Glu Ala Gln Lys Glu Asn Gly Ile580 585 590GAT ATC ACC TAC ACG GGT GGT GGG GCG GCG GCG CAG AAG GAG AAC GGC ATC 1823asp Ile Thr Tyr Ty GLY GLY Gln Lysn Glys GLU Asn GLY ile
595 600 605tac tcc gtc acc ttc acc ctg ctc tca ggc ccc agc cgt cgc ttc aag 1871Tyr Ser Val Thr Phe Thr Leu Leu Ser Gly Pro Ser Arg Arg Phe Lys595 6005TAC TCC GTC ACC TTC ACC CTC CTC CTC CCC CCC CGC TTC TTC AAG 1871Tyr Seru Thr Leu Serg Phe Phe Lys
610 615 620agg gtg gtg gag acc atc cag gcc cag ctg ctg agc aca cac gac ccg 1919Arg Val Val Glu Thr Ile Gln Ala Gln Leu Leu Ser Thr His Asp Pro610 615 620AGG GAG GAG GAG ACC ATC CAG GCC CAG CTG CTG AGC ACA CAC GAC CCG 1919arg Val Val Gln Ala Gln Leu Leu Seru His ASP PRO Pro
625 630 635cct gcg gcc cag cac ttg tca gac acc act aac tgt atg gaa atg atg 1967Pro Ala Ala Gln His Leu Ser Asp Thr Thr Asn Cys Met Glu Met Met640 645 650 655acg ggg cgg ctt tcc aaa tgt gga att atc ccg aaa agt taa 2009Thr Gly Arg Leu Ser Lys Cys Gly Ile Ile Pro Lys Ser625 630 635cct gcg gcc cag cac ttg tca gac acc act aac tgt atg gaa atg atg 1967Pro Ala Ala Gln His Leu Ser Asp Thr Thr Asn Cys Met Glu Met Met640 645 650 655acg ggg cgg ctt tcc aaa tgt gga att atc ccg aaa agt taa 2009Thr Gly Arg Leu Ser Lys Cys Gly Ile Ile Pro Lys Ser
660 665catgtcacct ccacgaggcc atc 2032<210>2<211>668<212>氨基酸<213>人类<400>2Met Thr Ser Thr Gly Lys Asp Gly Gly Ala Gln His Ala Gln Tyr Val1 5 10 15Gly Pro Tyr Arg Leu Glu Lys Thr Leu Gly Lys Gly Gln Thr Gly Leu660 665CATGTCACCT CCACGAGCC ATC 2032 <210> 2 <211> 668 <212> Amino acid <213> Human <400> 2MET THR SER GLY LYS ASP GLY GLN HIS Ala Gln Tyr Valr Tyr ARG LEU GLRG LEU GLL ARG Leu Gl Leu Gly Lys Gly Gln Thr Gly Leu
20 25 30Val Lys Leu Gly Val His Cys Val Thr Cys Gln Lys Val Ala Ile Lys20 25 30Val Lys Leu Gly Val His Cys Val Thr Cys Gln Lys Val Ala Ile Lys
35 40 45Ile Val Asn Arg Glu Lys Leu Ser Glu Ser Val Leu Met Lys Val Glu35 40 45Ile Val Asn Arg Glu Lys Leu Ser Glu Ser Val Leu Met Lys Val Glu
50 55 60Arg Glu Ile Ala Ile Leu Lys Leu Ile Glu His Pro His Val Leu Lys65 70 75 80Leu His Asp Val Tyr Glu Asn Lys Lys Tyr Leu Tyr Leu Val Leu Glu50 55 60arg Glu iLe Ale Leu Lys Leu Ile Glu His Prou His Val Leu Lys65 70 80leu His ASP Val Tyr Glu Asn Lys tyr Leu Val Leu Glu Glu Glu Glu Glu Glu
85 90 95His Val Ser Gly Gly Glu Leu Phe Asp Tyr Leu Val Lys Lys Gly Arg85 90 95His Val Ser Gly Gly Glu Leu Phe Asp Tyr Leu Val Lys Lys Gly Arg
100 105 110Leu Thr Pro Lys Glu Ala Arg Lys Phe Phe Arg Gln Ile Ile Ser Ala100 105 110Leu Thr Pro Lys Glu Ala Arg Lys Phe Phe Arg Gln Ile Ile Ser Ala
115 120 125Leu Asp Phe Cys His Ser His Ser Ile Cys His Arg Asp Leu Lys Pro115 120 125Leu Asp Phe Cys His Ser His Ser Ile Cys His Arg Asp Leu Lys Pro
130 135 140Glu Asn Leu Leu Leu Asp Glu Lys Asn Asn Ile Arg Ile Ala Asp Phe145 150 155 160Gly Met Ala Ser Leu Gln Val Gly Asp Ser Leu Leu Glu Thr Ser Cys130 135 140GLU ASN Leu Leu Leu asp Glu Lysn Asn Ile Ala Ala ALA ALA ASP PHE145 155 160GLY MET ALA Seru Gln Val GLY ASER Leu Leu THR Ser CYS CYS CYS CYS CYS CYS CYS CYs
165 170 175Gly Ser Pro His Tyr Ala Cys Pro Glu Val Ile Arg Gly Glu Lys Tyr165 170 175Gly Ser Pro His Tyr Ala Cys Pro Glu Val Ile Arg Gly Glu Lys Tyr
180 185 190Asp Gly Arg Lys Ala Asp Val Trp Ser Cys Gly Val Ile Leu Phe Ala180 185 190Asp Gly Arg Lys Ala Asp Val Trp Ser Cys Gly Val Ile Leu Phe Ala
195 200 205Leu Leu Val Gly Ala Leu Pro Phe Asp Asp Asp Asn Leu Arg Gln Leu195 200 205 Leu Leu Val Gly Ala Leu Pro Phe Asp Asp Asn Leu Arg Gln Leu
210 215 220Leu Glu Lys Val Lys Arg Gly Val Phe His Met Pro His Phe Ile Pro225 230 235 240Pro Asp Cys Gln Ser Leu Leu Arg Gly Met Ser Glu Val Asp Ala Ala210 215 220leu Lys Val Val Lys ARG GLY VAL PHE HIS MET Pro His PHE Ile Pro2225 230 235 240pro ASP CYS GLN Seru Leu ARG GLY MET Serg Ala Ala Ala Ala Ala Ala Ala Ala Ala
245 250 255Arg Arg Leu Thr Leu Glu His Ile Gln Lys His Ile Trp Tyr Ile Gly245 250 255Arg Arg Leu Thr Leu Glu His Ile Gln Lys His Ile Trp Tyr Ile Gly
260 265 270Gly Lys Asn Glu Pro Glu Pro Glu Gln Pro Ile Pro Arg Lys Val Gln260 265 270Gly Lys Asn Glu Pro Glu Pro Glu Gln Pro Ile Pro Arg Lys Val Gln
275 280 285Ile Arg Ser Leu Pro Ser Leu Glu Asp Ile Asp Pro Asp Val Leu Asp275 280 285Ile Arg Ser Leu Pro Ser Leu Glu Asp Ile Asp Pro Asp Val Leu Asp
290 295 300Ser Met His Ser Leu Gly Cys Phe Arg Asp Arg Asn Lys Leu Leu Gln305 310 315 320Asp Leu Leu Ser Glu Glu Glu Asn Gln Glu Lys Met Ile Tyr Phe Leu290 295 300ser Met His Seru GLY CYS PHE ARG ARG Asn LEU Leu Leu GLN30 310 320ASP Leu Leu Leu Glu Gln Gln Gln Gln Met Ile Tyr Phe Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu's Met Ile Tyr Leu's Ges that theanity
325 330 335Leu Leu Asp Arg Lys Glu Arg Tyr Pro Ser Gln Glu Asp Glu Asp Leu325 330 335Leu Leu Asp Arg Lys Glu Arg Tyr Pro Ser Gln Glu Asp Glu Asp Leu
340 345 350Pro Pro Arg Asn Glu Ile Asp Pro Pro Arg Lys Arg Val Asp Ser Pro340 345 350Pro Pro Arg Asn Glu Ile Asp Pro Pro Arg Lys Arg Val Asp Ser Pro
355 360 365Met Leu Asn Arg His Gly Lys Arg Arg Pro Glu Arg Lys Ser Met Glu355 360 365 Met Leu Asn Arg His Gly Lys Arg Arg Pro Glu Arg Lys Ser Met Glu
370 375 380Val Leu Ser Val Thr Asp Gly Gly Ser Pro Val Pro Ala Arg Arg Ala385 390 395 400Ile Glu Met Ala Gln His Gly Gln Arg Ser Arg Ser Ile Ser Gly Ala370 375 380val Leu Ser Val THR ASP GLY GLY GLE VAL PRO ALA ARG ARG ALA385 395 400ILE GLU MET ALA GLN HIS GLN ARG Serg Serg Serg Serge Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala
405 410 415Ser Ser Gly Leu Ser Thr Ser Pro Leu Ser Ser Pro Arg Val Thr Pro405 410 415Ser Ser Gly Leu Ser Thr Ser Pro Leu Ser Ser Pro Arg Val Thr Pro
420 425 430His Pro Ser Pro Arg Gly Ser Pro Leu Pro Thr Pro Lys Gly Thr Pro420 425 430His Pro Ser Pro Arg Gly Ser Pro Leu Pro Thr Pro Lys Gly Thr Pro
435 440 445Val His Thr Pro Lys Glu Ser Pro Ala Gly Thr Pro Asn Pro Thr Pro435 440 445Val His Thr Pro Lys Glu Ser Pro Ala Gly Thr Pro Asn Pro Thr Pro
450 455 460Pro Ser Ser Pro Ser Val Gly Gly Val Pro Trp Arg Ala Arg Leu Asn465 470 475 480Ser Ile Lys Asn Ser Phe Leu Gly Ser Pro Arg Phe His Arg Arg Lys450 455 460Pro Ser Ser Val Gly Gly Val Pro TRP ARG ARG Leu asn465 475 480SER ILE LYS Asn Ser, Leu ARG PHE HIS ARG ARG LYS LYS
485 490 495Leu Gln Val Pro Thr Pro Glu Glu Met Ser Asn Leu Thr Pro Glu Ser485 490 495Leu Gln Val Pro Thr Pro Glu Glu Met Ser Asn Leu Thr Pro Glu Ser
500 505 510Ser Pro Glu Leu Ala Lys Lys Ser Trp Phe Gly Asn Phe Ile Ser Leu500 505 510Ser Pro Glu Leu Ala Lys Lys Ser Trp Phe Gly Asn Phe Ile Ser Leu
515 520 525Glu Lys Glu Glu Gln Ile Phe Val Val Ile Lys Asp Lys Pro Leu Ser515 520 525Glu Lys Glu Glu Gln Ile Phe Val Val Ile Lys Asp Lys Pro Leu Ser
530 535 540Ser Ile Lys Ala Asp Ile Val His Ala Phe Leu Ser Ile Pro Ser Leu545 550 555 560Ser His Ser Val Ile Ser Gln Thr Ser Phe Arg Ala Glu Tyr Lys Ala530 535 540Ser Ile La Ala asp Ile Val His Ala Phe Leu Serle Pro Seru545 550 560Ser His Sering Ile Serite ARG Ala Glu Tyr Lys Ala
565 570 575Thr Gly Gly Pro Ala Val Phe Gln Lys Pro Val Lys Phe Gln Val Asp565 570 575Thr Gly Gly Pro Ala Val Phe Gln Lys Pro Val Lys Phe Gln Val Asp
580 585 590Ile Thr Tyr Thr Glu Gly Gly Glu Ala Gln Lys Glu Asn Gly Ile Tyr580 585 590Ile Thr Tyr Thr Glu Gly Gly Glu Ala Gln Lys Glu Asn Gly Ile Tyr
595 600 605Ser Val Thr Phe Thr Leu Leu Ser Gly Pro Ser Arg Arg Phe Lys Arg595 600 605Ser Val Thr Phe Thr Leu Leu Ser Gly Pro Ser Arg Arg Phe Lys Arg
610 615 620Val Val Glu Thr Ile Gln Ala Gln Leu Leu Ser Thr His Asp Pro Pro625 630 635 640Ala Ala Gln His Leu Ser Asp Thr Thr Asn Cys Met Glu Met Met Thr610 615 620Val Val Glu Thr Ile Gln Ala Gln Leu Leu Ser Thr His Asp Pro Pro625 630 635 640Ala Ala Gln His Leu Ser Asp Thr Thr Asn Cys Met Glu Met Met Thr
645 650 655Gly Arg Leu Ser Lys Cys Gly Ile Ile Pro Lys Ser645 650 655Gly Arg Leu Ser Lys Cys Gly Ile Ile Pro Lys Ser
660 665<210>3<211>2223<212>核苷酸<213>人类<220><221>编码序列<222>(3)..(2213)<223><400>3cg atg aca tcg acg ggg aag gac ggc ggc gcg cag cac gcg cag tat 47Met Thr Ser Thr Gly Lys Asp Gly Gly Ala Gln His Ala Gln Tyr1 5 10 15gtt ggg ccc tac cgg ctg gag aag acg ctg ggc aag ggg cag aca ggt 95Val Gly Pro Tyr Arg Leu Glu Lys Thr Leu Gly Lys Gly Gln Thr Gly660 665<210>3<211>2223<212>nucleotide<213>human<220><221>coding sequence<222>(3)..(2213)<223><400>3cg atg aca tcg acg ggg aag gac ggc ggc gcg cag cac gcg cag tat 47Met Thr Ser Thr Gly Lys Asp Gly Gly Ala Gln His Ala Gln Tyr1 5 10 15gtt ggg ccc tac cgg ctg gag aag acg ctg ggc aag ggg cag aca ggt 95Val Gly Pro Tyr Arg Leu Glu Lys Thr Leu Gly Lys Gly Gln Thr Gly
20 25 30ctg gtg aag ctg ggg gtt cac tgc gtc acc tgc cag aag gtg gcc atc 143Leu Val Lys Leu Gly Val His Cys Val Thr Cys Gln Lys Val Ala Ile20 25 30CTG GTG AAG CTG GGG GGG GTT CAC GTC GTC ACC TGC CAG GCC ATC 143leu Val Lys Leu Gly Val Val THR CYS Val Ala Ile
35 40 45aag atc gtc aac cgt gag aag ctc agc gag tcg gtg ctg atg aag gtg 191Lys Ile Val Asn Arg Glu Lys Leu Ser Glu Ser Val Leu Met Lys Val35 40 45AAG ATC GAC CGT GAG AAG CTC GAG GAG GAG GAG GAG GAG GAG GAG GAG GAG GAG CTG AAG GTG 191lys Ile Val Asn ARG GLU LEU Ser Val Leu Met Lys Val Val Val
50 55 60gag cgg gag atc gcg atc ctg aag ctc att gag cac ccc cac gtc cta 239Glu Arg Glu Ile Ala Ile Leu Lys Leu Ile Glu His Pro His Val Leu50 55 60gag CGG GAG ATC GCG ATC CTG CTC CTC ATT GAG CAC CAC CAC CAC CAC CTC
65 70 75aag ctg cac gac gtt tat gaa aac aaa aaa tat ttg tac ctg gtg cta 287Lys Leu His Asp Val Tyr Glu Asn Lys Lys Tyr Leu Tyr Leu Val Leu80 85 90 95gaa cac gtg tca ggt ggt gag ctc ttc gac tac ctg gtg aag aag ggg 335Glu His Val Ser Gly Gly Glu Leu Phe Asp Tyr Leu Val Lys Lys Gly65 70 75aag ctg cac gac gtt tat gaa aac aaa aaa tat ttg tac ctg gtg cta 287Lys Leu His Asp Val Tyr Glu Asn Lys Lys Tyr Leu Tyr Leu Val Leu80 85 90 95gaa cac gtg tca ggt ggt gag ctc ttc gac tac ctg gtg aag aag ggg 335Glu His Val Ser Gly Gly Glu Leu Phe Asp Tyr Leu Val Lys Lys Gly
100 105 110agg ctg acg cct aag gag gct cgg aag ttc ttc cgg cag atc atc tct 383Arg Leu Thr Pro Lys Glu Ala Arg Lys Phe Phe Arg Gln Ile Ile Ser100 105 110AGG CTG ACG CCT AAG GAG GCT CGG AAG TTC TTC CGG CAG ATC ATC ATC TCT 383ARG Leu Thr Pro Lys Glu ARG LYS PHE PHE GLN ILE ILN iLe Ile iLn iLe PLN iLE PLN iLE PLU -
115 120 125gcg ctg gac ttc tgc cac agc cac tcc ata tgc cac agg gat ctg aaa 431Ala Leu Asp Phe Cys His Ser His Ser Ile Cys His Arg Asp Leu Lys115 120 125GCG CTG GAC TTC TGC CAC CAC CAC CAC ATA TGC CAC AGG GAT CTG AAA 431ALA Leu ASP PHE CYS Serle Cys
130 135 140cct gaa aac ctc ctg ctg gac gag aag aac aac atc cgc atc gca gac 479Pro Glu Asn Leu Leu Leu Asp Glu Lys Asn Asn Ile Arg Ile Ala Asp135 135 140cct GAA AAC CTC CTG CTG GAC GAG AAC AAC AAC ATC CGC ATC GCA GAC 479Pro Glu Leu Leu ASN Asn Ile ARG Ile Ala Ala Asp
145 150 155ttt ggc atg gcg tcc ctg cag gtt ggc gac agc ctg ttg gag acc agc 527Phe Gly Met Ala Ser Leu Gln Val Gly Asp Ser Leu Leu Glu Thr Ser160 165 170 175tgt ggg tcc ccc cac tac gcc tgc ccc gag gtg atc cgg ggg gag aag 575Cys Gly Ser Pro His Tyr Ala Cys Pro Glu Val Ile Arg Gly Glu Lys145 150 155ttt ggc atg gcg tcc ctg cag gtt ggc gac agc ctg ttg gag acc agc 527Phe Gly Met Ala Ser Leu Gln Val Gly Asp Ser Leu Leu Glu Thr Ser160 165 170 175tgt ggg tcc ccc cac tac gcc tgc ccc gag gtg atc cgg ggg gag aag 575Cys Gly Ser Pro His Tyr Ala Cys Pro Glu Val Ile Arg Gly Glu Lys
180 185 190tat gac ggc cgg aag gcg gac gtg tgg agc tgc ggc gtc atc ctg ttc 623Tyr Asp Gly Arg Lys Ala Asp Val Trp Ser Cys Gly Val Ile Leu Phe180 185 190TAT GAC GGC CGG AAG GCG GAC GAC GAC GTG AGC TGC GGC GGC GGC ATC CTG TTC 623TYR ARG LYS Ala Ala Ala's TrS Ge Leu Phes
195 200 205gcc ttg ctg gtg ggg gct ctg ccc ttc gac gat gac aac ttg cga cag 671Ala Leu Leu Val Gly Ala Leu Pro Phe Asp Asp Asp Asn Leu Arg Gln195 200 205GCC TTG CTG GGG GGG GGG GCT CTG CCC GAC GAC GAC GAC GAC AAC TTG CGA CGA 671ALA Leu Val Gly Ala PHE ASP ASN Leu ARG Gln Leu ARG Gln
210 215 220ctg ctg gag aag gtg aag cgg ggc gtg ttc cac atg ccg cac ttt atc 719Leu Leu Glu Lys Val Lys Arg Gly Val Phe His Met Pro His Phe Ile210 215 220CTG CTG GAG AAG GTG AAG CGG GGC GGC GTG TTC CAC CAC CCG CAC TTT ATC 719leu Leu Lys Val Val Phe His Met Pro His PHE Ile
225 230 235ccg ccc gac tgc cag agt ctg cta cgg ggc atg agc gag gtg gac gcc 767Pro Pro Asp Cys Gln Ser Leu Leu Arg Gly Met Ser Glu Val Asp Ala240 245 250 255gca cgc cgc ctc acg cta gag cac att cag aaa cac ata tgg tat ata 815Ala Arg Arg Leu Thr Leu Glu His Ile Gln Lys His Ile Trp Tyr Ile225 230 235ccg ccc gac tgc cag agt ctg cta cgg ggc atg agc gag gtg gac gcc 767Pro Pro Asp Cys Gln Ser Leu Leu Arg Gly Met Ser Glu Val Asp Ala240 245 250 255gca cgc cgc ctc acg cta gag cac att cag aaa cac ata tgg tat ata 815Ala Arg Arg Leu Thr Leu Glu His Ile Gln Lys His Ile Trp Tyr Ile
260 265 270ggg ggc aag aat gag ccc gaa cca gag cag ccc att cct cgc aag gtg 863Gly Gly Lys Asn Glu Pro Glu Pro Glu Gln Pro Ile Pro Arg Lys Val260 265 270GGG GGC AAG AAT GAG CCC GAA CCA GAG CAG CAG CAG CCC AAG GTG 863GLY GLY LYS ASN GLU PRO GLU Gln Pro Ile Pro Arg Lys Val
275 280 285cag atc cgc tcg ctg ccc agc ctg gag gac atc gac ccc gac gtg ctg 911Gln Ile Arg Ser Leu Pro Ser Leu Glu Asp Ile Asp Pro Asp Val Leu275 280 285CAG ATC CGC TCG CTG CCC AGC CTG GAC GAC GAC GAC CCC GAC GAC GAC GTG 911GLN Ile ARG Sero Seru Glo ASP PRO ASP Val Leuu
290 295 300gac agc atg cac tca ctg ggc tgc ttc cga gac cgc aac aag ctg ctg 959Asp Ser Met His Ser Leu Gly Cys Phe Arg Asp Arg Asn Lys Leu Leu290 295 300GAC AGC ATG CAC TCA CTG GGC TGC TGC CGA GAC CGC AAC AAG CTG 959asp Serou Gly Cys PHE ARG ARG Asn Lys Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu's Lu Le Lu Lu Lu Lu Lu -Aws that Cla's that
305 310 315cag gac ctg ctg tcc gag gag gag aac cag gag aag atg att tac ttc 1007Gln Asp Leu Leu Ser Glu Glu Glu Asn Gln Glu Lys Met Ile Tyr Phe320 325 330 335ctc ctc ctg gac cgg aaa gaa agg tac ccg agc cag gag gat gag gac 1055Leu Leu Leu Asp Arg Lys Glu Arg Tyr Pro Ser Gln Glu Asp Glu Asp305 310 315cag gac ctg ctg tcc gag gag gag aac cag gag aag atg att tac ttc 1007Gln Asp Leu Leu Ser Glu Glu Glu Asn Gln Glu Lys Met Ile Tyr Phe320 325 330 335ctc ctc ctg gac cgg aaa gaa agg tac ccg agc cag gag gat gag gac 1055Leu Leu Leu Asp Arg Lys Glu Arg Tyr Pro Ser Gln Glu Asp Glu Asp
340 345 350ctg ccc ccc cgg aac gag ata gac cct ccc cgg aag cgt gtg gac tcc 1103Leu Pro Pro Arg Asn Glu Ile Asp Pro Pro Arg Lys Arg Val Asp Ser340 345 350CTG CCC CCC CCC CCC CGG AAC GAG ATA GAC CCC CCC CGG AAG CGT GAC TCC 1103leu Pro Pro ARG Asn Glu Pro Pro ARG LYS ARG Val Val Val Val Val Val Val Val Val Val Val Val Val Val Val Val ASP Ser
355 360 365ccg atg ctg aac cgg cac ggc aag cgg cgg cca gaa cgc aaa tcc atg 1151Pro Met Leu Asn Arg His Gly Lys Arg Arg Pro Glu Arg Lys Ser Met355 360 365CCG ATG CTG AAC CGG CAC GGC AAG CGG CGG CCA GAA CGC AAA TCC ATG 1151PRO MET Leu Asn ARG HS ARG Pro Glu ARG LYS MET MET
370 375 380gag gtg ctc agc gtg acg gac ggc ggc tcc ccg gtg cct gcg cgg cgg 1199Glu Val Leu Ser Val Thr Asp Gly Gly Ser Pro Val Pro Ala Arg Arg370 375 380GAG GTG CTC AGC GTG GAC GGC GGC GGC GGC GGC CCG CCG CCG CGG CGG 1199Glu Val Val THR AS PRO Val Pro Ala ARA ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG ARG
385 390 395gcc att gag atg gcc cag cac ggc cag agg tct cgg tcc atc agc ggt 1247Ala Ile Glu Met Ala Gln His Gly Gln Arg Ser Arg Ser Ile Ser Gly400 405 410 415gcc tcc tca ggc ctt tcc acc agc cca ctc agc agc ccc cgg gtg acc 1295Ala Ser Ser Gly Leu Ser Thr Ser Pro Leu Ser Ser Pro Arg Val Thr385 390 395gcc att gag atg gcc cag cac ggc cag agg tct cgg tcc atc agc ggt 1247Ala Ile Glu Met Ala Gln His Gly Gln Arg Ser Arg Ser Ile Ser Gly400 405 410 415gcc tcc tca ggc ctt tcc acc agc cca ctc agc agc ccc cgg gtg acc 1295Ala Ser Ser Gly Leu Ser Thr Ser Pro Leu Ser Ser Pro Arg Val Thr
420 425 430cct cac ccc tca cca agg ggc agt ccc ctc ccc acc ccc aag ggg aca 1343Pro His Pro Ser Pro Arg Gly Ser Pro Leu Pro Thr Pro Lys Gly Thr420 425 430CCT CAC CCC TCA CCA AGG GGC AGT CCC CCC CCC CCC AAG GGG ACA 1343PRO HIS PRO ARG GLY GLY Serg
435 440 445cct gtc cac acg cca aag gag agc ccg gct ggc acg ccc aac ccc acg 1391Pro Val His Thr Pro Lys Glu Ser Pro Ala Gly Thr Pro Asn Pro Thr435 440 445CCT GTC CAC ACG CCA AAG GAG GAG AGC CCG GCC ACG CCC CCC CCC ACG 1391Pro Val His THR Pro LYS GLY THRY THR Pro ASN PRO Asn
450 455 460ccc ccg tcc agc ccc agc gtc gga ggg gtg ccc tgg agg gcg cgg ctc 1439Pro Pro Ser Ser Pro Ser Val Gly Gly Val Pro Trp Arg Ala Arg Leu450 455 460ccc ccg tcc agc ccc agc gtc gga ggg gtg ccc tgg agg gcg cgg ctc 1439Pro Ar Val u Pro Ser A la Ser p Tr g Pro Val Gly
465 470 475aac tcc atc aag aac agc ttt ctg ggc tca ccc cgc ttc cac cgc cga 1487Asn Ser Ile Lys Asn Ser Phe Leu Gly Ser Pro Arg Phe His Arg Arg480 485 490 495aaa ctg caa gtt ccg acg ccg gag gag atg tcc aac ctg aca cca gag 1535Lys Leu Gln Val Pro Thr Pro Glu Glu Met Ser Asn Leu Thr Pro Glu465 470 475aac tcc atc aag aac agc ttt ctg ggc tca ccc cgc ttc cac cgc cga 1487Asn Ser Ile Lys Asn Ser Phe Leu Gly Ser Pro Arg Phe His Arg Arg480 485 490 495aaa ctg caa gtt ccg acg ccg gag gag atg tcc aac ctg aca cca gag 1535Lys Leu Gln Val Pro Thr Pro Glu Glu Met Ser Asn Leu Thr Pro Glu
500 505 510tcg tcc cca gag ctg gcg aag aag tcc tgg ttt ggg aac ttc atc agc 1583Ser Ser Pro Glu Leu Ala Lys Lys Ser Trp Phe Gly Asn Phe Ile Ser500 505 510TCG TCA GAG CTG GCG AAG AAG AAG TCC TGG TGG AAC TTC AGC AGC 1583ser Seru Leu Ala Lys Sern Phe Gly Asn Phe Ile Ser
515 520 525ctg gag aag gag gag cag atc ttc gtg gtc atc aaa gac aaa cct ctg 1631Leu Glu Lys Glu Glu Gln Ile Phe Val Val Ile Lys Asp Lys Pro Leu515 520 525CTG Gag Aag Gag Gag Gag CAG ATC GTG GTC ATC AAA GAC AAA CCT CTG 1631leu LYS Glu Gln Ile Lysre Lysp Lysp Lou
530 535 540agc tcc atc aag gct gac atc gtg cac gcc ttc ctg tcg att ccc agt 1679Ser Ser Ile Lys Ala Asp Ile Val His Ala Phe Leu Ser Ile Pro Ser530 535 540AGC TCC AAG GCT GCT GAC ATC GCC GCC TTC CTG AGT CCC AGT 1679ser Serle Lysp Ile Val His Ala Phe Leu Serle Pro Ser
545 550 555ctc agc cac agc gtc atc tcc caa acg agc ttc cgg gcc gag tac aag 1727Leu Ser His Ser Val Ile Ser Gln Thr Ser Phe Arg Ala Glu Tyr Lys560 565 570 575gcc acg ggg ggg cca gcc gtg ttc cag aag ccg gtc aag ttc cag gtt 1775Ala Thr Gly Gly Pro Ala Val Phe Gln Lys Pro Val Lys Phe Gln Val545 550 555ctc agc cac agc gtc atc tcc caa acg agc ttc cgg gcc gag tac aag 1727Leu Ser His Ser Val Ile Ser Gln Thr Ser Phe Arg Ala Glu Tyr Lys560 565 570 575gcc acg ggg ggg cca gcc gtg ttc cag aag ccg gtc aag ttc cag gtt 1775Ala Thr Gly Gly Pro Ala Val Phe Gln Lys Pro Val Lys Phe Gln Val
580 585 590gat atc acc tac acg gag ggt ggg gag gcg cag aag gag aac ggc atc 1823Asp Ile Thr Tyr Thr Glu Gly Gly Glu Ala Gln Lys Glu Asn Gly Ile580 585 590GAT ATC ACC TAC ACG GGT GGT GGG GCG GCG GCG CAG AAG GAG AAC GGC ATC 1823asp Ile Thr Tyr Ty GLY GLY Gln Lysn Glys GLU Asn GLY ile
595 600 605tac tcc gtc acc ttc acc ctg ctc tca ggc ccc agc cgt cgc ttc aag 1871Tyr Ser Val Thr Phe Thr Leu Leu Ser Gly Pro Ser Arg Arg Phe Lys595 6005TAC TCC GTC ACC TTC ACC CTC CTC CTC CCC CCC CGC TTC TTC AAG 1871Tyr Seru Thr Leu Serg Phe Phe Lys
610 615 620agg gtg gtg gag acc atc cag gcc cag ctg ctg agc aca cac gac ccg 1919Arg Val Val Glu Thr Ile Gln Ala Gln Leu Leu Ser Thr His Asp Pro610 615 620AGG GAG GAG GAG ACC ATC CAG GCC CAG CTG CTG AGC ACA CAC GAC CCG 1919arg Val Val Gln Ala Gln Leu Leu Seru His ASP PRO Pro
625 630 635cct gcg gcc cag cac ttg tca gac acc act aac tgt atg gaa atg atg 1967Pro Ala Ala Gln His Leu Ser Asp Thr Thr Asn Cys Met Glu Met Met640 645 650 655acg ggg cgg ctt tcc aaa tgt ggc agc cca ttg agt aac ttc ttt gac 2015Thr Gly Arg Leu Ser Lys Cys Gly Ser Pro Leu Ser Asn Phe Phe Asp625 630 635cct gcg gcc cag cac ttg tca gac acc act aac tgt atg gaa atg atg 1967Pro Ala Ala Gln His Leu Ser Asp Thr Thr Asn Cys Met Glu Met Met640 645 650 655acg ggg cgg ctt tcc aaa tgt ggc agc cca ttg agt aac ttc ttt gac 2015Thr Gly Arg Leu Ser Lys Cys Gly Ser Pro Leu Ser Asn Phe Phe Asp
660 665 670gta att aaa caa ctt ttt tca gac gag aag aac ggg cag gcg gcc cag 2063Val Ile Lys Gln Leu Phe Ser Asp Glu Lys Asn Gly Gln Ala Ala Gln660 665 670GTA AAA CAA CAA CTT TTT TCA GAC GAG AAC GGG CAG GCG GCC CAG 2063VAL ILE LYS GLN Leu PHE SERU LYSN ALASLN AL -S AS AL -S's
675 680 685gcc ccc agc acg ccc gcc aag cgg agt gcc cac ggc cca ctc ggt gac 2111Ala Pro Ser Thr Pro Ala Lys Arg Ser Ala His Gly Pro Leu Gly Asp675 685GCC CCC AGC AGC ACG CCC GCC AAG CGG AGT GCC CAC CCA CCA CTC GGT GAC 211ALA PRO ALA LYS Ala His Gly Pro Gly Prou Gly ASP
690 695 700tcc gcg gcc gct ggc cct ggc ccc gga ggg gac gcc gag tac cca acg 2159Ser Ala Ala Ala Gly Pro Gly Pro Gly Gly Asp Ala Glu Tyr Pro Thr690 695 700TCC GCC GCC GCT GGC CCC CCC GGG GGG GCC GCC GCC GAG TAG TAG TAG TAG TAG CCA ALA Ala Gly Pro GLY GLY GLY GLY ALA GLU TYYR PRO GEA
705 710 715ggc aag gac acg gcc aag atg ggc ccg ccc acc gcc cgc cgc gag cag 2207Gly Lys Asp Thr Ala Lys Met Gly Pro Pro Thr Ala Arg Arg Glu Gln720 725 730 735cct tag acacactagc 2223Pro<210>4<211>736<212>氨基酸<213>人类<400>4Met Thr Ser Thr Gly Lys Asp Gly Gly Ala Gln His Ala Gln Tyr Val1 5 10 15Gly Pro Tyr Arg Leu Glu Lys Thr Leu Gly Lys Gly Gln Thr Gly Leu705 710 715ggc aag gac acg gcc aag atg ggc ccg ccc acc gcc cgc cgc gag cag 2207Gly Lys Asp Thr Ala Lys Met Gly Pro Pro Thr Ala Arg Arg Glu Gln720 725 730 735cct tag acacactagc 2223Pro<210>4<211>736<212 >氨基酸<213>人类<400>4Met Thr Ser Thr Gly Lys Asp Gly Gly Ala Gln His Ala Gln Tyr Val1 5 10 15Gly Pro Tyr Arg Leu Glu Lys Thr Leu Gly Lys Gly Gln Thr Gly Leu
20 25 30Val Lys Leu Gly Val His Cys Val Thr Cys Gln Lys Val Ala Ile Lys20 25 30Val Lys Leu Gly Val His Cys Val Thr Cys Gln Lys Val Ala Ile Lys
35 40 45Ile Val Asn Arg Glu Lys Leu Ser Glu Ser Val Leu Met Lys Val Glu35 40 45Ile Val Asn Arg Glu Lys Leu Ser Glu Ser Val Leu Met Lys Val Glu
50 55 60Arg Glu Ile Ala Ile Leu Lys Leu Ile Glu His Pro His Val Leu Lys65 70 75 80Leu His Asp Val Tyr Glu Asn Lys Lys Tyr Leu Tyr Leu Val Leu Glu50 55 60arg Glu iLe Ale Leu Lys Leu Ile Glu His Prou His Val Leu Lys65 70 80leu His ASP Val Tyr Glu Asn Lys tyr Leu Val Leu Glu Glu Glu Glu Glu Glu
85 90 95His Val Ser Gly Gly Glu Leu Phe Asp Tyr Leu Val Lys Lys Gly Arg85 90 95His Val Ser Gly Gly Glu Leu Phe Asp Tyr Leu Val Lys Lys Gly Arg
100 105 110Leu Thr Pro Lys Glu Ala Arg Lys Phe Phe Arg Gln Ile Ile Ser Ala100 105 110Leu Thr Pro Lys Glu Ala Arg Lys Phe Phe Arg Gln Ile Ile Ser Ala
115 120 125Leu Asp Phe Cys His Ser His Ser Ile Cys His Arg Asp Leu Lys Pro115 120 125Leu Asp Phe Cys His Ser His Ser Ile Cys His Arg Asp Leu Lys Pro
130 135 140Glu Asn Leu Leu Leu Asp Glu Lys Asn Asn Ile Arg Ile Ala Asp Phe145 150 155 160Gly Met Ala Ser Leu Gln Val Gly Asp Ser Leu Leu Glu Thr Ser Cys130 135 140GLU ASN Leu Leu Leu asp Glu Lysn Asn Ile Ala Ala ALA ALA ASP PHE145 155 160GLY MET ALA Seru Gln Val GLY ASER Leu Leu THR Ser CYS CYS CYS CYS CYS CYS CYS CYs
165 170 175Gly Ser Pro His Tyr Ala Cys Pro Glu Val Ile Arg Gly Glu Lys Tyr165 170 175Gly Ser Pro His Tyr Ala Cys Pro Glu Val Ile Arg Gly Glu Lys Tyr
180 185 190Asp Gly Arg Lys Ala Asp Val Trp Ser Cys Gly Val Ile Leu Phe Ala180 185 190Asp Gly Arg Lys Ala Asp Val Trp Ser Cys Gly Val Ile Leu Phe Ala
195 200 205Leu Leu Val Gly Ala Leu Pro Phe Asp Asp Asp Asn Leu Arg Gln Leu195 200 205 Leu Leu Val Gly Ala Leu Pro Phe Asp Asp Asn Leu Arg Gln Leu
210 215 220Leu Glu Lys Val Lys Arg Gly Val Phe His Met Pro His Phe Ile Pro225 230 235 240Pro Asp Cys Gln Ser Leu Leu Arg Gly Met Ser Glu Val Asp Ala Ala210 215 220leu Lys Val Val Lys ARG GLY VAL PHE HIS MET Pro His PHE Ile Pro2225 230 235 240pro ASP CYS GLN Seru Leu ARG GLY MET Serg Ala Ala Ala Ala Ala Ala Ala Ala Ala
245 250 255Arg Arg Leu Thr Leu Glu His Ile Gln Lys His Ile Trp Tyr Ile Gly245 250 255Arg Arg Leu Thr Leu Glu His Ile Gln Lys His Ile Trp Tyr Ile Gly
260 265 270Gly Lys Asn Glu Pro Glu Pro Glu Gln Pro Ile Pro Arg Lys Val Gln260 265 270Gly Lys Asn Glu Pro Glu Pro Glu Gln Pro Ile Pro Arg Lys Val Gln
275 280 285Ile Arg Ser Leu Pro Ser Leu Glu Asp Ile Asp Pro Asp Val Leu Asp275 280 285Ile Arg Ser Leu Pro Ser Leu Glu Asp Ile Asp Pro Asp Val Leu Asp
290 295 300Ser Met His Ser Leu Gly Cys Phe Arg Asp Arg Asn Lys Leu Leu Gln305 310 315 320Asp Leu Leu Ser Glu Glu Glu Asn Gln Glu Lys Met Ile Tyr Phe Leu290 295 300ser Met His Seru GLY CYS PHE ARG ARG Asn LEU Leu Leu GLN30 310 320ASP Leu Leu Leu Glu Gln Gln Gln Gln Met Ile Tyr Phe Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu's Met Ile Tyr Leu's Ges that theanity
325 330 335Leu Leu Asp Arg Lys Glu Arg Tyr Pro Ser Gln Glu Asp Glu Asp Leu325 330 335Leu Leu Asp Arg Lys Glu Arg Tyr Pro Ser Gln Glu Asp Glu Asp Leu
340 345 350Pro Pro Arg Asn Glu Ile Asp Pro Pro Arg Lys Arg Val Asp Ser Pro340 345 350Pro Pro Arg Asn Glu Ile Asp Pro Pro Arg Lys Arg Val Asp Ser Pro
355 360 365Met Leu Asn Arg His Gly Lys Arg Arg Pro Glu Arg Lys Ser Met Glu355 360 365 Met Leu Asn Arg His Gly Lys Arg Arg Pro Glu Arg Lys Ser Met Glu
370 375 380Val Leu Ser Val Thr Asp Gly Gly Ser Pro Val Pro Ala Arg Arg Ala385 390 395 400Ile Glu Met Ala Gln His Gly Gln Arg Ser Arg Ser Ile Ser Gly Ala370 375 380val Leu Ser Val THR ASP GLY GLY GLE VAL PRO ALA ARG ARG ALA385 395 400ILE GLU MET ALA GLN HIS GLN ARG Serg Serg Serg Serge Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala
405 410 415Ser Ser Gly Leu Ser Thr Ser Pro Leu Ser Ser Pro Arg Val Thr Pro405 410 415Ser Ser Gly Leu Ser Thr Ser Pro Leu Ser Ser Pro Arg Val Thr Pro
420 425 430His Pro Ser Pro Arg Gly Ser Pro Leu Pro Thr Pro Lys Gly Thr Pro420 425 430His Pro Ser Pro Arg Gly Ser Pro Leu Pro Thr Pro Lys Gly Thr Pro
435 440 445Val His Thr Pro Lys Glu Ser Pro Ala Gly Thr Pro Asn Pro Thr Pro435 440 445Val His Thr Pro Lys Glu Ser Pro Ala Gly Thr Pro Asn Pro Thr Pro
450 455 460Pro Ser Ser Pro Ser Val Gly Gly Val Pro Trp Arg Ala Arg Leu Asn465 470 475 480Ser Ile Lys Asn Ser Phe Leu Gly Ser Pro Arg Phe His Arg Arg Lys450 455 460Pro Ser Ser Val Gly Gly Val Pro TRP ARG ARG Leu asn465 475 480SER ILE LYS Asn Ser, Leu ARG PHE HIS ARG ARG LYS LYS
485 490 495Leu Gln Val Pro Thr Pro Glu Glu Met Ser Asn Leu Thr Pro Glu Ser485 490 495Leu Gln Val Pro Thr Pro Glu Glu Met Ser Asn Leu Thr Pro Glu Ser
500 505 510Ser Pro Glu Leu Ala Lys Lys Ser Trp Phe Gly Asn Phe Ile Ser Leu500 505 510Ser Pro Glu Leu Ala Lys Lys Ser Trp Phe Gly Asn Phe Ile Ser Leu
515 520 525Glu Lys Glu Glu Gln Ile Phe Val Val Ile Lys Asp Lys Pro Leu Ser515 520 525Glu Lys Glu Glu Gln Ile Phe Val Val Ile Lys Asp Lys Pro Leu Ser
530 535 540Ser Ile Lys Ala Asp Ile Val His Ala Phe Leu Ser Ile Pro Ser Leu545 550 555 560Ser His Ser Val Ile Ser Gln Thr Ser Phe Arg Ala Glu Tyr Lys Ala530 535 540Ser Ile La Ala asp Ile Val His Ala Phe Leu Serle Pro Seru545 550 560Ser His Sering Ile Serite ARG Ala Glu Tyr Lys Ala
565 570 575Thr Gly Gly Pro Ala Val Phe Gln Lys Pro Val Lys Phe Gln Val Asp565 570 575Thr Gly Gly Pro Ala Val Phe Gln Lys Pro Val Lys Phe Gln Val Asp
580 585 590Ile Thr Tyr Thr Glu Gly Gly Glu Ala Gln Lys Glu Asn Gly Ile Tyr580 585 590Ile Thr Tyr Thr Glu Gly Gly Glu Ala Gln Lys Glu Asn Gly Ile Tyr
595 600 605Ser Val Thr Phe Thr Leu Leu Ser Gly Pro Ser Arg Arg Phe Lys Arg595 600 605Ser Val Thr Phe Thr Leu Leu Ser Gly Pro Ser Arg Arg Phe Lys Arg
610 615 620Val Val Glu Thr Ile Gln Ala Gln Leu Leu Ser Thr His Asp Pro Pro625 630 635 640Ala Ala Gln His Leu Ser Asp Thr Thr Asn Cys Met Glu Met Met Thr610 615 620Val Val Glu Thr Ile Gln Ala Gln Leu Leu Ser Thr His Asp Pro Pro625 630 635 640Ala Ala Gln His Leu Ser Asp Thr Thr Asn Cys Met Glu Met Met Thr
645 650 655Gly Arg Leu Ser Lys Cys Gly Ser Pro Leu Ser Asn Phe Phe Asp Val645 650 655Gly Arg Leu Ser Lys Cys Gly Ser Pro Leu Ser Asn Phe Phe Asp Val
660 665 670Ile Lys Gln Leu Phe Ser Asp Glu Lys Asn Gly Gln Ala Ala Gln Ala660 665 670Ile Lys Gln Leu Phe Ser Asp Glu Lys Asn Gly Gln Ala Ala Gln Ala
675 680 685Pro Ser Thr Pro Ala Lys Arg Ser Ala His Gly Pro Leu Gly Asp Ser675 680 685Pro Ser Thr Pro Ala Lys Arg Ser Ala His Gly Pro Leu Gly Asp Ser
690 695 700Ala Ala Ala Gly Pro Gly Pro Gly Gly Asp Ala Glu Tyr Pro Thr Gly705 710 715 720Lys Asp Thr Ala Lys Met Gly Pro Pro Thr Ala Arg Arg Glu Gln Pro690 695 700ALA Ala Gly Pro Gly Pro Gly Gly Gly Gly Ala Glu Tyr Pro Thr Gly705 715 720lys Asp THR ALA LYS MET GLY Pro THR ARG Gln Pro Gln Pro
725 730 735725 730 735
表1hSTKa MTSTGKDGGAQHAQYVGPYRLEKTLGKGQTGLVKLGVHCVTCGKVAIKIVNREKLSESVLhSTKc MTSTGKDGGAQHAQYVGPYRLEKTLGKGQTGLVKLGVHCVTCQKVAIKIVNREKLSESVLCAA07196 ------------------------------------------------------------hSTKa MKVEREIAILKLIEHPHVLKLHDVYENKKYLYLVLEHVSGGELFDYLVKKGRLTPKEARKhSTKc MKVEREIAILKLIEHPHVLKLHDVYENKKYLYLVLEHVSGGELFDYLVKKGRLTPKEARKCAA07196 -----------LIEHPHVLKLHDVYENKKYLYLVLEHVSGGELFDYLVKKGRLTPKEARKTable 1hSTKa MTSTGKDGGAQHAQYVGPYRLEKTLGKGQTGLVKLGVHCVTCGKVAIKIVNREKLSESVLhSTKc MTSTGKDGGAQHAQYVGPYRLEKTLGKGQTGLVKLGVHCVTCQKVAIKIVNRE-------------------------19 --------------hSTKa MKVEREIAILKLIEHPHVLKLHDVYENKKYLYLVLEHVSGGELFDYLVKKGRLTPKEARKhSTKc MKVEREIAILKLIEHPHVLKLHDVYENKKYLYLVLEHVSGGELFDYLVKKGRLTPKEARKCAA07196 -----------LIEHPHVLKLHDVYENKKYLYLVLEHVSGGELFDYLVKKGRLTPKEARK
*************************************************hSTKa FFRQIISALDFCHSHSICHRDLKPENLLLDEKNNIRIADFGMASLQVGDSLLETSCGSPHhSTKc FFRQIISALDFCHSHSICHRDLKPENLLLDEKNNIRIADFGMASLQVGDSLLETSCGSPHCAA07196 FFRQIISALDFCHSHSICHRDLKPENLLLDEKNNIRIADFGMASLQVGDSLLETSCGSPH***************************************************hSTKa FFRQIISALDFCHSHSICHRDLKPENLLLDEKNNIRIADFGMASLQVGDSLLETSCGSPHhSTKc FFRQIISALDFCHSHSICHRDLKPENLLLDEKNNIRIADFGMASLQVGDSLLETSCGSPHCAA07196 GSLLETSCGSPHCAA07196 FFRQIISALDFCHSHSICHRDLKPENLLLDEKNNIRIADFG
************************************************************hSTKa YACPEVIRGEKYDGRKADVWSCGVILFALLVGALPFDDDNLRQLLEKVKRGVFHMPHFIPhSTKc YACPEVIRGEKYDGRKADVWSCGVILFALLVGALPFDDDNLRQLLEKVKRGVFHMPHFIPCAA07196 YACPEVIRGEKYDGRKADVWSCGVILFALLVGALPFDDDNLRQLLEKVKRGVFHMPHFIP***************************************************** **********hSTKa YACPEVIRGEKYDGRKADVWSCGVILFALLVGALPFDDDNLRQLLEKVKRGVFHMPHFIPhSTKc YACPEVIRGEKYDGRKADVWSCGVILFALLVGALPFDDDNLRQLLEKVKRGVFHMPHFIPCAA07196 YACPEVIRGEKYDGRKADVWSCGVILFALLVGALPFDDDNLRQLLEKVKRGVFHMPHFIP
************************************************************hSTKa PDCQSLLRGMSEVDAARRLTLEHIQKHIWYIGGKNEPEPEQPIPRKVQIRSLPSLEDIDPhSTKc PDCQSLLRGMSEVDAARRLTLEHIQKHIWYIGGKNEPEPEQPIPRKVQIRSLPSLEDIDPCAA07196 PDCQSLLRGMSEVDAARRLTLEHIQKHIWYIGGKNEPEPEQPIPRKVQIRSLPSLEDIDP***************************************************** **********hSTKa PDCQSLLRGMSEVDAARRLTLEHIQKHIWYIGGKNEPEPEQPIPRKVQIRSLPSLEDIDPhSTKc PDCQSLLRGMSEVDAARRLTLEHIQKHIWYIGGKNEPEPEQPIPRKVQIRSLPSLEDIDPCAA07196 PDCQSLLRGMSEVDAARRLTLEHIQKHIWYIGGKNEPEPEQPIPRKVQIRSLPSLEDIDP
************************************************************hSTKa DVLDSMHSLGCFRDRNKLLGDLLSEEENQEKMIYFLLLDRKERYPSQEDEDLPPRNEIDPhSTKc DVLDSMHSLGCFRDRNKLLQDLLSEEENQEKMIYFLLLDRKERYPSQEDEDLPPRNEIDPCAA07196 DVLDSMHSLGCFRDRNKLLQDLLSEEENQEKMIYFLLLDRKERYPSQEDEDLPPRNEIDP***************************************************** **********hSTKa DVLDSMHSLGCFRDRNKLLGDLLSEEENQEKMIYFLLLDRKERYPSQEDEDLPPRNEIDPhSTKc DVLDSMHSLGCFRDRNKLLQDLLSEEENQEKMIYFLLLDRKERYPSQEDEDLPPRNEIDPCAA07196 DVLDSMHSLGCFRDRNKLLQDLLSEEENQEKMIYFLLLDRKERYPSQEDEDLPPRNEIDP
************************************************************hSTKa PRKRVDSPMLNRHGKRRPERKSMEVLSVTDGGSPVPARRAIEMAQHGQRSRSISGASSGLhSTKc PRKRVDSPMLNRHGKRRPERKSMEVLSVTDGGSPVPARRAIEMAQHGQRSRSISGASSGLCAA07196 PRKRVDSPMLNRHGKRRPERKSMEVLSVTDGGSPVPARRAIEMAQHGQRSRSISGASSGL***************************************************** **********hSTKa PRKRVDSPMLNRHGKRRPERKSMEVLSVTDGGSPVPARRAIEMAQHGQRSRSISGASSGLhSTKc PRKRVDSPMLNRHGKRRPERKSMEVLSVTDGGSPVPARRAIEMAQHGQRSRSISGASSGLCAA07196 PRKRVDSPMLNRHGKRRPERKSMEVLSVTDGGSPVPARRAIEMAQHGQRSRSISGASSGL
************************************************************hSTKa STSPLSSPRVTPHPSPRGSPLPTPKGTPVHTPKESPAGTPNPTPPSSPSVGGVPWRARLNhSTKc STSPLSSPRVTPHPSPRGSPLPTPKGTPVHTPKESPAGTPNPTPPSSPSVGGVPWRARLNCAA07196 STSPLSSPRVTPHPSPRGSPLPTPKGTPVHTPKESPAGTPNPTPPSSPSVGGVPWRARLN***************************************************** **********hSTKa STSPLSSPRVTPHPSPRGSPLPTPKGTPVHTPKESPAGTPNPTPPSSPSVGGVPWRARLNhSTKc STSPLSSPRVTPHPSPRGSPLPTPKGTPVHTPKESPAGTPNPTPPSSPSVGGVPWRARLNCAA07196 STSPLSSPRVTPHPSPRGSPLPTPKGTPVHTPKESPAGTPNPTPPSSPSVGGVPWRARLN
************************************************************hSTKa SIKNSFLGSPRFHRRKLQVPTPEEMSNLTPESSPELAKKSWFGNFISLEKEEQIFVVIKDhSTKc SIKNSFLGSPRFHRRKLQVPTPEEMSNLTPESSPELAKKSWFGNFISLEKEEQIFVVIKDCAA07196 SIKNSFLGSPRFHRRKLQVPTPEEMSNLTPESSPELAKKSWFGNFISLEKEEQIFVVIKD***************************************************** **********hSTKa SIKNSFLGSPRFHRRKLQVPTPEEMSNLTPESSPELAKKSWFGNFISLEKEEQIFVVIKDhSTKc SIKNSFLGSPRFHRRKLQVPTPEEMSNLTPESSPELAKKSWFGNFISLEKEEQIFVVIKDCAA07196 SIKNSFLGSPRFHRRKLQVPTPEEMSNLTPESSPELAKKSWFGNFISLEKEEQIFVVIKD
************************************************************hSTKa KPLSSIKADIVHAFLSIPSLSHSVISQTSFRAEYKATGGPAVFQKPVKFQVDITYTEGGEhSTKc KPLSSIKADIVHAFLSIPSLSHSVISQTSFRAEYKATGGPAVFQKPVKFQVDITYTEGGECAA07196 KPLSSIKADIVHAFLSIPSLSHSVISQTSFRAEYKATGGPAVFQKPVKFQVDITYTEGGE***************************************************** **********hSTKa KPLSSIKADIVHAFLSIPSLSHSVISQTSFRAEYKATGGPAVFQKPVKFQVDITYTEGGEhSTKc KPLSSIKADIVHAFLSIPSLSHSVISQTSFRAEYKATGGPAVFQKPVKFQVDITYTEGGECAA07196 KPLSSIKADIVHAFLSIPSLSHSVISQTSFRAEYKATGGPAVFQKPVKFQVDITYTEGGE
************************************************************hSTKa AQKENGIYSVTFTLLSGPSRRFKRVVETIQAQLLSTHDPPAAQHLSDTTNCMEMMTGRLShSTKc AQKENGIYSVTFTLLSGPSRRFKRVVETIQAQLLSTHDPPAAQHLSDTTNCMEMMTGRLSCAA07196 AQKENGIYSVTFTLLSGPSRRFKRVVETIQAQLLSTHDPPAAQHLSEPPPPAPGLSWGAG***************************************************** **********hSTKa AQKENGIYSVTFTLLSGPSRRFKRVVETIQAQLLSTHDPPAAQHLSDTTNCMEMMTGRLShSTKc AQKENGIYSVTFTLLSGPSRRFKRVVETIQAQLLSTHDPPAAQHLSDTTNCMEMMTGRLSCAA07196 AQKENGIYSVTFTLLSGPSRRFKRVVETIQAQLLSTHDPPAAQHLSEPPPPAPGLSWGAG
**********************************************:.. ::hSTKa KCG--------IIPKS--------------------------------------------hSTKc KCGSPLSNFFDVIKQLFSDEKNGQAAQAPSTPAKRSAHGPLGDSAAAGPGPGGDAEYPTGCAA07196 LKG-------QKVATSYESSL---------------------------------------*************************************************:.. : :hSTKa KCG-------IIPKS------------------------------------- ------hSTKc KCGSPLSNFFDVIKQLFSDEKNGQAAQAPSTPAKRSAHGPLGDSAAAGPGPGGDAEYPTGCAA07196 LKG-------QKVATSYESSL--------------------------------- ------
* :hSTKa ----------------hSTKc KDTAKMGPPTARREQPCAA07196 ----------------* *:hSTKa ----------------hSTKc KDTAKMGPPTARREQPCAA07196 ----------------
Claims (18)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN02136497A CN1414103A (en) | 2002-08-14 | 2002-08-14 | Human serine/threonine protein kitase, its coding sequence, preparation method and use |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN02136497A CN1414103A (en) | 2002-08-14 | 2002-08-14 | Human serine/threonine protein kitase, its coding sequence, preparation method and use |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1414103A true CN1414103A (en) | 2003-04-30 |
Family
ID=4748667
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN02136497A Pending CN1414103A (en) | 2002-08-14 | 2002-08-14 | Human serine/threonine protein kitase, its coding sequence, preparation method and use |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1414103A (en) |
-
2002
- 2002-08-14 CN CN02136497A patent/CN1414103A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1665839A (en) | Novel antagonists of MCP proteins | |
CN1170850C (en) | Human angiopoietin-like protein and coding sequence and use thereof | |
CN1289524C (en) | Human telomerase active inhibitor protein and use thereof | |
CN1244595C (en) | A kind of tumor suppressor protein and its application | |
CN1414103A (en) | Human serine/threonine protein kitase, its coding sequence, preparation method and use | |
CN1303102C (en) | Method for diagnosing and curing alopecia utilizing the Rhor gene of human and rat and the encoding products | |
CN1194012C (en) | Sperm generation-related protein, its coding sequence and use | |
CN1234728C (en) | Novel human lymphokine, its coding sequence and use | |
CN1151259C (en) | Human uridine-cytidylate kinase, its coding sequence, and its production method | |
CN1209369C (en) | Cell death inducing protein and its coding sequence and use | |
CN1164614C (en) | Memory clear protein and its application | |
CN1710069A (en) | Adenylosuccinate synthase-like molecule and its coding sequence and use | |
CN1155613C (en) | Human tumor associated gene in 1-zone 3-band 3-subband of short arm of human chromosome No.17 and its coding protein | |
CN1273485C (en) | Immunoglobulin sample receptor originated from human dendritic cell, its coding sequence and application | |
CN1570100A (en) | Nerval incretion specific protein analog, its coding sequence, making method and uses | |
CN1272540A (en) | Human immune factor related protein and its coded sequence | |
CN1209372C (en) | Human tumor relative gene CT120 in human 17p 13.3 area and coding protein thereof | |
CN1304568C (en) | Novel human complement C1r-like serine proteinase analogue, and its encoding sequence and use | |
CN1216991C (en) | Human guanosine monophophate reductase and its coding sequence and preparation method | |
CN1151251C (en) | New human-phosphoguanosine reductase, its coding sequence and application | |
CN1631898A (en) | Mitochondrial transport protein molecule derived from human bone marrow stromal cells and its coding sequence and use | |
CN1271007A (en) | New human chitinase protein and its code sequence | |
CN1302875A (en) | Human Charcot-leyden crystal 5, its coding sequence, its preparing process and its application | |
CN1544465A (en) | A protein that promotes neural differentiation and resists apoptosis and its coding gene | |
CN1920027A (en) | Human hPTP gene order, encode albumen and preparation method thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |