KR101016689B1 - Proteins involved in the transmission of peptidoglycan recognition signals, genes encoding them, and bacterial infection detection kits including the same - Google Patents
Proteins involved in the transmission of peptidoglycan recognition signals, genes encoding them, and bacterial infection detection kits including the same Download PDFInfo
- Publication number
- KR101016689B1 KR101016689B1 KR1020070140064A KR20070140064A KR101016689B1 KR 101016689 B1 KR101016689 B1 KR 101016689B1 KR 1020070140064 A KR1020070140064 A KR 1020070140064A KR 20070140064 A KR20070140064 A KR 20070140064A KR 101016689 B1 KR101016689 B1 KR 101016689B1
- Authority
- KR
- South Korea
- Prior art keywords
- gly
- leu
- lys
- ser
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 154
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 139
- 208000035143 Bacterial infection Diseases 0.000 title claims abstract description 18
- 208000022362 bacterial infectious disease Diseases 0.000 title claims abstract description 18
- 238000001514 detection method Methods 0.000 title claims abstract description 15
- 102000052544 Peptidoglycan recognition protein Human genes 0.000 title abstract description 25
- 230000005540 biological transmission Effects 0.000 title abstract 2
- 210000004369 blood Anatomy 0.000 claims abstract description 18
- 239000008280 blood Substances 0.000 claims abstract description 18
- 235000021329 brown rice Nutrition 0.000 claims abstract description 10
- 235000014101 wine Nutrition 0.000 claims abstract description 7
- 241000254105 Tenebrio Species 0.000 claims description 46
- 101100190227 Drosophila melanogaster PGRP-SA gene Proteins 0.000 claims description 10
- 102000016943 Muramidase Human genes 0.000 claims description 7
- 108010014251 Muramidase Proteins 0.000 claims description 7
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 claims description 7
- 229960000274 lysozyme Drugs 0.000 claims description 7
- 239000004325 lysozyme Substances 0.000 claims description 7
- 235000010335 lysozyme Nutrition 0.000 claims description 7
- 101000705294 Arabidopsis thaliana Oxygen-evolving enhancer protein 1-2, chloroplastic Proteins 0.000 claims description 6
- 108010057081 Merozoite Surface Protein 1 Proteins 0.000 claims description 6
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 5
- 101710205059 Beta-lytic metalloendopeptidase Proteins 0.000 claims description 5
- 241000124008 Mammalia Species 0.000 claims description 5
- 239000002773 nucleotide Substances 0.000 claims description 5
- 125000003729 nucleotide group Chemical group 0.000 claims description 5
- 235000013305 food Nutrition 0.000 claims description 4
- 239000003673 groundwater Substances 0.000 claims description 3
- 239000008176 lyophilized powder Substances 0.000 claims description 3
- 235000020679 tap water Nutrition 0.000 claims description 3
- 239000008399 tap water Substances 0.000 claims description 3
- 101100369769 Drosophila melanogaster blp gene Proteins 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 8
- 150000001875 compounds Chemical class 0.000 claims 1
- 101150061069 gnbp1 gene Proteins 0.000 claims 1
- MSFSPUZXLOGKHJ-UHFFFAOYSA-N Muraminsaeure Natural products OC(=O)C(C)OC1C(N)C(O)OC(CO)C1O MSFSPUZXLOGKHJ-UHFFFAOYSA-N 0.000 abstract description 20
- 108010013639 Peptidoglycan Proteins 0.000 abstract description 20
- 230000019491 signal transduction Effects 0.000 abstract description 3
- 108010069727 pro-phenoloxidase Proteins 0.000 abstract description 2
- 230000008054 signal transmission Effects 0.000 abstract description 2
- 150000001413 amino acids Chemical group 0.000 description 34
- 108010062466 Enzyme Precursors Proteins 0.000 description 15
- 102000010911 Enzyme Precursors Human genes 0.000 description 15
- 239000000872 buffer Substances 0.000 description 15
- 108020004414 DNA Proteins 0.000 description 12
- 239000000243 solution Substances 0.000 description 12
- 241000254109 Tenebrio molitor Species 0.000 description 11
- 101710187483 Gram-negative bacteria-binding protein 1 Proteins 0.000 description 10
- 108010009051 Peptidoglycan recognition protein Proteins 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- 230000037361 pathway Effects 0.000 description 9
- 102000035195 Peptidases Human genes 0.000 description 8
- 108091005804 Peptidases Proteins 0.000 description 8
- 239000004365 Protease Substances 0.000 description 8
- 238000004440 column chromatography Methods 0.000 description 8
- 239000002299 complementary DNA Substances 0.000 description 8
- 108010015792 glycyllysine Proteins 0.000 description 8
- 238000000034 method Methods 0.000 description 8
- 101710136697 Modular serine protease Proteins 0.000 description 7
- 241000254086 Tribolium <beetle> Species 0.000 description 7
- 108010047857 aspartylglycine Proteins 0.000 description 7
- 238000003776 cleavage reaction Methods 0.000 description 7
- 230000007246 mechanism Effects 0.000 description 7
- 108010070643 prolylglutamic acid Proteins 0.000 description 7
- 230000007017 scission Effects 0.000 description 7
- 239000000758 substrate Substances 0.000 description 7
- 241000219122 Cucurbita Species 0.000 description 6
- 235000009852 Cucurbita pepo Nutrition 0.000 description 6
- 229960002897 heparin Drugs 0.000 description 6
- 229920000669 heparin Polymers 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 5
- 108700023418 Amidases Proteins 0.000 description 5
- 102000005922 amidase Human genes 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 210000001124 body fluid Anatomy 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 238000010828 elution Methods 0.000 description 5
- 230000007717 exclusion Effects 0.000 description 5
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 150000007523 nucleic acids Chemical group 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 4
- STGVYUTZKGPRCI-GUBZILKMSA-N Pro-Val-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 STGVYUTZKGPRCI-GUBZILKMSA-N 0.000 description 4
- 108010022999 Serine Proteases Proteins 0.000 description 4
- 102000012479 Serine Proteases Human genes 0.000 description 4
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 4
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 4
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 229910052588 hydroxylapatite Inorganic materials 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 108010003700 lysyl aspartic acid Proteins 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 244000005700 microbiome Species 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- VIIIJFZJKFXOGG-UHFFFAOYSA-N 3-methylchromen-2-one Chemical compound C1=CC=C2OC(=O)C(C)=CC2=C1 VIIIJFZJKFXOGG-UHFFFAOYSA-N 0.000 description 3
- 241000590035 Achromobacter lyticus Species 0.000 description 3
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 3
- AMRANMVXQWXNAH-ZLUOBGJFSA-N Asp-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O AMRANMVXQWXNAH-ZLUOBGJFSA-N 0.000 description 3
- 229920002498 Beta-glucan Polymers 0.000 description 3
- 102000014914 Carrier Proteins Human genes 0.000 description 3
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 3
- VLPMGIJPAWENQB-SRVKXCTJSA-N His-Cys-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O VLPMGIJPAWENQB-SRVKXCTJSA-N 0.000 description 3
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 3
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 3
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 3
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 3
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 3
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 3
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 3
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- 108010079364 N-glycylalanine Proteins 0.000 description 3
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 3
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 3
- 108090000190 Thrombin Proteins 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 239000010839 body fluid Substances 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 108010016616 cysteinylglycine Proteins 0.000 description 3
- 229940088598 enzyme Drugs 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 3
- XYJRXVWERLGGKC-UHFFFAOYSA-D pentacalcium;hydroxide;triphosphate Chemical compound [OH-].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O XYJRXVWERLGGKC-UHFFFAOYSA-D 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 108090000765 processed proteins & peptides Proteins 0.000 description 3
- 230000009257 reactivity Effects 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 2
- NJIFPLAJSVUQOZ-JBDRJPRFSA-N Ala-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)N NJIFPLAJSVUQOZ-JBDRJPRFSA-N 0.000 description 2
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 2
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 2
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 2
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 2
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 2
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 2
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- YWENWUYXQUWRHQ-LPEHRKFASA-N Arg-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O YWENWUYXQUWRHQ-LPEHRKFASA-N 0.000 description 2
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 2
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 2
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 2
- NMTANZXPDAHUKU-ULQDDVLXSA-N Arg-Tyr-Lys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 NMTANZXPDAHUKU-ULQDDVLXSA-N 0.000 description 2
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 2
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 2
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 2
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 2
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 2
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 2
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- WXASLRQUSYWVNE-FXQIFTODSA-N Asp-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WXASLRQUSYWVNE-FXQIFTODSA-N 0.000 description 2
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 2
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 2
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 2
- ANPADMNVVOOYKW-DCAQKATOSA-N Cys-His-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ANPADMNVVOOYKW-DCAQKATOSA-N 0.000 description 2
- KGIHMGPYGXBYJJ-SRVKXCTJSA-N Cys-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CS KGIHMGPYGXBYJJ-SRVKXCTJSA-N 0.000 description 2
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 2
- LHJDLVVQRJIURS-SRVKXCTJSA-N Cys-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LHJDLVVQRJIURS-SRVKXCTJSA-N 0.000 description 2
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 2
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 2
- 108700019996 Drosophila Gnbp1 Proteins 0.000 description 2
- 241000192125 Firmicutes Species 0.000 description 2
- 108010092526 GKPV peptide Proteins 0.000 description 2
- 102100022887 GTP-binding nuclear protein Ran Human genes 0.000 description 2
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 2
- VVWWRZZMPSPVQU-KBIXCLLPSA-N Gln-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VVWWRZZMPSPVQU-KBIXCLLPSA-N 0.000 description 2
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 2
- JEFZIKRIDLHOIF-BYPYZUCNSA-N Gln-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(O)=O JEFZIKRIDLHOIF-BYPYZUCNSA-N 0.000 description 2
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 2
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 2
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 2
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 2
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 2
- ISXJHXGYMJKXOI-GUBZILKMSA-N Glu-Cys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O ISXJHXGYMJKXOI-GUBZILKMSA-N 0.000 description 2
- UENPHLAAKDPZQY-XKBZYTNZSA-N Glu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O UENPHLAAKDPZQY-XKBZYTNZSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 2
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 2
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 2
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 2
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 2
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 2
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- 101000774835 Heteractis crispa PI-stichotoxin-Hcr2o Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- BQYZXYCEKYJKAM-VGDYDELISA-N His-Cys-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQYZXYCEKYJKAM-VGDYDELISA-N 0.000 description 2
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 2
- BRZQWIIFIKTJDH-VGDYDELISA-N His-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N BRZQWIIFIKTJDH-VGDYDELISA-N 0.000 description 2
- LYDKQVYYCMYNMC-SRVKXCTJSA-N His-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYDKQVYYCMYNMC-SRVKXCTJSA-N 0.000 description 2
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 2
- WCHONUZTYDQMBY-PYJNHQTQSA-N His-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WCHONUZTYDQMBY-PYJNHQTQSA-N 0.000 description 2
- 101000620756 Homo sapiens GTP-binding nuclear protein Ran Proteins 0.000 description 2
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 2
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 2
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 2
- KTTMFLSBTNBAHL-MXAVVETBSA-N Ile-Phe-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N KTTMFLSBTNBAHL-MXAVVETBSA-N 0.000 description 2
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 2
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 2
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 2
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 2
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 2
- KSFQPRLZAUXXPT-GARJFASQSA-N Lys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)C(=O)O KSFQPRLZAUXXPT-GARJFASQSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 2
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 2
- YXTKSLRSRXKXNV-IHRRRGAJSA-N Lys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N YXTKSLRSRXKXNV-IHRRRGAJSA-N 0.000 description 2
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 2
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 2
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 2
- 108010053229 Lysyl endopeptidase Proteins 0.000 description 2
- SDTSLIMYROCDNS-FXQIFTODSA-N Met-Cys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O SDTSLIMYROCDNS-FXQIFTODSA-N 0.000 description 2
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 2
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 2
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 2
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 2
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 2
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 2
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 2
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 2
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 2
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 2
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 2
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- 239000012564 Q sepharose fast flow resin Substances 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 2
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 2
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 2
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 2
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 2
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 2
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 2
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 2
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 2
- XZSJDSBPEJBEFZ-QRTARXTBSA-N Trp-Asn-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O XZSJDSBPEJBEFZ-QRTARXTBSA-N 0.000 description 2
- VKMOGXREKGVZAF-QEJZJMRPSA-N Trp-Asp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VKMOGXREKGVZAF-QEJZJMRPSA-N 0.000 description 2
- JLTQXEOXIJMCLZ-ZVZYQTTQSA-N Trp-Gln-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 JLTQXEOXIJMCLZ-ZVZYQTTQSA-N 0.000 description 2
- WVHUFSCKCBQKJW-HKUYNNGSSA-N Trp-Gly-Tyr Chemical compound C([C@H](NC(=O)CNC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 WVHUFSCKCBQKJW-HKUYNNGSSA-N 0.000 description 2
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 2
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 2
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 2
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 2
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 2
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 2
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 2
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 2
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 2
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 2
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 2
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 2
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 2
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 2
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 2
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 2
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108091008324 binding proteins Proteins 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 230000004154 complement system Effects 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- MUCZHBLJLSDCSD-UHFFFAOYSA-N diisopropyl fluorophosphate Chemical compound CC(C)OP(F)(=O)OC(C)C MUCZHBLJLSDCSD-UHFFFAOYSA-N 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 229960005051 fluostigmine Drugs 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010037389 glutamyl-cysteinyl-lysine Proteins 0.000 description 2
- 150000004676 glycans Chemical class 0.000 description 2
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 238000003119 immunoblot Methods 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 235000012054 meals Nutrition 0.000 description 2
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000001488 sodium phosphate Substances 0.000 description 2
- 229910000162 sodium phosphate Inorganic materials 0.000 description 2
- 229960003339 sodium phosphate Drugs 0.000 description 2
- 239000012064 sodium phosphate buffer Substances 0.000 description 2
- 235000011008 sodium phosphates Nutrition 0.000 description 2
- 108010066180 tertiary-butyloxycarbonyl-valyl-prolyl-arginyl-7-amino-4-methylcoumarin Proteins 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- ZXJZGWOMAFPSJH-DCAQKATOSA-N (2S)-1-[2-[[2-[[(2S)-2-[[(2S)-2-[(2-aminoacetyl)amino]-3-carboxypropanoyl]amino]-3-hydroxypropanoyl]amino]acetyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O ZXJZGWOMAFPSJH-DCAQKATOSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- WPANETAWYGDRLL-UHFFFAOYSA-N 4-aminobenzenecarboximidamide Chemical compound NC(=N)C1=CC=C(N)C=C1 WPANETAWYGDRLL-UHFFFAOYSA-N 0.000 description 1
- TYMLOMAKGOJONV-UHFFFAOYSA-N 4-nitroaniline Chemical compound NC1=CC=C([N+]([O-])=O)C=C1 TYMLOMAKGOJONV-UHFFFAOYSA-N 0.000 description 1
- BTJIUGUIPKRLHP-UHFFFAOYSA-N 4-nitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1 BTJIUGUIPKRLHP-UHFFFAOYSA-N 0.000 description 1
- TVEXGJYMHHTVKP-UHFFFAOYSA-N 6-oxabicyclo[3.2.1]oct-3-en-7-one Chemical compound C1C2C(=O)OC1C=CC2 TVEXGJYMHHTVKP-UHFFFAOYSA-N 0.000 description 1
- 241000590020 Achromobacter Species 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- WYPUMLRSQMKIJU-BPNCWPANSA-N Ala-Arg-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WYPUMLRSQMKIJU-BPNCWPANSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- ANGAOPNEPIDLPO-XVYDVKMFSA-N Ala-His-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N ANGAOPNEPIDLPO-XVYDVKMFSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 1
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- UGJLILSJKSBVIR-ZFWWWQNUSA-N Arg-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)NCC(O)=O)=CNC2=C1 UGJLILSJKSBVIR-ZFWWWQNUSA-N 0.000 description 1
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- YJRORCOAFUZVKA-FXQIFTODSA-N Asn-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N YJRORCOAFUZVKA-FXQIFTODSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- XXAOXVBAWLMTDR-ZLUOBGJFSA-N Asn-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N XXAOXVBAWLMTDR-ZLUOBGJFSA-N 0.000 description 1
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 1
- QGNXYDHVERJIAY-ACZMJKKPSA-N Asn-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGNXYDHVERJIAY-ACZMJKKPSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- BKZFBJYIVSBXCO-KKUMJFAQSA-N Asn-Phe-His Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O BKZFBJYIVSBXCO-KKUMJFAQSA-N 0.000 description 1
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
- XCBKBPRFACFFOO-AQZXSJQPSA-N Asn-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O XCBKBPRFACFFOO-AQZXSJQPSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- WJHYGGVCWREQMO-GHCJXIJMSA-N Asp-Cys-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WJHYGGVCWREQMO-GHCJXIJMSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- ZVYYMCXVPZEAPU-CWRNSKLLSA-N Asp-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZVYYMCXVPZEAPU-CWRNSKLLSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 101000802666 Bombyx mori Beta-1,3-glucan-binding protein Proteins 0.000 description 1
- 101100380241 Caenorhabditis elegans arx-2 gene Proteins 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- YFXFOZPXVFPBDH-VZFHVOOUSA-N Cys-Ala-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CS)C(O)=O YFXFOZPXVFPBDH-VZFHVOOUSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 1
- ZQHQTSONVIANQR-BQBZGAKWSA-N Cys-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N ZQHQTSONVIANQR-BQBZGAKWSA-N 0.000 description 1
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 1
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 1
- LHMSYHSAAJOEBL-CIUDSAMLSA-N Cys-Lys-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O LHMSYHSAAJOEBL-CIUDSAMLSA-N 0.000 description 1
- GDNWBSFSHJVXKL-GUBZILKMSA-N Cys-Lys-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O GDNWBSFSHJVXKL-GUBZILKMSA-N 0.000 description 1
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 1
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 1
- LLUXQOVDMQZMPJ-KKUMJFAQSA-N Cys-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CS)CC1=CC=C(O)C=C1 LLUXQOVDMQZMPJ-KKUMJFAQSA-N 0.000 description 1
- 108700020359 Drosophila Tl Proteins 0.000 description 1
- 108700013637 Drosophila bw Proteins 0.000 description 1
- 101100519813 Drosophila melanogaster PGRP-LE gene Proteins 0.000 description 1
- ALUBSZXSNSPDQV-WDSKDSINSA-N Gln-Cys-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ALUBSZXSNSPDQV-WDSKDSINSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- JHPFPROFOAJRFN-IHRRRGAJSA-N Gln-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O JHPFPROFOAJRFN-IHRRRGAJSA-N 0.000 description 1
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- GQTNWYFWSUFFRA-KKUMJFAQSA-N Gln-Met-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GQTNWYFWSUFFRA-KKUMJFAQSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- BJVBMSTUUWGZKX-JYJNAYRXSA-N Gln-Tyr-His Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BJVBMSTUUWGZKX-JYJNAYRXSA-N 0.000 description 1
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 1
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- JPWIMMUNWUKOAD-STQMWFEESA-N Gly-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN JPWIMMUNWUKOAD-STQMWFEESA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- 206010018612 Gonorrhoea Diseases 0.000 description 1
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 1
- VHHYJBSXXMPQGZ-AVGNSLFASA-N His-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N VHHYJBSXXMPQGZ-AVGNSLFASA-N 0.000 description 1
- PYNUBZSXKQKAHL-UWVGGRQHSA-N His-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O PYNUBZSXKQKAHL-UWVGGRQHSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 1
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 1
- YVCGJPIKRMGNPA-LSJOCFKGSA-N His-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O YVCGJPIKRMGNPA-LSJOCFKGSA-N 0.000 description 1
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- QTMKFZAYZKBFRC-BZSNNMDCSA-N His-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O QTMKFZAYZKBFRC-BZSNNMDCSA-N 0.000 description 1
- 101000894534 Hyphantria cunea Beta-1,3-glucan-binding protein Proteins 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- XVUAQNRNFMVWBR-BLMTYFJBSA-N Ile-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N XVUAQNRNFMVWBR-BLMTYFJBSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 1
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- IIKJNQWOQIWWMR-CIUDSAMLSA-N Leu-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N IIKJNQWOQIWWMR-CIUDSAMLSA-N 0.000 description 1
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 1
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- NDSNUWJPZKTFAR-DCAQKATOSA-N Lys-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN NDSNUWJPZKTFAR-DCAQKATOSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- XFANQCRHTMOEAP-WDSOQIARSA-N Lys-Pro-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XFANQCRHTMOEAP-WDSOQIARSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- SEZADXQOJJTXPG-VFAJRCTISA-N Lys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N)O SEZADXQOJJTXPG-VFAJRCTISA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- 241001599018 Melanogaster Species 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- MIAZEQZXAFTCCG-UBHSHLNASA-N Met-Phe-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 MIAZEQZXAFTCCG-UBHSHLNASA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- ALTHVGNGGZZSAC-SRVKXCTJSA-N Met-Val-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N ALTHVGNGGZZSAC-SRVKXCTJSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 1
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 1
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 1
- YDUGVDGFKNXFPL-IXOXFDKPSA-N Phe-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YDUGVDGFKNXFPL-IXOXFDKPSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- 101000894541 Plodia interpunctella Beta-1,3-glucan-binding protein Proteins 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 1
- OFSZYRZOUMNCCU-BZSNNMDCSA-N Pro-Trp-Met Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCSC)C(O)=O)C(=O)[C@@H]1CCCN1 OFSZYRZOUMNCCU-BZSNNMDCSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 101100393821 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GSP2 gene Proteins 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- BLPYXIXXCFVIIF-FXQIFTODSA-N Ser-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N)CN=C(N)N BLPYXIXXCFVIIF-FXQIFTODSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- 229940122055 Serine protease inhibitor Drugs 0.000 description 1
- 101710102218 Serine protease inhibitor Proteins 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- PAOYNIKMYOGBMR-PBCZWWQYSA-N Thr-Asn-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PAOYNIKMYOGBMR-PBCZWWQYSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- UTCFSBBXPWKLTG-XKBZYTNZSA-N Thr-Cys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O UTCFSBBXPWKLTG-XKBZYTNZSA-N 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 1
- HABYQJRYDKEVOI-IHPCNDPISA-N Trp-His-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CCCCN)C(=O)O)N HABYQJRYDKEVOI-IHPCNDPISA-N 0.000 description 1
- WKCFCVBOFKEVKY-HSCHXYMDSA-N Trp-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WKCFCVBOFKEVKY-HSCHXYMDSA-N 0.000 description 1
- RQKMZXSRILVOQZ-GMVOTWDCSA-N Trp-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N RQKMZXSRILVOQZ-GMVOTWDCSA-N 0.000 description 1
- IEESWNWYUOETOT-BVSLBCMMSA-N Trp-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1ccccc1)C(O)=O IEESWNWYUOETOT-BVSLBCMMSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 1
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 1
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 1
- ICFRWCLVYFKHJV-FXQIFTODSA-N Val-Cys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N ICFRWCLVYFKHJV-FXQIFTODSA-N 0.000 description 1
- SRWWRLKBEJZFPW-IHRRRGAJSA-N Val-Cys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SRWWRLKBEJZFPW-IHRRRGAJSA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 1
- LNWSJGJCLFUNTN-ZOBUZTSGSA-N Val-Trp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LNWSJGJCLFUNTN-ZOBUZTSGSA-N 0.000 description 1
- AYHNXCJKBLYVOA-KSZLIROESA-N Val-Trp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N AYHNXCJKBLYVOA-KSZLIROESA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 101150092805 actc1 gene Proteins 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010039538 alanyl-glycyl-aspartyl-valine Proteins 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 230000002152 alkylating effect Effects 0.000 description 1
- 238000003277 amino acid sequence analysis Methods 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 108010089975 arginyl-glycyl-aspartyl-serine Proteins 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000009918 complex formation Effects 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000004090 dissolution Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 210000002468 fat body Anatomy 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 208000001786 gonorrhea Diseases 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 230000007124 immune defense Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 229910000160 potassium phosphate Inorganic materials 0.000 description 1
- 229940093916 potassium phosphate Drugs 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000000164 protein isolation Methods 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 150000003354 serine derivatives Chemical class 0.000 description 1
- 239000003001 serine protease inhibitor Substances 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- HRXKRNGNAMMEHJ-UHFFFAOYSA-K trisodium citrate Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O HRXKRNGNAMMEHJ-UHFFFAOYSA-K 0.000 description 1
- 229940038773 trisodium citrate Drugs 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/43504—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates
- C07K14/43563—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates from insects
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56911—Bacteria
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Hematology (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Urology & Nephrology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Toxicology (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- Gastroenterology & Hepatology (AREA)
- Biotechnology (AREA)
- Cell Biology (AREA)
- Virology (AREA)
- Microbiology (AREA)
- Insects & Arthropods (AREA)
- Food Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- General Physics & Mathematics (AREA)
- Pathology (AREA)
- Peptides Or Proteins (AREA)
Abstract
본 발명은 펩티도글리칸 인식 신호 전달에 관여하는 신규의 단백질 및 이를 코딩하는 유전자를 제공한다. 또한, 본 발명은 상기 펩티도글리칸 인식 신호 전달에 관여하는 단백질을 포함하는, 검체 중 박테리아 감염의 검출용 키트를 제공한다. 본 발명에 의하여 새롭게 밝혀진 펩티도글리칸 인식 신호 전달에 관여하는 단백질은 혈액 등의 검체 중 박테리아 감염을 검출하는 키트에 유용하게 사용될 수 있다.The present invention provides novel proteins involved in peptidoglycan recognition signal transduction and genes encoding them. The present invention also provides a kit for the detection of bacterial infection in a sample, comprising a protein involved in the transmission of the peptidoglycan recognition signal. Proteins involved in the peptidoglycan recognition signal transmission newly discovered by the present invention can be usefully used in kits for detecting bacterial infection in samples such as blood.
펩티도글리칸, 프로-페놀옥시다아제, 갈색거저리 Peptidoglycan, pro-phenol oxidase, brown rice wine
Description
본 발명은 갈색거저리(Tenebrio molitor)의 펩티도글리칸 인식 신호 전달에 관여하는 신규의 단백질, 이를 코딩하는 유전자, 또는 이를 포함하는 검체 중 박테리아 감염의 검출용 키트에 관한 것이다.The present invention relates to a novel protein involved in the peptidoglycan recognition signal transduction of a brown rice meal ( Tenebrio molitor ), a gene encoding the same, or a kit for detecting a bacterial infection in a sample containing the same.
최근의 유전공학 연구를 통해, 초파리(Drosophila melanogaster)의 펩티도글리칸(PG)을 인식하는 단백질인 Drosophila PGRP-SA 및 Drosophila PGRP-SD가 Toll 경로를 활성화시키며(Michel, T., Reichhart, J. M., Hoffmann, J. A. & Royet, J. (2001) Nature 414, 756-759 및 Bischoff, V., Vignal, C., Boneca, I. G., Michel, T., Hoffmann, J. A. & Royet, J. (2004) Nat Immunol 5, 1175-1180), Drosophila PGRP-LC 및 Drosophila PGRP-LE가 Imd 경로에 대한 수용체임이 보고된 바 있다 (Gottar, M., Gobert, V., Michel, T., Belvin, M., Duyk, G., Hoffmann, J. A., Ferrandon, D. & Royet, J. (2002) Nature 416, 640-644; Choe, K. M., Werner, T., Stoven, S., Hultmark, D. & Anderson, K. V. (2002) Science 296, 359-362; 및 Takehana, A., Katsuyama, T., Yano, T., Oshima, Y., Takada, H., Aigaki, T. & Kurata, S. (2002) Proc Natl Acad Sci USA 99, 13705-13710). 한편, Drosophila 그람 음성균 결합 단백질 1 (Drosophila GNBP1)의 기능-소실 돌연변이(loss-of-function mutant)의 면역 표현형은 Drosophila PGRP-SA와 구분할 수 없으며, 이는 두 개의 단백질이 그람 양성균의 감염에 대한 반응에 있어서 Toll 경로를 활성화시키기 위하여 필요하다는 것을 나타낸다 (Gobert, V., Gottar, M., Matskevich, A. A., Rutschmann, S., Royet, J., Belvin, M., Hoffmann, J. A. & Ferrandon, D. (2003) Science 302, 2126-2130; Pili-Floury, S., Leulier, F., Takahashi, K., Saigo, K., Samain, E., Ueda, R. & Lemaitre, B. (2004) J Biol Chem 279, 12848-12853; 및 Wang, L., Weber, A. N., Atilano, M. L., Filipe, S. R., Gay, N. J. & Ligoxygakis, P. (2006) EMBO J 25, 5005-5014). 그러나, 그람 양성균 인식에 있어서의 Toll 경로의 상류(upstream) 부분의 분자수준에서의 기전은 아직 확실하게 밝혀지지 않았다.Through recent genetic engineering studies, Drosophila Drosophila PGRP-SA and Drosophila PGRP-SD, proteins that recognize the peptidoglycan (PG) of melanogaster ), activate the Toll pathway (Michel, T., Reichhart, JM, Hoffmann, JA & Royet, J. (2001) ) Nature 414, 756-759 and Bischoff, V., Vignal, C., Boneca, IG, Michel, T., Hoffmann, JA & Royet, J. (2004) Nat Immunol 5, 1175-1180), Drosophila PGRP-LC and Drosophila It has been reported that PGRP-LE is a receptor for the Imd pathway (Gottar, M., Gobert, V., Michel, T., Belvin, M., Duyk, G., Hoffmann, JA, Ferrandon, D. & Royet , J. (2002) Nature 416, 640-644; Choe, KM, Werner, T., Stoven, S., Hultmark, D. & Anderson, KV (2002) Science 296, 359-362; and Takehana, A. , Katsuyama, T., Yano, T., Oshima, Y., Takada, H., Aigaki, T. & Kurata, S. (2002) Proc Natl Acad Sci USA 99, 13705-13710. Meanwhile, Drosophila Gram-negative bacteria binding protein 1 ( Drosophila GNBP1) is an immune phenotype of loss-of-function mutant Drosophila Indistinguishable from PGRP-SA, this indicates that two proteins are required to activate the Toll pathway in response to infection of Gram-positive bacteria (Gobert, V., Gottar, M., Matskevich, AA, Rutschmann, S Royet, J., Belvin, M., Hoffmann, JA & Ferrandon, D. (2003) Science 302, 2126-2130; Pili-Floury, S., Leulier, F., Takahashi, K., Saigo, K., Samain, E., Ueda, R. & Lemaitre, B. (2004) J Biol Chem 279, 12848-12853; And Wang, L., Weber, AN, Atilano, ML, Filipe, SR, Gay, NJ & Ligoxygakis, P. (2006) EMBO J 25, 5005-5014). However, the mechanism at the molecular level in the upstream portion of the Toll pathway in Gram-positive bacteria recognition is not yet clear.
침습하는 병원균의 멜라닌화를 유도하는 프로-페놀옥시다아제 (pro-phenoloxisase, pro-PO) 활성화 캐스캐이드는 무척추 동물에 있어서 또 다른 중요한 고유의 면역 방어 기전이며, 이는 펩티도글리칸(peptidoglycan, PG) 및 β-1,3-글루칸에 의해 유발된다 (Cerenius, L. & Soderhall, K. (2004) Immunol Rev 198, 116-126 및 Kanost, M. R., Jiang, H. & Yu, X. Q. (2004) Immunol Rev 198, 97-105). 척추 동물의 보체 시스템과 유사하게, pro-PO 캐스케이드는 혈장내 단백질분 해(proteolytic) 캐스케이드이다. 그러므로, 상기 pro-PO 시스템은 세포가 포함이 되지 않은(cell-free) 조건에서 PG 및 β-1,3-글루칸 인식 및 이어지는 신호전달에 대한 생화학적 연구를 위한 좋은 연구 모델계이다. Pro-phenoloxisase (pro-PO) activating cascades that induce melaninization of invading pathogens are another important inherent immune defense mechanism in invertebrates, which are peptidoglycan (PG). ) And β-1,3-glucan (Cerenius, L. & Soderhall, K. (2004) Immunol Rev 198, 116-126 and Kanost, MR, Jiang, H. & Yu, XQ (2004) Immunol Rev 198, 97-105). Similar to the complement system of vertebrates, the pro-PO cascade is a plasma proteolytic cascade. Therefore, the pro-PO system is a good research model system for biochemical studies of PG and β-1,3-glucan recognition and subsequent signaling in cell-free conditions.
본 발명자들은 Drosophila PGRP-SA와 높은 서열 상동성을 나타내는 갈색거저리(Tenebrio molitor)의 펩티도글리칸 인식 단백질(peptidoglycan recognition protein, PGRP)를 동정한 바 있으며, Tenebrio PGRP-SA로 명명한 상기 PGRP가 Tenebrio 곤충에서 Lys-PG-의존성 pro-PO 시스템을 활성화시킨다는 것을 밝힌 바 있다 (Park, J. W., Je, B. R., Piao, S., Inamura, S., Fujimoto, Y., Fukase, K., Kusumoto, S., Soderhall, K., Ha, N. C. & Lee, B. L. (2006) J Biol Chem 281, 7747-7755). 또한, 본 발명자들은 생체 내 초파리(Drosophila) Toll 경로, 시험관 내 pro-PO 시스템, 및 재조합 PGRP-SA를 이용한 생화학적인 접근을 통하여, Lys-PG 인식 신호가 어떻게 하류(downstream)로 전달되는지에 관한 기전을 밝혀내고자 다양한 연구를 수행하였으며, pro-PO 시스템에 관여하는 신규 단백질들을 분리하고, 이를 이용할 경우 혈액 등의 검체 중 박테리아 감염을 검출할 수 있음을 밝힌 바 있다 (대한민국 특허출원 제10-2007-0095196호, 대한민국 특허출원 제10-2007-0013231호; Park JW, Kim CH, Kim JH, Je BR, Roh KB, Kim SJ, Lee HH, Ryu JH, Lim JH, Oh BH, Lee WJ, Ha NC, Lee BL Proc Natl Acad Sci U S A. 104, 6602-6607(2007)).The inventors of the Drosophila Brown gonorrhea with high sequence homology with PGRP-SA ( Tenebrio Molecular Peptidoglycan Recognition Protein (PGRP) has been identified, Tenebrio The PGRP, named PGRP-SA, has been shown to activate the Lys-PG-dependent pro-PO system in Tenebrio insects (Park, JW, Je, BR, Piao, S., Inamura, S., Fujimoto, Y ., Fukase, K., Kusumoto, S., Soderhall, K., Ha, NC & Lee, BL (2006) J Biol Chem 281, 7747-7755). In addition, the inventors have described how Lys-PG recognition signals are transmitted downstream through the in vivo Drosophila Toll pathway, in vitro pro-PO system, and biochemical approaches using recombinant PGRP-SA. Various studies have been conducted to find out the mechanism, and it has been shown that new proteins involved in the pro-PO system can be isolated and used to detect bacterial infections in samples such as blood (Korean Patent Application No. 10-2007 -0095196, Korean Patent Application No. 10-2007-0013231; Park JW, Kim CH, Kim JH, Je BR, Roh KB, Kim SJ, Lee HH, Ryu JH, Lim JH, Oh BH, Lee WJ, Ha NC , Lee BL Proc Natl Acad Sci US A. 104, 6602-6607 (2007).
또한, 본 발명자들은 펩티도글리칸 인식 과정에 관여하는 새로운 단백질인 갈색거저리 유래의 약 41kDa 크기의 프로테아제(Tenebrio Tm-41)을 분리하였으며, 상기 Tenebrio Tm-41이 펩티도글리칸 인식 신호에 의하여 활성화되는 상위 단계의 프로테아제인 갈색거저리 유래의 모듈러 세린 프로테아제(Tenebrio modular serine protease, Tenebrio-MSP)에 의하여 활성화되는 단백질임을 확인하고, 상기 단백질이 혈액 등의 검체 중 박테리아 감염을 검출하는데 유용하게 사용될 수 있음을 밝혀낸 바 있다 (대한민국 특허출원 제10-2007-0095195호, 2007. 9. 19.자 출원).In addition, the present inventors have peptidoglycan new protein of approximately 41kDa mealworm size of protease derived involved in glycan recognition - were isolated (Tenebrio Tm 41), wherein the Tenebrio Tm - 41 The peptidoglycan of brown modular serine proteases derived from the higher level of geojeori protease that is activated by the recognition signal (Tenebrio It was confirmed that the protein is activated by the modular serine protease ( Tenebrio- MSP), and it has been found that the protein can be usefully used to detect bacterial infection in samples such as blood (Korean Patent Application No. 10-2007-0095195). , Filed Sept. 19, 2007).
본 발명자들은 pro-PO 시스템에 관여하는 단백질들을 사용하여 분자생물학적 수준에서 펩티도글리칸 인식 신호가 어떻게 하류(downstream)로 전달되는지에 관한 구체적인 기전을 밝히고자 다양한 연구를 시도하였다. 그 결과, 펩티도글리칸 인식 과정에 관여하는 새로운 단백질을 분리였으며, 얻어진 새로운 단백질이 활성화 형태의 Tenebrio Tm-41에 의해 활성화되고, 활성화된 단백질이 다시 하위에 존재하는 Spatzle 단백질을 분해하여 톨-경로(Toll pathway)를 활성화시킨다는 것을 밝혀냈다. 따라서, 이를 이용할 경우 혈액 등의 검체 중 박테리아 감염을 검출하는데 유용하게 사용될 수 있다.The inventors have attempted a variety of studies to elucidate the specific mechanism of how the peptidoglycan recognition signal is transmitted downstream at the molecular biological level using proteins involved in the pro-PO system. As a result, a new protein involved in the peptidoglycan recognition process was isolated and the new protein obtained was activated in the form of Tenebrio. It was found that activated by Tm - 41, the activated protein again degrades the Spatzle protein present in the lower part to activate the Toll pathway. Therefore, it can be usefully used to detect bacterial infection in samples such as blood.
따라서, 본 발명은 곤충의 생체 방어 반응인 Toll 활성화 경로 및 프로-페놀옥시다아제(pro-phenoloxidase, pro-PO) 활성화 시스템에 관여하는 신규의 단백질 및 이를 코딩하는 유전자를 제공하는 것을 목적으로 한다.Accordingly, an object of the present invention is to provide a novel protein involved in the Toll activation pathway and pro-phenoloxidase (pro-PO) activation system, which is a biological defense reaction of insects, and a gene encoding the same.
또한, 본 발명은 상기 단백질을 이용한 검체 중 박테리아 감염의 검출용 키트를 제공하는 것을 목적으로 한다.Another object of the present invention is to provide a kit for detecting bacterial infection in a sample using the protein.
본 발명의 일 태양에 따라, 서열번호 1의 아미노산 서열로 이루어진 갈색거저리 유래의 단백질이 제공된다.According to one aspect of the present invention, there is provided a protein of brown mealworms consisting of the amino acid sequence of SEQ ID NO: 1.
또한, 본 발명의 다른 태양에 따라, 서열번호 1의 아미노산 서열로 이루어진 갈색거저리 유래의 단백질을 코딩하는 유전자가 제공되며, 바람직하게는 상기 유전자는 서열번호 2의 염기서열을 갖는다.In addition, according to another aspect of the present invention, a gene encoding a protein derived from brown gourd consisting of the amino acid sequence of SEQ ID NO: 1 is provided, and preferably, the gene has a nucleotide sequence of SEQ ID NO: 2.
또한, 본 발명의 또 다른 태양에 따라, 서열번호 1의 아미노산 서열로 이루어진 갈색거저리 유래의 단백질을 포함하는, 검체 중 박테리아 감염의 검출용 키트가 제공된다. In addition, according to another aspect of the present invention, there is provided a kit for detecting bacterial infection in a specimen, comprising a protein derived from brown rice wine consisting of the amino acid sequence of SEQ ID NO: 1.
본 발명에 따른 키트에 있어서, 상기 검체는 수혈용 혈액, 포유동물의 혈액, 식품, 수돗물, 지하수, 빗물, 또는 무균 제품일 수 있으며, 바람직하게는 수혈용 혈액 또는 포유동물의 혈액이다. 또한, 상기 키트는 용액, 동결건조 분말, 냉동 용액, 또는 스트립 형태를 가질 수 있다.In the kit according to the present invention, the sample may be blood for transfusion, blood of a mammal, food, tap water, groundwater, rainwater, or sterile product, preferably blood for transfusion or blood of a mammal. In addition, the kit may have the form of a solution, lyophilized powder, frozen solution, or strip.
본 발명에 따른 검출용 키트는 서열번호 3의 아미노산 서열로 이루어진 Tenebrio PGRP-SA; 및 서열번호 4의 아미노산 서열로 이루어진 Tenebrio GNBP1, 서열번호 5의 아미노산 서열로 이루어진 Tenebrio MSP-1, 서열번호 6의 아미노산 서열로 이루어진 Tenebrio MSP-2, 및 서열번호 7의 아미노산으로 이루어진 Tenebrio Tm-41로 이루어진 군으로부터 1 종 이상 선택된 단백질을 더 포함할 수 있으며, 필요에 따라 β-라이틱 프로테아제(β-lytic protease, blp), 라이소자임, 또는 blp와 라이소자임을 추가로 포함할 수 있다.Detection kit according to the invention is Tenebrio PGRP-SA consisting of the amino acid sequence of SEQ ID NO: 3; And Tenebrio GNBP1 consisting of the amino acid sequence of SEQ ID NO: 4, Tenebrio MSP-1 consisting of the amino acid sequence of SEQ ID NO: 5, Tenebrio MSP-2 consisting of the amino acid sequence of SEQ ID NO: 6, and Tenebrio consisting of the amino acid of SEQ ID NO: 7 One or more proteins selected from the group consisting of Tm-41 may be further included, and if necessary, may further include β-lytic protease (β-lytic protease, blp), lysozyme, or blp and lysozyme.
본 발명에 의하여 펩티도글리칸 인식 신호 전달에 관여하는 인자(즉, 서열번호 1의 아미노산 서열로 이루어진 단백질)가 규명되었으며, 상기 단백질을 이용할 경우 혈액 등의 검체 중 박테리아 감염을 검출하기 위한 키트의 제조에 유용하게 사용될 수 있다. 즉, 본 발명에 의하여 새롭게 밝혀진 상기 단백질은 본 발명자들의 선행 연구(대한민국 특허출원 제10-2007-0095196호, 대한민국 특허출원 제10-2007-0013231호; Proc Natl Acad Sci U S A. 104, 6602-6607(2007)) 결과 pro-PO 시스템에 관여하는 것으로 밝혀진 펩티도글리칸 인식 단백질들 즉, 서열번호 3의 아미노산 서열로 이루어진 Tenebrio PGRP-SA; 및 서열번호 4의 아미노산 서열로 이루어진 Tenebrio GNBP1, 서열번호 5의 아미노산 서열로 이루어진 Tenebrio MSP-1, 서열번호 6의 아미노산 서열로 이루어진 Tenebrio MSP-2, 및 서열번호 7의 아미노산으로 이루어진 Tenebrio Tm-41로 이루어진 군으로부터 1 종 이상 선택된 단백질들과 함께 또는 별도로 박테리아 감염을 검출하기 위한 키트로 제작될 수 있다.According to the present invention, a factor (ie, a protein consisting of the amino acid sequence of SEQ ID NO: 1) involved in peptidoglycan recognition signal transmission has been identified, and the kit for detecting bacterial infection in a sample such as blood when the protein is used. It can be usefully used for manufacturing. That is, the protein newly revealed by the present invention is the inventors of the present inventors (Korea Patent Application No. 10-2007-0095196, Republic of Korea Patent Application No. 10-2007-0013231; Proc Natl Acad Sci US A. 104, 6602-6607 (2007)) found peptidoglycan recognition proteins involved in the pro-PO system, ie Tenebrio PGRP-SA consisting of the amino acid sequence of SEQ ID NO: 3; And Tenebrio GNBP1 consisting of the amino acid sequence of SEQ ID NO: 4, Tenebrio MSP-1 consisting of the amino acid sequence of SEQ ID NO: 5, Tenebrio MSP-2 consisting of the amino acid sequence of SEQ ID NO: 6, and Tenebrio consisting of the amino acid of SEQ ID NO: 7 It can be made into a kit for detecting bacterial infection with or separately from one or more selected proteins from the group consisting of Tm-41.
본 발명자들은 갈색거저리의 펩티도글리칸 인식 단백질 즉, Tenebrio PGRP-SA와 PG와의 복합체(complex)가 하류 단계로의 신호전달에 관여하는 단백질을 끌어 모음(recruiting)으로써, pro-PO 시스템 및 Toll 경로를 활성화시킨다는 것을 밝혀낸 바 있다. 또한, 상기 하류 단계로의 신호전달에 관여하는 단백질이 Gram-negative bacteria binding protein 1(GNBP1) 유사 단백질인 Tenebrio GNBP1 및 N-말단에 저밀도 지질 단백질 유사 도메인과 보체계 조절 단백질 유사 도메인이 존재 하는 Tenebrio-다중 도메인 함유 모듈러 SP(Tenebrio-multi-domain containing modular SP, MSP)임을 밝혀냈으며 상기 Tenebrio MSP는 두개의 형태 즉, Tenebrio MSP-1 및/또는 Tenebrio MSP-2가 존재함을 밝혀낸 바 있다(대한민국 특허출원 제10-2007-0013231호; Park JW, Kim CH, Kim JH, Je BR, Roh KB, Kim SJ, Lee HH, Ryu JH, Lim JH, Oh BH, Lee WJ, Ha NC, Lee BL Proc Natl Acad Sci U S A. 104, 6602-6607(2007)). 또한, 펩티도글리칸 인식 신호에 의하여 활성화되는 상위 단계의 프로테아제인 갈색거저리 유래의 모듈러 세린 프로테아제(Tenebrio modular serine protease, Tenebrio-MSP)에 의하여 활성화되는 단백질인 Tenebrio Tm-41이 존재함을 밝혀낸 바 있다(대한민국 특허출원 제10-2007-0095196호).The inventors of the present invention found that brown-peptide peptidoglycan recognition proteins, ie, complexes of Tenebrio PGRP-SA and PG, are involved in signaling to downstream stages, thereby reproducing the pro-PO system and Toll. It has been found to activate the pathway. In addition, the protein involved in signaling to the downstream step is Gram-negative bacteria binding protein 1 (GNBP1) -like protein Tenebrio Tenebrio MSP was found to be a T enebrio-multi-domain containing modular SP (MSP) with low density lipid protein-like domain and complement system regulatory protein-like domain at GNBP1 and N-terminus. That is, it has been found that Tenebrio MSP-1 and / or Tenebrio MSP-2 is present (Korean Patent Application No. 10-2007-0013231; Park JW, Kim CH, Kim JH, Je BR, Roh KB, Kim SJ, Lee HH, Ryu JH, Lim JH, Oh BH, Lee WJ, Ha NC, Lee BL Proc Natl Acad Sci US A. 104, 6602-6607 (2007). In addition, a modular serine protease derived from brown rice bran, a protease of higher levels activated by a peptidoglycan recognition signal ( Tenebrio) Tenebrio Tm-41, a protein activated by the modular serine protease ( Tenebrio- MSP), has been identified (Korean Patent Application No. 10-2007-0095196).
본 발명자들은 상기 선행 연구결과를 기초로 분자생물학적 수준의 인식 기전(mechanism) 연구를 수행하였으며, 놀랍게도 펩티도글리칸 인식 과정에 관여하는 새로운 단백질을 분리였다. 단백질의 서열 분석 결과, 상기 단백질은 서열번호 1의 아미노산 서열을 가지며, 이를 코딩하는 유전자는 서열번호 2의 염기서열을 갖는다는 것을 밝혀냈다. 또한, 상기 서열번호 1의 아미노산 서열을 갖는 단백질의 기능 분석 결과, 상기 단백질이 펩티도글리칸 인식 신호에 의하여 활성화되는 상위 단계의 프로테아제인 갈색거저리 유래의 세린 프로테아제(Tenebrio Tm-41)에 의하여 활성화되는 단백질임을 새롭게 밝혀냈다. 따라서, 본 발명에 의해 새롭게 밝혀진 단백질은, 갈색거저리 유래의 그람 음성 결합 단백질(Tenebrio Gram negative bacteria binding protein 1, Tenebrio GNBP1), 갈색거저리 유래의 모듈러 세린 프로테아제(Tenebrio modular serine protease, Tenebrio MSP)(Tenebrio MSP-1 및/또 는 Tenebrio MSP-2), 갈색거저리 유래의 세린 프로테아제(Tenebrio Tm-41) 등을 포함하는 박테리아 감염의 검출용 키트에 있어서, 최종 기질의 절단 효소로 기능할 수 있다. 그러므로, 본 발명에 의해 새롭게 밝혀진 단백질은 혈액 등의 검체 중 박테리아 감염을 검출하는데 사용될 수 있다.Based on the results of the previous studies, the inventors conducted a study on the molecular biological level of recognition mechanisms and surprisingly isolated new proteins involved in the peptidoglycan recognition process. Sequence analysis of the protein revealed that the protein has an amino acid sequence of SEQ ID NO: 1, and the gene encoding it has a nucleotide sequence of SEQ ID NO: 2. Also, the SEQ ID NO: 1. Functional analysis of a protein comprising the amino acid sequence of the result, the protein is a peptidoglycan a serine protease derived from the higher level of the mealworm protease that is activated by the recognition signal (Tenebrio Tm-41) newly revealed that the protein is activated. Therefore, the protein newly revealed by the present invention is a brown worm-derived Gram negative bacteria binding protein ( Tenebrio Gram negative
따라서, 본 발명은 서열번호 1의 아미노산 서열로 이루어진 갈색거저리 유래의 단백질이 제공한다.Accordingly, the present invention provides a protein derived from brown rice wine consisting of the amino acid sequence of SEQ ID NO: 1.
또한, 본 발명은 상기 서열번호 1의 아미노산 서열로 이루어진 갈색거저리 유래의 단백질을 코딩하는 유전자, 바람직하게는 서열번호 2의 염기서열로 이루어진 유전자를 제공한다.In addition, the present invention provides a gene encoding a protein derived from brown gourd consisting of the amino acid sequence of SEQ ID NO: 1, preferably a gene consisting of the nucleotide sequence of SEQ ID NO: 2.
본 발명의 일 구현예에 따라, 서열번호 1의 아미노산 서열로 이루어진 갈색거저리 유래의 단백질을 포함하는, 검체 중 박테리아 감염의 검출용 키트가 제공된다. According to one embodiment of the present invention, there is provided a kit for detecting bacterial infection in a sample, comprising a protein derived from brown gourd consisting of the amino acid sequence of SEQ ID NO: 1.
상기 검출용 키트에 있어서, 상기 검체는 수혈용 혈액, 사람을 포함한 포유동물의 혈액, 채소, 육류, 과일 등의 식품, 조리 또는 비조리된 식품, 수돗물, 지하수, 빗물을 포함하는 물, 무균제품 등을 포함하며, 기타 미생물 검출이 필요한 모든 검체를 포함한다. 바람직하게는, 본 발명의 검출용 키트는 수혈용 혈액 또는 사람을 포함한 포유동물의 혈액 중 박테리아 감염의 검출에 유용하게 사용될 수 있다.In the detection kit, the sample is blood for transfusion, blood of mammals including humans, foods such as vegetables, meat, fruits, cooked or uncooked foods, tap water, ground water, rain water, sterile products And all other specimens in need of detection of other microorganisms. Preferably, the detection kit of the present invention may be usefully used for the detection of bacterial infection in blood for transfusion or in the blood of mammals including humans.
본 발명의 검출용 키트는 반응성 검출을 위한 시약, 예를 들어, 파라-니트로 아닐린(p-nitroaniline)과 결합된 아미노산 혹은 펩티드 물질 및 다른 pro-PO 활성 화 인자 단백질 및 pro-PO 효소의 발색기질 등을 포함할 수 있다. 또한, 상기 검출용 키트는 용액, 동결건조 분말, 냉동 용액, 또는 스트립 형태를 가질 수 있으며, 각각의 형태는 당업계에서 통상적인 방법으로 제제화할 수 있다. 예를 들어, 용액 형태의 검출용 키트는 나트륨-인산, 칼륨-인산, 트리스-염산 및 이외의 여러 종류의 완충액 등의 완충액에 상기 단백질(들)을 별도로 또는 혼합하여 제제화할 수 있으며, 필요에 따라 냉동시키거나 동결건조할 수도 있다. The kit for detection of the present invention is a reagent for detecting reactivity, for example, an amino acid or peptide substance combined with para-nitroaniline and a chromophore substrate of other pro-PO activating factor proteins and pro-PO enzymes. And the like. In addition, the detection kit may have the form of a solution, lyophilized powder, frozen solution, or strip, each form can be formulated by conventional methods in the art. For example, a kit for detection in solution form may be formulated separately or by mixing the protein (s) in a buffer such as sodium-phosphate, potassium-phosphate, tris-hydrochloric acid and several other buffers, It may be frozen or lyophilized accordingly.
또한, 본 발명의 검출용 키트는 상기 서열번호 1의 아미노산 서열로 이루어진 갈색거저리 유래의 단백질을 별도의 검출용 키트로 제조하거나, 본 발명자들의 선행 연구 결과 pro-PO 시스템에 관여하는 것으로 밝혀진 펩티도글리칸 인식 단백질과 함께 또는 별도의 용기에 포함하는 검출용 키트로 제조할 수도 있다.In addition, the detection kit of the present invention may be prepared as a separate kit for detecting the protein derived from brown gourd consisting of the amino acid sequence of SEQ ID NO: 1, or the peptides found to be involved in the pro-PO system as a result of the previous studies of the present inventors It may also be prepared as a kit for detection included in the glycan recognition protein or in a separate container.
즉, 본 발명의 검출용 키트는 상기 서열번호 1의 아미노산 서열로 이루어진 갈색거저리 유래의 단백질에 추가하여, 본 발명자들의 선행 연구(대한민국 특허출원 제10-2007-0095196호, 대한민국 특허출원 제10-2007-0013231호; Park JW, Kim CH, Kim JH, Je BR, Roh KB, Kim SJ, Lee HH, Ryu JH, Lim JH, Oh BH, Lee WJ, Ha NC, Lee BL Proc Natl Acad Sci U S A. 104, 6602-6607(2007)) 결과 pro-PO 시스템에 관여하는 것으로 밝혀진 펩티도글리칸 인식 단백질들 즉, 서열번호 3의 아미노산 서열로 이루어진 Tenebrio PGRP-SA; 및 서열번호 4의 아미노산 서열로 이루어진 Tenebrio GNBP1, 서열번호 5의 아미노산 서열로 이루어진 Tenebrio MSP-1, 서열번호 6의 아미노산 서열로 이루어진 Tenebrio MSP-2, 및 서열번호 7의 아미노산 서열로 이루어진 Tenebrio Tm-41로 이루어진 군으로부터 1 종 이상 선택된 단백질을 더 포함할 수 있다.That is, the detection kit of the present invention, in addition to the brown gourd derived protein consisting of the amino acid sequence of SEQ ID NO: 1, the present inventors (Korean Patent Application No. 10-2007-0095196, Korean Patent Application No. 10- 2007-0013231; Park JW, Kim CH, Kim JH, Je BR, Roh KB, Kim SJ, Lee HH, Ryu JH, Lim JH, Oh BH, Lee WJ, Ha NC, Lee BL Proc Natl Acad Sci US A. 104, 6602-6607 (2007)) found peptidoglycan recognition proteins involved in the pro-PO system, ie Tenebrio PGRP-SA consisting of the amino acid sequence of SEQ ID NO: 3; And Tenebrio GNBP1 consisting of the amino acid sequence of SEQ ID NO: 4, Tenebrio MSP-1 consisting of the amino acid sequence of SEQ ID NO: 5, Tenebrio MSP-2 consisting of the amino acid sequence of SEQ ID NO: 6, and Tenebrio consisting of the amino acid sequence of SEQ ID NO: 7 It may further comprise at least one protein selected from the group consisting of Tm-41.
또한, 본 발명의 검출용 키트는 β-라이틱 프로테아제(β-lytic protease, blp) 및/또는 라이소자임을 추가로 포함할 수 있다. 상기 blp는 토양미생물 등을 포함한 다양한 미생물로부터 유래할 수 있다. 예를 들어, 상기 blp는 아크로모박터(Achromobacter) 속 미생물, 바람직하게는 아크로모박터 리티쿠스(Achromobacter lyticus), 더욱 바람직하게는 아크로모박터 리티쿠스(Achromobacter lyticus) ATCC 21456 또는 아크로모박터 리티쿠스(Achromobacter lyticus) ATCC 21457 로부터 유래될 수 있다. 또한, 상기 blp는 공지의 방법(Li, S., Norioka, S. & Sakiyama, F. (1998) J Biochem ( Tokyo ) 124, 332-339)에 따라, 상업적으로 유용한 조 아크로모펩티아아제(Achromopeptidase) 시료 (Wako Pure Chemical Institute, 014-09661)로부터 정제된 것을 사용할 수도 있다. 또한, 상기 라이소자임은 상업적으로 유용한 통상의 라이소자임을 사용할 수 있다.In addition, the kit for detection of the present invention may further comprise β-lytic protease (β-lytic protease, blp) and / or lysozyme. The blp may be derived from various microorganisms including soil microorganisms. For example, the blp is a microorganism of the genus Achromobacter , and preferably, Achromobacter Achromobacter lyticus ), more preferably Achromobacter lyticus ) ATCC 21456 or Achromobacter lyticus ) may be derived from ATCC 21457. In addition, the blp is a known method (Li, S., Norioka, S. & Sakiyama, F. (1998) J Biochem ( Tokyo ) 124, 332-339), purified from commercially available crude Achromopeptidase samples (Wako Pure Chemical Institute, 014-09661) can also be used. In addition, the lysozyme may be used commercially available conventional lysozyme.
이하, 본 발명을 실시예를 통하여 더욱 상세히 설명한다. 그러나, 하기 실시예는 본 발명을 예시하기 위한 것이며, 본 발명이 하기 실시예에 의해 제한되는 것은 아니다.Hereinafter, the present invention will be described in more detail with reference to examples. However, the following examples are intended to illustrate the invention, and the invention is not limited by the following examples.
실시예Example
1. 단백질의 정제 1. Purification of Proteins
(1) 펩티도글리칸 인식 과정에 관여하는 단백질에 대한 항체의 제조(1) Preparation of antibody against protein involved in peptidoglycan recognition process
본 발명자들의 선행 연구 결과에 따라 밝혀진 펩티도글리칸 인식 과정에 관여하는 단백질, 즉 Tm-PGRP-SA(Tenebrio PG recognition protein) (J Biol Chem. 2006;281:7747-55), Tm-GNBP3(Tenebrio Gram-negative binding protein 3) (J Biol Chem. 2003;278:42072-9), Tm-GNBP1(Tenebrio Gram-negative binding protein 1) (Proc Natl Acad Sci U S A. 104, 6602-6607(2007)), 및 Tm-MSP(Tenebrio modular serine protease) (Proc Natl Acad Sci U S A. 104, 6602-6607(2007)) 각각에 대한 항체를 제조하였다. Tm-PGRP-SA는 재조합 단백질을 이용하였으며, Tm-GNBP1, Tm-GNBP3 및 Tm-MSP는 부분 아미노산 서열 정보를 이용하여 Tm-GNBP1은 Keyhole limpet hemocyanin (KLH)-cys-LEAYEPKGFRAS-NH2로 Tm-GNBP3는 KLH-cys-YFDGKNKLGYPNDDQKF-NH2로 Tm-MSP는 KLH-cys-VNGKPVKKGDYPWQ-NH2로 부분 아미노산 서열을 가진 펩타이드를 화학합성하여 담체(carrier) 단백질인 KLH를 컨쥬게이션시켜 아쥬반트와 섞어 토끼의 피하에 4주간 4회 주사하여 토끼의 혈청으로부터 각각의 단백질에 대한 항체를 얻었다. Proteins involved in the peptidoglycan recognition process found according to the results of our previous studies, ie Tenebrio PG recognition protein (Tm-PGRP-SA) (J Biol Chem. 2006; 281: 7747-55), Tm-GNBP3 ( Tenebrio Gram-negative binding protein 3) (J Biol Chem. 2003; 278: 42072-9), Tm-GNBP1 ( Tenebrio Gram-negative binding protein 1) ( Proc Natl Acad Sci US A. 104, 6602-6607 (2007)), and Tenebrio modular serine protease (Tm-MSP) ( Proc Natl Acad Antibodies against Sci US A. 104, 6602-6607 (2007)) were prepared. Tm-PGRP-SA was used for recombinant protein, Tm-GNBP1, Tm and Tm-GNBP3-MSP was Tm-GNBP1 using the partial amino acid sequence information is Tm as Keyhole limpet hemocyanin (KLH) -cys- LEAYEPKGFRAS-NH 2 -GNBP3 chemically synthesizes a peptide having a partial amino acid sequence with KLH-cys-YFDGKNKLGYPNDDQKF-NH 2 and Tm-MSP with KLH-cys-VNGKPVKKGDYPWQ-NH 2 to conjugate KLH, a carrier protein, with adjuvant Injections four times over four weeks subcutaneously in rabbits yielded antibodies to each protein from rabbit serum.
(2) 갈색거저리(Tenebrio molitor)의 체액으로부터 용출액 분획의 제조(2) Preparation of the eluate fraction from the body fluids of the brown rice wine ( Tenebrio molitor )
갈색거저리(Tenebrio molitor)로부터 체액(4,000 ml, 43.7g의 단백질)을 분리한 후, 비-특이적으로 활성화된 세린 프로테아제를 다음과 같이 불활성화시켰다: 비가역적 세린 프로테아제 억제제인 0.5 mM의 디이소프로필 플루오로포스페이 트(diisopropyl fluorophosphate, DFP) 용액을 갈색거저리(Tenebrio molitor)로부터 얻은 체액에 가한 후, 완충액 A (50 mM 트리스-HCl, 3 mM EDTA, pH 6.0) 15 liter로 12 시간 동안 투석하였다. 이를 원심분리하여 잔사를 제거한 후, 상등액을 회수하여 토요펄 AF-헤파린 HC 650M 컬럼(Toyopearl AF-Heparin HC 650M column)에 로딩한 후 완충액 A로 충분히 세척하였다. 완충액 A 에 1M NaCl를 포함하는 용액을 이용하여 선형 구배로 용출하였으며, 얻어진 용출액 분획을 크게 3 종류로 구분하였다 (이하, E1, E2, E3 로 명명함, 도 1 참조). After separating body fluids (4,000 ml, 43.7 g of protein) from the brown edible ( Tenebrio molitor ), non-specifically activated serine protease was inactivated as follows: 0.5 mM diiso, an irreversible serine protease inhibitor Diisopropyl fluorophosphate (DFP) solution was added to the bodily fluid obtained from the brown gourd, Tenebrio molitor , followed by dialysis with 15 liters of Buffer A (50 mM Tris-HCl, 3 mM EDTA, pH 6.0) for 12 hours. It was. After removing the residue by centrifugation, the supernatant was recovered, loaded on a Toyopearl AF-Heparin HC 650M column, and washed with Buffer A sufficiently. Elution was carried out in a linear gradient using a solution containing 1M NaCl in buffer A, and the resulting eluate fractions were divided into three types (hereinafter, referred to as E1, E2, and E3, see FIG. 1).
(3) 이뮤노블롯팅 분석(immunoblot analysis) 및 시험관내 활성재조합 시험(3) Immunoblot analysis and in vitro active recombination testing
E1, E2, 및 E3 각 분획에 존재하는 펩티도글리칸 인식 과정에 관여하는 단백질의 분포를 알아보기 위해, 상기 (1)에서 각각의 항체를 이용하여 이뮤노블롯팅 분석(immunoblot analysis)을 수행한 결과, E1 에는 GNBP1 이 존재하고, E2에는 MSP 및 PGRP-SA가 존재하고, E3에는 GNBP3가 존재하는 것을 확인하였다. (도2 참조)In order to determine the distribution of proteins involved in the peptidoglycan recognition process present in each of E1, E2, and E3 fractions, immunoblot analysis was performed using the respective antibodies in (1). As a result, it was confirmed that GNBP1 exists in E1, MSP and PGRP-SA exist in E2, and GNBP3 exists in E3. (See Figure 2)
또한, E1, E2, 및 E3 분획 각각에서 5 μL을 취하여 10 μL의 탈응집 완충액(decoagulation buffer)(조성: 30 mM 트리소듐 시트레이트, 26 mM 시트르산, 20 mM EDTA, 및 15 mM 소듐 클로라이드, pH 4.6), 10 μL의 Lys-PGN(대한민국 특허출원 제10-2007-0013231호; Proc Natl Acad Sci U S A. 104, 6602-6607(2007)) 또는 β-1,3-글루칸 용액, 15μL의 완충액 A를 첨가하여 5 분 동안 전-배양한 후, 443 μL의 20 mM 트리스-HCl (pH 8.0), 5 μL의 1 M CaCl2 용액의 첨가 또는 비첨가, 2μL의 10 mM α-트롬빈 기질(thrombin substrate, Boc-Val-Pro-Arg-MCA)의 첨가로 30 ℃에서 20 분간 반응 후 유리되는 메틸쿠마린(methylcoumarin) 형광의 강도를 측정함으로써, 시험관내 활성재조합 시험을 수행하였다. 얻어진 결과는 도 3과 같다. In addition, 5 μL of each of the E1, E2, and E3 fractions was taken and 10 μL of decoagulation buffer (composition: 30 mM trisodium citrate, 26 mM citric acid, 20 mM EDTA, and 15 mM sodium chloride, pH). 4.6), 10 μL of Lys-PGN (Korean Patent Application No. 10-2007-0013231; Proc Natl Acad Sci US A. 104, 6602-6607 (2007)) or β-1,3-glucan solution, 15 μL of buffer A, pre-incubated for 5 minutes, followed by 443 μL of 20 mM Tris-HCl (pH 8.0). ), 5 μL of 1 M CaCl 2 solution or no addition, 2 μL of 10 mM α-thrombin substrate (Boc-Val-Pro-Arg-MCA), followed by reaction at 30 ° C. for 20 minutes. In vitro active recombination tests were performed by measuring the intensity of methylcoumarin fluorescence. The obtained result is shown in FIG.
도 3에서 알 수 있는 바와 같이, E1, E2, 및 E3 분획들은 펩티도글리칸이 없는 조건(컬럼 2) 및 Ca2 + 이온이 없는 조건 (컬럼 3)에서는 펩티도글리칸-특이적인 활성이 나타나지 않았으며, 또한 각각의 분획들이 결여된 조합에 있어서도 펩티도글리칸 존재하에서 아미다제(amidase)는 나타내지 않았다 (컬럼 4, 5 및 6). 특히 펩티도글리칸 인식과정에 관여하는 단백질인 Tm-PGRP-SA, GNBP1 및 MSP가 모두 E1 및 E2에 존재함에도 불구하고, 펩티도글리칸-특이적인 아미다제 활성이 나타나지 않았다(컬럼 6). 그러나 펩티도글리칸, Ca2 + 이온 및 E1, E2, E3의 모든 분획들 존재 하에서 펩티도글리칸-특이적인 아미다제 활성을 나타내는 것을 알 수 있다(컬럼 1).The specific activity - as can be seen in Figure 3, E1, E2, and E3 fractions peptidoglycan is free conditions (column 2) and Ca 2 + ion-free conditions (column 3), the peptidoglycan There was also no amidase in the presence of peptidoglycan in combinations lacking individual fractions (
상기 결과는 박테리아의 펩티도글리칸을 인식하는 단백질들이 펩티도글리칸을 인식한 후 하위단계에 존재하는 비활성의(inactive) 자이모겐(zymogen) 형태의 프로테아제(들)을 활성화시킨 후, 알파-트롬빈 기질인 Boc-Val-Pro-Arg-MCA을 분해시켜 형광 물질인 메틸쿠마린을 유리시키는데 관여하는 프로테아제를 가지고 있을 것을 시사하며, 각 분획에는 아직까지 미확인된 펩티도글리칸 인식 과정을 조절하 는 새로운 인자가 존재하는 것을 시사한다. 특히 E2 분획에는 하위단계의 비활성의 자이모겐 형태의 프로테아제가 존재하는 것을 시사한다.The results indicate that proteins that recognize the peptidoglycan of bacteria activate the protease (s) in the inactive zymogen form, which is present in the lower stage after the peptidoglycan is recognized, and then alpha Suggests that it has a protease involved in decomposing Boc-Val-Pro-Arg-MCA, a thrombin substrate, to liberate the methylcoumarin, a fluorescent substance. Indicates that a new argument exists. In particular, the E2 fraction suggests the presence of a lower level of inactive zymogen form protease.
(4) E2 분획으로부터의 신규 단백질 분리(4) Novel Protein Isolation from E2 Fraction
1단계 - Q- Sepharose Fast Flow 컬럼 크로마토그래피 : 상기 (2)에서 얻어진 E2 분획(총 단백질량은 737mg)을 Q-Sepharose Fast Flow 컬럼에 로딩하고, 용리(flow-through)된 분획들을 각각 활성화된 형태의 갈색거저리 유래의 세린 프로테아제 Tenebrio Tm-41 (대한민국 특허출원 제10-2007-0095196호)과 혼합한 뒤, 아미다아제 활성을 나타내는 분획들을 풀링(pooling)하였다 (단백질 총량: 514 mg). Step 1-Q- Sepharose Fast Flow Column Chromatography : The E2 fraction obtained in (2) (total protein amount was 737 mg) was loaded on a Q-Sepharose Fast Flow column, and the flow-through fractions were serine proteases derived from brown edible meals respectively activated. Tenebrio After mixing with Tm-41 (Korean Patent Application No. 10-2007-0095196), the fractions showing amidase activity were pooled (total protein: 514 mg).
2단계 - CM - 토요펄 650M 컬럼 크로마토그래피 : 상기 1단계에서 수득된 풀링된 용출 분획을 완충액 A와 함께 한외여과하고, 이를 CM-토요펄 650M 컬럼(30mm x 150mm)에 로딩하여, 단백질 분획을 모아 농축하였다 (단백질 총량: 83 mg). Step 2- CM - Toyopearl 650M Column Chromatography : The pooled elution fraction obtained in
3단계 - HiTrap Heparin FPLC 컬럼 크로마토그래피 : 20mM Tri-HCl 완충액(3 mM EDTA, pH8.0)로 평형화시킨 HiTrap Heparin FPLC 컬럼에 상기 2단계에서 얻어진 농축액을 로딩하여, 상기 동일한 완충액으로 세척한 뒤에, 400mL의 완충액을 분당 4 ml의 유속으로 0 내지 1.0M의 NaCl 농도구배로 남아있는 단백질을 용출시켰다. Step 3- HiTrap Heparin FPLC Column Chromatography : The HiTrap Heparin FPLC column equilibrated with 20 mM Tri-HCl buffer (3 mM EDTA, pH8.0) was loaded with the concentrate obtained in
4단계 - 히드록실아파타이트 ( Hydroxylapatite ) 컬럼 크로마토그래피 : 상기 3단계에서 얻어진 용출액을 3 mM EDTA를 함유하는 20 mM 소듐 포스페이트 완충액( pH 7.0)으로 평형화시킨 히드록실아파타이트(hydroxylapatite) FPLC 컬럼(5mm x 50mm, Bio-Rad)에 로딩하였다. 상기 컬럼을 동일한 완충액 4mL로 세척한 후에, 20 내지 500 mM의 소듐 포스페이트 용액 25mL로 농도구배로 용출시켜 단백질 분획을 풀링(pooling) 하였다 (총 단백질량: 1.1 mg). Step 4 - hydroxyl apatite (Hydroxylapatite) was purified by column chromatography: 20 mM sodium phosphate buffer was equilibrated with a (pH 7.0) hydroxyl apatite (hydroxylapatite) FPLC column (5mm x 50mm of the eluate obtained in the third step containing 3 mM EDTA , Bio-Rad). After washing the column with 4 mL of the same buffer, the protein fractions were pooled by elution with a concentration gradient of 25 mL of 20-500 mM sodium phosphate solution (total protein amount: 1.1 mg).
5단계 - TSKgel G2000SW 크기 배제( size exclusion ) 컬럼 크로마토그래피 : 상기 4단계에서 얻어진 풀링된 분획을 TSKgel G2000SW 컬럼(4.6mm x 30cm)에 로딩하고, 분당 0.3mL의 유속으로 50mM 소듐-포스페이트 완충액(0.3M NaCl 포함, pH7.0)을 사용하여 용출시켰다. 순수한 단백질을 함유한 분획(단백질량: 90 ug)을 풀링(pooling) 하였다. 얻어진 단백질의 분자량은 약 44 kDa이었으며, 하기 특성규명 시험에서 자이모겐 형태의 단백질로서 사용하였다. Step 5- TSKgel G2000SW size exclusion (size exclusion ) column chromatography : The pooled fractions obtained in
6단계 - 벤즈아미딘 - 세파로스 ( Benzamidine - Sepharose ) 6B 흡착 컬럼 크로마토그래피 : 활성화된 형태의 단백질을 정제하기 위하여, 상기 1단계로부터 얻어진 용출(flow-through)액(104 mg의 단백질)을 Tenebrio Tm-41(대한민국 특허출원 제10-2007-0095196호)과 반응시켰다. 활성화된 형태의 상기 단백질을 함유하는 상기 용출액을 50mM의 Tris-HCl 완충액(0.5M NaCl 함유, pH8.0)으로 평형화시킨 벤즈아미딘-세포로스 6B (Pharmacia Biotech) 칼럼에 로딩하였다. 상기 컬럼을 동일한 완충액으로 세척한 뒤, 흡착된 단백질을 동일한 완충액 중의 20mM의 4-아미노벤즈아미딘으로 용출시켰다. 활성화된 단백질을 함유하는 용출액을 HiTrap Heparin FPLC 컬럼 크로마토그래피 및 TSKgel G2000SW 크기 배제 컬럼 크로마토그래피를 상기와 같이 동일한 방법으로 순차적으로 수행하여 정제하였다. 활성형의 상기 단백질을 함유하는 분획을 풀링하여, 정제된 단백질 60 ug (분자량 약 44 kDa)을 얻었으며, 이를 Tm-44로 명명하였다. Step 6 - benzamidine-Sepharose (Benzamidine-Sepharose) 6B absorption column chromatography to purify the protein in active form, the elution (flow-through) solution (protein 104 mg) obtained from
얻어진 Tm-44에 대한 전기 영동을 수행하였으며, 그 결과는 도 4와 같다 (구체적으로는 (A)의 5번 레인). 환원 조건에서 12 % SDS-PAGE 상의 약 44 kDa의 겔 이동성(gel mobility)을 갖는 순수한 단백질 밴드를 확인할 수 있다.Electrophoresis was performed on the obtained Tm-44, and the result is shown in FIG. 4 (specifically,
2. 단백질 및 2. Protein and 핵산서열Nucleic acid sequence 분석 analysis
(1) Tm-44 단백질의 N-말단 서열 및 부분 아미노산 서열 분석(1) N-terminal sequence and partial amino acid sequence analysis of Tm-44 protein
Tm-44 단백질을 환원 및 알킬화시킨 후, 라이실엔도펩티다제(lysylendopeptidase)를 처리한 다음, HPLC로 분리하고, Edman 분해 반응을 이용하여 N-말단 서열과 부분 아미노산 서열을 결정한 결과는 다음 표 1과 같다.After reducing and alkylating the Tm-44 protein, the lysylendopeptidase was treated, separated by HPLC, and the N-terminal sequence and partial amino acid sequence were determined by Edman digestion. Same as 1.
번호peak
number
(2) Tm-44의 cDNA 클로닝 및 핵산 염기서열(2) cDNA Cloning and Nucleotide Sequence of Tm-44
역전사-Tenebrio의 유충에서 추출한 지방체로부터 얻은 총 RNA를 이용하여 역전사를 수행하였다. 역전사 효소는 SuperscriptⅡ(Invitrogen)를 사용하였다. Reverse transcription-Reverse transcription was performed using total RNA obtained from fat bodies extracted from the larvae of Tenbribrio. Reverse transcriptase was used Superscript II (Invitrogen).
부분적 cDNA 서열 생성 - (1)에서 얻어진 Tm-44 단백질의 아미노산 서열 정보에 근거하여 정방향 프라이머는 5'-GAYGARTTYCCNTGGATGGC-3'와 같이 디자인하였고, 역방향 프라이머는 5'-CCARTTNGCCATNCCRCANGG-3'와 같이 디자인하여 degenerate PCR 방법으로 부분적 cDNA 단편을 얻었다. 증폭된 DNA는 아가로즈 겔 전기영동을 수행하고, pCR2.1-TOPO (Invitrogen) 벡터를 이용하여 핵산 서열을 분석하였다. Partial cDNA Sequence Generation -Based on the amino acid sequence information of the Tm-44 protein obtained in (1), the forward primer was designed as 5'-GAYGARTTYCCNTGGATGGC-3 ', and the reverse primer was designed as 5'-CCARTTNGCCATNCCRCANGG-3'. Partial cDNA fragments were obtained by degenerate PCR. Amplified DNA was subjected to agarose gel electrophoresis and nucleic acid sequences were analyzed using pCR2.1-TOPO (Invitrogen) vector.
RACE ( Rapid Amplification of cDNA Ends ) PCR - 5'- 및 3'-RACE 분석은 SMART RACE method (CLONTECH)을 사용하였다. RACE PCR에 사용된 프라이머들은 부분적 cDNA 서열 정보에 근거하여 디자인되었고, 그 서열과 핵산서열 내의 위치는 아래 표 2와 같다. RACE ( Rapid Amplification of cDNA Ends ) PCR -5'- and 3'-RACE analysis was performed using the SMART RACE method (CLONTECH). Primers used in RACE PCR were designed based on partial cDNA sequence information, the sequence and the position in the nucleic acid sequence is shown in Table 2 below.
전체 cDNA 서열 생성 - 전체 길이의 cDNA를 증폭시키기 위하여 각 말단 부위에 특이적인 프라이머들을 디자인하였다. 정방향 프라이머 및 역방향 프라이머로서, 각각 5'-ATGTTGGTCCGCTCCTTGTT-3' (서열번호 15) 및 5'-CTAGGGCTTCAGCTTGCCG-3' (서열번호 16)을 사용하여 전체 길이의 cDNA를 얻었다. 증폭된 DNA는 전기영동으로 밴드를 분리한 후, pGEM T-vector(Promega) 벡터에 삽입시켜 전체 핵산 서열을 분석하였다. 얻어진 전체 아미노산 서열은 도 5와 같으며, 기존의 알려진 다른 곤충의 단백질과 서열분석 결과 새로운 서열임을 확인하였다. 또한, Tm-44 유전자(즉, 서열번호 2의 유전자)의 BLAST search (www.ncbi.nlm.nih.gov/BLST) 결과 기존에 알려지지 않은 전혀 상이한 유전자임을 확인하였다.Full cDNA Sequence Generation—Specific primers were designed for each terminal site to amplify full length cDNA. As forward and reverse primers, full length cDNA was obtained using 5'-ATGTTGGTCCGCTCCTTGTT-3 '(SEQ ID NO: 15) and 5'-CTAGGGCTTCAGCTTGCCG-3' (SEQ ID NO: 16), respectively. The amplified DNA was subjected to electrophoresis to separate the band, and then inserted into a pGEM T-vector (Promega) vector to analyze the entire nucleic acid sequence. The obtained total amino acid sequence is as shown in Figure 5, it was confirmed that the new sequence as a result of sequencing with proteins of other known insects. In addition, the results of BLAST search (www.ncbi.nlm.nih.gov/BLST) of the Tm-44 gene (ie, the gene of SEQ ID NO: 2) confirmed that it is a completely different gene.
(3) Tm-44 단백질의 생화학적 특성 및 기능 규명(3) Identification of biochemical properties and function of Tm-44 protein
Tm-MSP 단백질, Tm-41 단백질, 및 Tm-44 단백질 간의 분자 활성화 메커니즘을 확인하기 위하여, 각 단백질의 자이모겐 또는 활성형 형태를 상호 반응시킨 후, 절단 패턴(cleavage patterns)을 측정하였다. 예를 들어, Tm-41 단백질의 활성화된 형태 1μg과 자이모겐 형태의 Tm-44 1μg을 20 mM Tris-HCl (pH. 8.0)의 조건에서 1시간 30 분 동안 30 ℃에서 반응시킨 후, 반응시킨 용액을 환원 및 비환원 조건에서 12 % SDS-PGAE 수행하여 절단 패턴을 측정하였으며, 그 결과는 도 6과 같다. In order to confirm the molecular activation mechanism between the Tm-MSP protein, Tm-41 protein, and Tm-44 protein, cleavage patterns were measured after interacting with the zymogen or active form of each protein. For example, 1 μg of the activated form of Tm-41 protein and 1 μg of Tm-44 in the zymogen form are reacted at 30 ° C. for 1 hour and 30 minutes under the conditions of 20 mM Tris-HCl (pH. 8.0). The solution was subjected to 12% SDS-PGAE under reducing and non-reducing conditions to measure the cleavage pattern, and the results are shown in FIG. 6.
도 6(A)는 활성형의 Tm-41 단백질과 자이모겐 형태의 Tm-44과의 반응성을 나타낸다. 레인 1과 레인 4는 활성형의 Tm-41 단백질이고, 레인 2와 레인 5는 자이모겐 형태의 Tm-44 단백질이며, 레인 3과 레인 6은 활성형의 Tm-41 단백질과 자이모겐 형태의 Tm-44 단백질과의 혼합물이다. 레인 1-3 및 4-6은 각각 환원 조건 및 비-환원 조건에서 상기 단백질들의 겔 이동성을 나타낸다. 밴드 (a)와 (b)는 절단후 Tm-44로부터 유래된 catalytic domain 및 clip domain을 나타내고, 밴드 (c)는 절단된 Tm-44 단백질을 나타낸다. 도 6(B)는 활성형의 Tm-MSP 단백질과 자이모겐 형태의 Tm-41과의 반응성을 나타낸다. 레인 1과 레인 4는 활성형의 Tm-MSP 단백질이고, 레인 2와 레인 5는 자이모겐 형태의 Tm-41 단백질이며, 레인 3과 레인 6은 활성형의 Tm-MSP 단백질과 자이모겐 형태의 Tm-41 단백질과의 혼합물이다. 레인 1-3 및 4-6은 각각 환원 조건 및 비-환원 조건에서 상기 단백질들의 겔 이동성을 나타낸다. 밴드 (d)와 (e)는 절단후 Tm-41로부터 유래된 catalytic domain 및 clip domain을 나타내고, 밴드 (f)는 절단된 Tm-41 단백질을 나타낸다. 도 6 (C)에서 레인 1은 활성화된 Tm-MSP 단백질이고, 레인 2는 자이모겐 형태의 Tm-44 단백질이며, 레인 3은 활성화된 Tm-MSP 단백질과 자이모겐 형태의 Tm-44 단백질의 혼합물을 나타낸다. 도 6(C)의 결과로부터 Tm-44 단백질은 Tm-MSP 단백질에 의하여 절단되지 않는 것을 알 수 있다. 도 6 (D)에서 레인 1은 활성형의 Tm-44 단백질이고, 레인 2는 자이모겐 형태의 Tm-41 단백질이며, 레인 3은 활성형의 Tm-44 단백질과 자이모겐 형태의 Tm-41 단백질의 혼합물을 나타낸다. 도 6(D)의 결과로부터 활성형의 Tm-44 단백질은 자이모겐 형태의 Tm-41 단백질을 절단하지 않는 것을 알 수 있다.Fig. 6 (A) shows the reactivity of the active Tm-41 protein with Zymogen form Tm-44.
활성형의 Tm-44 단백질에 의해 절단된 Spatzle 단백질과 Tribolium Toll ectodomain과의 친화성(affinity)을 TSK SW3000 크기 배제 컬럼을 사용하여 HPLC로 분석한 결과는 도 7과 같다. 도 7(A)에서 커브 (c) 및 (b)는 각각 절단된 Spatzle 단백질(8μg) 및 정제된 Tribolium Toll ectodomain(18μg) 주입 후의 용출패턴을 나타낸다. 커브 (a)는 절단된 Spatzle 단백질(8μg) 및 정제된 Tribolium Toll ectodomain(18μg) 절단된 Spatzle 단백질(8μg) 및 Tribolium Toll ectodomain(18μg)과의 혼합물 주입 후의 용출패턴을 나타낸다. 도 7(B)는 상기 HPLC 분석결과 얻어진 각각의 피크들을 환원 및 비환원 조건에서 12 % SDS-PGAE로 분석한 결과를 나타낸다. 피크 a1은 절단된 Spatzle 단백질 및 정제된 Tribolium Toll ectodomain을 포함하며, 이들 두개의 단백질은 안정한 복합체(complex)를 이루고 있음을 나타낸다. 피크 a2는 복합체 형성후 남아있는 과량의 Spatzle 단백질을 나타낸다. 피크 b와 피크 c는 각각 정제된 Tribolium Toll ectodomain과 절단된 Spatzle 단백질을 나타낸다. Spatzle Protein and Tribolium Cleaved by Active Tm-44 Protein Affinity with Toll ectodomain was analyzed by HPLC using a TSK SW3000 size exclusion column. Curves (c) and (b) in FIG. 7 (A) show the cleaved Spatzle protein (8 μg) and purified Tribolium, respectively. The dissolution pattern after injection of toll ectodomain (18 μg) is shown. Curve (a) shows truncated Spatzle protein (8 μg) and purified Tribolium Toll ectodomain (18 μg) truncated Spatzle protein (8 μg) and Tribolium The elution pattern after injecting a mixture with toll ectodomain (18 μg) is shown. Figure 7 (B) shows the results of the analysis of each of the peaks obtained by the HPLC analysis by 12% SDS-PGAE under reducing and non-reducing conditions. Peak a1 is the truncated Spatzle protein and purified Tribolium Toll ectodomain, these two proteins represent a stable complex. Peak a2 represents excess Spatzle protein remaining after complex formation. Peak b and c are each purified Tribolium Toll ectodomain and truncated Spatzle protein.
상기 도 6 및 7의 결과로부터 알 수 있는 바와 같이, 활성화 형태의 Tm-41은 하위과정에 있는 자이모겐 형태의 Tm-44 단백질을 분해하여 활성화시키며, 활성화된 Tm-44 단백질은 다시 하위에 존재하는 Spatzle 단백질을 절단하여 Toll-ectodomain과 결합함으로써 톨 경로(Toll pathway)를 활성화시키게 된다. 이는 Tm-44 단백질이 펩티도글리칸 인식 시그날 경로(pathway) 중에서 Spatzle 단백질의 상위 단계에 존재하는 프로테아제임을 확인할 수 있다. 따라서, 본 발명에 의해 새롭게 분리된 Tm-44는 이를 절단할 수 있는 형광기질 (예를 들어, 메틸쿠마린 등) 또는 발색단 (p-nitrophenol)을 이용할 경우, 박테리아의 펩티도글리칸을 선택적으로 검출할 수 있으므로, 즉, 최종 기질 절단 효소로서 기능할 수 있으므로 박테리아 감염의 검출용 키트에 유용하게 사용될 수 있다. 이상의 일련의 갈색거저리 유래의 pro-PO system에 관여하는 단백질들의 일련의 반응 과정을 요약하면 도 8과 같다. 도 8에서 SAE 및 SPE 는 각각 Tm-41 및 Tm-44 단백질을 의미한다.As can be seen from the results of FIGS. 6 and 7, the activated form of Tm-41 decomposes and activates the zymogen form of Tm-44 protein in the subprocess, and the activated Tm-44 protein is further lowered. The toll pathway is activated by cleaving the existing Spatzle protein and binding it to the Toll-ectodomain. It can be confirmed that the Tm-44 protein is a protease present in the upper level of the Spatzle protein in the peptidoglycan recognition signal path. Therefore, Tm-44 newly isolated by the present invention selectively detects bacterial peptidoglycan when using a fluorescent substrate (eg, methylcoumarin or the like) or a chromophore (p-nitrophenol) capable of cleaving it. As such, it can function as a final substrate cleavage enzyme, and thus can be usefully used in a kit for detecting bacterial infection. A summary of a series of reaction processes of proteins involved in the pro-PO system derived from a series of brown rice bran is as shown in FIG. 8. In FIG. 8, SAE and SPE mean Tm-41 and Tm-44 proteins, respectively.
도 1은 갈색거저리의 체액에 로딩된 토요펄 AF-헤파린 HC 650M 컬럼의 크로마토그램이다.1 is a chromatogram of a ToyoPal AF-heparin HC 650M column loaded in a brownish bodily fluid.
도 2는 갈색거저리의 체액을 토요펄 AF-헤파린 HC 650M 컬럼의 크로마토그램으로 분리된 분획을 각각의 항체를 이용하여 이뮤노블롯팅 분석(immunoblot analysis)을 수행한 결과이다Figure 2 is a result of performing an immunooblot analysis of each fraction of the brown body fluid fraction separated by chromatogram of TOYOPEL AF-heparin HC 650M column using each antibody.
도 3은 α-트롬빈 기질에 대한 아미다제 활성에 대한 시험관내 활성재조합 시험 결과를 나타낸다.FIG. 3 shows the results of in vitro activator test for amidase activity on α-thrombin substrate.
도 4는 Tm-44 대한 각각 환원 조건, 비-환원 조건에서 12% SDS-PAGE 결과를 나타낸다.4 shows the 12% SDS-PAGE results for reducing and non-reducing conditions, respectively, for Tm-44.
도 5는 Tm-44의 아미노산 서열을 나타낸다. 5 shows the amino acid sequence of Tm-44.
도 6은 Tm-MSP 단백질, Tm-41 단백질, 및 Tm-44 단백질 간의 분자 활성화 메커니즘을 확인하기 위하여, 각 단백질의 자이모겐 또는 활성형 형태를 상호 반응시킨 후, 절단 패턴(cleavage patterns)을 측정한 결과를 나타낸다.FIG. 6 shows the cleavage patterns after interacting the zymogen or active form of each protein to confirm the molecular activation mechanism between the Tm-MSP protein, Tm-41 protein, and Tm-44 protein. The result of the measurement is shown.
도 7은 활성형의 Tm-44 단백질에 의해 절단된 Spatzle 단백질과 Tribolium Toll ectodomain과의 친화성(affinity)을 TSK SW3000 크기 배제 컬럼을 사용하여 HPLC로 분석한 결과를 나타낸다.Fig. 7 shows Spatzle protein and Tribolium cleaved by active Tm-44 protein. Affinity with Toll ectodomain is analyzed by HPLC using TSK SW3000 size exclusion column.
도 8은 본 발명에 따라 밝혀진 pro-PO 시스템에 관여하는 단백질들의 반응과정을 요약한 것이다.Figure 8 summarizes the reaction of proteins involved in the pro-PO system found in accordance with the present invention.
<110> YUHAN CORPORATION <120> Protein involving in peptidoglycan-recognition signal pathway, gene encoding the same, and kit for detecting bacterial infection comprising the same <130> PN0153 <160> 16 <170> KopatentIn 1.71 <210> 1 <211> 384 <212> PRT <213> Tenebrio molitor <400> 1 Met Leu Val Arg Ser Leu Phe Ile Leu Val Val Thr Ala Gln Val Leu 1 5 10 15 Asn Ala Asp Glu Asn Cys Arg Thr Pro Asp Asn Glu Glu Gly Asp Cys 20 25 30 Lys Pro Ile Asn Gln Cys Arg Pro Leu Tyr Ser Leu Leu Glu Arg Arg 35 40 45 Pro Ile Thr Ala Ser Thr Ala Glu Tyr Leu Arg Arg Ser Asn Cys Gly 50 55 60 Phe Asp Gly Ser Tyr Pro Arg Val Cys Cys Pro Gln Gly Ser Ile Glu 65 70 75 80 Pro Pro Thr Ile Lys Pro Pro Ile Val Asp Gly Pro Thr Glu Ser Asn 85 90 95 Asn Val Ser Pro Val Thr Ser Asp Leu Leu Pro Asp Gly Ser Ile Cys 100 105 110 Gly Pro Asn Thr Gln Asn Arg Ile Tyr Gly Gly Glu Lys Thr Asp Leu 115 120 125 Asp Glu Phe Pro Trp Met Ala Leu Val Glu Tyr Glu Lys Pro Gly Gly 130 135 140 Ser Arg Gly Phe Tyr Cys Gly Gly Val Leu Ile Ser Lys Arg Tyr Val 145 150 155 160 Leu Thr Ala Ala His Cys Val Lys Gly Lys Asp Leu Pro Lys Thr Trp 165 170 175 Lys Leu Val Ser Val Arg Leu Gly Glu Tyr Asn Thr Glu Thr Asp Thr 180 185 190 Asp Cys Ile Asn Asn Gly Phe Gly Glu Asp Cys Ala Pro Pro Pro Val 195 200 205 Asn Val Gln Val Glu Ala Arg Ile Ala His Glu Ser Tyr Glu Pro Asn 210 215 220 Asn Ile Asn Gln Tyr His Asp Ile Ala Leu Leu Arg Leu Arg Arg Glu 225 230 235 240 Val Lys Phe Ser Asp Tyr Ile Lys Pro Ile Cys Leu Pro Thr Thr Thr 245 250 255 Glu Glu Leu Ser Lys Ser Tyr Leu Gly Gln Lys Leu Phe Val Ala Gly 260 265 270 Trp Gly Lys Thr Glu Asn Arg Ser Glu Ser Asn Ile Lys Leu Lys Val 275 280 285 Gln Val Pro Val Lys Gln Met Ser Asp Cys Thr Ala Thr Tyr Ser Ser 290 295 300 Ala Asn Val Arg Leu Gly Ser Gly Gln Leu Cys Ala Gly Gly Glu Ser 305 310 315 320 Gly Lys Asp Ser Cys Arg Gly Asp Ser Gly Gly Pro Leu Met Ile Leu 325 330 335 Ser Leu Asp Lys Asp Lys Asp Ile His Trp Tyr Ala Ala Gly Val Val 340 345 350 Ser Phe Gly Pro Ser Pro Cys Gly Met Ala Asn Trp Pro Gly Val Tyr 355 360 365 Thr Lys Val Ser Lys Tyr Val Asp Trp Ile Val Gly Lys Leu Lys Pro 370 375 380 <210> 2 <211> 1155 <212> DNA <213> Tenebrio molitor <400> 2 atgttggtcc gctccttgtt catcctggta gtaacagcac aagtgctcaa tgccgacgag 60 aattgtcgta ctcctgataa tgaagaaggt gattgtaagc ctatcaatca atgccgcccc 120 ctctactccc tgttggagcg ccgccccatc accgccagca ccgccgagta tttgcgccga 180 tccaactgcg gcttcgacgg gagctaccct cgcgtctgct gcccccaagg ctcgatcgaa 240 cccccgacca tcaaaccccc aatagtggac gggcccaccg agtccaacaa tgtgtctccc 300 gtgacgagcg acctcctccc agacggctcc atctgcggtc ccaacaccca gaacaggatc 360 tacggcgggg agaaaaccga cctggatgag ttcccctgga tggccctggt ggaatacgag 420 aaacccggag gcagtcgagg gttctactgc ggcggagtgc tgatcagcaa gaggtacgtc 480 ctgacggcgg cgcactgcgt caaagggaag gatctgccca aaacgtggaa actcgtgagc 540 gtgcgtttgg gcgagtacaa caccgagact gacacggact gcatcaacaa cggcttcggg 600 gaggactgcg ccccaccccc cgtcaacgtc caggtggagg ccaggatcgc ccacgagagc 660 tacgaaccca acaacatcaa ccagtaccac gacatagctc tgttgaggtt gcgacgcgaa 720 gtcaaattct ctgactacat caaacccatt tgtctgccga ccaccaccga agagctgagc 780 aagtcgtacc tcggccagaa actcttcgtg gcgggctggg gcaagaccga gaaccggtcc 840 gagagcaaca tcaagctcaa agtgcaagtt cccgtcaagc aaatgtcaga ctgcaccgcc 900 acctacagca gcgccaatgt gaggttaggt tctggtcagc tgtgcgcagg aggcgaatcg 960 gggaaagatt cgtgtcgcgg agacagcgga gggcctttga tgatcctcag tttggacaaa 1020 gacaaggaca tacactggta cgccgcgggg gtggtgtctt tcgggccctc gccctgcggc 1080 atggccaact ggccgggagt ttacaccaaa gtgtccaaat acgtagactg gatcgtcggc 1140 aagctgaagc cctag 1155 <210> 3 <211> 193 <212> PRT <213> Tenebrio molitor <400> 3 Met Leu Leu Ala Thr Ile Ala Arg Gly Val Tyr Gln Ile Ser Ala Leu 1 5 10 15 Ser Gly Ser Thr Ile Pro Arg Ile Cys Pro Glu Ile Ile Ser Arg Thr 20 25 30 Arg Trp Gly Ala Arg Thr Pro Leu Glu Val Asp Tyr Ser Leu Ile Pro 35 40 45 Ile Glu Asn Val Val Val His His Thr Val Thr His Thr Cys Asp Ser 50 55 60 Glu Ser Glu Cys Ala Thr Leu Leu Arg Asn Val Gln Asn Phe His Met 65 70 75 80 Glu Asn Leu Glu Phe His Asp Ile Gly Tyr Asn Phe Leu Val Ala Gly 85 90 95 Asp Gly Gln Ile Tyr Glu Gly Ala Gly Trp His Lys Val Gly Ala His 100 105 110 Thr Arg Gly Tyr Asn Thr Arg Ser Leu Gly Leu Ala Phe Ile Gly Asn 115 120 125 Phe Thr Ser Gln Leu Pro Val Gln Lys Gln Leu Lys Val Ala Lys Asp 130 135 140 Phe Leu Gln Cys Gly Val Glu Leu Gly Glu Leu Ser Lys Asn Tyr Lys 145 150 155 160 Leu Phe Gly Ala Arg Gln Val Ser Ser Thr Ser Ser Pro Gly Leu Lys 165 170 175 Leu Tyr Arg Glu Leu Gln Asp Trp Pro His Phe Thr Arg Ser Pro Pro 180 185 190 Lys <210> 4 <211> 443 <212> PRT <213> Tenebrio molitor <400> 4 Met Phe Ala Lys Ala Ile Ile Leu Phe Leu Ile Leu Thr Thr Phe Gln 1 5 10 15 Cys His Gly Glu Phe Val Ile Pro Glu Val Thr Leu Glu Ala Tyr Glu 20 25 30 Pro Lys Gly Phe Arg Ala Ser Ile Pro Ala Leu Asn Gly Ile Gln Met 35 40 45 Phe Ala Phe His Gly Asn Ile Asn Lys Pro Ile Ser Gln Val Asp Pro 50 55 60 Gly Glu Tyr Ser Gln Asp Tyr Thr Ser Pro Thr Gly Asn Thr Trp Ser 65 70 75 80 Tyr Phe Asn Lys Asp Leu Lys Leu Lys Ala Gly Asp Val Ile His Tyr 85 90 95 Trp Val Phe Ile Gln Phe Leu Lys Leu Gly Tyr Arg Lys Asp Asn Gln 100 105 110 Val Trp Asn Val Thr Glu Leu Val Gln Leu Lys Asn Ser Ser Cys Glu 115 120 125 Thr Ser Pro Thr Thr Val Arg Gly Arg Ser Val Ile Cys Lys Asn Ser 130 135 140 Ile Ile Phe Glu Glu Asn Phe Asn Gly Glu Gly Ile Asp Thr Lys Lys 145 150 155 160 Trp Leu Ile Glu Gln Tyr Ile Pro Thr Tyr Thr Ser Leu Asp Tyr Glu 165 170 175 Phe Val Ser Tyr Gln Asn Asp Pro Thr Val Cys Phe Leu Asn Asp Asn 180 185 190 Lys Leu Phe Ile Lys Pro Lys Tyr Ala Gln Ser Glu Ala Glu Val Asn 195 200 205 Gly Glu Leu Asp Phe Arg Asn Arg Cys Thr Arg Lys Thr Asp Glu Glu 210 215 220 Cys Tyr Lys Lys Arg Glu Ile Tyr Phe Ile Ile Pro Pro Val Thr Ser 225 230 235 240 Gly Arg Leu Val Ser Asp Phe Arg Phe Lys Tyr Gly Lys Val Glu Ile 245 250 255 Arg Ala Lys Leu Pro Ala Gly Asp Trp Ile Tyr Pro Gln Met Tyr Leu 260 265 270 Glu Gln Val Asn Asp Pro Lys Lys Lys Ile Trp Ile Gly Tyr Ala Arg 275 280 285 Gly Asn Asn Lys Leu Leu Ala Asn Asn Gln Glu Asp Ile Gly Gly Asn 290 295 300 Leu Leu Phe Gly Gly Pro Val Leu Asp Pro Glu Glu Pro His Arg Ser 305 310 315 320 Gln Tyr Leu Lys Ser Thr Arg Asn Ser Lys Pro Phe Thr Ser Gln Met 325 330 335 His Thr Leu Val Val Leu Trp Asp Glu Asp His Ile Ser Leu Gln Leu 340 345 350 Asn Gly Ile Glu Tyr Gly Lys Ile Asp Lys Arg Thr Met Gln Glu Val 355 360 365 Asn Phe Ala Asp Asn Asp Met Val Arg Leu Val Leu Gly Val Gly Val 370 375 380 Gly Gly Val Asn Asp Phe Pro Asp Asp Phe Arg Ser Gly Thr Asn Val 385 390 395 400 Lys Pro Trp Arg Asn Lys Asp Asn Lys Gln Val Lys Asn Phe Phe Thr 405 410 415 Ala Arg Ser Glu Trp Gly Lys Thr Trp Ser Gly Asp Asn Cys Ala Leu 420 425 430 Gln Val Asp Tyr Ile Lys Val Trp Ala Leu *** 435 440 <210> 5 <211> 633 <212> PRT <213> Tenebrio molitor <400> 5 Met Cys Asn Val Arg Thr Leu Leu Gln Val Ile Cys Leu Ser Leu Ile 1 5 10 15 Val Ile Gln Thr Val Asp Ser Tyr Ser Phe Ala Leu Ser Lys Phe Thr 20 25 30 Arg Ile Arg Arg Gln Ala Arg Arg Thr Cys Thr Ser Thr Glu Phe Ala 35 40 45 Cys Lys Ser Gly Glu Cys Ile Asp Glu Asp Lys Glu Cys Asp Gly Ile 50 55 60 Val Asp Cys Thr Asp Ala Ser Asp Glu Thr Asn Ala Cys His Arg Ile 65 70 75 80 Lys Cys Pro Asn Tyr Leu Phe Arg Cys Lys Tyr Gly Ala Cys Ile Asn 85 90 95 Pro Asp Leu Glu Cys Asp Gly Lys Pro Asp Cys Met Asp Gly Ser Asp 100 105 110 Glu Lys Thr Ser Lys Cys Lys Pro Asp Asp Ser Ser Pro Glu Cys Lys 115 120 125 Ala Asn Glu Phe Arg Cys Ser Ser Gly Gln Cys Ile Pro Glu Asp Phe 130 135 140 Lys Cys Asp Gly Lys Ala Glu Cys Lys Asp Asn Ser Asp Glu Ile Arg 145 150 155 160 Ala Thr Cys Trp Asn Val Arg Cys Pro Gly Phe Thr His Lys Cys Lys 165 170 175 Tyr Gly Ala Cys Val Ser Gly Asn Ala Glu Cys Asn Gly Ile Val Glu 180 185 190 Cys Phe Asp Gly Ser Asp Glu Asp Pro Ala Ile Cys Lys Thr Lys Pro 195 200 205 Thr Pro Arg Pro Thr Pro Thr Pro Gly Thr Pro Gly Pro Gln Pro Thr 210 215 220 Gln Gly Gly Cys Val Leu Pro Asn His Pro Glu Phe Gly Glu Trp Gln 225 230 235 240 Val Tyr Gly Ile Pro Gly Gln Phe Ser Pro Gly Met Val Ile Arg Ala 245 250 255 Gly Ala Thr Leu Arg Ile Gln Cys Lys Lys Arg Tyr Lys Leu Glu Gly 260 265 270 Lys Asn Ala Ile Phe Cys Glu Asn Gly Lys Trp Ser Asp Ala Val Gly 275 280 285 His Cys Leu Lys Leu Cys Pro Ser Ile Gln Ser Thr Ser Thr Met Arg 290 295 300 Val Thr Cys Ile Tyr Asn Lys His Glu Glu Thr Glu Asn Cys Thr Glu 305 310 315 320 Ala Val Glu Gly Thr Leu Val Arg Phe Asp Cys Ala Pro Phe Tyr Glu 325 330 335 Asp Leu Gly Leu Ser Arg His Pro Ile His Ile Cys Arg Asp Gly Ser 340 345 350 Trp Asp Gln Arg Arg Pro Glu Cys Thr Pro Val Cys Gly Gln Lys Ser 355 360 365 Val Asn Ala Gln Thr Leu Ile Val Asn Gly Lys Pro Val Lys Lys Gly 370 375 380 Asp Tyr Pro Trp Gln Val Ala Leu Tyr Thr Leu Asn Asp Lys Glu Leu 385 390 395 400 Ile Cys Gly Gly Ser Leu Leu Asn Gln Arg Val Val Leu Thr Ala Ala 405 410 415 His Cys Ile Thr Asp Asp Lys Gly Lys Leu Leu Ser Lys Glu Asn Tyr 420 425 430 Met Val Ala Val Gly Lys Tyr Tyr Arg Pro Phe Asn Asp Ser Arg Asp 435 440 445 Arg Asn Glu Ala Gln Phe Ser Glu Val Lys His Met Phe Ile Pro Glu 450 455 460 Leu Tyr Lys Gly Ser Thr Gln Asn Tyr Val Gly Asp Ile Ala Ile Leu 465 470 475 480 Val Thr Arg Val Thr Phe Thr Leu Ser Arg Arg Val Gln Pro Val Cys 485 490 495 Ile Asp Tyr Gly Leu Lys Tyr Thr Ser Tyr Thr Asn Glu Phe Gly Tyr 500 505 510 Val Thr Gly Trp Gly Tyr Thr Leu Gln Asn Asp Lys Pro Ser Asp Val 515 520 525 Leu Lys Glu Leu Lys Val Pro Ala Val Ser Thr Glu Gln Cys Ser Ser 530 535 540 Ala Ile Pro Glu Asp Tyr Asp Ile Tyr Leu Thr His Asp Lys Leu Cys 545 550 555 560 Ala Gly Tyr Leu Asp Asn Gly Thr Ser Val Cys Ser Gly Asp Ser Gly 565 570 575 Gly Gly Leu Val Phe Lys Phe Asp Gly Arg Tyr Tyr Val Thr Gly Ile 580 585 590 Val Ser Leu Ser Pro Gln Ala Ser Thr Gly Gly Cys Asp Thr Gln Gln 595 600 605 Tyr Gly Leu Tyr Thr Lys Val Gly Thr Tyr Ile Ser Asp Phe Ile Ile 610 615 620 Lys Thr Glu Ser Gln Phe Arg Pro *** 625 630 <210> 6 <211> 633 <212> PRT <213> Tenebrio molitor <400> 6 Met Cys Asn Val Arg Thr Leu Leu Gln Val Ile Cys Leu Ser Leu Ile 1 5 10 15 Val Ile Gln Thr Val Asp Ser Tyr Ser Phe Ala Leu Ser Lys Phe Thr 20 25 30 Arg Ile Arg Arg Pro Ala Arg Arg Thr Cys Thr Ser Thr Glu Phe Ala 35 40 45 Cys Lys Ser Gly Glu Cys Ile Asp Glu Asp Lys Glu Cys Asp Gly Ile 50 55 60 Val Asp Cys Thr Asp Ala Ser Asp Glu Thr Asn Ala Cys His Arg Ile 65 70 75 80 Lys Cys Pro Asn Tyr Leu Phe Arg Cys Lys Tyr Gly Ala Cys Ile Asn 85 90 95 Pro Asp Leu Glu Cys Asp Gly Lys Pro Asp Cys Met Asp Gly Ser Asp 100 105 110 Glu Lys Ala Ser Lys Cys Lys Pro Asp Asp Ser Ser Pro Glu Cys Lys 115 120 125 Ala Asn Glu Phe Arg Cys Ser Ser Gly Gln Cys Ile Pro Glu Asp Tyr 130 135 140 Lys Cys Asp Gly Lys Ala Glu Cys Lys Asp Asn Ser Asp Glu Ile Arg 145 150 155 160 Ala Thr Cys Trp Asn Val Arg Cys Pro Gly Phe Thr His Lys Cys Lys 165 170 175 Tyr Gly Ala Cys Val Ser Gly Asn Ala Glu Cys Asn Gly Ile Val Glu 180 185 190 Cys Phe Asp Gly Ser Asp Glu Asp Pro Ala Ile Cys Lys Thr Glu Pro 195 200 205 Thr Pro Lys Pro Thr Pro Thr Pro Gly Thr Pro Gly Pro Gln Pro Thr 210 215 220 Gln Gly Gly Cys Val Leu Pro Asn His Pro Glu Phe Gly Glu Trp Gln 225 230 235 240 Val Tyr Gly Ile Pro Gly Gln Phe Ser Pro Gly Met Ala Ile Arg Ala 245 250 255 Gly Ala Thr Leu Arg Ile Gln Cys Lys Lys Arg Tyr Lys Leu Glu Gly 260 265 270 Lys Asn Ala Ile Phe Cys Glu Asn Gly Lys Trp Ser Asp Ala Val Gly 275 280 285 His Cys Leu Lys Leu Cys Pro Ser Ile Gln Ser Thr Ser Thr Met Arg 290 295 300 Val Thr Cys Ile Tyr Asn Lys His Glu Glu Thr Glu Asn Cys Thr Glu 305 310 315 320 Ala Val Glu Gly Thr Leu Val Arg Phe Asp Cys Ala Pro Phe Tyr Glu 325 330 335 Asp Leu Gly Leu Ser Arg His Pro Ile His Ile Cys Arg Asp Gly Ser 340 345 350 Trp Asp Gln Arg Arg Pro Glu Cys Thr Pro Val Cys Gly Gln Lys Ser 355 360 365 Val Asn Ala Gln Thr Leu Ile Val Asn Gly Lys Pro Val Lys Lys Gly 370 375 380 Asp Tyr Pro Trp Gln Val Ala Leu Tyr Thr Leu Asn Asp Lys Glu Leu 385 390 395 400 Ile Cys Gly Gly Ser Leu Leu Asn Gln Arg Val Val Leu Thr Ala Ala 405 410 415 His Cys Ile Thr Asp Asp Lys Gly Lys Leu Leu Ser Lys Glu Asn Tyr 420 425 430 Met Val Ala Val Gly Lys Tyr Tyr Arg Pro Phe Asn Asp Ser Arg Asp 435 440 445 Arg Asn Glu Ala Gln Phe Ser Glu Val Lys His Met Phe Ile Pro Glu 450 455 460 Leu Tyr Lys Gly Ser Thr Gln Asn Tyr Val Gly Asp Ile Ala Ile Leu 465 470 475 480 Val Thr Arg Val Thr Phe Thr Leu Ser Arg Arg Val Gln Pro Val Cys 485 490 495 Ile Asp Tyr Gly Leu Lys Tyr Thr Ser Tyr Thr Asn Glu Phe Gly Tyr 500 505 510 Val Thr Gly Trp Gly Tyr Thr Leu Gln Asn Asp Lys Pro Ser Asp Val 515 520 525 Leu Lys Glu Leu Lys Val Pro Ala Val Ser Thr Glu Gln Cys Ser Ser 530 535 540 Ala Ile Pro Glu Asp Tyr Asp Ile Tyr Leu Thr His Asp Lys Leu Cys 545 550 555 560 Ala Gly Tyr Leu Asp Asn Gly Thr Ser Val Cys Ser Gly Asp Ser Gly 565 570 575 Gly Gly Leu Val Phe Lys Phe Asp Gly Arg Tyr Tyr Val Thr Gly Ile 580 585 590 Val Ser Leu Ser Pro Gln Ala Ser Thr Gly Gly Cys Asp Thr Gln Gln 595 600 605 Tyr Gly Leu Tyr Thr Lys Val Gly Thr Tyr Ile Ser Asp Phe Ile Ile 610 615 620 Lys Thr Glu Ser Gln Phe Arg Pro *** 625 630 <210> 7 <211> 374 <212> PRT <213> Tenebrio molitor <400> 7 Met Leu Asn Leu Asn Tyr Phe Thr Cys Phe Val Ile Val Leu Ile Gln 1 5 10 15 Leu Val Ser Ser Gln Arg Phe Val Gly Asp Leu Cys Thr Leu Glu Ser 20 25 30 Ser Gly Ala Pro Gly Val Cys Glu Leu Phe Lys Glu Cys Lys Gln Ala 35 40 45 Arg Asp Asp Leu Gln Lys His Gln Leu Phe Pro Gln Gln Cys Gly Tyr 50 55 60 Gln Lys Asn Glu Pro Ile Val Cys Cys Leu Lys Lys Ser Lys Arg Lys 65 70 75 80 Pro Gly Glu Ile Ser Leu Lys Lys Cys Gln Glu Tyr Ser Arg Leu Val 85 90 95 Tyr Glu Val Asn Arg Ala Pro Val Leu Ile Ile Asn Ala Pro Asn Ile 100 105 110 Thr Lys Asn Glu Cys Gly His Lys Ile Ile Lys Leu Ile Val Gly Gly 115 120 125 Thr Asn Ala Thr Arg Lys Glu Phe Pro His Met Ala Val Ile Gly Phe 130 135 140 Glu Pro Gln Pro Gly Asp Ile Lys Trp Leu Cys Gly Gly Thr Val Leu 145 150 155 160 Ser Lys His Tyr Ile Leu Thr Ala Ala His Cys Leu Ser His Gln Glu 165 170 175 His Gly Arg Ala Arg Tyr Val Arg Ile Gly Val Thr Asp Leu Glu Asp 180 185 190 Thr Asn His Arg Gln Gln Leu Glu Val Glu Glu Leu Ile Pro Tyr Pro 195 200 205 Glu Tyr Lys Ser Ser Ser His Tyr His Asp Ile Gly Leu Leu Arg Leu 210 215 220 Lys Arg Ser Ala Lys Leu Asp Ser Phe Thr Val Pro Ala Cys Leu Tyr 225 230 235 240 Arg Lys His Asp Ile Glu Ala Glu Lys Ala Ile Ala Thr Gly Trp Gly 245 250 255 His Thr Thr Trp Gly Gly Ser Gly Ser Asn Asn Leu Leu Lys Val Thr 260 265 270 Leu Asp Leu Phe Asp His Ala Ser Cys Asn Arg Ser Tyr Lys Asn Gln 275 280 285 Ile Ser Arg Arg Leu Lys Asp Gly Ile Ile Asp Asp Ile Gln Val Cys 290 295 300 Ala Gly Ser Leu Asp Asp Glu Lys Asp Thr Cys Gln Gly Asp Ser Gly 305 310 315 320 Gly Pro Leu Gln Ile Phe His Glu Ser Lys Asp Ile Lys Cys Met Tyr 325 330 335 Asp Ile Ile Gly Val Thr Ser Phe Gly Lys Ala Cys Ser Gly Ser Pro 340 345 350 Gly Val Tyr Val Arg Val Ser Gln Tyr Ile Gly Trp Ile Glu Asp Ile 355 360 365 Val Trp Pro Glu Asn Ser 370 <210> 8 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 8 gaygarttyc cntggatggc 20 <210> 9 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 9 ccarttngcc atnccrcang g 21 <210> 10 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 10 cgtcgcaacc tcaacagagc ta 22 <210> 11 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 11 gatgttgttg ggttcgtagc tctc 24 <210> 12 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 12 cgtgtcagtc tcggtgttgt actc 24 <210> 13 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 13 ctctgactac atcaaaccca tttgtc 26 <210> 14 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 14 gaagagctga gcaagtcgta cctc 24 <210> 15 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 15 atgttggtcc gctccttgtt 20 <210> 16 <211> 19 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 16 ctagggcttc agcttgccg 19 <110> YUHAN CORPORATION <120> Protein involving in peptidoglycan-recognition signal pathway, gene encoding the same, and kit for detecting bacterial infection configuring the same <130> PN0153 <160> 16 <170> KopatentIn 1.71 <210> 1 <211> 384 <212> PRT <213> Tenebrio molitor <400> 1 Met Leu Val Arg Ser Leu Phe Ile Leu Val Val Thr Ala Gln Val Leu 1 5 10 15 Asn Ala Asp Glu Asn Cys Arg Thr Pro Asp Asn Glu Glu Gly Asp Cys 20 25 30 Lys Pro Ile Asn Gln Cys Arg Pro Leu Tyr Ser Leu Leu Glu Arg Arg 35 40 45 Pro Ile Thr Ala Ser Thr Ala Glu Tyr Leu Arg Arg Ser Asn Cys Gly 50 55 60 Phe Asp Gly Ser Tyr Pro Arg Val Cys Cys Pro Gln Gly Ser Ile Glu 65 70 75 80 Pro Pro Thr Ile Lys Pro Pro Ile Val Asp Gly Pro Thr Glu Ser Asn 85 90 95 Asn Val Ser Pro Val Thr Ser Asp Leu Leu Pro Asp Gly Ser Ile Cys 100 105 110 Gly Pro Asn Thr Gln Asn Arg Ile Tyr Gly Gly Glu Lys Thr Asp Leu 115 120 125 Asp Glu Phe Pro Trp Met Ala Leu Val Glu Tyr Glu Lys Pro Gly Gly 130 135 140 Ser Arg Gly Phe Tyr Cys Gly Gly Val Leu Ile Ser Lys Arg Tyr Val 145 150 155 160 Leu Thr Ala Ala His Cys Val Lys Gly Lys Asp Leu Pro Lys Thr Trp 165 170 175 Lys Leu Val Ser Val Arg Leu Gly Glu Tyr Asn Thr Glu Thr Asp Thr 180 185 190 Asp Cys Ile Asn Asn Gly Phe Gly Glu Asp Cys Ala Pro Pro Val 195 200 205 Asn Val Gln Val Glu Ala Arg Ile Ala His Glu Ser Tyr Glu Pro Asn 210 215 220 Asn Ile Asn Gln Tyr His Asp Ile Ala Leu Leu Arg Leu Arg Arg Glu 225 230 235 240 Val Lys Phe Ser Asp Tyr Ile Lys Pro Ile Cys Leu Pro Thr Thr Thr 245 250 255 Glu Glu Leu Ser Lys Ser Tyr Leu Gly Gln Lys Leu Phe Val Ala Gly 260 265 270 Trp Gly Lys Thr Glu Asn Arg Ser Glu Ser Asn Ile Lys Leu Lys Val 275 280 285 Gln Val Pro Val Lys Gln Met Ser Asp Cys Thr Ala Thr Tyr Ser Ser 290 295 300 Ala Asn Val Arg Leu Gly Ser Gly Gln Leu Cys Ala Gly Gly Glu Ser 305 310 315 320 Gly Lys Asp Ser Cys Arg Gly Asp Ser Gly Gly Pro Leu Met Ile Leu 325 330 335 Ser Leu Asp Lys Asp Lys Asp Ile His Trp Tyr Ala Ala Gly Val Val 340 345 350 Ser Phe Gly Pro Ser Pro Cys Gly Met Ala Asn Trp Pro Gly Val Tyr 355 360 365 Thr Lys Val Ser Lys Tyr Val Asp Trp Ile Val Gly Lys Leu Lys Pro 370 375 380 <210> 2 <211> 1155 <212> DNA <213> Tenebrio molitor <400> 2 atgttggtcc gctccttgtt catcctggta gtaacagcac aagtgctcaa tgccgacgag 60 aattgtcgta ctcctgataa tgaagaaggt gattgtaagc ctatcaatca atgccgcccc 120 ctctactccc tgttggagcg ccgccccatc accgccagca ccgccgagta tttgcgccga 180 tccaactgcg gcttcgacgg gagctaccct cgcgtctgct gcccccaagg ctcgatcgaa 240 cccccgacca tcaaaccccc aatagtggac gggcccaccg agtccaacaa tgtgtctccc 300 gtgacgagcg acctcctccc agacggctcc atctgcggtc ccaacaccca gaacaggatc 360 tacggcgggg agaaaaccga cctggatgag ttcccctgga tggccctggt ggaatacgag 420 aaacccggag gcagtcgagg gttctactgc ggcggagtgc tgatcagcaa gaggtacgtc 480 ctgacggcgg cgcactgcgt caaagggaag gatctgccca aaacgtggaa actcgtgagc 540 gtgcgtttgg gcgagtacaa caccgagact gacacggact gcatcaacaa cggcttcggg 600 gaggactgcg ccccaccccc cgtcaacgtc caggtggagg ccaggatcgc ccacgagagc 660 tacgaaccca acaacatcaa ccagtaccac gacatagctc tgttgaggtt gcgacgcgaa 720 gtcaaattct ctgactacat caaacccatt tgtctgccga ccaccaccga agagctgagc 780 aagtcgtacc tcggccagaa actcttcgtg gcgggctggg gcaagaccga gaaccggtcc 840 gagagcaaca tcaagctcaa agtgcaagtt cccgtcaagc aaatgtcaga ctgcaccgcc 900 acctacagca gcgccaatgt gaggttaggt tctggtcagc tgtgcgcagg aggcgaatcg 960 gggaaagatt cgtgtcgcgg agacagcgga gggcctttga tgatcctcag tttggacaaa 1020 gacaaggaca tacactggta cgccgcgggg gtggtgtctt tcgggccctc gccctgcggc 1080 atggccaact ggccgggagt ttacaccaaa gtgtccaaat acgtagactg gatcgtcggc 1140 aagctgaagc cctag 1155 <210> 3 <211> 193 <212> PRT <213> Tenebrio molitor <400> 3 Met Leu Leu Ala Thr Ile Ala Arg Gly Val Tyr Gln Ile Ser Ala Leu 1 5 10 15 Ser Gly Ser Thr Ile Pro Arg Ile Cys Pro Glu Ile Ile Ser Arg Thr 20 25 30 Arg Trp Gly Ala Arg Thr Pro Leu Glu Val Asp Tyr Ser Leu Ile Pro 35 40 45 Ile Glu Asn Val Val Val His His Thr Val Thr His Thr Cys Asp Ser 50 55 60 Glu Ser Glu Cys Ala Thr Leu Leu Arg Asn Val Gln Asn Phe His Met 65 70 75 80 Glu Asn Leu Glu Phe His Asp Ile Gly Tyr Asn Phe Leu Val Ala Gly 85 90 95 Asp Gly Gln Ile Tyr Glu Gly Ala Gly Trp His Lys Val Gly Ala His 100 105 110 Thr Arg Gly Tyr Asn Thr Arg Ser Leu Gly Leu Ala Phe Ile Gly Asn 115 120 125 Phe Thr Ser Gln Leu Pro Val Gln Lys Gln Leu Lys Val Ala Lys Asp 130 135 140 Phe Leu Gln Cys Gly Val Glu Leu Gly Glu Leu Ser Lys Asn Tyr Lys 145 150 155 160 Leu Phe Gly Ala Arg Gln Val Ser Ser Thr Ser Ser Pro Gly Leu Lys 165 170 175 Leu Tyr Arg Glu Leu Gln Asp Trp Pro His Phe Thr Arg Ser Pro Pro 180 185 190 Lys <210> 4 <211> 443 <212> PRT <213> Tenebrio molitor <400> 4 Met Phe Ala Lys Ala Ile Ile Leu Phe Leu Ile Leu Thr Thr Phe Gln 1 5 10 15 Cys His Gly Glu Phe Val Ile Pro Glu Val Thr Leu Glu Ala Tyr Glu 20 25 30 Pro Lys Gly Phe Arg Ala Ser Ile Pro Ala Leu Asn Gly Ile Gln Met 35 40 45 Phe Ala Phe His Gly Asn Ile Asn Lys Pro Ile Ser Gln Val Asp Pro 50 55 60 Gly Glu Tyr Ser Gln Asp Tyr Thr Ser Pro Thr Gly Asn Thr Trp Ser 65 70 75 80 Tyr Phe Asn Lys Asp Leu Lys Leu Lys Ala Gly Asp Val Ile His Tyr 85 90 95 Trp Val Phe Ile Gln Phe Leu Lys Leu Gly Tyr Arg Lys Asp Asn Gln 100 105 110 Val Trp Asn Val Thr Glu Leu Val Gln Leu Lys Asn Ser Ser Cys Glu 115 120 125 Thr Ser Pro Thr Thr Val Arg Gly Arg Ser Val Ile Cys Lys Asn Ser 130 135 140 Ile Ile Phe Glu Glu Asn Phe Asn Gly Glu Gly Ile Asp Thr Lys Lys 145 150 155 160 Trp Leu Ile Glu Gln Tyr Ile Pro Thr Tyr Thr Ser Leu Asp Tyr Glu 165 170 175 Phe Val Ser Tyr Gln Asn Asp Pro Thr Val Cys Phe Leu Asn Asp Asn 180 185 190 Lys Leu Phe Ile Lys Pro Lys Tyr Ala Gln Ser Glu Ala Glu Val Asn 195 200 205 Gly Glu Leu Asp Phe Arg Asn Arg Cys Thr Arg Lys Thr Asp Glu Glu 210 215 220 Cys Tyr Lys Lys Arg Glu Ile Tyr Phe Ile Ile Pro Pro Val Thr Ser 225 230 235 240 Gly Arg Leu Val Ser Asp Phe Arg Phe Lys Tyr Gly Lys Val Glu Ile 245 250 255 Arg Ala Lys Leu Pro Ala Gly Asp Trp Ile Tyr Pro Gln Met Tyr Leu 260 265 270 Glu Gln Val Asn Asp Pro Lys Lys Lys Ile Trp Ile Gly Tyr Ala Arg 275 280 285 Gly Asn Asn Lys Leu Leu Ala Asn Asn Gln Glu Asp Ile Gly Gly Asn 290 295 300 Leu Leu Phe Gly Gly Pro Val Leu Asp Pro Glu Glu Pro His Arg Ser 305 310 315 320 Gln Tyr Leu Lys Ser Thr Arg Asn Ser Lys Pro Phe Thr Ser Gln Met 325 330 335 His Thr Leu Val Val Leu Trp Asp Glu Asp His Ile Ser Leu Gln Leu 340 345 350 Asn Gly Ile Glu Tyr Gly Lys Ile Asp Lys Arg Thr Met Gln Glu Val 355 360 365 Asn Phe Ala Asp Asn Asp Met Val Arg Leu Val Leu Gly Val Gly Val 370 375 380 Gly Gly Val Asn Asp Phe Pro Asp Asp Phe Arg Ser Gly Thr Asn Val 385 390 395 400 Lys Pro Trp Arg Asn Lys Asp Asn Lys Gln Val Lys Asn Phe Phe Thr 405 410 415 Ala Arg Ser Glu Trp Gly Lys Thr Trp Ser Gly Asp Asn Cys Ala Leu 420 425 430 Gln Val Asp Tyr Ile Lys Val Trp Ala Leu *** 435 440 <210> 5 <211> 633 <212> PRT <213> Tenebrio molitor <400> 5 Met Cys Asn Val Arg Thr Leu Leu Gln Val Ile Cys Leu Ser Leu Ile 1 5 10 15 Val Ile Gln Thr Val Asp Ser Tyr Ser Phe Ala Leu Ser Lys Phe Thr 20 25 30 Arg Ile Arg Arg Gln Ala Arg Arg Thr Cys Thr Ser Thr Glu Phe Ala 35 40 45 Cys Lys Ser Gly Glu Cys Ile Asp Glu Asp Lys Glu Cys Asp Gly Ile 50 55 60 Val Asp Cys Thr Asp Ala Ser Asp Glu Thr Asn Ala Cys His Arg Ile 65 70 75 80 Lys Cys Pro Asn Tyr Leu Phe Arg Cys Lys Tyr Gly Ala Cys Ile Asn 85 90 95 Pro Asp Leu Glu Cys Asp Gly Lys Pro Asp Cys Met Asp Gly Ser Asp 100 105 110 Glu Lys Thr Ser Lys Cys Lys Pro Asp Asp Ser Ser Pro Glu Cys Lys 115 120 125 Ala Asn Glu Phe Arg Cys Ser Ser Gly Gln Cys Ile Pro Glu Asp Phe 130 135 140 Lys Cys Asp Gly Lys Ala Glu Cys Lys Asp Asn Ser Asp Glu Ile Arg 145 150 155 160 Ala Thr Cys Trp Asn Val Arg Cys Pro Gly Phe Thr His Lys Cys Lys 165 170 175 Tyr Gly Ala Cys Val Ser Gly Asn Ala Glu Cys Asn Gly Ile Val Glu 180 185 190 Cys Phe Asp Gly Ser Asp Glu Asp Pro Ala Ile Cys Lys Thr Lys Pro 195 200 205 Thr Pro Arg Pro Thr Pro Thr Pro Gly Thr Pro Gly Pro Gln Pro Thr 210 215 220 Gln Gly Gly Cys Val Leu Pro Asn His Pro Glu Phe Gly Glu Trp Gln 225 230 235 240 Val Tyr Gly Ile Pro Gly Gln Phe Ser Pro Gly Met Val Ile Arg Ala 245 250 255 Gly Ala Thr Leu Arg Ile Gln Cys Lys Lys Arg Tyr Lys Leu Glu Gly 260 265 270 Lys Asn Ala Ile Phe Cys Glu Asn Gly Lys Trp Ser Asp Ala Val Gly 275 280 285 His Cys Leu Lys Leu Cys Pro Ser Ile Gln Ser Thr Ser Thr Met Arg 290 295 300 Val Thr Cys Ile Tyr Asn Lys His Glu Glu Thr Glu Asn Cys Thr Glu 305 310 315 320 Ala Val Glu Gly Thr Leu Val Arg Phe Asp Cys Ala Pro Phe Tyr Glu 325 330 335 Asp Leu Gly Leu Ser Arg His Pro Ile His Ile Cys Arg Asp Gly Ser 340 345 350 Trp Asp Gln Arg Arg Pro Glu Cys Thr Pro Val Cys Gly Gln Lys Ser 355 360 365 Val Asn Ala Gln Thr Leu Ile Val Asn Gly Lys Pro Val Lys Lys Gly 370 375 380 Asp Tyr Pro Trp Gln Val Ala Leu Tyr Thr Leu Asn Asp Lys Glu Leu 385 390 395 400 Ile Cys Gly Gly Ser Leu Leu Asn Gln Arg Val Val Leu Thr Ala Ala 405 410 415 His Cys Ile Thr Asp Asp Lys Gly Lys Leu Leu Ser Lys Glu Asn Tyr 420 425 430 Met Val Ala Val Gly Lys Tyr Tyr Arg Pro Phe Asn Asp Ser Arg Asp 435 440 445 Arg Asn Glu Ala Gln Phe Ser Glu Val Lys His Met Phe Ile Pro Glu 450 455 460 Leu Tyr Lys Gly Ser Thr Gln Asn Tyr Val Gly Asp Ile Ala Ile Leu 465 470 475 480 Val Thr Arg Val Thr Phe Thr Leu Ser Arg Arg Val Gln Pro Val Cys 485 490 495 Ile Asp Tyr Gly Leu Lys Tyr Thr Ser Tyr Thr Asn Glu Phe Gly Tyr 500 505 510 Val Thr Gly Trp Gly Tyr Thr Leu Gln Asn Asp Lys Pro Ser Asp Val 515 520 525 Leu Lys Glu Leu Lys Val Pro Ala Val Ser Thr Glu Gln Cys Ser Ser 530 535 540 Ala Ile Pro Glu Asp Tyr Asp Ile Tyr Leu Thr His Asp Lys Leu Cys 545 550 555 560 Ala Gly Tyr Leu Asp Asn Gly Thr Ser Val Cys Ser Gly Asp Ser Gly 565 570 575 Gly Gly Leu Val Phe Lys Phe Asp Gly Arg Tyr Tyr Val Thr Gly Ile 580 585 590 Val Ser Leu Ser Pro Gln Ala Ser Thr Gly Gly Cys Asp Thr Gln Gln 595 600 605 Tyr Gly Leu Tyr Thr Lys Val Gly Thr Tyr Ile Ser Asp Phe Ile Ile 610 615 620 Lys Thr Glu Ser Gln Phe Arg Pro *** 625 630 <210> 6 <211> 633 <212> PRT <213> Tenebrio molitor <400> 6 Met Cys Asn Val Arg Thr Leu Leu Gln Val Ile Cys Leu Ser Leu Ile 1 5 10 15 Val Ile Gln Thr Val Asp Ser Tyr Ser Phe Ala Leu Ser Lys Phe Thr 20 25 30 Arg Ile Arg Arg Pro Ala Arg Arg Thr Cys Thr Ser Thr Glu Phe Ala 35 40 45 Cys Lys Ser Gly Glu Cys Ile Asp Glu Asp Lys Glu Cys Asp Gly Ile 50 55 60 Val Asp Cys Thr Asp Ala Ser Asp Glu Thr Asn Ala Cys His Arg Ile 65 70 75 80 Lys Cys Pro Asn Tyr Leu Phe Arg Cys Lys Tyr Gly Ala Cys Ile Asn 85 90 95 Pro Asp Leu Glu Cys Asp Gly Lys Pro Asp Cys Met Asp Gly Ser Asp 100 105 110 Glu Lys Ala Ser Lys Cys Lys Pro Asp Asp Ser Ser Pro Glu Cys Lys 115 120 125 Ala Asn Glu Phe Arg Cys Ser Ser Gly Gln Cys Ile Pro Glu Asp Tyr 130 135 140 Lys Cys Asp Gly Lys Ala Glu Cys Lys Asp Asn Ser Asp Glu Ile Arg 145 150 155 160 Ala Thr Cys Trp Asn Val Arg Cys Pro Gly Phe Thr His Lys Cys Lys 165 170 175 Tyr Gly Ala Cys Val Ser Gly Asn Ala Glu Cys Asn Gly Ile Val Glu 180 185 190 Cys Phe Asp Gly Ser Asp Glu Asp Pro Ala Ile Cys Lys Thr Glu Pro 195 200 205 Thr Pro Lys Pro Thr Pro Thr Pro Gly Thr Pro Gly Pro Gln Pro Thr 210 215 220 Gln Gly Gly Cys Val Leu Pro Asn His Pro Glu Phe Gly Glu Trp Gln 225 230 235 240 Val Tyr Gly Ile Pro Gly Gln Phe Ser Pro Gly Met Ala Ile Arg Ala 245 250 255 Gly Ala Thr Leu Arg Ile Gln Cys Lys Lys Arg Tyr Lys Leu Glu Gly 260 265 270 Lys Asn Ala Ile Phe Cys Glu Asn Gly Lys Trp Ser Asp Ala Val Gly 275 280 285 His Cys Leu Lys Leu Cys Pro Ser Ile Gln Ser Thr Ser Thr Met Arg 290 295 300 Val Thr Cys Ile Tyr Asn Lys His Glu Glu Thr Glu Asn Cys Thr Glu 305 310 315 320 Ala Val Glu Gly Thr Leu Val Arg Phe Asp Cys Ala Pro Phe Tyr Glu 325 330 335 Asp Leu Gly Leu Ser Arg His Pro Ile His Ile Cys Arg Asp Gly Ser 340 345 350 Trp Asp Gln Arg Arg Pro Glu Cys Thr Pro Val Cys Gly Gln Lys Ser 355 360 365 Val Asn Ala Gln Thr Leu Ile Val Asn Gly Lys Pro Val Lys Lys Gly 370 375 380 Asp Tyr Pro Trp Gln Val Ala Leu Tyr Thr Leu Asn Asp Lys Glu Leu 385 390 395 400 Ile Cys Gly Gly Ser Leu Leu Asn Gln Arg Val Val Leu Thr Ala Ala 405 410 415 His Cys Ile Thr Asp Asp Lys Gly Lys Leu Leu Ser Lys Glu Asn Tyr 420 425 430 Met Val Ala Val Gly Lys Tyr Tyr Arg Pro Phe Asn Asp Ser Arg Asp 435 440 445 Arg Asn Glu Ala Gln Phe Ser Glu Val Lys His Met Phe Ile Pro Glu 450 455 460 Leu Tyr Lys Gly Ser Thr Gln Asn Tyr Val Gly Asp Ile Ala Ile Leu 465 470 475 480 Val Thr Arg Val Thr Phe Thr Leu Ser Arg Arg Val Gln Pro Val Cys 485 490 495 Ile Asp Tyr Gly Leu Lys Tyr Thr Ser Tyr Thr Asn Glu Phe Gly Tyr 500 505 510 Val Thr Gly Trp Gly Tyr Thr Leu Gln Asn Asp Lys Pro Ser Asp Val 515 520 525 Leu Lys Glu Leu Lys Val Pro Ala Val Ser Thr Glu Gln Cys Ser Ser 530 535 540 Ala Ile Pro Glu Asp Tyr Asp Ile Tyr Leu Thr His Asp Lys Leu Cys 545 550 555 560 Ala Gly Tyr Leu Asp Asn Gly Thr Ser Val Cys Ser Gly Asp Ser Gly 565 570 575 Gly Gly Leu Val Phe Lys Phe Asp Gly Arg Tyr Tyr Val Thr Gly Ile 580 585 590 Val Ser Leu Ser Pro Gln Ala Ser Thr Gly Gly Cys Asp Thr Gln Gln 595 600 605 Tyr Gly Leu Tyr Thr Lys Val Gly Thr Tyr Ile Ser Asp Phe Ile Ile 610 615 620 Lys Thr Glu Ser Gln Phe Arg Pro *** 625 630 <210> 7 <211> 374 <212> PRT <213> Tenebrio molitor <400> 7 Met Leu Asn Leu Asn Tyr Phe Thr Cys Phe Val Ile Val Leu Ile Gln 1 5 10 15 Leu Val Ser Ser Gln Arg Phe Val Gly Asp Leu Cys Thr Leu Glu Ser 20 25 30 Ser Gly Ala Pro Gly Val Cys Glu Leu Phe Lys Glu Cys Lys Gln Ala 35 40 45 Arg Asp Asp Leu Gln Lys His Gln Leu Phe Pro Gln Gln Cys Gly Tyr 50 55 60 Gln Lys Asn Glu Pro Ile Val Cys Cys Leu Lys Lys Ser Lys Arg Lys 65 70 75 80 Pro Gly Glu Ile Ser Leu Lys Lys Cys Gln Glu Tyr Ser Arg Leu Val 85 90 95 Tyr Glu Val Asn Arg Ala Pro Val Leu Ile Ile Asn Ala Pro Asn Ile 100 105 110 Thr Lys Asn Glu Cys Gly His Lys Ile Ile Lys Leu Ile Val Gly Gly 115 120 125 Thr Asn Ala Thr Arg Lys Glu Phe Pro His Met Ala Val Ile Gly Phe 130 135 140 Glu Pro Gln Pro Gly Asp Ile Lys Trp Leu Cys Gly Gly Thr Val Leu 145 150 155 160 Ser Lys His Tyr Ile Leu Thr Ala Ala His Cys Leu Ser His Gln Glu 165 170 175 His Gly Arg Ala Arg Tyr Val Arg Ile Gly Val Thr Asp Leu Glu Asp 180 185 190 Thr Asn His Arg Gln Gln Leu Glu Val Glu Glu Leu Ile Pro Tyr Pro 195 200 205 Glu Tyr Lys Ser Ser Ser His Tyr His Asp Ile Gly Leu Leu Arg Leu 210 215 220 Lys Arg Ser Ala Lys Leu Asp Ser Phe Thr Val Pro Ala Cys Leu Tyr 225 230 235 240 Arg Lys His Asp Ile Glu Ala Glu Lys Ala Ile Ala Thr Gly Trp Gly 245 250 255 His Thr Thr Trp Gly Gly Ser Gly Ser Asn Asn Leu Leu Lys Val Thr 260 265 270 Leu Asp Leu Phe Asp His Ala Ser Cys Asn Arg Ser Tyr Lys Asn Gln 275 280 285 Ile Ser Arg Arg Leu Lys Asp Gly Ile Ile Asp Asp Ile Gln Val Cys 290 295 300 Ala Gly Ser Leu Asp Asp Glu Lys Asp Thr Cys Gln Gly Asp Ser Gly 305 310 315 320 Gly Pro Leu Gln Ile Phe His Glu Ser Lys Asp Ile Lys Cys Met Tyr 325 330 335 Asp Ile Ile Gly Val Thr Ser Phe Gly Lys Ala Cys Ser Gly Ser Pro 340 345 350 Gly Val Tyr Val Arg Val Ser Gln Tyr Ile Gly Trp Ile Glu Asp Ile 355 360 365 Val Trp Pro Glu Asn Ser 370 <210> 8 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 8 gaygarttyc cntggatggc 20 <210> 9 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 9 ccarttngcc atnccrcang g 21 <210> 10 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 10 cgtcgcaacc tcaacagagc ta 22 <210> 11 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 11 gatgttgttg ggttcgtagc tctc 24 <210> 12 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 12 cgtgtcagtc tcggtgttgt actc 24 <210> 13 <211> 26 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 13 ctctgactac atcaaaccca tttgtc 26 <210> 14 <211> 24 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 14 gaagagctga gcaagtcgta cctc 24 <210> 15 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 15 atgttggtcc gctccttgtt 20 <210> 16 <211> 19 <212> DNA <213> Artificial Sequence <220> <223> Primer <400> 16 ctagggcttc agcttgccg 19
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020070140064A KR101016689B1 (en) | 2007-12-28 | 2007-12-28 | Proteins involved in the transmission of peptidoglycan recognition signals, genes encoding them, and bacterial infection detection kits including the same |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020070140064A KR101016689B1 (en) | 2007-12-28 | 2007-12-28 | Proteins involved in the transmission of peptidoglycan recognition signals, genes encoding them, and bacterial infection detection kits including the same |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20090072078A KR20090072078A (en) | 2009-07-02 |
KR101016689B1 true KR101016689B1 (en) | 2011-02-25 |
Family
ID=41329344
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020070140064A Expired - Fee Related KR101016689B1 (en) | 2007-12-28 | 2007-12-28 | Proteins involved in the transmission of peptidoglycan recognition signals, genes encoding them, and bacterial infection detection kits including the same |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR101016689B1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101251023B1 (en) * | 2009-11-05 | 2013-04-03 | 주식회사유한양행 | Kit for detecting bacterial infection comprising novel monoclonal antibodies |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002101083A1 (en) | 2001-06-08 | 2002-12-19 | Samyang Genex Corporation | Composition for detecting peptidoglycan, and diagnostic kit detecting peptidoglycan |
KR20030018672A (en) * | 2001-08-30 | 2003-03-06 | 주식회사 삼양제넥스 | Protein of phenoloxidase system and gene encoding the same |
KR20090029973A (en) * | 2007-09-19 | 2009-03-24 | 주식회사유한양행 | Proteins involved in the transmission of peptidoglycan recognition signals, genes encoding them, and bacterial infection detection kits including the same |
-
2007
- 2007-12-28 KR KR1020070140064A patent/KR101016689B1/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2002101083A1 (en) | 2001-06-08 | 2002-12-19 | Samyang Genex Corporation | Composition for detecting peptidoglycan, and diagnostic kit detecting peptidoglycan |
KR20030018672A (en) * | 2001-08-30 | 2003-03-06 | 주식회사 삼양제넥스 | Protein of phenoloxidase system and gene encoding the same |
KR20090029973A (en) * | 2007-09-19 | 2009-03-24 | 주식회사유한양행 | Proteins involved in the transmission of peptidoglycan recognition signals, genes encoding them, and bacterial infection detection kits including the same |
Also Published As
Publication number | Publication date |
---|---|
KR20090072078A (en) | 2009-07-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Shapiro et al. | Isolation and characterization of a human colon carcinoma-secreted enzyme with pancreatic ribonuclease-like activity | |
ES2627334T3 (en) | Endoglucosidase from streptococcus pyogenes and methods to use it | |
AU735015B2 (en) | Use of substrate subtraction libraries to distinguish enzyme specificities | |
Romeis et al. | Penicillin‐binding protein 7/8 of Escherichia coli is a DD‐endopeptidase | |
CA2143621C (en) | Horseshoe crab amebocyte lysate factor g subunit a and dna encoding thereof | |
Healy et al. | The lysine-specific proteinase from Armillaria mellea is a member of a novel class of metalloendopeptidases located in Basidiomycetes | |
KR101016688B1 (en) | Proteins involved in the transmission of peptidoglycan recognition signals, genes encoding them, and bacterial infection detection kits including the same | |
KR101016689B1 (en) | Proteins involved in the transmission of peptidoglycan recognition signals, genes encoding them, and bacterial infection detection kits including the same | |
KR101072815B1 (en) | Proteins activating pro-phenoloxidase system and genes encoding the same | |
Rodenburg et al. | Arg-27, Arg-127 and Arg-155 in the β-trefoil protein barley α-amylase/subtilisin inhibitor are interface residues in the complex with barley α-amylase 2 | |
Kouzuma et al. | The tissue-type plasminogen activator inhibitor ETIa from Erythrina variegata: structural basis for the inhibitory activity by cloning, expression, and mutagenesis of the cDNA encoding ETIa | |
Sugimura et al. | Studies on algal cytochromes. III. Amino acid sequence of cytochrome c-553 from a brown alga, Petalonia fascia | |
EP0753304A1 (en) | Thrombolytic enzyme and method of obtaining same | |
Uchida et al. | Base specificity and primary structure of poly U-preferential ribonuclease from chicken liver | |
EP1418182A1 (en) | Method of preparing peptide fragment having cell death inhibitory activity | |
US6399759B1 (en) | Ant proteases and methods of inhibition | |
KR101092936B1 (en) | Proteins activating pro-phenoloxidase system and genes encoding the same | |
Chiou et al. | Purification and characterization of a novel phospholipase A2 from king cobra (Ophiophagus hannah) venom | |
Hrušková-Heidingsfeldová et al. | Enzymological characterization of secreted proteinases from Candida parapsilosis and Candida lusitaniae | |
SUGIHARA et al. | Purification and characterization of phosphodiesterase from the venom of Agkistrodon acutus (China) | |
KR101060098B1 (en) | Proteins activating pro-phenoloxidase system and genes encoding the same | |
KR101104631B1 (en) | Proteins activating the pro-phenoloxidase system and genes encoding them | |
KR100497123B1 (en) | Protein of phenoloxidase system and gene encoding the same | |
KR20100102284A (en) | Antifungi composition comprising the essential components for transferring the beta-1,3-glucan recognition signal to spin larvae of the mealworm, tenebrio molitor as effective component | |
PARK et al. | Cloning and Characterization of the cDNA Encoding the Masquerade‐like Serine Proteinase Homologue Gene of the Silkworm, Bombyx mori |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PA0109 | Patent application |
Patent event code: PA01091R01D Comment text: Patent Application Patent event date: 20071228 |
|
A201 | Request for examination | ||
PA0201 | Request for examination |
Patent event code: PA02012R01D Patent event date: 20090121 Comment text: Request for Examination of Application Patent event code: PA02011R01I Patent event date: 20071228 Comment text: Patent Application |
|
PG1501 | Laying open of application | ||
E701 | Decision to grant or registration of patent right | ||
PE0701 | Decision of registration |
Patent event code: PE07011S01D Comment text: Decision to Grant Registration Patent event date: 20110211 |
|
GRNT | Written decision to grant | ||
PR0701 | Registration of establishment |
Comment text: Registration of Establishment Patent event date: 20110215 Patent event code: PR07011E01D |
|
PR1002 | Payment of registration fee |
Payment date: 20110215 End annual number: 3 Start annual number: 1 |
|
PG1601 | Publication of registration | ||
FPAY | Annual fee payment |
Payment date: 20140110 Year of fee payment: 4 |
|
PR1001 | Payment of annual fee |
Payment date: 20140110 Start annual number: 4 End annual number: 4 |
|
FPAY | Annual fee payment |
Payment date: 20150127 Year of fee payment: 5 |
|
PR1001 | Payment of annual fee |
Payment date: 20150127 Start annual number: 5 End annual number: 5 |
|
FPAY | Annual fee payment |
Payment date: 20160108 Year of fee payment: 6 |
|
PR1001 | Payment of annual fee |
Payment date: 20160108 Start annual number: 6 End annual number: 6 |
|
FPAY | Annual fee payment |
Payment date: 20180109 Year of fee payment: 8 |
|
PR1001 | Payment of annual fee |
Payment date: 20180109 Start annual number: 8 End annual number: 8 |
|
LAPS | Lapse due to unpaid annual fee | ||
PC1903 | Unpaid annual fee |
Termination category: Default of registration fee Termination date: 20191126 |