US20030180885A1 - DNA molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby - Google Patents
DNA molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby Download PDFInfo
- Publication number
- US20030180885A1 US20030180885A1 US10/420,845 US42084503A US2003180885A1 US 20030180885 A1 US20030180885 A1 US 20030180885A1 US 42084503 A US42084503 A US 42084503A US 2003180885 A1 US2003180885 A1 US 2003180885A1
- Authority
- US
- United States
- Prior art keywords
- leu
- dna
- sequence
- ala
- polypeptide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- MTNDZQHUAFNZQY-UHFFFAOYSA-N imidazoline Chemical compound C1CN=CN1 MTNDZQHUAFNZQY-UHFFFAOYSA-N 0.000 title claims abstract description 37
- 108090000765 processed proteins & peptides Proteins 0.000 title claims description 84
- 102000004196 processed proteins & peptides Human genes 0.000 title claims description 80
- 229920001184 polypeptide Polymers 0.000 title claims description 76
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 112
- 108020004414 DNA Proteins 0.000 claims abstract description 89
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 87
- 102000009032 Imidazoline Receptors Human genes 0.000 claims abstract description 76
- 108010049134 Imidazoline Receptors Proteins 0.000 claims abstract description 76
- 238000000034 method Methods 0.000 claims abstract description 69
- 241000282414 Homo sapiens Species 0.000 claims abstract description 68
- 239000002299 complementary DNA Substances 0.000 claims abstract description 62
- 239000012634 fragment Substances 0.000 claims abstract description 35
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 34
- 238000012216 screening Methods 0.000 claims abstract description 15
- 210000004027 cell Anatomy 0.000 claims description 66
- 239000003446 ligand Substances 0.000 claims description 36
- 102000053602 DNA Human genes 0.000 claims description 34
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 34
- 240000007594 Oryza sativa Species 0.000 claims description 28
- 239000013598 vector Substances 0.000 claims description 26
- 150000002462 imidazolines Chemical class 0.000 claims description 15
- 241001465754 Metazoa Species 0.000 claims description 9
- 238000009396 hybridization Methods 0.000 claims description 8
- -1 imidazoline compound Chemical class 0.000 claims description 8
- 210000000170 cell membrane Anatomy 0.000 claims description 7
- 238000012258 culturing Methods 0.000 claims description 7
- 239000001963 growth medium Substances 0.000 claims description 7
- 108020003215 DNA Probes Proteins 0.000 claims description 6
- 239000003298 DNA probe Substances 0.000 claims description 6
- 238000006073 displacement reaction Methods 0.000 claims description 6
- 238000002372 labelling Methods 0.000 claims description 5
- 230000000295 complement effect Effects 0.000 claims description 3
- 239000000463 material Substances 0.000 claims description 3
- 230000027455 binding Effects 0.000 abstract description 76
- 102000005962 receptors Human genes 0.000 abstract description 21
- 108020003175 receptors Proteins 0.000 abstract description 21
- 229940079593 drug Drugs 0.000 abstract description 10
- 239000003814 drug Substances 0.000 abstract description 10
- 238000010367 cloning Methods 0.000 abstract description 7
- 108020004635 Complementary DNA Proteins 0.000 abstract description 6
- 235000018102 proteins Nutrition 0.000 description 72
- 235000001014 amino acid Nutrition 0.000 description 24
- 239000012528 membrane Substances 0.000 description 23
- 229940024606 amino acid Drugs 0.000 description 22
- 150000001413 amino acids Chemical class 0.000 description 22
- HSRPTPAPMBHRRJ-UHFFFAOYSA-N n-(2,6-dichloro-4-iodophenyl)-4,5-dihydro-1h-imidazol-2-amine Chemical compound ClC1=CC(I)=CC(Cl)=C1NC1=NCCN1 HSRPTPAPMBHRRJ-UHFFFAOYSA-N 0.000 description 19
- 239000002773 nucleotide Substances 0.000 description 19
- 125000003729 nucleotide group Chemical group 0.000 description 19
- 108020004999 messenger RNA Proteins 0.000 description 17
- 238000012163 sequencing technique Methods 0.000 description 16
- WPNJAUFVNXKLIM-UHFFFAOYSA-N Moxonidine Chemical compound COC1=NC(C)=NC(Cl)=C1NC1=NCCN1 WPNJAUFVNXKLIM-UHFFFAOYSA-N 0.000 description 15
- 229960003938 moxonidine Drugs 0.000 description 15
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 14
- 238000001378 electrochemiluminescence detection Methods 0.000 description 14
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 13
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 12
- 125000000899 L-alpha-glutamyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C([H])([H])C([H])([H])C(O[H])=O 0.000 description 12
- 102000010909 Monoamine Oxidase Human genes 0.000 description 12
- 108010062431 Monoamine oxidase Proteins 0.000 description 12
- 108010087924 alanylproline Proteins 0.000 description 12
- 150000001875 compounds Chemical class 0.000 description 12
- 210000004556 brain Anatomy 0.000 description 11
- 238000010790 dilution Methods 0.000 description 11
- 239000012895 dilution Substances 0.000 description 11
- 210000001519 tissue Anatomy 0.000 description 11
- 108091060211 Expressed sequence tag Proteins 0.000 description 10
- 102000003923 Protein Kinase C Human genes 0.000 description 10
- 108090000315 Protein Kinase C Proteins 0.000 description 10
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 10
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 10
- GOOXRYWLNNXLFL-UHFFFAOYSA-H azane oxygen(2-) ruthenium(3+) ruthenium(4+) hexachloride Chemical compound N.N.N.N.N.N.N.N.N.N.N.N.N.N.[O--].[O--].[Cl-].[Cl-].[Cl-].[Cl-].[Cl-].[Cl-].[Ru+3].[Ru+3].[Ru+4] GOOXRYWLNNXLFL-UHFFFAOYSA-H 0.000 description 10
- 108010078144 glutaminyl-glycine Proteins 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- 230000000694 effects Effects 0.000 description 9
- 239000000499 gel Substances 0.000 description 9
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 9
- 150000007523 nucleic acids Chemical group 0.000 description 9
- 239000000523 sample Substances 0.000 description 9
- 238000011144 upstream manufacturing Methods 0.000 description 9
- 102000001424 Ryanodine receptors Human genes 0.000 description 8
- 210000000133 brain stem Anatomy 0.000 description 8
- 238000001514 detection method Methods 0.000 description 8
- 238000002474 experimental method Methods 0.000 description 8
- 210000001320 hippocampus Anatomy 0.000 description 8
- 229950001476 idazoxan Drugs 0.000 description 8
- 239000002287 radioligand Substances 0.000 description 8
- 108091052345 ryanodine receptor (TC 1.A.3.1) family Proteins 0.000 description 8
- 108010026333 seryl-proline Proteins 0.000 description 8
- 238000001890 transfection Methods 0.000 description 8
- 238000001262 western blot Methods 0.000 description 8
- 238000003556 assay Methods 0.000 description 7
- 108010070643 prolylglutamic acid Proteins 0.000 description 7
- 210000005262 rostral ventrolateral medulla Anatomy 0.000 description 7
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 6
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 6
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 6
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 6
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 6
- 229960002896 clonidine Drugs 0.000 description 6
- 238000009826 distribution Methods 0.000 description 6
- 230000000971 hippocampal effect Effects 0.000 description 6
- HPMRFMKYPGXPEP-UHFFFAOYSA-N idazoxan Chemical compound N1CCN=C1C1OC2=CC=CC=C2OC1 HPMRFMKYPGXPEP-UHFFFAOYSA-N 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 238000003127 radioimmunoassay Methods 0.000 description 6
- 102000014914 Carrier Proteins Human genes 0.000 description 5
- GJSURZIOUXUGAL-UHFFFAOYSA-N Clonidine Chemical compound ClC1=CC=CC(Cl)=C1NC1=NCCN1 GJSURZIOUXUGAL-UHFFFAOYSA-N 0.000 description 5
- 108091026890 Coding region Proteins 0.000 description 5
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 5
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 5
- 108091092195 Intron Proteins 0.000 description 5
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- 108700026244 Open Reading Frames Proteins 0.000 description 5
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 108091008324 binding proteins Proteins 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 5
- 108010049041 glutamylalanine Proteins 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 4
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 4
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 4
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 4
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 4
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 4
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 4
- 108010090461 DFG peptide Proteins 0.000 description 4
- 238000001712 DNA sequencing Methods 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 4
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 4
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 4
- 101710181914 Neural retina-specific leucine zipper protein Proteins 0.000 description 4
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 4
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 4
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 4
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 4
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 4
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 239000000556 agonist Substances 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 229910052791 calcium Inorganic materials 0.000 description 4
- 239000011575 calcium Substances 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 230000002209 hydrophobic effect Effects 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 108010079317 prolyl-tyrosine Proteins 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 230000035945 sensitivity Effects 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 101710191936 70 kDa protein Proteins 0.000 description 3
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 3
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 3
- MWVDDZUTWXFYHL-XKBZYTNZSA-N Cys-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O MWVDDZUTWXFYHL-XKBZYTNZSA-N 0.000 description 3
- 108700024394 Exon Proteins 0.000 description 3
- 241000282326 Felis catus Species 0.000 description 3
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 3
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 3
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 3
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 3
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 3
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 3
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 3
- 101000852815 Homo sapiens Insulin receptor Proteins 0.000 description 3
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 3
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 3
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 3
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 3
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 3
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 3
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- 108010079364 N-glycylalanine Proteins 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 239000000020 Nitrocellulose Substances 0.000 description 3
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 3
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 3
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 3
- 108010079005 RDV peptide Proteins 0.000 description 3
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 3
- 238000002105 Southern blotting Methods 0.000 description 3
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 3
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 3
- BXJQKVDPRMLGKN-PMVMPFDFSA-N Tyr-Trp-Leu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 BXJQKVDPRMLGKN-PMVMPFDFSA-N 0.000 description 3
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 3
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 3
- 235000009697 arginine Nutrition 0.000 description 3
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 3
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 150000001768 cations Chemical class 0.000 description 3
- 210000001638 cerebellum Anatomy 0.000 description 3
- 108010054813 diprotin B Proteins 0.000 description 3
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 238000001502 gel electrophoresis Methods 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 230000003053 immunization Effects 0.000 description 3
- 238000002649 immunization Methods 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 235000013336 milk Nutrition 0.000 description 3
- 239000008267 milk Substances 0.000 description 3
- 210000004080 milk Anatomy 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 229920001220 nitrocellulos Polymers 0.000 description 3
- 230000001129 nonadrenergic effect Effects 0.000 description 3
- 230000009871 nonspecific binding Effects 0.000 description 3
- 102000039446 nucleic acids Human genes 0.000 description 3
- 108020004707 nucleic acids Proteins 0.000 description 3
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 3
- 108010024607 phenylalanylalanine Proteins 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 108010073969 valyllysine Proteins 0.000 description 3
- UCTWMZQNUQWSLP-VIFPVBQESA-N (R)-adrenaline Chemical compound CNC[C@H](O)C1=CC=C(O)C(O)=C1 UCTWMZQNUQWSLP-VIFPVBQESA-N 0.000 description 2
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- NKDFYOWSKOHCCO-YPVLXUMRSA-N 20-hydroxyecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@](C)(O)[C@H](O)CCC(C)(O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 NKDFYOWSKOHCCO-YPVLXUMRSA-N 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 2
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 2
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 2
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 2
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 2
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 2
- LLZXKVAAEWBUPB-KKUMJFAQSA-N Arg-Gln-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLZXKVAAEWBUPB-KKUMJFAQSA-N 0.000 description 2
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 2
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 2
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 2
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 2
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 2
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 2
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 2
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- 241000972773 Aulopiformes Species 0.000 description 2
- KXDAEFPNCMNJSK-UHFFFAOYSA-N Benzamide Chemical compound NC(=O)C1=CC=CC=C1 KXDAEFPNCMNJSK-UHFFFAOYSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- HYKFOHGZGLOCAY-ZLUOBGJFSA-N Cys-Cys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O HYKFOHGZGLOCAY-ZLUOBGJFSA-N 0.000 description 2
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 2
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 2
- 102000003849 Cytochrome P450 Human genes 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 2
- 238000012286 ELISA Assay Methods 0.000 description 2
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 2
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 2
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 2
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 2
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 2
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 2
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 2
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 2
- MQVNVZUEPUIAFA-WDSKDSINSA-N Gly-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN MQVNVZUEPUIAFA-WDSKDSINSA-N 0.000 description 2
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 2
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 2
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 2
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 2
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 2
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 2
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 2
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 2
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 2
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 2
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 2
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 2
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- HPECNYCQLSVCHH-BZSNNMDCSA-N Phe-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N HPECNYCQLSVCHH-BZSNNMDCSA-N 0.000 description 2
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 2
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 2
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 2
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 2
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 2
- GZGPMBKUJDRICD-ULQDDVLXSA-N Phe-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O GZGPMBKUJDRICD-ULQDDVLXSA-N 0.000 description 2
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 2
- 229920001213 Polysorbate 20 Polymers 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 2
- GQLOZEMWEBDEAY-NAKRPEOUSA-N Pro-Cys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GQLOZEMWEBDEAY-NAKRPEOUSA-N 0.000 description 2
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 2
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 2
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 2
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 2
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 2
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 2
- KJTLSVCANCCWHF-UHFFFAOYSA-N Ruthenium Chemical compound [Ru] KJTLSVCANCCWHF-UHFFFAOYSA-N 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 2
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 2
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 2
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 2
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 2
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- DTPARJBMONKGGC-IHPCNDPISA-N Trp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N DTPARJBMONKGGC-IHPCNDPISA-N 0.000 description 2
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 2
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 2
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 2
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 2
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 2
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 2
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 2
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 2
- YQMILNREHKTFBS-IHRRRGAJSA-N Val-Phe-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YQMILNREHKTFBS-IHRRRGAJSA-N 0.000 description 2
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 2
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 102000030484 alpha-2 Adrenergic Receptor Human genes 0.000 description 2
- 108020004101 alpha-2 Adrenergic Receptor Proteins 0.000 description 2
- 239000005557 antagonist Substances 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000007876 drug discovery Methods 0.000 description 2
- 238000007877 drug screening Methods 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 229940125425 inverse agonist Drugs 0.000 description 2
- 230000003278 mimic effect Effects 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- OJUGVDODNPJEEC-UHFFFAOYSA-N phenylglyoxal Chemical compound O=CC(=O)C1=CC=CC=C1 OJUGVDODNPJEEC-UHFFFAOYSA-N 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 229920002401 polyacrylamide Polymers 0.000 description 2
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 2
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 230000036515 potency Effects 0.000 description 2
- 230000003389 potentiating effect Effects 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 238000003653 radioligand binding assay Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 229910052707 ruthenium Inorganic materials 0.000 description 2
- 235000019515 salmon Nutrition 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 238000003153 stable transfection Methods 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 238000003146 transient transfection Methods 0.000 description 2
- 239000003656 tris buffered saline Substances 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- SFLSHLFXELFNJZ-QMMMGPOBSA-N (-)-norepinephrine Chemical compound NC[C@H](O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-QMMMGPOBSA-N 0.000 description 1
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- WDVIDPRACNGFPP-QWRGUYRKSA-N (2s)-2-[[(2s)-6-amino-2-[[2-[(2-aminoacetyl)amino]acetyl]amino]hexanoyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound NCC(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WDVIDPRACNGFPP-QWRGUYRKSA-N 0.000 description 1
- 229930182837 (R)-adrenaline Natural products 0.000 description 1
- WHTVZRBIWZFKQO-AWEZNQCLSA-N (S)-chloroquine Chemical compound ClC1=CC=C2C(N[C@@H](C)CCCN(CC)CC)=CC=NC2=C1 WHTVZRBIWZFKQO-AWEZNQCLSA-N 0.000 description 1
- OZDAOHVKBFBBMZ-UHFFFAOYSA-N 2-aminopentanedioic acid;hydrate Chemical compound O.OC(=O)C(N)CCC(O)=O OZDAOHVKBFBBMZ-UHFFFAOYSA-N 0.000 description 1
- LIOLIMKSCNQPLV-UHFFFAOYSA-N 2-fluoro-n-methyl-4-[7-(quinolin-6-ylmethyl)imidazo[1,2-b][1,2,4]triazin-2-yl]benzamide Chemical compound C1=C(F)C(C(=O)NC)=CC=C1C1=NN2C(CC=3C=C4C=CC=NC4=CC=3)=CN=C2N=C1 LIOLIMKSCNQPLV-UHFFFAOYSA-N 0.000 description 1
- SLXKOJJOQWFEFD-UHFFFAOYSA-N 6-aminohexanoic acid Chemical compound NCCCCCC(O)=O SLXKOJJOQWFEFD-UHFFFAOYSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- QYPPJABKJHAVHS-UHFFFAOYSA-N Agmatine Natural products NCCCCNC(N)=N QYPPJABKJHAVHS-UHFFFAOYSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WZGZDOXCDLLTHE-SYWGBEHUSA-N Ala-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 WZGZDOXCDLLTHE-SYWGBEHUSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- 108010087765 Antipain Proteins 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 1
- DGFXIWKPTDKBLF-AVGNSLFASA-N Arg-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N DGFXIWKPTDKBLF-AVGNSLFASA-N 0.000 description 1
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- VJTWLBMESLDOMK-WDSKDSINSA-N Asn-Gln-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VJTWLBMESLDOMK-WDSKDSINSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- UWFOMGUWGPRVBW-GUBZILKMSA-N Asn-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N UWFOMGUWGPRVBW-GUBZILKMSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- YBMUFUWSMIKJQA-GUBZILKMSA-N Asp-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N YBMUFUWSMIKJQA-GUBZILKMSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- RWHHSFSWKFBTCF-KKUMJFAQSA-N Asp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N RWHHSFSWKFBTCF-KKUMJFAQSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 229940121926 Calpain inhibitor Drugs 0.000 description 1
- 102100035037 Calpastatin Human genes 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 101100111281 Chlorobium chlorochromatii (strain CaD3) bchB gene Proteins 0.000 description 1
- 229940122644 Chymotrypsin inhibitor Drugs 0.000 description 1
- YAORIDZYZDUZCM-UHFFFAOYSA-N Cirazoline Chemical compound N=1CCNC=1COC1=CC=CC=C1C1CC1 YAORIDZYZDUZCM-UHFFFAOYSA-N 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 1
- LDIKUWLAMDFHPU-FXQIFTODSA-N Cys-Cys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LDIKUWLAMDFHPU-FXQIFTODSA-N 0.000 description 1
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- UVZFZTWNHOQWNK-NAKRPEOUSA-N Cys-Ile-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UVZFZTWNHOQWNK-NAKRPEOUSA-N 0.000 description 1
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 102000013446 GTP Phosphohydrolases Human genes 0.000 description 1
- 108091006109 GTPases Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- GLAPJAHOPFSLKL-SRVKXCTJSA-N Gln-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N GLAPJAHOPFSLKL-SRVKXCTJSA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 1
- FYYSIASRLDJUNP-WHFBIAKZSA-N Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FYYSIASRLDJUNP-WHFBIAKZSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- XOIATPHFYVWFEU-DCAQKATOSA-N Glu-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOIATPHFYVWFEU-DCAQKATOSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 1
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- WJUYPBBCSSLVJE-CIUDSAMLSA-N His-Asn-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N WJUYPBBCSSLVJE-CIUDSAMLSA-N 0.000 description 1
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 description 1
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- ISQOVWDWRUONJH-YESZJQIVSA-N His-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ISQOVWDWRUONJH-YESZJQIVSA-N 0.000 description 1
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 1
- 101100273831 Homo sapiens CDS1 gene Proteins 0.000 description 1
- 101000997017 Homo sapiens Neural retina-specific leucine zipper protein Proteins 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- LEHPJMKVGFPSSP-ZQINRCPSSA-N Ile-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 LEHPJMKVGFPSSP-ZQINRCPSSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- DNKDIDZHXZAGRY-HJWJTTGWSA-N Ile-Met-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DNKDIDZHXZAGRY-HJWJTTGWSA-N 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 108010061833 Integrases Proteins 0.000 description 1
- 108020005351 Isochores Proteins 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- FEHQLKKBVJHSEC-SZMVWBNQSA-N Leu-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FEHQLKKBVJHSEC-SZMVWBNQSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- GSSMYQHXZNERFX-WDSOQIARSA-N Leu-Met-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N GSSMYQHXZNERFX-WDSOQIARSA-N 0.000 description 1
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- GDBQQVLCIARPGH-UHFFFAOYSA-N Leupeptin Natural products CC(C)CC(NC(C)=O)C(=O)NC(CC(C)C)C(=O)NC(C=O)CCCN=C(N)N GDBQQVLCIARPGH-UHFFFAOYSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- BPDXWKVZNCKUGG-BZSNNMDCSA-N Lys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N BPDXWKVZNCKUGG-BZSNNMDCSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- RBGLBUDVQVPTEG-DCAQKATOSA-N Met-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N RBGLBUDVQVPTEG-DCAQKATOSA-N 0.000 description 1
- HUURTRNKPBHHKZ-JYJNAYRXSA-N Met-Phe-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 HUURTRNKPBHHKZ-JYJNAYRXSA-N 0.000 description 1
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 1
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- 241000191938 Micrococcus luteus Species 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- FINLZXKJWTYYLC-ACRUOGEOSA-N Phe-His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FINLZXKJWTYYLC-ACRUOGEOSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 1
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- YDUGVDGFKNXFPL-IXOXFDKPSA-N Phe-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YDUGVDGFKNXFPL-IXOXFDKPSA-N 0.000 description 1
- KQUMFXGQTSAEJE-PMVMPFDFSA-N Phe-Trp-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KQUMFXGQTSAEJE-PMVMPFDFSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- ALJGSKMBIUEJOB-FXQIFTODSA-N Pro-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 ALJGSKMBIUEJOB-FXQIFTODSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 1
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 1
- MAWSJXHRLWVJEZ-ACZMJKKPSA-N Ser-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N MAWSJXHRLWVJEZ-ACZMJKKPSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 241000906446 Theraps Species 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- VUSAEKOXGNEYNE-PBCZWWQYSA-N Thr-His-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VUSAEKOXGNEYNE-PBCZWWQYSA-N 0.000 description 1
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 1
- JLTQXEOXIJMCLZ-ZVZYQTTQSA-N Trp-Gln-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 JLTQXEOXIJMCLZ-ZVZYQTTQSA-N 0.000 description 1
- YTCNLMSUXPCFBW-SXNHZJKMSA-N Trp-Ile-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O YTCNLMSUXPCFBW-SXNHZJKMSA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- DGEZNRSVGBDHLK-UHFFFAOYSA-N [1,10]phenanthroline Chemical compound C1=CN=C2C3=NC=CC=C3C=CC2=C1 DGEZNRSVGBDHLK-UHFFFAOYSA-N 0.000 description 1
- HMNZFMSWFCAGGW-XPWSMXQVSA-N [3-[hydroxy(2-hydroxyethoxy)phosphoryl]oxy-2-[(e)-octadec-9-enoyl]oxypropyl] (e)-octadec-9-enoate Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OCC(COP(O)(=O)OCCO)OC(=O)CCCCCCC\C=C\CCCCCCCC HMNZFMSWFCAGGW-XPWSMXQVSA-N 0.000 description 1
- 108091006088 activator proteins Proteins 0.000 description 1
- 230000001919 adrenal effect Effects 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- QYPPJABKJHAVHS-UHFFFAOYSA-P agmatinium(2+) Chemical compound NC(=[NH2+])NCCCC[NH3+] QYPPJABKJHAVHS-UHFFFAOYSA-P 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 229960002684 aminocaproic acid Drugs 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- SDNYTAYICBFYFH-TUFLPTIASA-N antipain Chemical compound NC(N)=NCCC[C@@H](C=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCN=C(N)N)NC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SDNYTAYICBFYFH-TUFLPTIASA-N 0.000 description 1
- IEJXVRYNEISIKR-UHFFFAOYSA-N apraclonidine Chemical compound ClC1=CC(N)=CC(Cl)=C1NC1=NCCN1 IEJXVRYNEISIKR-UHFFFAOYSA-N 0.000 description 1
- 229940114079 arachidonic acid Drugs 0.000 description 1
- 235000021342 arachidonic acid Nutrition 0.000 description 1
- 150000001484 arginines Chemical class 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000001772 blood platelet Anatomy 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 108010079785 calpain inhibitors Proteins 0.000 description 1
- 108010044208 calpastatin Proteins 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 150000001718 carbodiimides Chemical class 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 210000003169 central nervous system Anatomy 0.000 description 1
- 210000003710 cerebral cortex Anatomy 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 229960003677 chloroquine Drugs 0.000 description 1
- WHTVZRBIWZFKQO-UHFFFAOYSA-N chloroquine Natural products ClC1=CC=C2C(NC(C)CCCN(CC)CC)=CC=NC2=C1 WHTVZRBIWZFKQO-UHFFFAOYSA-N 0.000 description 1
- 239000003541 chymotrypsin inhibitor Substances 0.000 description 1
- 229950008137 cirazoline Drugs 0.000 description 1
- 238000012875 competitive assay Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 102000003675 cytokine receptors Human genes 0.000 description 1
- 108010057085 cytokine receptors Proteins 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 238000000326 densiometry Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000005546 dideoxynucleotide Substances 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 230000002526 effect on cardiovascular system Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 229960005139 epinephrine Drugs 0.000 description 1
- DEFVIWRASFVYLL-UHFFFAOYSA-N ethylene glycol bis(2-aminoethyl)tetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)CCOCCOCCN(CC(O)=O)CC(O)=O DEFVIWRASFVYLL-UHFFFAOYSA-N 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000012894 fetal calf serum Substances 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 210000005153 frontal cortex Anatomy 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 229960002989 glutamic acid Drugs 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 150000002307 glutamic acids Chemical class 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 102000045091 human NRL Human genes 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 238000003119 immunoblot Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 1
- XEEYBQQBJWHFJM-UHFFFAOYSA-N iron Substances [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 229910052746 lanthanum Inorganic materials 0.000 description 1
- FZLIPJUXYLNCLC-UHFFFAOYSA-N lanthanum atom Chemical compound [La] FZLIPJUXYLNCLC-UHFFFAOYSA-N 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- GDBQQVLCIARPGH-ULQDDVLXSA-N leupeptin Chemical compound CC(C)C[C@H](NC(C)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C=O)CCCN=C(N)N GDBQQVLCIARPGH-ULQDDVLXSA-N 0.000 description 1
- 108010052968 leupeptin Proteins 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 230000002934 lysing effect Effects 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 210000001767 medulla oblongata Anatomy 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 229960002748 norepinephrine Drugs 0.000 description 1
- SFLSHLFXELFNJZ-UHFFFAOYSA-N norepinephrine Natural products NCC(O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-UHFFFAOYSA-N 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 108010091212 pepstatin Proteins 0.000 description 1
- FAXGPCHRFPCXOO-LXTPJMTPSA-N pepstatin A Chemical compound OC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)CC(C)C FAXGPCHRFPCXOO-LXTPJMTPSA-N 0.000 description 1
- 230000003617 peroxidasic effect Effects 0.000 description 1
- 239000008177 pharmaceutical agent Substances 0.000 description 1
- 230000002974 pharmacogenomic effect Effects 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- MRBDMNSDAVCSSF-UHFFFAOYSA-N phentolamine Chemical compound C1=CC(C)=CC=C1N(C=1C=C(O)C=CC=1)CC1=NCCN1 MRBDMNSDAVCSSF-UHFFFAOYSA-N 0.000 description 1
- 229960001999 phentolamine Drugs 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 208000028591 pheochromocytoma Diseases 0.000 description 1
- 230000003169 placental effect Effects 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 238000002331 protein detection Methods 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 239000000700 radioactive tracer Substances 0.000 description 1
- 239000001044 red dye Substances 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 229920006298 saran Polymers 0.000 description 1
- 210000001908 sarcoplasmic reticulum Anatomy 0.000 description 1
- 238000011451 sequencing strategy Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 101150038671 strat gene Proteins 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 108010003885 valyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
Definitions
- the present invention is directed to DNA molecules encoding imidazoline receptive polypeptides, preferably encoding human imidazoline receptive polypeptides, that can be used as an imidazoline receptor (abbreviated IR).
- transcript(s) and protein sequences are predicted from the DNA clones.
- the invention is also directed to a genomic DNA clone designated as JEP-1A.
- the CDNA clones according to the invention comprise cDNA homologous to portion(s) of this genomic clone; including 5A-1 cDNA, cloned by the inventors that established the open-reading frame for translation of mRNA from the gene, and established the immunoreactive properties of its polypeptide sequence in an expression systems.
- the invention relates to cDNA clone EST04033, which is another clone identified to contain cDNA sequences from the JEP-1A gene, and of which the 5A-1 is a part, that encodes an active fragment of the IR polypeptide in transfection assays, and the protein sequences thereof.
- the invention also relates to methods for producing such genomic and cDNA clones, methods for expressing the IR protein and fragments, and uses thereof.
- brainstem imidazoline receptors possess binding site(s) for therapeutically relevant imidazoline compounds, such as clonidine and idazoxan. These drugs represent the first generation of ligands discovered for the binding site(s) of imidazoline receptors.
- clonidine and idazoxan were developed based on their high affinity for ⁇ 2 -adrenergic receptors.
- Second generation ligands, such as moxonidine possess somewhat improved selectivity for IR over ⁇ 2 -adrenergic receptors, but more selective compounds for IR are needed.
- An imidazoline receptor clone is of particular interest because of its potential utility in identifying novel pharmaceutical agents having greater potency and/or more selectivity than currently available ligands have for imidazoline receptors.
- Recent technological advances permit pharmaceutical companies to use combinatorial chemistry techniques to rapidly screen a cloned receptor for ligands (drugs) binding thereto.
- a cloned imidazoline receptor would be of significant value to a drug discovery program.
- imidazoline receptors have been described in the literature by three different laboratories to visualize imidazoline-selective binding proteins (imidazoline receptor candidates) using gel electrophoresis. Some important consistencies have emerged from these results despite the diversity of the techniques employed. On the other hand, multiple protein bands have been identified, which suggests heterogeneity amongst imidazoline receptors. These reports are discussed below.
- phentolamine another imidazoline compound
- Reis antiserum The term “I-site” refers to the imidazoline binding site, presumably defined within the imidazoline receptor protein.
- Reis antiserum was prepared by injecting the purified protein into rabbits [Wang et al, 1992]. The first immunization was done subcutaneously with the protein antigen (10 ⁇ g) emulsified in an equal volume of complete Freund's adjuvant, and the next three booster shots were given at 15-day intervals with incomplete Freund's adjuvant.
- the polyclonal antiserum has been mostly characterized by immunoblotting, but radioimmunoassays (RIA) and/or conjugated assay procedures, i.e., ELISA assays, are also conceivable [see “Radioimmunoassay of Gut Regulatory Peptides: Methods in Laboratory Medicine,” Vol. 2, chapters 1 and 2, Praeger Scientific Press, 1982].
- the present inventors and others [Escriba et al., Neurosci. Lett. 178: 81-84 (1994)] have characterized the Reis antiserum in several respects. For instance, the present inventors have discovered that human platelet immunoreactivity with Reis antiserum is mainly confined to a single protein band of MW ⁇ 33 kDa, although a trace band at ⁇ 85 kDa was also observed. The ⁇ 33 and ⁇ 85 kDa bands were enriched in plasma membrane fractions as expected for an imidazoline receptor.
- the intensity of the ⁇ 33 kDa band was found to be positively correlated with non-adrenergic 125 PIC Bmax values at platelet IR 1 sites in samples from the same subjects, with an almost one-to-one slope factor.
- the nonadrenergic 125 PIC binding sites on platelets were discovered by the present inventors to have the same rank order of affinities as IR 1 binding sites in brainstem [Piletz and Sletten, J. Pharm. & Exper. Therap., 267: 1493-1502 (1993)].
- the platelet ⁇ 33 kDa band may also be a product of a larger protein, since in human megakaryoblastoma cells, which are capable of forming platelets in tissue cultures, an ⁇ 85 kDa immunoreactive band was found to predominate.
- the bands of MW ⁇ 41 and 44 kDa detected by Dontenwill antiserum may be derived from an ⁇ 85 kDa precursor protein, similar to that occurring in platelet precursor cells.
- An 85 kDa immunoreactive protein is obtained in fresh rat brain membranes only when a cocktail of 11 protease inhibitors is used.
- Reis antiserum detects the ⁇ 41 and 44 kDa bands in human brain when fewer protease inhibitors are used.
- the Dontenwill antiserum weakly detects a platelet ⁇ 33 kDa band.
- the present inventors have hypothesized that the ⁇ 41 and 44 kDa immunoreactive proteins may be alternative breakdown products of an ⁇ 85 kDa protein, as opposed to the platelet ⁇ 33 kDa breakdown product.
- a photoaffinity imidazoline ligand 125 AZIPI
- the 125 AZIPI photoaffinity ligand was used to visualize ⁇ 55 kDa and ⁇ 61 kDa binding proteins from rat liver and brain. It is believed that the ⁇ 61 kDa protein is probably MAO, in agreement with other findings [Tesson et al., J. Biol. Chem., 270: 9856-9861 (1995)] showing that MAO proteins bind certain imidazoline compounds.
- the different molecular weights between these bands and those detected immunologically by the present inventors is one of many pieces of evidence that distinguishes IR 1 from I 2 sites.
- the present invention involves various cDNA clones (ie., 5A-1 and EST04033) and a genomic clone (JEP-1A) which are directed to an isolated polypeptide(s) that is receptive to (bind to) imidazoline compound(s), and can be used to identify other compounds of interest.
- imidazoline compounds in this context are p-iodoclonidine and moxonidine.
- the inventors detected a polypeptide expressed by their cDNA clone (5A-1 isolated from a human hippocampus cDNA library) that immunoreacted with Reis antiserum and/or Dontenwill antiserum.
- the DNA sequence of the 5A-1 clone is encapsulated within a portion of the other clones (EST04033 and JEP-1A genomic clone).
- a polypeptide includes a 651 amino acid sequence as shown in SEQ ID No. 5.
- This polypeptide is predicted from non-plasmid cDNA in EST04033; a clone which the inventors showed possesses sequences inclusive of 5A-1.
- transfection of EST04033 into COS cells yielded imidazoline receptivity by radioligand binding assays (described in detail later).
- Other imidazoline receptive proteins homologous to this polypeptide are also contemplated.
- Such polypeptide(s) generally have a molecular weight of about 50 to 80 kDa. More particularly, one can have a molecular weight of about 70 kDa.
- a polypeptide in another aspect of this invention, includes a 390 amino acid sequence as shown in SEQ ID No. 6. This represents the polypeptide predicted from the non-plasmid DNA of the original 5A-1 clone.
- Such a polypeptide generally has a molecular weight of about 35 to 50 kDa. More particularly, it can have a molecular weight of about 43 kDa.
- DNA molecules encoding aforementioned imidazoline-receptive polypeptide(s) are also contemplated.
- a DNA molecule e.g., a cDNA derived from mRNA
- a DNA molecule containing the 1954 base pairs (b.p.) (1954 b.p. encodes 651 amino acids) nucleotide sequence shown in SEQ ID No. 2 is contemplated. This represents the coding sequence for the polypeptide predicted by EST04033 transfections.
- a DNA molecule includes the longer nucleotide sequence shown in SEQ ID No. 3. This represents the cDNA predicted to have been translated+not predicted to have been translated in transfections experiments of EST04033.
- a DNA molecule contains a nucleic acid sequence encoding the amino acid sequence shown in SEQ ID No. 6. In another aspect, it can include the 1171 b.p. nucleic acid sequence shown in SEQ ID No. 4. The 1171 b.p. nucleic acid sequence shown in SEQ ID No. 4 is the 5A-1 non-plasmid DNA.
- the nucleic acid sequence of the genomic clone encoding the imidazoline receptor is further shown in SEQ ID No. 21.
- the nucleic acid and amino acid sequence of the predicted transcript ie., cDNA
- the polypeptide encoded by the genomic DNA is shown in SEQ ID No. 22.
- Sequence similarity with the sequences indicated in SEQ ID protocols of the attached Sequence Listing is defined in connection with the present invention as a very close structural relationship of the relevant sequences with the sequences indicated in the respective SEQ ID protocols.
- sequence similarity in each case the structurally mutually corresponding sections of the sequence of the SEQ ID protocol and of the sequence to be compared therewith are superimposed in such a way that the structural correspondence between the sequences is a maximum, account being taken of differences caused by deletion or insertion of individual sequence members (DNA-codon or amino acid respectively), and being compensated by appropriate shifts in sections of the sequences.
- sequence similarity in % results from the number of sequence members which now correspond to one another in the sequences (“homologous positions”) relative to the total number of members contained in the sequences of the SEQ ID protocols. Differences in the sequences may be caused by variation, insertion or deletion of sequence members.
- DNA-codons encoding for the same amino acid are considered identical in the context of the present invention.
- conservative amino acid substitutions encoded by their corresponding DNA-codons, as well as naturally occurring homologs of the sequences, are considered within the context of sequence similarity.
- DNA molecules of substantial homology are an implicit aspect of this sort of invention.
- the inventors have already identified two possible splice variants in the amino acid coding sequence.
- artificially mutated receptor cDNA molecules can be routinely constructed by methods such as site-directed polymerase chain reaction-mediated mutagenesis [Nelson and Long, Anal. Biochem. 180: 147-151 (1989)]. It is commonly appreciated that highly homologous mutants frequently mimic their natural receptor.
- Kjelsberg et al. J. Biol. Chem. 267: 1430-1433 (1992)] showed that all 20 amino acid substitutions produce an active receptor at a single site in the ⁇ 1b -adrenergic receptor.
- RNA molecules of ⁇ 75 % complementarity to an instant DNA molecule e.g., an mRNA molecule (sense) or a complementary cRNA molecule (antisense) are a further aspect of the invention.
- a further aspect of the invention is for a recombinant vector, as well as a host cell transfected with the recombinant vector, wherein the recombinant vector contains at least one of the nucleotide sequences shown in SEQ ID Nos. 1-4, or sequences predicted by the genomic clone, or nucleotide sequences ⁇ 75% homologous thereto.
- a method of producing an imidazoline receptor protein is another aspect of the invention. Such a method entails transfecting a host cell with an aforementioned vector, and culturing the transfected host cell in a culture medium to generate the imidazoline receptor.
- a method for producing homologous imidazoline receptor proteins, and the proteins produced thereby, are also considered an aspect of this invention.
- a significant further aspect of the invention is a method of screening for a ligand that binds to an imidazoline receptor.
- Such a method can comprise culturing an above-mentioned transfected cell in a culture medium to express imidazoline receptor proteins, followed by contacting the proteins with a labelled ligand for the imidazoline receptor under conditions effective to bind the labelled ligand thereto.
- the imidazoline receptor proteins can then be contacted with a candidate ligand, and any displacement of the labelled ligand from the proteins can be detected.
- Displacement of labelled ligand signifies that the candidate ligand is a ligand for the imidazoline receptor.
- FIG. 1 depicts a comparison of Reis antiserum (lane 1, 1:2000 dilution) and Dontenwill antiserum (lane 2, 1:5000 dilution) immunoreactivities for human NRL (same as RVLM) and hippocampus, as discussed in Example 1.
- FIG. 2 depicts a comparison of Reis antiserum (1:15,000 dilution) and Dontenwill antiserum (1:20,000 dilution) immunoreactivities for plaques isolated from the human hippocampal cDNA library used in cloning as discussed in Example 2.
- the plaques contain the initial clone, designated herein as 5A-1, in a third stage of purification.
- FIG. 3 depicts the restriction map of the EST04033 cDNA clone.
- FIG. 4 depicts a competitive binding assay between 125 I-labelled p-iodoclonidine (PIC) and various ligands for the imidazoline receptor on membranes expressed in COS cells transfected with the EST04033 cDNA clone, as discussed in Example 4.
- PIC p-iodoclonidine
- FIG. 5 depicts the prediction of introns and exons of the genomic clone (as analyzed by the GENESCAN program and verified by the available CDNAS).
- FIG. 6 depicts the distribution of MRNA homologous to our CDNA in human adult tissues (bar graph) and the two species of MRNA (6 and 9.5 kb).
- the present invention is concerned with multiple aspects of an imidazoline receptor protein, and DNA molecules encoding the same, and fragments thereof, which have now been discovered.
- polypeptide having imidazoline binding activity contains the putative active site for binding, as discussed hereinafter.
- polypeptide(s) described herein has a binding affinity for an imidazoline compound, it may also have an enzymatic activity, such as do catalytic antibodies and ribozymes. In fact, one such domain within our protein predicts a cytochrome p450 activity (described later).
- Exemplary “binding” polypeptides are those containing either of the amino acid sequences shown in SEQ ID Nos. 5 or 6 (with the amino acid sequence predicted by EST04033 given in SEQ ID No. 5). Functionally equivalent polypeptides are also contemplated, such as those having a high degree of homology with such aforementioned polypeptides, particularly when they contain the Glu-Asp-rich region described hereinafter which we believe may define an active imidazoline binding site.
- a polypeptide of the invention can be formed by direct chemical synthesis on a solid support using the carbodiimide method [R. Merrifield, JACS, 85: 2143 (1963)].
- an instant polypeptide can be produced by a recombinant DNA technique as described herein and elsewhere [e.g., U.S. Pat. No. 4,740,470 (issued to Cohen and Boyer), the disclosure of which is incorporated herein by reference], followed by culturing transformants in a nutrient broth.
- a DNA molecule of the present invention encodes aforementioned polypeptide.
- any of the degenerate set of codons encoding an instant polypeptide is contemplated.
- a particularly preferred coding sequence is the 1954 b.p. sequence set forth in SEQ ID No. 2, which has now been discovered to be a nucleotide sequence that encodes a polypeptide capable of binding imidazoline compound(s).
- a DNA molecule includes the 3318 b.p. nucleotide sequence shown in SEQ ID No. 3. This latter sequence is the entire EST04033 insert. It includes the nucleotide sequence of SEQ ID No. 2 which was predicted to have been translated into protein in the transfection experiments.
- a DNA molecule contains a nucleic acid sequence encoding the amino acid sequence (390 residues) shown in SEQ ID No. 6. This amino acid sequence corresponds to that derived from direct sequencing of the 5A-1 clone and represents a fragment of the native protein.
- the 5A-1 DNA molecule is defined by the 1171 b.p. nucleic acid sequence shown in SEQ ID No. 4.
- a DNA molecule of the present invention can be synthesized according to the phosphotriester method [Matteucci et al., JACS, 103: 3185 (1988)]. This method is particularly suitable when it is desired to effect site-directed mutagenesis of an instant DNA sequence, whereby a desired nucleotide substitution can be readily made.
- Another method for making an instant DNA molecule is by simply growing cells transformed with plasmids containing the DNA sequence, lysing the cells, and isolating the plasmid DNA molecules.
- an isolated DNA molecule of the invention is made by employing the polymerase chain reaction (PCR) [e.g., U.S. Pat. No.
- a further aspect of the invention is for a vector, e.g., a plasmid, that contains at least one of the nucleotide sequences shown in SEQ ID Nos. 1-4 or those predicted by the genomic clone in SEQ ID No. 21.
- the vector encodes an IR polypeptide of the invention.
- fragments of the native IR protein are contemplated; as well as fusion proteins that incorporate an amino acid sequence as described herein.
- a vector containing a nucleotide sequence having a high degree of homology with any of SEQ ID Nos. 1-4 or 21 is contemplated within the invention, particularly when it encodes a protein having imidazoline binding activity.
- a recombinant vector of the invention can be formed by ligating an afore-mentioned DNA molecule to a preselected expression plasmid, e.g., with T4 DNA ligase.
- a preselected expression plasmid e.g., with T4 DNA ligase.
- the plasmid and DNA molecule are provided with cohesive (overlapping) terminii, with the plasmid and DNA molecule operatively linked (i.e., in the correct reading frame).
- Another aspect of the invention is a host cell transfected with a vector of the invention.
- a protein expressed by a host cell transfected with such a vector is contemplated, which protein may be bound to the cell membrane.
- Such a protein can be identical with an aforementioned polypeptide, or it can be a fragment thereof, such as when the polypeptide has been partially digested by a protease in the cell.
- the expressed protein can differ from an aforementioned polypeptide, as whenever it has been subjected to one or more post-translational modifications.
- it should exhibit imidazoline binding capacity.
- a method of producing an imidazoline receptor protein is another aspect of the invention, which entails transfecting a host cell with an aforementioned vector, and culturing the transfected host cell in a culture medium to generate the imidazoline receptor.
- the receptor molecule can undergo any post-translational modification(s), including proteolytic decomposition, whereby its structure is altered from the basic amino acid residue sequence encoded by the vector.
- a suitable transfection method is electroporation, and the like.
- a vector encoding an instant polypeptide can be transfected directly in animals.
- embryonic stem cells can be transfected, and the cells can be manipulated in embryos to produce transgenic animals. Methods for performing such an operation have been previously described [Bond et al., Nature, 374:272-276 (1995)]. These methods for expressing an instant CDNA molecule in either tissue culture cells or in animals can be especially useful for drug discovery.
- the most significant aspect of the present invention is in its potential for affording a method of screening for a ligand (drug) that binds to an imidazoline receptor.
- a method of screening for a ligand (drug) that binds to an imidazoline receptor comprises culturing an above-mentioned host cell in a culture medium to express an instant imidazoline receptive polypeptide, then contacting the polypeptides with a labelled ligand, e.g., radiolabelled p-iodoclonidine, for the imidazoline receptor under conditions effective to bind the labelled ligand thereto.
- the polypeptides are further contacted with a candidate ligand, and any displacement of the labelled ligand from the polypeptides is detected. Displacement signifies that the candidate ligand actually binds to the imidazoline receptor.
- a suitable drug screening protocol involves preparing cells (or possibly tissues from transgenic animals) that express an instant imidazoline receptive polypeptide.
- categories of chemical structure are systematically screened for binding affinity or activation of the receptor molecule encoded by the transfected CDNA. This process is currently referred to as combinatorial chemistry.
- the imidazoline receptor a number of commercially available radioligands, e.g., 125 PIC, can be used for competitive drug binding affinity screening.
- An alternative approach is to screen for drugs that elicit or block a second messenger effect known to be coupled to activation of the imidazoline receptor, e.g., moxonidine-stimulated arachidonic acid release.
- a second messenger effect known to be coupled to activation of the imidazoline receptor, e.g., moxonidine-stimulated arachidonic acid release.
- a preferred compound drug
- Identification of this compound would lead to animal testing and upwards to human trials.
- the initial rationale for drug discovery becomes vastly improved with an instant cloned imidazoline receptor.
- a drug screening method is contemplated in which a host cell of the invention is cultured in a culture medium to express an instant imidazoline receptive polypeptide. Intact cells are then exposed to an identified agent (ie., agonist, inverse agonist, or antagonist) under conditions effective to elicit a second messenger or other detectable responses upon interacting with the receptor molecule.
- the imidazoline receptive polypeptides are then contacted with one or more candidate chemical compounds (drugs), and any modification in a second messenger response is detected.
- Compounds that mimic an identified agonist would be agonist candidates, and those producing the opposite response would be inverse agonist candidates.
- Those compounds that block the effects of a known agonist would be antagonist candidates for an in vivo imidazoline receptor.
- the contacting step with a candidate compound is preferably conducted at a plurality of candidate compound concentrations.
- a method of probing for another gene encoding an imidazoline receptor or homologous protein is further contemplated.
- Such a method comprises providing a radiolabelled DNA molecule identical or complementary to one of the above-described CDNA molecules (probe).
- the probe is then placed in contact with genetic material suspected of containing a gene encoding an imidazoline receptor or encoding a homologous protein, under stringent hybridization conditions (e.g., a high stringency wash condition is 0.1 ⁇ SSC, 0.5% SDS at 65° C.), and identifying any portion of the genetic material that hybridizes to the DNA molecule.
- stringent hybridization conditions e.g., a high stringency wash condition is 0.1 ⁇ SSC, 0.5% SDS at 65° C.
- a method of selectively producing antibodies comprises injecting a mammal with an aforementioned polypeptide, and isolating the antibodies produced by the mammal. This aspect is discussed in more detail in an example presented hereinafter.
- the present inventors began their search for a human imidazoline receptor CDNA by screening a ⁇ gt11 phage human hippocampus CDNA expression library. Their research had indicated that both of the known antisera (Reis and Dontenwill) that are directed against human imidazoline receptors were immunoreactive with identical bands on SDS gels of membranes prepared from the human hippocampus (an in other tissues). By contrast, other brain regions either were commercially unavailable as cDNA expression libraries or yielded inconsistencies between the two antisera. Therefore, it was felt that a human hippocampal cDNA library held the best opportunity for obtaining a CDNA for an imidazoline receptor. Immunoexpression screening was chosen over other cloning strategies because of its sensitivity when coupled with the ECL detection system used by the present inventors, as discussed hereinbelow.
- the obtained Reis antiserum had been prepared against a purified imidazoline binding protein isolated from BAC cells, which protein runs in denaturing-SDS gels at 70 Kda [Wang et al., 1992, 1993].
- the Dontenwill antiserum is anti-idiotypic, and thus is believed to detect the molecular configuration of an imidazoline binding site domain in any species. Prior to being used for screening plaques, both antisera were cleaned by stripping out possible antibacterial antibodies.
- both antisera have been tested to ensure that they are in fact selective for a human imidazoline receptor.
- both of these antisera detected identical bands in human platelets and hippocampus, and in brainstem RVLM (NRL) by Western blotting (see FIG. 1).
- ECL Enhanced Chemiluminescence
- the linearity of response of the ECL system was demonstrated with a standard curve.
- ECL detection was demonstrated to be very quantifiable and about ten times more sensitive than other screening methods previously used with these antisera.
- Western blots with antiserum dilutions of 1:3000 revealed immunoreactivity with as little as 1 ng of protein from a human hippocampal homogenate by dot blot analysis.
- lane 1 shows the immunoreactive bands observed with the Reis antibody and lane 2 shows the bands detected with the Dontenwill antibody. Protein molecular weight standards are indicated to the left of each panel (in Kda).
- both of these antisera detected a similar 85 Kda protein in human brain and other tissues.
- a 33 Kda band was found in human platelets.
- the 33 Kda band is of smaller size than that reported for other tissues [Wang et al., 1993; Escriba et al., 1994; Greney et al., 1994]
- the fact that both antisera detected it suggests that both the 85 Kda and 33 Kda bands may be imidazoline binding polypeptides.
- the 85 and 33 Kda bands were enriched in plasma membrane fractions, as is known to be the case for IR 1 binding, but not I 2 binding [Piletz and Sletten, 1993].
- a commercially available human hippocampal cDNA ⁇ gt11 expression library (Clontech Inc., Palo Alto, Calif.) was screened for immunoreactivity sequentially using both the anti-idiotypic Dontenwill antiserum and the Reis antiserum. Standard techniques to induce protein and transference to a nitrocellulose overlay were employed. [See, for instance, Sambrook et al., 1989, “Molecular Cloning: A Laboratory Manual,” Cold Spring Harbor Laboratory Press]. After washing and blocking with 5% milk, the Dontenwill antiserum was added to the overlay at 1:20,000 dilution in Tris-buffered saline, 0.05% Tween20, and 5% milk.
- the Reis antiserum was employed similarly, but at 1:15,000 dilution. These high dilutions.of primary antiserum were chosen to avoid false positives.
- the secondary antibody was added, and positive plaques were identified using ECL. Representative results are shown in FIG. 2.
- DNA sequencing was performed using T7 DNA polymerase and the dideoxy nucleotide termination reaction.
- DNA sequences were analyzed by MacVector Version 5.0. and by various Internet-available programs, i.e., the BLAST program.
- HSA09H122 contained 250 b.p. with 7 unknown/incorrect base pairs (97% homology) versus 5A-1 over the same region.
- HSA09H122 was generated in France (Genethon, B.P. 60, 91002 Evry Cedex France) from a human lymphoblast cDNA library.
- the other EST designated EST04033, contained 155 b.p. with 12 unknown/incorrect base pairs (92% homology) versus 5A-1 over the same region.
- EST04033 was generated at the Institute for Genomic Research (Gaithersburg, Md.) from a human fetal brain cDNA clone (HFBDP28). Thus, both of these ESTs are short DNA sequences and contain a number of errors (typical of single-stranded sequencing procedures as used when randomly screening ESTs).
- HSA09H122 Based on the BLASTN search, the owner of HSA09H122 was contacted in an effort to obtain that clone.
- the current owner of the clone appears to be Dr. Charles Auffret (Paul Brousse Hospital, Genetique, B. P. 8, 94801 Villejuif Cedex, France).
- Dr. Auffret indicated by telephone that his clone came from a lot of clones believed to be contaminated with yeast DNA, and he did not trust it for release. Contamination with yeast DNA of that clone was later confirmed to have been reported within an Internet database. Thus, HSA09H122 was not reliable.
- EST04033 The other partial clone (EST04033) was purchased from American Type Culture Collection in Rockville, Md. (ATCC Catalog no. 82815). A telephone call to the Institute for Genomic Research revealed that it had been deposited at ATCC under [insert terms]. As far as can be determined, the present inventors were the first to completely sequence EST04033. The complete size of EST04033 was 3389 b.p. (SEQ ID No. 1), with a 3,318 b.p. nonplasmid insert (see SEQ ID No. 3).
- the nucleotide sequence of the entire clone is shown in SEQ ID No. 1. In this sequence, an identical overlap was observed for the sequence obtained previously for the 5A-1 clone and the sequence obtained for EST04033. The 5A-1 overlap began at EST04033 b.p. 2,181 (SEQ. No.1) and continued to the end of the molecule (b.p. 3,351).
- cDNA of the present invention encode a protein that is immunoreactive with both of the known selective antisera for an imidazoline receptor, i.e.,. Reis antiserum and Dontenwill antiserum.
- an instant cDNA molecule produces a protein immunologically related to a purified imidazoline receptor and has the antigenic specificity expected for an imidazoline binding site.
- These antisera have been documented in the scientific literature as being selective for an “imidazoline receptor”, which provides strong evidence that such an imidazoline receptor has indeed been cloned.
- our instant cDNA sequence contains open reading frame distinct from any previously described proteins. Therefore, the encoded protein is novel, and it is unrelated to ⁇ 2 -adrenoceptors or monoamine oxidases. Small hydrophobic domains in the predicted amino acid sequence suggest that the protein is probably membrane bound, as expected for an imidazoline receptor.
- a pre-made genomic library of human placental DNA was purchased from Stratagene (La Jolla, Calif.) to screen for an IR gene by hybridization.
- the genomic library was constructed in Stratagene's vector ⁇ FIX® II (catalog # 946206), and it was grown in XL1-Blue MRA (P2) host bacteria. It was titered to yield approximately 50,000 plaques per 137 mm plate. Lifts from six such plates were screened in duplicate by hybridization.
- the DNA probe used for screening was a 1.85 kb EcoR1 fragment from EST 04033 cDNA (uniquely related to our sequences based on the BLASTN).
- the 1.85 kb fragment was extracted from an agarose electrophoresis gel, cleaned according to the GENECLEAN® III kit manual (BIO 101, Inc., P.O. Box 2284, La Jolla, Calif.), and radiolabeled with [ ⁇ - 32 P]d-CTP according to Stratagene's Prime-It® II Random Primer Labeling Kit manual. Plaques were lifted onto 137 mm Duralon-UVTM membranes (Stratagene's catalog #420102), denatured, and cross-linked with Stratgene's UV-StratalinkerTM 1800.
- This hybridization procedure is essentially described in Stratagene's vector ⁇ FIX® II instruction manual. Positive plaques were localized by developing Kodak BioMax films. Two positive genomic clones of identical size were retained through three rounds of screening.
- One of the positive genomic clones (designated JEP 1-A) was selected for complete characterization. It was found to contain an ⁇ 17 kb insert. Large-scale preparations of this genomic clone DNA were performed using the ⁇ QUICK! SPIN kit (BIO101, La Jolla, Calif.). To verify that we had cloned a gene corresponding to 5A-1 and EST04033 cDNA, some restriction site positions in the genomic clone were determined using the FLASH Nonradioactive Gene Mapping Kit (Stratagene) and compared to Southern blots of human DNA.
- genomic sequences highly related to (or identical to) those of our cDNA clones was determined by high stringency hybridization (as above) with the following 32 P-labeled probe: a 1110 bp ApaI-EcoRI fragment from the cDNA clone 5A-1. This fragment was chosen as the probe because it lacks the GAG repeat (encoding glutamic acids), which might have complicated matters if it were found to be repeated elsewhere in the genome.
- genomic clone JEP1-A we detected a 14.1 kb EcoRI fragment and a 7.7 kb SacI fragment that hybridized with this probe.
- Genomic DNA sequencing was done by contract with Cadus Pharmaceutical Corporation (Tarrytown, N.Y.). The original lambda JEP1-A clone was subcloned into pzero (Invitrogen) as a convenient vector. The initial fragments for sequencing were derived from Sac I and Xba I restriction enzymes. The short Sac I fragments of 1.5, 3.0 and 3.5 kb were further digested with Hind III, Pst I, and Kpn I yielding 15 subclones of varying length. The procedure consisted of sequencing all these subclones and parent clones with vector forward and reverse primers. Subsequently, this initial round of sequencing was supplemented with primer walking using custom oligonucleotides.
- the Sac I fragments were joined together by primer walking using the 2 Xba I fragments of 3 and 10 Kb. Then, the largest Sac I fragment (8 kb) and the 10 kb Xba I fragment were used as templates for a transposon sequencing method.
- the method used was the Primer Island Transposition Kit (Perkin-Elmer Corp., Norwalk, Conn.; Applied Biosystems) (ABI).
- the kit consists of a synthetic transposon (Ty1) containing forward and reverse primers and the integrase enzyme which inserts the transposon randomly into the target plasmid DNA. Transposon insertion is an alternative to subcloning or primer walking when sequencing a large region of DNA (Devine and Boeke, Nucleic Acids Res.
- clone 5A-1 might encode an imidazoline binding site.
- this glu/asp-rich sequence is located within the longest stretch of homology that the clone has with any known protein, i.e., the ryanodine receptor (as determined by on BLASTN). Specifically, we have discovered four regions of homology between the imidazoline receptor and the ryanodine receptor, which are all Glu/Asp-rich.
- the total nucleic acid homology is 67% with the ryanodine receptor DNA over the stretches encompassing this region. However, this is not sufficient to indicate that the imidazoline receptor is a subtype of the ryanodine receptor, because this homologous stretch is still a minor portion of the overall transcript(s) identified in the gene. Instead, this significant homology may reflect a commonality in function between this region of the IR and the ryanodine receptor.
- the Glu/Asp-rich region within the ryanodine receptor has also been reported to define a calcium and ruthenium,red dye binding domain that modulates the ryanodine receptor/Ca ++ release channel located within the sarcoplasmic reticulum.
- the only other charged amino acids within the Glu/Asp-rich region of our clones are two arginines (the ryanodine receptor has uncharged amino acids at the corresponding positions).
- IL-2R ⁇ interleukin-2R ⁇ receptor
- IL-2R ⁇ possesses the following regions over a span of 286 amino acids: ser-rich region, followed by glu/asp-rich region, followed by proline-rich region.
- our predicted protein has the same three regions, in the same order, over a span of about 625 amino acids. This suggests that our protein might function similarly as cytokine receptors.
- the 6 kb band is weakly detectable in some non-CNS tissues, it is enriched in brain. An enrichment of the 6 kb mRNA is observed in brainstem, although not exclusively. The regional distribution of the mRNA is somewhat in keeping with the reported distribution of IR binding sites, when extrapolated across species (FIG. 6). Thus, the rank order of Bmax values for IR in rat brain has been reported to be frontal cortex>hippocampus>medulla oblongata>cerebellum [Kamisaki et al., Brain Res., 514: 15-21 (1990)]. Therefore, with the exception of human cerebellum, which showed two mRNA bands, the distribution of the mRNA for our the present cloned cDNA is consistent with it belonging to IR.
- IR binding sites are commonly considered to be low in cerebral cortex compared to brainstem, this is in fact a misinterpretation of the literature based only on comparisons to the alpha-2 adrenoceptor's Bmax, rather than on absolute values.
- IR Bmax values have actually been reported to be slightly higher in the cortex than the brainstem, but they only “appear” to be low in the cortex in comparison to the abundance of alpha-2 binding sites in cortex. Therefore, the distribution of the IR mRNA is reasonably in keeping with the actual Bmax values for radioligand binding to the receptor [Kamisaki et al., (1990)].
- the JEP-1A clone clearly contains most of the gene. Within it we have identified at least 3,776 nucleotides for transcript(s) (encoding 1,065 amino acids plus 587 b.p. of untranslated region down to the polyT + tail). This has been lengthened by at least 66 coding nucleotides upstream (22 amino acids) in comparison to overlapping ESTs. In addition to this, we are quite confident of the splice site for the two observed mRNA sizes. Most of the functional sequences are predicted to be encoded within our genomic clone.
- Reis-Ab activity correlates w platelet IR 1 Bmax ( 125 PIC binding) Segment homologous Weak to moderate Not sensitive to GTP to a GTPase-acti- sensitivity to GTP vator prot'n Predicts ⁇ 120,000 85,000 MW 59-61,000 MW MW protein immunoreactivity photoaffinity Predicts 1-4 Enriched in plasma Enriched in hydrophobic domains membranes mitochondria Encodes Glu/Asp-rich Binds (+)-charged Binds (+)-charged (negatively charged) imidazolines imidazolines domain consistent with Sensitive to Not sensitive to Ca ++ and ruthenium divalent cations divalent cations red binding Sensitive to Unknown ruthenium red sensitivity for Ruthen.
- COS-7 cells were transfected with a vector containing EST04033 cDNA, which was predicted based on sequence analysis to contain the glu/asp rich region thought to be important for ligand binding to the imidazoline receptor protein.
- the EST04033 cDNA was subcloned into pSVK3 (Pharmacia LKB Biotechnology, Piscataway, N.J.) using standard techniques [Sambrook, supra], and transfected via the DEAE-dextran technique as previously described [Choudhary et al., Mol. Pharmacol., 42: 627-633 (1992); Choudhary et al., Mol.
- a restriction map of the EST04033 cDNA is shown in FIG. 3.
- the restriction enzymes Sal I and Xba I were used for subcloning into pSVK3.
- COS-7 cells were seeded at 3 ⁇ 10 6 cells/100 mm plate, grown overnight and exposed to 2 ml of DEAE-dextran/plasmid mixture. After a 10-15 min. exposure, 20 ml of complete medium (10% fetal calf serum; 5 ⁇ g/ml streptomycin; 100 units/ml penicillin, high glucose, Dulbeccos' modified Eagle's medium) containing 80 ⁇ M chloroquine was added and the incubation continued for 2.5 hr. at 37° C. in a 5% CO 2 incubator. The mixture was then aspirated and 10 ml of complete medium containing 10% dimethyl sulfoxide was added with shaking for 150 seconds.
- complete medium 10% fetal calf serum; 5 ⁇ g/ml streptomycin; 100 units/ml penicillin, high glucose, Dulbeccos' modified Eagle's medium
- Transfected samples were also analyzed by Western blots.
- the protocol used for Western blot assay of transfected cells is as follows.
- Cell membranes were prepared in a special cocktail of protease inhibitors (1 mM EDTA, 0.1 mM EGTA, 1 mM phenylmethyl-sufonylfluoride, 10 mM ⁇ -aminocaproic acid, 0.1 mM benzamide, 0.1 mM benzamide-HCl, 0.1 mM phenanthroline, 10 ⁇ g/ml pepstatin A, 5 mM iodoacetamide, 10 ⁇ g/ml antipain, 10 ⁇ g/ml trypsin-chymotrypsin inhibitor, 10 ⁇ g/ml leupeptin, and 1.67 ⁇ g/ml calpain inhibitor) in 0.25 M sucrose, 1 mM MgCl 2 , 5 mM Tris, pH 7.4.
- the protocol to fully characterize radioligand binding in the transfected cells entails the following. First, the presence of IR and/or I 2 binding sites are scanned over a range of protein concentrations using a single concentration of [ 125 I]-p-iodoclonidine (1.0 nM) and 3 H-idazoxan (8 nM), respectively. Then, rate of association binding experiments (under a 10 ⁇ M mask of NE to remove ⁇ 2 AR interference) are performed to determine if the kinetic parameters are similar to those reported for native imidazoline receptors [Ernsberger et al. Annals NY Acad. Sci., 763: 163-168 (1995)].
- Stable transfections can be obtained by subcloning the imidazoline receptor cDNA into a suitable expression vector, e.g., pRc/CMV (Invitrogen, San Diego, Calif.), which can then be used to transform host cells, e.g. CHO and HEK-293 cells, using the Lipofectin reagent (Gibco/BRL, Gaithersburg, Md.) according to the manufacturer's instructions. These two host cell lines can be used to increase the permanence of expression of an instant clone. The inventors have previously ascertained that parent CHO cells lack both alpha 2 -adrenoceptor and IR binding sites [Piletz et al., J. Pharm . & Exper.
- Direct probing of other human genomic and cDNA libraries can be performed by preparing labelled cDNA probes from different subcloned regions of our clone.
- Commercially available human DNA libraries can be used.
- another genomic library is EMBL (Clontech), which integrates genomic fragments up to 22 kbp long. It is reasonable to expect that introns may exist within other human IR genes so that only by obtaining overlapping clones can the full-length genes be sequenced.
- a probe encompassing the 5′ end of an instant cDNA is generally useful to obtain the gene promoter region.
- Clontech's Human PromoterFinder DNA Walking procedure provides a method for “walking” upstream or downstream from cloned sequences such as cDNAs into adjacent genomic DNA.
- An instant imidazoline receptive polypeptide can also be used to prepare antibodies immunoreactive therewith.
- synthetic peptides (based on deduced amino acid sequences from the DNA) can be generated and used as immunogens.
- transfected cell lines or other manipulations of the DNA sequence of an instant imidazoline receptor can provide a source of purified imidazoline receptor peptides in sufficient quantities for immunization, which can lead to a source of selective antibodies having potential commercial value.
- kits for assaying imidazoline receptors can be developed that include either such antibodies or the purified imidazoline receptor protein.
- a purification protocol has already been published for the bovine imidazoline receptor in BAC cells [Wang et al, 1992] and an immunization protocol has also been published [Wang et al., 1993]. These same protocols can be utilized with little if any modification to afford purified human IR protein from transfected cells and to yield selective antibodies thereto.
- the peptide may be linked to a suitable soluble carrier to which antibodies are unlikely to be encountered in human serum.
- suitable soluble carriers include bovine serum albumin, keyhole limpet hemocyanin, and the like.
- the conjugated peptide is injected into a mouse, or other suitable animal, where an immune response is elicited.
- Monoclonal antibodies can be obtained from hybridomas formed by fusing spleen cells harvested from the animal and myeloma cells [see, e.g., Kohler and Milstein, Nature, 256: 495-497 (1975)].
- the present inventors also demonstrated that an imidazoline receptive site can be expressed in cells transfected with the EST 04033 cDNA clone, and this site has the proper potencies of an IR. We have deduced most of the complete cDNA encoding this protein.
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Biochemistry (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Toxicology (AREA)
- Immunology (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Cell Biology (AREA)
- Peptides Or Proteins (AREA)
Abstract
A genomic DNA encoding a human imidazoline receptor is described. cDNAs encoding the receptor and fragments thereof are also provided. An amino acid sequence predicted to be 120,000 MW for nearly the entire protein is identified, as well as a middle fragment believed to contain the imidazoline binding site of the receptor. The protein is highly unique in its sequence and may represent the first in a novel family of receptor proteins. Methods of cloning the cDNA and expressing the imidazoline receptor in a host cell are described. Methods of preparing antibodies against the transfected protein are also described. Also, a screening method for identifying additional subtypes of this receptor are identified. Also, screening methods for identifying drugs that interact with the imidazoline receptor are described.
Description
- The present application is a continuation-in-part of application Ser. No. 08/650,766 filed May 20, 1996, which is related to provisional application Serial No. 60/12,600, filed Mar. 1, 1996.
- 1. Field of the Invention
- The present invention is directed to DNA molecules encoding imidazoline receptive polypeptides, preferably encoding human imidazoline receptive polypeptides, that can be used as an imidazoline receptor (abbreviated IR). In addition, transcript(s) and protein sequences are predicted from the DNA clones. The invention is also directed to a genomic DNA clone designated as JEP-1A. The CDNA clones according to the invention comprise cDNA homologous to portion(s) of this genomic clone; including 5A-1 cDNA, cloned by the inventors that established the open-reading frame for translation of mRNA from the gene, and established the immunoreactive properties of its polypeptide sequence in an expression systems. Also, the invention relates to cDNA clone EST04033, which is another clone identified to contain cDNA sequences from the JEP-1A gene, and of which the 5A-1 is a part, that encodes an active fragment of the IR polypeptide in transfection assays, and the protein sequences thereof. The invention also relates to methods for producing such genomic and cDNA clones, methods for expressing the IR protein and fragments, and uses thereof.
- 2. Description of Related Art
- It is believed that brainstem imidazoline receptors possess binding site(s) for therapeutically relevant imidazoline compounds, such as clonidine and idazoxan. These drugs represent the first generation of ligands discovered for the binding site(s) of imidazoline receptors. However, clonidine and idazoxan were developed based on their high affinity for α2-adrenergic receptors. Second generation ligands, such as moxonidine, possess somewhat improved selectivity for IR over α2-adrenergic receptors, but more selective compounds for IR are needed.
- An imidazoline receptor clone is of particular interest because of its potential utility in identifying novel pharmaceutical agents having greater potency and/or more selectivity than currently available ligands have for imidazoline receptors. Recent technological advances permit pharmaceutical companies to use combinatorial chemistry techniques to rapidly screen a cloned receptor for ligands (drugs) binding thereto. Thus, a cloned imidazoline receptor would be of significant value to a drug discovery program.
- Until now, the molecular nature of imidazoline receptors remains unknown. For instance, no amino acid sequence data for a novel IR, e.g., by N-terminal sequencing, has been reported. Three different techniques have been described in the literature by three different laboratories to visualize imidazoline-selective binding proteins (imidazoline receptor candidates) using gel electrophoresis. Some important consistencies have emerged from these results despite the diversity of the techniques employed. On the other hand, multiple protein bands have been identified, which suggests heterogeneity amongst imidazoline receptors. These reports are discussed below.
- Some of the abbreviations used hereinbelow, have the following meanings:
α2AR Alpha-2 adrenoceptor BAC Bovine adrenal chromaffin ECL Enhanced chemiluminescence (protein detection procedure) EST Expressed Sequence Tag (a one-pass cDNA documentation without identification) I-site Any imidazoline-receptive binding site (e.g., encoded on IR) IR1 Imidazoline receptor subtype1 IR-Ab Imidazoline receptor antibody I2Site Imidazoline binding subtype2 kDa Kilodaltons (molecular size) MAO monoamine oxidase MW molecular weight NRL European abbreviation for RVLM (see below) PC-12 Phaeochromocytoma-12 cells 125PIC [125I]p-iodoclonidine PKC Protein Kinase C RVLM Rostral Ventrolateral Medulla in brainstem SDS sodium dodecyl sulfate gel electrophoresis - Reis et al. [Wang et al.,Mol. Pharm., 42: 792-801 (1992); Wang et al., Mol. Pharm., 43: 509-515 (1993)] were the first to characterize an imidazoline-selective binding protein and to demonstrate it as having MW=70 kDa. This was accomplished using bovine cells (BAC), which lack an α2AR [Powis & Baker, Mol. Pharm., 29:134-141 (1986)]. The 70 kDa imidazoline-selective protein in those studies had high affinities for both idazoxan and p-aminoclonidine affinity chromatography columns and was eluted by another imidazoline compound (phentolamine). Unfortunately, those investigators failed to isolate sufficient 70 kDa protein to determine its other biochemical properties. To date, no one has reported the complete purification of an imidazoline receptor protein. Likewise, no amino acid sequences have been reported for IR.
- Their 70 kDa protein was used by Reis and co-workers to raise “I-site binding antiserum”, designated herein as Reis antiserum. The term “I-site” refers to the imidazoline binding site, presumably defined within the imidazoline receptor protein. Reis antiserum was prepared by injecting the purified protein into rabbits [Wang et al, 1992]. The first immunization was done subcutaneously with the protein antigen (10 μg) emulsified in an equal volume of complete Freund's adjuvant, and the next three booster shots were given at 15-day intervals with incomplete Freund's adjuvant. The polyclonal antiserum has been mostly characterized by immunoblotting, but radioimmunoassays (RIA) and/or conjugated assay procedures, i.e., ELISA assays, are also conceivable [see “Radioimmunoassay of Gut Regulatory Peptides: Methods in Laboratory Medicine,” Vol. 2,
chapters - The present inventors and others [Escriba et al.,Neurosci. Lett. 178: 81-84 (1994)] have characterized the Reis antiserum in several respects. For instance, the present inventors have discovered that human platelet immunoreactivity with Reis antiserum is mainly confined to a single protein band of MW≈33 kDa, although a trace band at ≈85 kDa was also observed. The ≈33 and ≈85 kDa bands were enriched in plasma membrane fractions as expected for an imidazoline receptor. Furthermore, the intensity of the ≈33 kDa band was found to be positively correlated with non-adrenergic 125PIC Bmax values at platelet IR1 sites in samples from the same subjects, with an almost one-to-one slope factor. In addition, the nonadrenergic 125PIC binding sites on platelets were discovered by the present inventors to have the same rank order of affinities as IR1 binding sites in brainstem [Piletz and Sletten, J. Pharm. & Exper. Therap., 267: 1493-1502 (1993)]. The platelet ≈33 kDa band may also be a product of a larger protein, since in human megakaryoblastoma cells, which are capable of forming platelets in tissue cultures, an ≈85 kDa immunoreactive band was found to predominate.
- Immunoreactivity with Reis antiserum does not appear to be directed against human α2AR and/or MAO A/B. This is a significant point because α2AR and MAO A/B have previously been cloned and also bind to imidazolines. The present inventors have obtained selective antibodies and recombinant preparations for α2AR and MAO A/B, and these proteins do not correspond to the ≈33, 70, or 85 kDa putative IR1 bands. Thus, there is substantial evidence that, at least in human platelets, the Reis antiserum is IR1 selective.
- Another antiserum was raised by Drs. Dontenwill and Bousquet in France [Greney et al.,Europ. J. Pharmacol., 265: R1-R2 (1994); Greney et al., Neurochem. Int., 25: 183-191 (199.4); Bennai et al., Annals NY Acad. Sci., 763:140-148 (1995)] against polyclonal antibodies for idazoxan (designated Dontenwill antiserum). This anti-idiotypic antiserum inhibits 3H-clonidine but not 3H-rauwolscine (α2-selective) binding sites in the brainstem, suggesting it also interacts with IR1 [Bennai et al., 1995]. As shown in FIG. 1, human RVLM (same as NRL) membrane fractions displayed bands of ≈41 and 44 kDa, as detected by the present inventors using this anti-idiotypic antiserum.
- The present inventors have found that the bands of MW≈41 and 44 kDa detected by Dontenwill antiserum may be derived from an ≈85 kDa precursor protein, similar to that occurring in platelet precursor cells. An 85 kDa immunoreactive protein is obtained in fresh rat brain membranes only when a cocktail of 11 protease inhibitors is used. Also, as shown in FIG. 1, it is found that Reis antiserum detects the ≈41 and 44 kDa bands in human brain when fewer protease inhibitors are used. Additionally, the Dontenwill antiserum weakly detects a platelet ≈33 kDa band. Thus, the present inventors have hypothesized that the ≈41 and 44 kDa immunoreactive proteins may be alternative breakdown products of an ≈85 kDa protein, as opposed to the platelet ≈33 kDa breakdown product.
- In summary, the main conclusion from the above results is that, despite vastly different origins, the Reis and Dontenwill antisera both detect identical bands in human platelets, RVLM, and hippocampus.
- Using yet another technique, a photoaffinity imidazoline ligand,125AZIPI, has also been developed to preferentially label I2-imidazoline binding sites [Lanier et al., J. Biol. Chem., 268: 16047-16051 (1993)]. The 125AZIPI photoaffinity ligand was used to visualize ≈55 kDa and ≈61 kDa binding proteins from rat liver and brain. It is believed that the ≈61 kDa protein is probably MAO, in agreement with other findings [Tesson et al., J. Biol. Chem., 270: 9856-9861 (1995)] showing that MAO proteins bind certain imidazoline compounds. The different molecular weights between these bands and those detected immunologically by the present inventors is one of many pieces of evidence that distinguishes IR1 from I2 sites.
- To the inventors' knowledge and as described herein, we are first to clone the gene, cDNAs and fragments thereof encoding a protein with the immunological and ligand binding properties expected of an IR. On this basis, we are first to identify the nucleotide sequences of DNA molecules encoding an imidazoline receptor and active fragments thereof, and the first to determine the amino acid sequence of an imidazoline receptor and active fragments thereof. The polypeptides described herein are clearly distinct from α2AR or MAO A/B proteins.
- The present invention involves various cDNA clones (ie., 5A-1 and EST04033) and a genomic clone (JEP-1A) which are directed to an isolated polypeptide(s) that is receptive to (bind to) imidazoline compound(s), and can be used to identify other compounds of interest. Currently available imidazoline compounds in this context are p-iodoclonidine and moxonidine. Initially, the inventors detected a polypeptide expressed by their cDNA clone (5A-1 isolated from a human hippocampus cDNA library) that immunoreacted with Reis antiserum and/or Dontenwill antiserum. The DNA sequence of the 5A-1 clone is encapsulated within a portion of the other clones (EST04033 and JEP-1A genomic clone).
- In one aspect of the invention, a polypeptide includes a 651 amino acid sequence as shown in SEQ ID No. 5. This polypeptide is predicted from non-plasmid cDNA in EST04033; a clone which the inventors showed possesses sequences inclusive of 5A-1. Furthermore, transfection of EST04033 into COS cells yielded imidazoline receptivity by radioligand binding assays (described in detail later). Other imidazoline receptive proteins homologous to this polypeptide are also contemplated. Such polypeptide(s) generally have a molecular weight of about 50 to 80 kDa. More particularly, one can have a molecular weight of about 70 kDa.
- In another aspect of this invention, a polypeptide includes a 390 amino acid sequence as shown in SEQ ID No. 6. This represents the polypeptide predicted from the non-plasmid DNA of the original 5A-1 clone. Such a polypeptide generally has a molecular weight of about 35 to 50 kDa. More particularly, it can have a molecular weight of about 43 kDa.
- DNA molecules encoding aforementioned imidazoline-receptive polypeptide(s) are also contemplated. Such a DNA molecule, e.g., a cDNA derived from mRNA, can contain a nucleotide sequence encoding the 651 amino acid sequence shown in SEQ ID No. 5. Thus, a DNA molecule containing the 1954 base pairs (b.p.) (1954 b.p. encodes 651 amino acids) nucleotide sequence shown in SEQ ID No. 2 is contemplated. This represents the coding sequence for the polypeptide predicted by EST04033 transfections. In another embodiment, a DNA molecule includes the longer nucleotide sequence shown in SEQ ID No. 3. This represents the cDNA predicted to have been translated+not predicted to have been translated in transfections experiments of EST04033.
- In another embodiment of the invention, a DNA molecule contains a nucleic acid sequence encoding the amino acid sequence shown in SEQ ID No. 6. In another aspect, it can include the 1171 b.p. nucleic acid sequence shown in SEQ ID No. 4. The 1171 b.p. nucleic acid sequence shown in SEQ ID No. 4 is the 5A-1 non-plasmid DNA.
- The nucleic acid sequence of the genomic clone encoding the imidazoline receptor is further shown in SEQ ID No. 21. The nucleic acid and amino acid sequence of the predicted transcript (ie., cDNA) can be predicted from the description hereinbelow. The polypeptide encoded by the genomic DNA is shown in SEQ ID No. 22.
- Sequence similarity with the sequences indicated in SEQ ID protocols of the attached Sequence Listing is defined in connection with the present invention as a very close structural relationship of the relevant sequences with the sequences indicated in the respective SEQ ID protocols. To determine the sequence similarity, in each case the structurally mutually corresponding sections of the sequence of the SEQ ID protocol and of the sequence to be compared therewith are superimposed in such a way that the structural correspondence between the sequences is a maximum, account being taken of differences caused by deletion or insertion of individual sequence members (DNA-codon or amino acid respectively), and being compensated by appropriate shifts in sections of the sequences. The sequence similarity in % results from the number of sequence members which now correspond to one another in the sequences (“homologous positions”) relative to the total number of members contained in the sequences of the SEQ ID protocols. Differences in the sequences may be caused by variation, insertion or deletion of sequence members.
- Additionally in DNA sequences, different DNA-codons encoding for the same amino acid are considered identical in the context of the present invention. For amino acid sequences, conservative amino acid substitutions encoded by their corresponding DNA-codons, as well as naturally occurring homologs of the sequences, are considered within the context of sequence similarity.
- DNA molecules of substantial homology (≧75%) are an implicit aspect of this sort of invention. As will be discussed later, the inventors have already identified two possible splice variants in the amino acid coding sequence. In addition, artificially mutated receptor cDNA molecules can be routinely constructed by methods such as site-directed polymerase chain reaction-mediated mutagenesis [Nelson and Long,Anal. Biochem. 180: 147-151 (1989)]. It is commonly appreciated that highly homologous mutants frequently mimic their natural receptor. A study by Kjelsberg et al. [J. Biol. Chem. 267: 1430-1433 (1992)] showed that all 20 amino acid substitutions produce an active receptor at a single site in the α1b-adrenergic receptor. RNA molecules of ≧75 % complementarity to an instant DNA molecule, e.g., an mRNA molecule (sense) or a complementary cRNA molecule (antisense), are a further aspect of the invention.
- A further aspect of the invention is for a recombinant vector, as well as a host cell transfected with the recombinant vector, wherein the recombinant vector contains at least one of the nucleotide sequences shown in SEQ ID Nos. 1-4, or sequences predicted by the genomic clone, or nucleotide sequences ≧75% homologous thereto.
- A method of producing an imidazoline receptor protein is another aspect of the invention. Such a method entails transfecting a host cell with an aforementioned vector, and culturing the transfected host cell in a culture medium to generate the imidazoline receptor.
- A method for producing homologous imidazoline receptor proteins, and the proteins produced thereby, are also considered an aspect of this invention.
- A significant further aspect of the invention is a method of screening for a ligand that binds to an imidazoline receptor. Such a method can comprise culturing an above-mentioned transfected cell in a culture medium to express imidazoline receptor proteins, followed by contacting the proteins with a labelled ligand for the imidazoline receptor under conditions effective to bind the labelled ligand thereto. The imidazoline receptor proteins can then be contacted with a candidate ligand, and any displacement of the labelled ligand from the proteins can be detected. Displacement of labelled ligand signifies that the candidate ligand is a ligand for the imidazoline receptor. These steps could be performed on intact host cells, or on proteins isolated from the cell membranes of the host cells.
- The invention will now be described in more detail with reference to specific examples.
- FIG. 1 depicts a comparison of Reis antiserum (
lane 1, 1:2000 dilution) and Dontenwill antiserum (lane 2, 1:5000 dilution) immunoreactivities for human NRL (same as RVLM) and hippocampus, as discussed in Example 1. - FIG. 2 depicts a comparison of Reis antiserum (1:15,000 dilution) and Dontenwill antiserum (1:20,000 dilution) immunoreactivities for plaques isolated from the human hippocampal cDNA library used in cloning as discussed in Example 2. The plaques contain the initial clone, designated herein as 5A-1, in a third stage of purification.
- FIG. 3 depicts the restriction map of the EST04033 cDNA clone.
- FIG. 4 depicts a competitive binding assay between125I-labelled p-iodoclonidine (PIC) and various ligands for the imidazoline receptor on membranes expressed in COS cells transfected with the EST04033 cDNA clone, as discussed in Example 4.
- FIG. 5 depicts the prediction of introns and exons of the genomic clone (as analyzed by the GENESCAN program and verified by the available CDNAS).
- FIG. 6 depicts the distribution of MRNA homologous to our CDNA in human adult tissues (bar graph) and the two species of MRNA (6 and 9.5 kb).
- The present invention is concerned with multiple aspects of an imidazoline receptor protein, and DNA molecules encoding the same, and fragments thereof, which have now been discovered.
- First, a polypeptide having imidazoline binding activity has been identified, which contains the putative active site for binding, as discussed hereinafter. Although polypeptide(s) described herein has a binding affinity for an imidazoline compound, it may also have an enzymatic activity, such as do catalytic antibodies and ribozymes. In fact, one such domain within our protein predicts a cytochrome p450 activity (described later).
- Exemplary “binding” polypeptides are those containing either of the amino acid sequences shown in SEQ ID Nos. 5 or 6 (with the amino acid sequence predicted by EST04033 given in SEQ ID No. 5). Functionally equivalent polypeptides are also contemplated, such as those having a high degree of homology with such aforementioned polypeptides, particularly when they contain the Glu-Asp-rich region described hereinafter which we believe may define an active imidazoline binding site.
- A polypeptide of the invention can be formed by direct chemical synthesis on a solid support using the carbodiimide method [R. Merrifield,JACS, 85: 2143 (1963)]. Alternatively, and preferably, an instant polypeptide can be produced by a recombinant DNA technique as described herein and elsewhere [e.g., U.S. Pat. No. 4,740,470 (issued to Cohen and Boyer), the disclosure of which is incorporated herein by reference], followed by culturing transformants in a nutrient broth.
- Second, a DNA molecule of the present invention encodes aforementioned polypeptide. Thus, any of the degenerate set of codons encoding an instant polypeptide is contemplated. A particularly preferred coding sequence is the 1954 b.p. sequence set forth in SEQ ID No. 2, which has now been discovered to be a nucleotide sequence that encodes a polypeptide capable of binding imidazoline compound(s). In another embodiment, a DNA molecule includes the 3318 b.p. nucleotide sequence shown in SEQ ID No. 3. This latter sequence is the entire EST04033 insert. It includes the nucleotide sequence of SEQ ID No. 2 which was predicted to have been translated into protein in the transfection experiments.
- In another embodiment of the invention, a DNA molecule contains a nucleic acid sequence encoding the amino acid sequence (390 residues) shown in SEQ ID No. 6. This amino acid sequence corresponds to that derived from direct sequencing of the 5A-1 clone and represents a fragment of the native protein. The 5A-1 DNA molecule is defined by the 1171 b.p. nucleic acid sequence shown in SEQ ID No. 4.
- A DNA molecule of the present invention can be synthesized according to the phosphotriester method [Matteucci et al.,JACS, 103: 3185 (1988)]. This method is particularly suitable when it is desired to effect site-directed mutagenesis of an instant DNA sequence, whereby a desired nucleotide substitution can be readily made. Another method for making an instant DNA molecule is by simply growing cells transformed with plasmids containing the DNA sequence, lysing the cells, and isolating the plasmid DNA molecules. Preferably, an isolated DNA molecule of the invention is made by employing the polymerase chain reaction (PCR) [e.g., U.S. Pat. No. 4,683,202 (issued to Mullis)] using synthetic primers that anneal to the desired DNA sequence, whereby DNA molecules containing the desired nucleotide sequence are amplified. Also, a combination of the above methods can be employed, such as one in which synthetic DNA is ligated to CDNA to produce a quasi-synthetic gene [e.g., U.S. Pat. No. 4,601,980 (issued to Goeddel et al.)].
- A further aspect of the invention is for a vector, e.g., a plasmid, that contains at least one of the nucleotide sequences shown in SEQ ID Nos. 1-4 or those predicted by the genomic clone in SEQ ID No. 21. Whenever the reading frame of the vector is appropriately selected, the vector encodes an IR polypeptide of the invention. Hence, as well as full-length protein, fragments of the native IR protein are contemplated; as well as fusion proteins that incorporate an amino acid sequence as described herein. Also, a vector containing a nucleotide sequence having a high degree of homology with any of SEQ ID Nos. 1-4 or 21 is contemplated within the invention, particularly when it encodes a protein having imidazoline binding activity.
- A recombinant vector of the invention can be formed by ligating an afore-mentioned DNA molecule to a preselected expression plasmid, e.g., with T4 DNA ligase. Preferably, the plasmid and DNA molecule are provided with cohesive (overlapping) terminii, with the plasmid and DNA molecule operatively linked (i.e., in the correct reading frame).
- Another aspect of the invention is a host cell transfected with a vector of the invention. Relatedly, a protein expressed by a host cell transfected with such a vector is contemplated, which protein may be bound to the cell membrane. Such a protein can be identical with an aforementioned polypeptide, or it can be a fragment thereof, such as when the polypeptide has been partially digested by a protease in the cell. Also, the expressed protein can differ from an aforementioned polypeptide, as whenever it has been subjected to one or more post-translational modifications. For the protein to be useful within the context of the present invention, it should exhibit imidazoline binding capacity.
- A method of producing an imidazoline receptor protein is another aspect of the invention, which entails transfecting a host cell with an aforementioned vector, and culturing the transfected host cell in a culture medium to generate the imidazoline receptor. The receptor molecule can undergo any post-translational modification(s), including proteolytic decomposition, whereby its structure is altered from the basic amino acid residue sequence encoded by the vector. A suitable transfection method is electroporation, and the like.
- With respect to transfecting a host cell with a vector of the invention, it is contemplated that a vector encoding an instant polypeptide can be transfected directly in animals. For instance, embryonic stem cells can be transfected, and the cells can be manipulated in embryos to produce transgenic animals. Methods for performing such an operation have been previously described [Bond et al.,Nature, 374:272-276 (1995)]. These methods for expressing an instant CDNA molecule in either tissue culture cells or in animals can be especially useful for drug discovery.
- Possibly the most significant aspect of the present invention is in its potential for affording a method of screening for a ligand (drug) that binds to an imidazoline receptor. Such a method comprises culturing an above-mentioned host cell in a culture medium to express an instant imidazoline receptive polypeptide, then contacting the polypeptides with a labelled ligand, e.g., radiolabelled p-iodoclonidine, for the imidazoline receptor under conditions effective to bind the labelled ligand thereto. The polypeptides are further contacted with a candidate ligand, and any displacement of the labelled ligand from the polypeptides is detected. Displacement signifies that the candidate ligand actually binds to the imidazoline receptor. These steps could be performed on intact host cells, or on proteins isolated from the cell membranes of the host cells.
- Typically, a suitable drug screening protocol involves preparing cells (or possibly tissues from transgenic animals) that express an instant imidazoline receptive polypeptide. In this process, categories of chemical structure are systematically screened for binding affinity or activation of the receptor molecule encoded by the transfected CDNA. This process is currently referred to as combinatorial chemistry. With respect to the imidazoline receptor, a number of commercially available radioligands, e.g.,125PIC, can be used for competitive drug binding affinity screening.
- An alternative approach is to screen for drugs that elicit or block a second messenger effect known to be coupled to activation of the imidazoline receptor, e.g., moxonidine-stimulated arachidonic acid release. Even with a weak binding affinity or activation by one category of chemicals, systematic variations of that chemical structure can be studied and a preferred compound (drug) can be deduced as being a good pharmaceutical candidate. Identification of this compound would lead to animal testing and upwards to human trials. However, the initial rationale for drug discovery becomes vastly improved with an instant cloned imidazoline receptor.
- Along these lines, a drug screening method is contemplated in which a host cell of the invention is cultured in a culture medium to express an instant imidazoline receptive polypeptide. Intact cells are then exposed to an identified agent (ie., agonist, inverse agonist, or antagonist) under conditions effective to elicit a second messenger or other detectable responses upon interacting with the receptor molecule. The imidazoline receptive polypeptides are then contacted with one or more candidate chemical compounds (drugs), and any modification in a second messenger response is detected. Compounds that mimic an identified agonist would be agonist candidates, and those producing the opposite response would be inverse agonist candidates. Those compounds that block the effects of a known agonist would be antagonist candidates for an in vivo imidazoline receptor. For meaningful results, the contacting step with a candidate compound is preferably conducted at a plurality of candidate compound concentrations.
- A method of probing for another gene encoding an imidazoline receptor or homologous protein is further contemplated. Such a method comprises providing a radiolabelled DNA molecule identical or complementary to one of the above-described CDNA molecules (probe). The probe is then placed in contact with genetic material suspected of containing a gene encoding an imidazoline receptor or encoding a homologous protein, under stringent hybridization conditions (e.g., a high stringency wash condition is 0.1×SSC, 0.5% SDS at 65° C.), and identifying any portion of the genetic material that hybridizes to the DNA molecule.
- Still further, a method of selectively producing antibodies, (e.g., monoclonal antibodies, immunoreactive with an instant imidazoline-receptive protein) comprises injecting a mammal with an aforementioned polypeptide, and isolating the antibodies produced by the mammal. This aspect is discussed in more detail in an example presented hereinafter.
- The present inventors began their search for a human imidazoline receptor CDNA by screening a λgt11 phage human hippocampus CDNA expression library. Their research had indicated that both of the known antisera (Reis and Dontenwill) that are directed against human imidazoline receptors were immunoreactive with identical bands on SDS gels of membranes prepared from the human hippocampus (an in other tissues). By contrast, other brain regions either were commercially unavailable as cDNA expression libraries or yielded inconsistencies between the two antisera. Therefore, it was felt that a human hippocampal cDNA library held the best opportunity for obtaining a CDNA for an imidazoline receptor. Immunoexpression screening was chosen over other cloning strategies because of its sensitivity when coupled with the ECL detection system used by the present inventors, as discussed hereinbelow.
- A number of unique discoveries led to identifying the first 5A-1 clone as an imidazoline receptor CDNA. These included discoveries that led to the choice of a hippocampal CDNA library and adapting ECL to the antisera. Once the initial clone (5A-1) was identified and sequenced, a more complete clone (EST04033) was purchased without restriction from ATCC Inc. (Catalogue # 82815; American Type Culture Collection, Rockville, Md.). EST 04033 was the only EST clone available at the time of the discovery of 5A-1, that contained a segment of complete homology (the origination of EST04033 is discussed later on). The binding affinities of the expressed protein after transfection in COS cells were determined by radioligand binding procedures developed in the inventor's laboratory [Piletz and Sletten, 1993, ibid].
- To identify an instant CDNA clone encoding an imidazoline receptor it was preferable to employ both of the known antibodies to imidazoline receptors. These antibodies were obtained by contacting Dr. D. Reis (Cornell University Medical Center, New York City), and Drs. M. Dontenwill and P. Bousquet (Laboratoire de Pharmacologie Cardiovascular et Renale, CNRS, Strasbourg, France). These antisera were obtained free of charge and without confidentiality or restrictions on their use. The former antiserum (Reis antiserum) was derived from a published imidazoline receptor protein [Wang et al., (1992, 1993), the disclosures of which are incorporated herein by reference]. The method for raising the latter antiserum (Dontenwill antiserum) has also been published [Bennai et al., (1995), the disclosure of which is also incorporated herein by reference]. The latter antiserum was developed using an anti-idiotypic approach that identified the pharmacologically correct (clonidine and idazoxan selective) binding site structure.
- The obtained Reis antiserum had been prepared against a purified imidazoline binding protein isolated from BAC cells, which protein runs in denaturing-SDS gels at 70 Kda [Wang et al., 1992, 1993]. The Dontenwill antiserum is anti-idiotypic, and thus is believed to detect the molecular configuration of an imidazoline binding site domain in any species. Prior to being used for screening plaques, both antisera were cleaned by stripping out possible antibacterial antibodies.
- Both antisera have been tested to ensure that they are in fact selective for a human imidazoline receptor. In particular, we found that both of these antisera detected identical bands in human platelets and hippocampus, and in brainstem RVLM (NRL) by Western blotting (see FIG. 1). In these studies, in order to increase sensitivity over previously published detection methods, an ECL (Enhanced Chemiluminescence) system was employed. The linearity of response of the ECL system was demonstrated with a standard curve. ECL detection was demonstrated to be very quantifiable and about ten times more sensitive than other screening methods previously used with these antisera. Western blots with antiserum dilutions of 1:3000 revealed immunoreactivity with as little as 1 ng of protein from a human hippocampal homogenate by dot blot analysis.
- For the studies depicted in FIG. 1, human hippocampal homogenate (30 μg) and NRL membrane proteins (10 μg) were electrophoresed through a 12.5% SDS-polyacrylamide gel, electrotransfered to nitrocellulose and sequentially incubated with (1) the Reis antibody (1:2000 dilution) and (2) the Dontenwill antibody (1:5000 dilution). Immunoreactive bands were visualized with an Enhanced Chemiluminescence (ECL) detection kit (Amersham) using anti-rabbit Ig-HRP conjugated antibody at a dilution of 1:3000 and the ECL detection reagents. Following detection with the antibody, blots were stripped and reprocessed omitting the primary antibody to check for complete removal of this antibody. In panels A and B,
lane 1 shows the immunoreactive bands observed with the Reis antibody andlane 2 shows the bands detected with the Dontenwill antibody. Protein molecular weight standards are indicated to the left of each panel (in Kda). - Despite the diverse origins of the Reis and Dontenwill antisera, both of these antisera detected a similar 85 Kda protein in human brain and other tissues. But, a 33 Kda band was found in human platelets. Although the 33 Kda band is of smaller size than that reported for other tissues [Wang et al., 1993; Escriba et al., 1994; Greney et al., 1994], the fact that both antisera detected it, suggests that both the 85 Kda and 33 Kda bands may be imidazoline binding polypeptides. The 85 and 33 Kda bands were enriched in plasma membrane fractions, as is known to be the case for IR1 binding, but not I2 binding [Piletz and Sletten, 1993].
- A significant positive correlation was established for the 85 Kda band detected by the Dontenwill antiserum with IR1 Bmax values across nine rat tissues (r2=0.8736). A similar positive correlation was established amongst platelet samples from 15 healthy platelet donors between radioligand IR1 Bmax values (but not I2 or α2AR Bmax values), and the 33 Kda band (presumed IR1 immunoreactivity) on Western blots. This correlation exhibited a slope factor close to unity (results not shown). These correlations strongly suggested that an IR1 binding protein might be revealed in an imidazoline receptor-antibody Western blotting assay. Furthermore, the Reis antiserum failed to detect authentic α2AR, MAO A, or MAO B bands on gels, i.e., it was not immunoreactive with MAO at MW=61 Kda, or α2AR at MW=64 Kda. Additionally, no immunoreactive bands were observed using preimmune antiserum. Thus, after extensively characterizing these antisera with human and rat materials, it was concluded that these antisera are indeed selective for human imidazoline receptor protein.
- A commercially available human hippocampal cDNA λgt11 expression library (Clontech Inc., Palo Alto, Calif.) was screened for immunoreactivity sequentially using both the anti-idiotypic Dontenwill antiserum and the Reis antiserum. Standard techniques to induce protein and transference to a nitrocellulose overlay were employed. [See, for instance, Sambrook et al., 1989, “Molecular Cloning: A Laboratory Manual,” Cold Spring Harbor Laboratory Press]. After washing and blocking with 5% milk, the Dontenwill antiserum was added to the overlay at 1:20,000 dilution in Tris-buffered saline, 0.05% Tween20, and 5% milk. The Reis antiserum was employed similarly, but at 1:15,000 dilution. These high dilutions.of primary antiserum were chosen to avoid false positives. The secondary antibody was added, and positive plaques were identified using ECL. Representative results are shown in FIG. 2.
- Positive plaques were pulled and rescreened until tertiary screenings yielded only positive plaques. Four separate positive plaques were identified from more than 300,000 primary plaques in our library. Recombinant λgt11 DNA purified from each of the four plaques was subsequently subcloned intoE. coli pBluescript vector (Stratagene, La Jolla, Calif.). Sequencing of these four cDNA inserts in pBluescript demonstrated that they were identical, suggesting that only one cDNA had actually been identified four times. Thus, the screening had been verified as being highly reproducible and the frequency of occurrence was as expected for a single copy gene, i.e., one in 75,000 transcripts. As shown in FIG. 2, the protein produced by the first positive clone, designated 5A-1, tested positive with both the Reis antiserum and the Dontenwill antiserum. Clone 5A-1 has been deposited under the Budapest Treaty with the American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockville, Md., USA, 20852, on Aug. 28, 1997 and has been assigned deposit accession no. ATCC 209217. Tertiary-screened plaques of 5A-1 were all immuno-positive with either of the two known anti-imidazoline receptor antisera, but not with either preimmune antisera. These results suggested that clone 5A-1 encoded a fusion peptide similar to or identical with one of the predominant bands detected in human Western blots by both the Dontenwill and Reis antisera.
- Sequencing of the first four clones was performed by contracting with ACGT Company (Chicago, Ill.) after subcloning them into pBluescript vector SK (Stratagene). Both manual and automatic sequencing strategies were employed which are outlined as follows:
- Manual Sequencing
- 1. DNA sequencing was performed using T7 DNA polymerase and the dideoxy nucleotide termination reaction.
- 2. The primer walking method [Sambrook et al., ibid.] was used in both directions.
- 3. (35S)dATP was used for labelling.
- 4. The reactions were analyzed on 6% polyacrylamide wedge or non-wedge gels containing 8 M urea, with samples being loaded in the order of A C G T.
- 5. DNA sequences were analyzed by MacVector Version 5.0. and by various Internet-available programs, i.e., the BLAST program.
- Automatic Sequencing
- 1.DNA sequencing was performed by the fluorescent dye terminator labelling method using AmpliTaq DNA polymerase (Applied Biosystems Inc., Prizm DNA Sequencing Kit, Perkin-Elmer Corp., Foster City, Calif.).
- 2. The primer walking method was used. The primers actually used were a subset of those shown in SEQ ID Nos. 7-20.
- 3. Sequencing reactions were analyzed on an Applied Biosystems, Inc. (Foster City, Calif.) sequence analyzer.
- These results demonstrated that the initial clone (5A-1) contained a 1171 base pair insert (see SEQ ID No. 4). The entire 5A-1 cDNA was found to exist as extended open reading frame for translation into protein. Consequently, it was determined that the 5A-1 cDNA must be a fragment of a larger mRNA.
- cDNA Sequence Homologies
- Using programs and databases available on the Internet (retrieved from NCBI Blast E-mail Server address blast@ncbi.nlm.nih.gov), it was determined that the 5A-1 clone encodes a previously undefined unique molecule. The BLASTP program [1.4.8MP, Jun. 20, 1995 (build Nov. 11. 1995)] was used to compare all of the possible frames of amino acid sequences encoded by 5A-1 versus all known amino acid sequences available within multiple international databases [Altschul et al.,J. Mol. Biol., 215: 403-410 (1990)]. Only one protein, from Micrococcus luteus, possessed a marginally significant homology (p=0.04)(41%) over a short stretch of 75 of the 390 amino acids encoded by 5A-1. Otherwise, there were not any amino acid homologies (i.e., with p≦0.05) for any known proteins. Therefore, the protein encoded by 5A-1 is not significantly related to MAO A or B, α2AR, or any other known eukaryotic protein in the literature.
- In contrast to the amino acid search on BLASTP, two nearly homologous EST cDNA sequences of undefined nature covering 155 and 250 b.p. of the 5A-1 clone were reported to exist using BLASTN (reached from the same Internet server on Nov. 13, 1995). BLASTN is a program used to compare known DNA sequences from international databases, regardless of whether they encode a polypeptide. Neither of the two EST cDNA sequences having high homology to 5A-1, to our knowledge have been reported anywhere else except on the Internet. Both were derived as Expressed Sequence Tags (ESTs) in random attempts to sequence the human cDNA repertoire [as described in Adams et al.,Science, 252: 1651-1656 (1991)]. As far as can be determined, the people who generated these ESTs lack any knowledge of what protein(s) they encode. One cDNA, designated HSA09H122, contained 250 b.p. with 7 unknown/incorrect base pairs (97% homology) versus 5A-1 over the same region. HSA09H122 was generated in France (Genethon, B.P. 60, 91002 Evry Cedex France) from a human lymphoblast cDNA library. The other EST, designated EST04033, contained 155 b.p. with 12 unknown/incorrect base pairs (92% homology) versus 5A-1 over the same region. EST04033 was generated at the Institute for Genomic Research (Gaithersburg, Md.) from a human fetal brain cDNA clone (HFBDP28). Thus, both of these ESTs are short DNA sequences and contain a number of errors (typical of single-stranded sequencing procedures as used when randomly screening ESTs).
- Based on the BLASTN search, the owner of HSA09H122 was contacted in an effort to obtain that clone. The current owner of the clone appears to be Dr. Charles Auffret (Paul Brousse Hospital, Genetique, B. P. 8, 94801 Villejuif Cedex, France). Dr. Auffret indicated by telephone that his clone came from a lot of clones believed to be contaminated with yeast DNA, and he did not trust it for release. Contamination with yeast DNA of that clone was later confirmed to have been reported within an Internet database. Thus, HSA09H122 was not reliable.
- The other partial clone (EST04033) was purchased from American Type Culture Collection in Rockville, Md. (ATCC Catalog no. 82815). A telephone call to the Institute for Genomic Research revealed that it had been deposited at ATCC under [insert terms]. As far as can be determined, the present inventors were the first to completely sequence EST04033. The complete size of EST04033 was 3389 b.p. (SEQ ID No. 1), with a 3,318 b.p. nonplasmid insert (see SEQ ID No. 3). Within this sequence of EST04033 the remaining 783 base pairs of the coding sequence presumed for a 70 kDa imidazoline receptor were predicted at the 5′ side of 5A-1 (i.e., 783 coding nucleotides unique to EST04033+1171 coding nucleotides of 5A-1=1954 predicted total coding nucleotides; assuming b.p.# 1397-1400 in SEQ. No. 1 encodes the initiating methionine). The entire 1954 b.p. coding region for an 70 kDa protein is shown in SEQ ID No. 2. The nucleotide sequence of EST04033 was determined in the same manner as described previously for the 5A-1 clone. The nucleotide sequence of the entire clone is shown in SEQ ID No. 1. In this sequence, an identical overlap was observed for the sequence obtained previously for the 5A-1 clone and the sequence obtained for EST04033. The 5A-1 overlap began at EST04033 b.p. 2,181 (SEQ. No.1) and continued to the end of the molecule (b.p. 3,351).
- Conclusions About Our cDNA Clones
- cDNA of the present invention encode a protein that is immunoreactive with both of the known selective antisera for an imidazoline receptor, i.e.,. Reis antiserum and Dontenwill antiserum. Thus, an instant cDNA molecule produces a protein immunologically related to a purified imidazoline receptor and has the antigenic specificity expected for an imidazoline binding site. These antisera have been documented in the scientific literature as being selective for an “imidazoline receptor”, which provides strong evidence that such an imidazoline receptor has indeed been cloned.
- As mentioned, our instant cDNA sequence contains open reading frame distinct from any previously described proteins. Therefore, the encoded protein is novel, and it is unrelated to α2-adrenoceptors or monoamine oxidases. Small hydrophobic domains in the predicted amino acid sequence suggest that the protein is probably membrane bound, as expected for an imidazoline receptor.
- A pre-made genomic library of human placental DNA was purchased from Stratagene (La Jolla, Calif.) to screen for an IR gene by hybridization. The genomic library was constructed in Stratagene's vector λ FIX® II (catalog # 946206), and it was grown in XL1-Blue MRA (P2) host bacteria. It was titered to yield approximately 50,000 plaques per 137 mm plate. Lifts from six such plates were screened in duplicate by hybridization. The DNA probe used for screening was a 1.85 kb EcoR1 fragment from EST 04033 cDNA (uniquely related to our sequences based on the BLASTN). After the restriction digestion of EST 04033 DNA, the 1.85 kb fragment was extracted from an agarose electrophoresis gel, cleaned according to the GENECLEAN® III kit manual (BIO 101, Inc., P.O. Box 2284, La Jolla, Calif.), and radiolabeled with [α-32P]d-CTP according to Stratagene's Prime-It® II Random Primer Labeling Kit manual. Plaques were lifted onto 137 mm Duralon-UV™ membranes (Stratagene's catalog #420102), denatured, and cross-linked with Stratgene's UV-Stratalinker™ 1800. Hybridization was conducted under high stringency conditions: prehybridization=6×SSC, 1% SDS, 50% formamide, and 100 μg/ml of sheared, denatured salmon sperm DNA at 42° C. for 2 hrs; hybridization=6×SSC, 1% SDS, 50% formamide, and 100 μg/ml of sheared, denatured salmon sperm DNA at 45° C. overnight; wash=2 washes of 1×SSC, 0.1% SDS at 65° C. and 3 washes of 0.2×SSC, 0.2% SDS at 65° C. This hybridization procedure is essentially described in Stratagene's vector λ FIX® II instruction manual. Positive plaques were localized by developing Kodak BioMax films. Two positive genomic clones of identical size were retained through three rounds of screening.
- One of the positive genomic clones (designated JEP 1-A) was selected for complete characterization. It was found to contain an ≈17 kb insert. Large-scale preparations of this genomic clone DNA were performed using the λ QUICK! SPIN kit (BIO101, La Jolla, Calif.). To verify that we had cloned a gene corresponding to 5A-1 and EST04033 cDNA, some restriction site positions in the genomic clone were determined using the FLASH Nonradioactive Gene Mapping Kit (Stratagene) and compared to Southern blots of human DNA. The location of genomic sequences highly related to (or identical to) those of our cDNA clones was determined by high stringency hybridization (as above) with the following32P-labeled probe: a 1110 bp ApaI-EcoRI fragment from the cDNA clone 5A-1. This fragment was chosen as the probe because it lacks the GAG repeat (encoding glutamic acids), which might have complicated matters if it were found to be repeated elsewhere in the genome. With genomic clone JEP1-A, we detected a 14.1 kb EcoRI fragment and a 7.7 kb SacI fragment that hybridized with this probe. Southern blots containing EcoRI- or SacI-digested human genomic DNA (from human blood) with the 1110 bp ApaI-EcoRI cDNA probe also resulted in the detection of a 14.1 kb EcoRI fragment and a 7.7 kb SacI fragment. No additional restriction fragments of human genomic DNA appeared to hybridize with this probe under lower stringency conditions. These results strongly suggested that this gene (JEP-1A) encodes transcript(s) giving rise to the 5A-1 and EST04033 cDNA clones. Clone JEP-1A has been deposited under the Budapest Treaty with the American Type Culture Collection (ATCC), 12301 Parklawn Drive, Rockville, Md., USA, 20852, on Aug. 28, 1997 and has been assigned deposit accession no. ATCC 209216.
- Genomic DNA sequencing was done by contract with Cadus Pharmaceutical Corporation (Tarrytown, N.Y.). The original lambda JEP1-A clone was subcloned into pzero (Invitrogen) as a convenient vector. The initial fragments for sequencing were derived from Sac I and Xba I restriction enzymes. The short Sac I fragments of 1.5, 3.0 and 3.5 kb were further digested with Hind III, Pst I, and Kpn I yielding 15 subclones of varying length. The procedure consisted of sequencing all these subclones and parent clones with vector forward and reverse primers. Subsequently, this initial round of sequencing was supplemented with primer walking using custom oligonucleotides. The Sac I fragments were joined together by primer walking using the 2 Xba I fragments of 3 and 10 Kb. Then, the largest Sac I fragment (8 kb) and the 10 kb Xba I fragment were used as templates for a transposon sequencing method. The method used was the Primer Island Transposition Kit (Perkin-Elmer Corp., Norwalk, Conn.; Applied Biosystems) (ABI). The kit consists of a synthetic transposon (Ty1) containing forward and reverse primers and the integrase enzyme which inserts the transposon randomly into the target plasmid DNA. Transposon insertion is an alternative to subcloning or primer walking when sequencing a large region of DNA (Devine and Boeke, Nucleic Acids Res. 22: 3765-3772 (1994); Devine et al., Genome Res., in press, (1997); Kimmel et al., In Genome Analysis, a Laboratory Manual, Cold Spring Harbor Press, NY, N.Y., in press (1997). A total of over 250 individual sequencing reactions were performed. Sequencing was done on ABI model 373 and 377 automated sequencers using ABI dye-terminator sequencing kits. Primers were designed using Gene Runner software (Hastings Software, Hastings On Hudson, N.Y.). Oligonucleotides were purchased from Gibco-BRL (Gaithersburg, Md.). Sequence assembly was performed using Sequencer Software (Gene Codes Corp., Ann Arbor, Mich.) from 4-fold redundancy of sequences.
- The entire sequence of our JEP-1A genomic clone is shown in SEQ. 21. The computer program, GENSCAN 1.0, was able to identify splice sites of known topology. As expected, this gene contained a number of introns. See Table 1 hereinbelow. Only one continuous open reading frame was identified within our genomic clone. This open reading frame was interrupted by a number of introns (which is typical of eukaryotic transcripts) as shown in FIG. 5. The predicted polypeptide is encoded by the genomic DNA beginning at b.p. # 971 of SEQ ID No. 21. The predicted amino acid sequence of the polypeptide encoded thereby is shown in SEQ ID No. 22. As can be seen, the entire 5A-1 DNA and polypeptide sequence was encapsulated within this predicted genomic transcript. Therefore, there is no question that this is the gene encoding 5A-1 and EST04033 cDNA. In addition, JEP-1A has more nearly defined the full-length transcript (by at least 102 more coding nucleotides than the cDNAs alone).
TABLE 1 Position of Predicted Introns and Exons GENSCAN 1.0 Date run: Aug. 26, 1997 Time: 12:35:39 Sequence gs_seqfile : 15202 bp : 58.36% C + G : Isochore 4 (57.00- 100.00 C + G %) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin . . . End .Len Fr Ph I/Ac Do/T CodRg P . . . Tscr . . . 1.01 Intr + 971 1084 114 1 0 69 98 200 0.836 20.91 1.02 Intr + 4096 4177 82 0 1 37 53 81 0.358 −0.13 1.03 Intr + 5732 5856 125 0 2 117 95 84 0.953 13.48 1.04 Intr + 6997 7046 50 0 2 95 116 44 0.998 6.52 1.05 Intr + 8416 9825 1410 1 0 96 94 2914 0.970 283.09 1.06 Intr + 10489 10897 409 1 1 15 59 318 0.517 17.19 1.07 Intr + 11293 11449 157 0 1 57 61 236 0.998 18.57 1.08 Intr + 11923 12051 129 2 0 84 63 224 0.997 21.34 1.09 Intr + 12570 12731 162 1 0 95 80 229 0.996 23.94 1.10 Term + 13090 13700 611 2 2 59 41 1012 0.942 89.44 1.11 PlyA + 14257 14262 6 1.05 - A BLASTN analysis of the entire genomic sequence (on Aug. 26, 1997) demonstrated again that this gene has not been previously defined in the literature.
- As with the cDNA clones, some EST sequences of identity were found (listed below and later). Of particular interest was a variance in the first intron splice site predicted by the computer. Upstream of that site (ie., upstream of amino acids PEKKGGE=amino acids predicted after first splice site) we have identified two types of transcripts. Genomic clone JEP-1A predicted 34 amino acids upstream of that sequence before entering another intron upstream. In an identical manner, three ESTs (H61282, AA428790 and AA428250) overlapped that entire region in our clones and they contained the identical nucleotides for those 34 amino acids, plus an additional 22 more amino acids further upstream. By comparison, however, our EST04033 varied from all of these ESTs upstream of that site. This means, the first 1,532 nucleotides of EST04033 (thought to encode translation of amino acids 1-56 of EST04033 beginning at b.p. 1,398 in SEQ. 1) are completely at variance with the other ESTs down to that splice site, but from there on they are identical. This provides strong evidence that this site can generate two alternatively spliced transcripts which can produce at least one functional protein (ie., the transfections with EST04033). For the reader's information, this splice site is upstream of b.p. # 1,565 in SEQ.1, b.p. # 168 in SEQ.2, b.p. # 1,532 in SEQ.3, amino acid # 57 in SEQ.5, and b.p. # 971 in the genomic SEQ.21.
- Genomic Sequence Analysis
- Of interest is a unique glutamic- and aspartic acid-rich region within our predicted protein. This region of the IR protein delineates a highly unique span of 59 amino acids, 36 of which are Glu or Asp residues (61%). This region was largely discovered within clone 5A-1 and it is present within all discovered and predicted transcripts from the gene (EST04033 included). This sequence lies between two potential transmembrane loops (hydrophobic domains). The identification of this unique Glu/Asp-rich domain within our clones is consistent with an expected negatively charged pocket capable of binding clonidine and agmatine, both of which are highly positively charged ligands. Also, since the Dontenwill antiserum was specifically developed against an idazoxan/clonidine binding site, and its immunoreactivity is directed against the clone 5A-l/λgt11 fusion protein, this suggests that clone 5A-1 might encode an imidazoline binding site. Furthermore, this glu/asp-rich sequence is located within the longest stretch of homology that the clone has with any known protein, i.e., the ryanodine receptor (as determined by on BLASTN). Specifically, we have discovered four regions of homology between the imidazoline receptor and the ryanodine receptor, which are all Glu/Asp-rich. The total nucleic acid homology is 67% with the ryanodine receptor DNA over the stretches encompassing this region. However, this is not sufficient to indicate that the imidazoline receptor is a subtype of the ryanodine receptor, because this homologous stretch is still a minor portion of the overall transcript(s) identified in the gene. Instead, this significant homology may reflect a commonality in function between this region of the IR and the ryanodine receptor.
- The Glu/Asp-rich region within the ryanodine receptor has also been reported to define a calcium and ruthenium,red dye binding domain that modulates the ryanodine receptor/Ca++ release channel located within the sarcoplasmic reticulum. The only other charged amino acids within the Glu/Asp-rich region of our clones are two arginines (the ryanodine receptor has uncharged amino acids at the corresponding positions).
- Based on this identification of Arg residues within the Glu/Asp-rich region of the predicted imidazoline binding site, the assistance of Dr. Paul Ernsberger (Case Western Reserve University, Cleveland, Ohio) was enlisted. Dr. Ernsberger performed phenylglyoxal attack of arginine on native PC-12 membranes. Dr. Ernsberger was able to demonstrate that this treatment completely eliminated imidazoline binding sites in these membranes. This provides some indirect evidence that the native imidazoline binding site also contains an Arg residue. On the other hand, attempts to chemically modify cysteine and tyrosine residues, which are not located near the Glu/Asp-rich region did not affect PC-12 membrane binding of imidazolines.
- As a further test of the sequence, it was determined whether native IR binding sites in PC-12 cells would be sensitive to ruthenium red. From the structure of the cloned sequence, it was reasoned that native IR should bind ruthenium red. Accordingly, a competition of ruthenium red with125PIC at native IR sites on PC-12 membranes was studied. In these studies it was observed that ruthenium red competed for 125PIC binding to the same extent as did the potent imidazoline compound, moxonidine, i.e., 100% competition. Furthermore, the IC50 for competition of ruthenium red at IR was slightly more robust than reported for ruthenium red on the activation of calcium-dependent cyclic nucleotide phosphodiesterase—the previous most potent effect of ruthenium red on any biological site—indicating possible pharmacological importance. It is also noteworthy that calcium failed to compete for 125PIC binding at PC-12 IR sites (as did a calcium substitute, lanthanum). We and others have previously reported that a number of other cations robustly interfere with IR binding [Ernsberger et al., Annals NY Acad.Sci., 763: 22-42 (1995); Ernsberger et al., Annals NY Acad.Sci., 763: 163-168 (1995)]. Attempts were also made to directly stain the proteins in SDS gels with ruthenium red [Chen and MacLennan, J. Biol. Chem., 269: 22698-22704 (1994)]. It was found that ruthenium red stains the same platelet (33 kDa) and brain (85 kDa) bands that Reis antiserum detects. (Remember, the same 33 kDa band was verified to directly correlate with 125PIC Bmax values for IR.) Thus, these results linked the attributes predicted from the cloned sequence to a native IR binding site.
- Two other facets of the predicted polypeptide from JEP-1A suggest that we have identified most of the functional sequences. First, our predicted protein is comparable in regard to both the order and size of three regions of importance to the function of the interleukin-2Rβ receptor (IL-2Rβ). Specifically, IL-2Rβ possesses the following regions over a span of 286 amino acids: ser-rich region, followed by glu/asp-rich region, followed by proline-rich region. Likewise, our predicted protein has the same three regions, in the same order, over a span of about 625 amino acids. This suggests that our protein might function similarly as cytokine receptors. Secondly, our predicted protein possesses a cytochrome p450 heme-iron ligand signature sequence [Nelson et al., Pharmacogenetics 6: 1-42 (1996)]. This suggests that our protein might also function as do cytochrome p450s in oxidative, peroxidative and reductive metabolism of endogenous compounds.
- Some additional findings about the amino acid sequence of our instant IR polypeptide are: (1) The glu/asp-rich region also bears similarity to an amino acid sequence within a GTPase activator protein. (2) There appear to be four small hydrophobic domains indicative of transmembrane domain receptors. (3) A number of potential protein kinase C (PKC) phosphorylation sites appear near to the carboxy side of the protein, and we have previously found that treatment of membranes with PKC leads to an enhancement of native IR binding. Thus, these observations are all consistent with other observations expected for native IR.
- RNA Studies
- Northern blotting has also been performed on polyA+ mRNA from human tissues in order to ascertain the regional expression of the mRNA corresponding to our cDNA. The same 1110 b.p. ApaI-EcoRI fragment from cDNA clone 5A-1 used in Southern blots was used for these studies. This probe region was not found within any other known sequences on the BLASTN database. The results revealed a 6 kb mRNA band, which predominated over a much fainter 9.5 kb mRNA in most regions (FIG. 6). Some exceptions to this pattern were in lymph nodes and cerebellum (FIG. 6), where the 9.5 kb band was equally or more intense. Although the 6 kb band is weakly detectable in some non-CNS tissues, it is enriched in brain. An enrichment of the 6 kb mRNA is observed in brainstem, although not exclusively. The regional distribution of the mRNA is somewhat in keeping with the reported distribution of IR binding sites, when extrapolated across species (FIG. 6). Thus, the rank order of Bmax values for IR in rat brain has been reported to be frontal cortex>hippocampus>medulla oblongata>cerebellum [Kamisaki et al., Brain Res., 514: 15-21 (1990)]. Therefore, with the exception of human cerebellum, which showed two mRNA bands, the distribution of the mRNA for our the present cloned cDNA is consistent with it belonging to IR.
- [It should be noted that while IR binding sites are commonly considered to be low in cerebral cortex compared to brainstem, this is in fact a misinterpretation of the literature based only on comparisons to the alpha-2 adrenoceptor's Bmax, rather than on absolute values. Thus, IR Bmax values have actually been reported to be slightly higher in the cortex than the brainstem, but they only “appear” to be low in the cortex in comparison to the abundance of alpha-2 binding sites in cortex. Therefore, the distribution of the IR mRNA is reasonably in keeping with the actual Bmax values for radioligand binding to the receptor [Kamisaki et al., (1990)].
- A final point to emphasize about the Northern blots is that they clearly demonstrate two high-stringency transcripts (FIG. 6). This is in keeping with the alternatively spliced EST cDNAs mentioned earlier. Thus, we suggest this may be the basis for both the 6 and 9.5 kb transcripts.
- Summary of Genomic Sequence Results
- The JEP-1A clone clearly contains most of the gene. Within it we have identified at least 3,776 nucleotides for transcript(s) (encoding 1,065 amino acids plus 587 b.p. of untranslated region down to the polyT+ tail). This has been lengthened by at least 66 coding nucleotides upstream (22 amino acids) in comparison to overlapping ESTs. In addition to this, we are quite confident of the splice site for the two observed mRNA sizes. Most of the functional sequences are predicted to be encoded within our genomic clone.
- A summary of the evidence that a gene encoding an imidazoline receptor protein has been cloned is summarized in Table 2 hereinbelow.
TABLE 2 Comparison of Protein Predicted From Our Clones with Properties of Native IR1 and I2 Sites Imidazoline Receptor- like Clone Authentic IR1 Authentic I2 Original λ phage Dontenwill-Ab activity Dontenwill & Reis Abs fusion protein (from (a)inhibits RVLM IR1 both inhibit brain I2 5A-1) is immunoreac- binding (3H-Cloni- sites (3H-IDX). tive with Dontenwill dine), & (b) correlates and Reis antibodies with 85 kDa Western band. Reis-Ab activity correlates w platelet IR1 Bmax (125PIC binding) Segment homologous Weak to moderate Not sensitive to GTP to a GTPase-acti- sensitivity to GTP vator prot'n Predicts ≧ 120,000 85,000 MW 59-61,000 MW MW protein immunoreactivity photoaffinity Predicts 1-4 Enriched in plasma Enriched in hydrophobic domains membranes mitochondria Encodes Glu/Asp-rich Binds (+)-charged Binds (+)-charged (negatively charged) imidazolines imidazolines domain consistent with Sensitive to Not sensitive to Ca++ and ruthenium divalent cations divalent cations red binding Sensitive to Unknown ruthenium red sensitivity for Ruthen. red Arginine is only Arg attack Unknown positively charged eliminates amino acid near Glu/ binding Asp domain Cys & Tyr attack w/o effect on binding Encodes PKC sites PKC treatment Unknown enhances binding Human mRNA Rat IR1 Bmax Rat I2 Bmax (3H-IDX): Distribution; (125PIC): F. Cortex > Medulla > F. Cortex F. Cortex > hippo- hippocampus > campus > medulla medulla Transfected COS-7 High affinity for Low affinity for cells expressed moxonidine and PIC moxonidine and PIC high affinity for moxonidine & p-iodoclonidine (PIC) - COS-7 cells were transfected with a vector containing EST04033 cDNA, which was predicted based on sequence analysis to contain the glu/asp rich region thought to be important for ligand binding to the imidazoline receptor protein. The EST04033 cDNA was subcloned into pSVK3 (Pharmacia LKB Biotechnology, Piscataway, N.J.) using standard techniques [Sambrook, supra], and transfected via the DEAE-dextran technique as previously described [Choudhary et al.,Mol. Pharmacol., 42: 627-633 (1992); Choudhary et al., Mol. Pharmacol., 43: 557-561 (1993); Kohen et al., J. Neurochem., 66: 47-56 (1996)]. A restriction map of the EST04033 cDNA is shown in FIG. 3. The restriction enzymes Sal I and Xba I were used for subcloning into pSVK3.
- Briefly stated, COS-7 cells were seeded at 3×106 cells/100 mm plate, grown overnight and exposed to 2 ml of DEAE-dextran/plasmid mixture. After a 10-15 min. exposure, 20 ml of complete medium (10% fetal calf serum; 5 μg/ml streptomycin; 100 units/ml penicillin, high glucose, Dulbeccos' modified Eagle's medium) containing 80 μM chloroquine was added and the incubation continued for 2.5 hr. at 37° C. in a 5% CO2 incubator. The mixture was then aspirated and 10 ml of complete medium containing 10% dimethyl sulfoxide was added with shaking for 150 seconds.
- Following aspiration, 15 ml of complete medium with dialyzed serum was added and the incubation continued for an additional 65 hours. After this time period, the cells from 6 plates were harvested and membranes were prepared as previously described [Ernsberger et al.,Annals NY Acad. Sci., 763: 22-42 (1995), the disclosure of which is incorporated herein by reference]. Parent, untransfected COS-7 cells were prepared as a negative control. Some membranes were treated with and without PKC for 2 hrs prior to analysis, since previous studies had indicated that receptor phosphorylation could be beneficial to detect IR binding.
- Transfected samples were also analyzed by Western blots. The protocol used for Western blot assay of transfected cells is as follows. Cell membranes were prepared in a special cocktail of protease inhibitors (1 mM EDTA, 0.1 mM EGTA, 1 mM phenylmethyl-sufonylfluoride, 10 mM ε-aminocaproic acid, 0.1 mM benzamide, 0.1 mM benzamide-HCl, 0.1 mM phenanthroline, 10 μg/ml pepstatin A, 5 mM iodoacetamide, 10 μg/ml antipain, 10 μg/ml trypsin-chymotrypsin inhibitor, 10 μg/ml leupeptin, and 1.67 μg/ml calpain inhibitor) in 0.25 M sucrose, 1 mM MgCl2, 5 mM Tris, pH 7.4. Fifteen μg of total protein were denatured and separated by SDS gel electrophoresis. Gels were equilibrated and electrotransferred to nitrocellulose membranes. Blots were then blocked with 10% milk in Tris-buffered saline with 0.1% Tween-20 (TBST) during 60 min. of gentle rocking. Afterwards, blots were incubated in anti-imidazoline receptor antiserum (1:3000 dil.) for 2 hours. Following the primary antibody, blots were washed and incubated with horseradish peroxidase-conjugated anti-rabbit goat IgG (1:3000 dil.) for 1 hr. Blots were extensively washed and incubated for 1 min. in a 1:1 mix of Amersham ECL detection solution. The blots were wrapped in cling-film (SARAN WRAP) and exposed to Hyperfilm-ECL (Amersham) for 2 minutes. Quantitation was based on densitometry using a standard curve of known amounts of protein containing BAC membranes or platelet membranes run in each gel.
- One nM [125I]p-iodoclonidine was employed in the radioligand binding competition assays, since at this low concentration this radioligand is selective for the IR site much more than for I2 binding sites. The critical processes of membrane preparation of tissue culture cells and the radioligand binding assays of IR and I2 have been reviewed by Piletz and colleagues [Ernsberger et al., Annals NY Acad Sci., 763: 510-519 (1995)]. Total binding (n=12 per experiment) was determined in the absence of added competitive ligands and nonspecific binding was determined in the presence of 10−4 M moxonidine (n=6 per experiment). Log normal competition curves were generated against unlabeled moxonidine, p-iodoclonidine, and (−) epinephrine. Each concentration of the competitors was determined in triplicate and the experiment was repeated thrice.
- The protocol to fully characterize radioligand binding in the transfected cells entails the following. First, the presence of IR and/or I2 binding sites are scanned over a range of protein concentrations using a single concentration of [125I]-p-iodoclonidine (1.0 nM) and 3H-idazoxan (8 nM), respectively. Then, rate of association binding experiments (under a 10 μM mask of NE to remove α2AR interference) are performed to determine if the kinetic parameters are similar to those reported for native imidazoline receptors [Ernsberger et al. Annals NY Acad. Sci., 763: 163-168 (1995)]. Then, full Scatchard plots of [125I]-p-iodoclonidine (2-20 nM if like IR) and 3H-idazoxan (5-60 nM if like I2) binding are conducted under a 10 μM mask of NE. Total NE (10 μM)-displaceable binding is ascertained as a control to rule out α2-adrenergic binding. The Bmax and KD parameters for the transfected cells are ascertained by computer modeling using the LIGAND program [McPherson, G., J. Pharmacol. Meth., 14: 213-228 (1985)] using 20 μM moxonidine to define IR nonspecific binding, or 20 μM cirazoline to define I2 nonspecific binding.
- The results of the transient transfection experiments of the imidazoline receptor vector into COS-7 cells are shown in FIG. 4. Competition binding experiments were performed using membrane preparations from these cells and125PIC was used to radiolabel IR sites. A mask of 10 μM norepinephrine was used to rule out any possible α2AR binding in each assay even though parent COS-7 cells lacked any α2AR sites. Moxonidine and p-iodoclondine (PIC) were the compounds tested for their affinity to the membranes of transfected cells. As can be seen, the affinities of these compounds in competition with 125PIC were well within the high affinity (nM) range.
- The following IC50 values and Hill slopes were obtained in this study: moxonidine, IC50=45.1 nM (Hill slope=0.35±0.04); p-iodoclonidine without PKC pretreatment of the membranes, IC50=2.3 nM (Hill slope=0.42±0.06); p-iodoclonidine with PKC pretreatment of the membranes, IC50=19.0 nM (Hill slope=0.48±0.08). Shallow Hill slopes for [125I]p-iodoclonidine have been reported before in studies of the interaction of moxonidine and p-iodoclonidine with the human platelet IR1 binding site [Piletz and Sletten, (1993)]. Epinephrine failed to displace any of the [125I]p-iodoclonidine binding in the transfected cells, as expected since this is a nonadrenergic imidazoline receptor. Furthermore, in untransfected cells less than 5% of the amount of displaceable binding was observed as for the transfected cells—and this “noise” in the parent cells all appeared to be low affinity (data not shown). These results thus demonstrate the high affinities of two imidazoline compounds, p-iodoclonidine and moxonidine, for the portion of our cloned receptor encoded within EST04033. PKC pretreatment of the membranes had no effect in the transfected COS cells.
- It was also observed that the level of the expressed protein, as measured by Western blotting of the transfected cells, was consistent with the level of IR binding that was detected. In other words, a protein band was uniquely detected in the transfected cells, and it was of a density consistent with the amount of radioligand binding. Hence, the present results are in keeping with those expected for an imidazoline receptor. In summary, these data provide direct evidence that the EST04033 clone encodes an imidazoline binding site having high affinities for moxonidine and p-iodoclonidine, which is expected for an IR protein.
- Stable transfections can be obtained by subcloning the imidazoline receptor cDNA into a suitable expression vector, e.g., pRc/CMV (Invitrogen, San Diego, Calif.), which can then be used to transform host cells, e.g. CHO and HEK-293 cells, using the Lipofectin reagent (Gibco/BRL, Gaithersburg, Md.) according to the manufacturer's instructions. These two host cell lines can be used to increase the permanence of expression of an instant clone. The inventors have previously ascertained that parent CHO cells lack both alpha2-adrenoceptor and IR binding sites [Piletz et al., J. Pharm. & Exper. Ther., 272: 581-587 (1995)], making them useful for these studies. Twenty-four hours after transfection, cells are split into culture dishes and grown in the presence of 600 μg/ml G418-supplemented complete medium (Gibco/BRL). The medium is changed every 3 days and clones surviving in G418 are isolated and expanded for further investigation.
- Direct probing of other human genomic and cDNA libraries can be performed by preparing labelled cDNA probes from different subcloned regions of our clone. Commercially available human DNA libraries can be used. Besides the cDNA and genomic libraries we have already screened, another genomic library is EMBL (Clontech), which integrates genomic fragments up to 22 kbp long. It is reasonable to expect that introns may exist within other human IR genes so that only by obtaining overlapping clones can the full-length genes be sequenced. A probe encompassing the 5′ end of an instant cDNA is generally useful to obtain the gene promoter region. Clontech's Human PromoterFinder DNA Walking procedure provides a method for “walking” upstream or downstream from cloned sequences such as cDNAs into adjacent genomic DNA.
- An instant imidazoline receptive polypeptide can also be used to prepare antibodies immunoreactive therewith. Thus, synthetic peptides (based on deduced amino acid sequences from the DNA) can be generated and used as immunogens. Additionally, transfected cell lines or other manipulations of the DNA sequence of an instant imidazoline receptor can provide a source of purified imidazoline receptor peptides in sufficient quantities for immunization, which can lead to a source of selective antibodies having potential commercial value.
- In addition, various kits for assaying imidazoline receptors can be developed that include either such antibodies or the purified imidazoline receptor protein. A purification protocol has already been published for the bovine imidazoline receptor in BAC cells [Wang et al, 1992] and an immunization protocol has also been published [Wang et al., 1993]. These same protocols can be utilized with little if any modification to afford purified human IR protein from transfected cells and to yield selective antibodies thereto.
- In order to obtain antibodies to a subject peptide, the peptide may be linked to a suitable soluble carrier to which antibodies are unlikely to be encountered in human serum. Illustrative carriers include bovine serum albumin, keyhole limpet hemocyanin, and the like. The conjugated peptide is injected into a mouse, or other suitable animal, where an immune response is elicited. Monoclonal antibodies can be obtained from hybridomas formed by fusing spleen cells harvested from the animal and myeloma cells [see, e.g., Kohler and Milstein,Nature, 256: 495-497 (1975)].
- Once an antibody is prepared (either polyclonal or monoclonal), procedures are well established in the literature, using other proteins, to develop either RIA or ELISA assays [see, e.g., “Radioimmunoassay of Gut Regulatory Peptides; Methods in Laboratory Medicine,” Vol. 2,
chapters - Currently available methods to assay imidazoline receptors are unsuitable for routine clinical use, and therefore the development of an assay kit in this manner could have significant market appeal. Suitable assay techniques can employ polyclonal or monoclonal antibodies, as has been previously described [U.S. Pat. No. 4,376,110 (issued to David et al.), the disclosure of which is incorporated herein by reference].
- In summary, we have identified unique DNA sequences that have properties expected of a gene and the cDNA transcript(s) of an imidazoline receptor. Prior to our first cloning the cDNA, only two sequences of EST cDNA were identified within public databases having similar nature. But, these were both partial and imprecise sequences—not identified at all with respect to any encoded protein. Indeed, one of them (HSA09H122) was reported to be contaminated. In our hands, the other EST 04033 clone was correctly sequenced for the first time (in its entirety=3318 bp). Prior to this, even the size of EST 04033 was unknown. The present inventors also demonstrated that an imidazoline receptive site can be expressed in cells transfected with the EST 04033 cDNA clone, and this site has the proper potencies of an IR. We have deduced most of the complete cDNA encoding this protein.
- The present invention has been described with reference to specific examples for purposes of clarity and explanation. Certain obvious modifications of the invention readily apparent to one skilled in the art can be practiced within the scope of the appended claims.
-
1 22 1 3385 DNA Homo sapiens CDS (1398)..(3383) 1 gctctagaac tagtggatcc cccgggctgc aggaattcca gtttaatact aaccctaatg 60 tgtgactgcg gtttacaaag agctctgtat cacctgggat agctttcagt agcaattcac 120 tacaactggt cctaaaaaat aataacaata ataataataa ttagagaatt aaaacccaac 180 agcatgttga atggttaaaa tcacgtaaga actgaaattt ggggtggggg tgtcctcaac 240 agctgagctt gtcctagcag tgaaaatgct cgcctccaag cagggctcag aaaggtctgg 300 agccctccag gcagagggct gagctcaggg ggctcttgga ggacactcac cccatggtcc 360 atgggatgct tctggcttcc ttaaaaacag ttgggcatcc gcattgtata agtaggtgga 420 gaccctagtg tggttctttt gaaggatatg ggaagggagg atgacgaact agagaagtgg 480 gaggggacca aaatcactga ggtcccagaa tatcatagat ttgggtatag gattggggtc 540 actaagaatt gagcaccagg aattccagct tcttcccatt aaagaaactg ggactggttt 600 tgccttggag gcctatgtag tgttttctgc ccctgtccca taccaagtct cattgatatt 660 tctgcagaat atcagatgaa aatctatttc taaagaccat tgggagaatg ggtggtggag 720 aaggagttgg agtggggttg gggggcagtt aaaaatgaat aaaaatctct cagctacaga 780 acccaaacat cacttccctc cgcattcaca gcatttccca gcagtcccca gatggttgtt 840 tccgtgggga cacagcagct gcctcatttc ccttcaggcc ccatgggctg ctggtcaacc 900 tcaggatcta ctaaagatga cgcaaatgcc gactgaacaa tctgaaaccc aaaggactcg 960 aggagagaca tgttctgctg aggagagaaa ggtgagccaa gggcagggcc caggtccccc 1020 agggggcccc cgagagcccg gacatgcacc ttctggatgt gtttgttcaa gtaggactta 1080 gagcggaaga agctcccaca ttcagggcat gggtacttct tctccccatc agactccatt 1140 ttgtttttgg ggactgccat gtcgcaggag aaagagccat tggcactctg cttctctggc 1200 gtcttcaggt cgctggcatc tgagaggtca ccataggagt cagagctctc aatcggatcc 1260 tgatgtgagc atttctggcc ttctcggtta cagatactgc agaagttgct gggcccctcg 1320 ctgtgcttct tcaggtggtc tgccatgtat gctgcccgca agtacttccc acacacctgg 1380 cagggcacct tgtcttc atg aca ggc cag gtg gga gcg cag acg gtc tcg 1430 Met Thr Gly Gln Val Gly Ala Gln Thr Val Ser 1 5 10 ggt ggc aaa aga agc att gca ggt ctg aca ctt gtg agg ccg ctc aga 1478 Gly Gly Lys Arg Ser Ile Ala Gly Leu Thr Leu Val Arg Pro Leu Arg 15 20 25 agt gtg cac ctg ctt gat atg tcc gtt caa gtg atc agg cct gga gaa 1526 Ser Val His Leu Leu Asp Met Ser Val Gln Val Ile Arg Pro Gly Glu 30 35 40 gcc ttt ccc aca gct ctg gca gat gta agg cgg aat tcc cca gag aag 1574 Ala Phe Pro Thr Ala Leu Ala Asp Val Arg Arg Asn Ser Pro Glu Lys 45 50 55 aag ggt ggt gaa gac tcc cgg ctc tca gct gcc ccc tgc atc aga ccc 1622 Lys Gly Gly Glu Asp Ser Arg Leu Ser Ala Ala Pro Cys Ile Arg Pro 60 65 70 75 agc agc tcc cct ccc act gtg gct ccc gca tct gcc tcc ctg ccc cag 1670 Ser Ser Ser Pro Pro Thr Val Ala Pro Ala Ser Ala Ser Leu Pro Gln 80 85 90 ccc atc ctc tct aac caa gga atc atg ttc gtt cag gag gag gcc ctg 1718 Pro Ile Leu Ser Asn Gln Gly Ile Met Phe Val Gln Glu Glu Ala Leu 95 100 105 gcc agc agc ctc tcg tcc act gac agt ctg act ccc gag cac cag ccc 1766 Ala Ser Ser Leu Ser Ser Thr Asp Ser Leu Thr Pro Glu His Gln Pro 110 115 120 att gcc cag gga tgt tct gat tcc ttg gag tcc atc cct gcg gga cag 1814 Ile Ala Gln Gly Cys Ser Asp Ser Leu Glu Ser Ile Pro Ala Gly Gln 125 130 135 gca gct tcc gat gat tta agg gac gtg cca gga gct gtt ggt ggt gca 1862 Ala Ala Ser Asp Asp Leu Arg Asp Val Pro Gly Ala Val Gly Gly Ala 140 145 150 155 agc cca gaa cat gcc gag ccg gag gtc cag gtg gtg ccg ggg tct ggc 1910 Ser Pro Glu His Ala Glu Pro Glu Val Gln Val Val Pro Gly Ser Gly 160 165 170 cag atc atc ttc ctg ccc ttc acc tgc att ggc tac acg gcc acc aat 1958 Gln Ile Ile Phe Leu Pro Phe Thr Cys Ile Gly Tyr Thr Ala Thr Asn 175 180 185 cag gac ttc atc cag cgc ctg agc aca ctg atc cgg cag gcc atc gag 2006 Gln Asp Phe Ile Gln Arg Leu Ser Thr Leu Ile Arg Gln Ala Ile Glu 190 195 200 cgg cag ctg cct gcc tgg atc gag gct gcc aac cag cgg gag gag ggc 2054 Arg Gln Leu Pro Ala Trp Ile Glu Ala Ala Asn Gln Arg Glu Glu Gly 205 210 215 cag ggt gaa cag ggc gag gag gag gat gag gag gag gaa gaa gag gag 2102 Gln Gly Glu Gln Gly Glu Glu Glu Asp Glu Glu Glu Glu Glu Glu Glu 220 225 230 235 gac gtg gct gag aac cgc tac ttt gaa atg ggg ccc cca gac gtg gag 2150 Asp Val Ala Glu Asn Arg Tyr Phe Glu Met Gly Pro Pro Asp Val Glu 240 245 250 gag gag gag gga gga ggc cag ggg gag gaa gag gag gag gaa gag gag 2198 Glu Glu Glu Gly Gly Gly Gln Gly Glu Glu Glu Glu Glu Glu Glu Glu 255 260 265 gat gaa gag gcc gag gag gag cgc ctg gct ctg gaa tgg gcc ctg ggc 2246 Asp Glu Glu Ala Glu Glu Glu Arg Leu Ala Leu Glu Trp Ala Leu Gly 270 275 280 gcg gac gag gac ttc ctg ctg gag cac atc cgc atc ctc aag gtg ctg 2294 Ala Asp Glu Asp Phe Leu Leu Glu His Ile Arg Ile Leu Lys Val Leu 285 290 295 tgg tgc ttc ctg atc cat gtg cag ggc agt atc cgc cag ttc gcc gcc 2342 Trp Cys Phe Leu Ile His Val Gln Gly Ser Ile Arg Gln Phe Ala Ala 300 305 310 315 tgc ctt gtg ctc acc gac ttc ggc atc gca gtc ttc gag atc ccg cac 2390 Cys Leu Val Leu Thr Asp Phe Gly Ile Ala Val Phe Glu Ile Pro His 320 325 330 cag gag tct cgg ggc agc agc cag cac atc ctc tcc tcc ctg cgc ttt 2438 Gln Glu Ser Arg Gly Ser Ser Gln His Ile Leu Ser Ser Leu Arg Phe 335 340 345 gtc ttt tgc ttc ccg cat ggc gac ctc acc gag ttt ggc ttc ctc atg 2486 Val Phe Cys Phe Pro His Gly Asp Leu Thr Glu Phe Gly Phe Leu Met 350 355 360 ccg gag ctg tgt ctg gtg ctc aag gta cgg cac agt gag aac acg ctc 2534 Pro Glu Leu Cys Leu Val Leu Lys Val Arg His Ser Glu Asn Thr Leu 365 370 375 ttc att atc tcg gac gcc gcc aac ctg cac gag ttc cac gcg gac ctg 2582 Phe Ile Ile Ser Asp Ala Ala Asn Leu His Glu Phe His Ala Asp Leu 380 385 390 395 cgc tca tgc ttt gca ccc cag cac atg gcc atg ctg tgt agc ccc atc 2630 Arg Ser Cys Phe Ala Pro Gln His Met Ala Met Leu Cys Ser Pro Ile 400 405 410 ctc tac ggc agc cac acc agc ctg cag gag ttc ctg cgc cag ctg ctc 2678 Leu Tyr Gly Ser His Thr Ser Leu Gln Glu Phe Leu Arg Gln Leu Leu 415 420 425 acc ttc tac aag gtg gct ggc ggc tgc cag gag cgc agc cag ggc tgc 2726 Thr Phe Tyr Lys Val Ala Gly Gly Cys Gln Glu Arg Ser Gln Gly Cys 430 435 440 ttc ccc gtc tac ctg gtc tac agt gac aag cgc atg gtg cag acg gcc 2774 Phe Pro Val Tyr Leu Val Tyr Ser Asp Lys Arg Met Val Gln Thr Ala 445 450 455 gcc ggg gac tac tca ggc aac atc gag tgg gcc agc tgc aca ctc tgt 2822 Ala Gly Asp Tyr Ser Gly Asn Ile Glu Trp Ala Ser Cys Thr Leu Cys 460 465 470 475 tca gcc gtg cgg cgc tcc tgc tgc gcg ccc tct gag gcc gtc aag tcc 2870 Ser Ala Val Arg Arg Ser Cys Cys Ala Pro Ser Glu Ala Val Lys Ser 480 485 490 gcc gcc atc ccc tac tgg ctg ttg ctc acg ccc cag cac ctc aac gtc 2918 Ala Ala Ile Pro Tyr Trp Leu Leu Leu Thr Pro Gln His Leu Asn Val 495 500 505 atc aag gcc gac ttc aac ccc atg ccc aac cgt ggc acc cac aac tgt 2966 Ile Lys Ala Asp Phe Asn Pro Met Pro Asn Arg Gly Thr His Asn Cys 510 515 520 cgc aac cgc aac agc ttc aag ctc agc cgt gtg ccg ctc tcc acc gtg 3014 Arg Asn Arg Asn Ser Phe Lys Leu Ser Arg Val Pro Leu Ser Thr Val 525 530 535 ctg ctg gac ccc aca cgc agc tgt acc cag cct cgg ggc gcc ttt gct 3062 Leu Leu Asp Pro Thr Arg Ser Cys Thr Gln Pro Arg Gly Ala Phe Ala 540 545 550 555 gat ggc cac gtg cta gag ctg ctc gtg ggg tac cgc ttt gtc act gcc 3110 Asp Gly His Val Leu Glu Leu Leu Val Gly Tyr Arg Phe Val Thr Ala 560 565 570 atc ttc gtg ctg ccc cac gag aag ttc cac ttc ctg cgc gtc tac aac 3158 Ile Phe Val Leu Pro His Glu Lys Phe His Phe Leu Arg Val Tyr Asn 575 580 585 cag ctg cgg gcc tcg ctg cag gac ctg aag act gtg gtc atc gcc aag 3206 Gln Leu Arg Ala Ser Leu Gln Asp Leu Lys Thr Val Val Ile Ala Lys 590 595 600 acc ccc ggg acg gga ggc agc ccc cag ggc tcc ttt gcg gat ggc cag 3254 Thr Pro Gly Thr Gly Gly Ser Pro Gln Gly Ser Phe Ala Asp Gly Gln 605 610 615 cct gcc gag cgc agg gcc agc aat gac cag cgt ccc cag gag gtc cca 3302 Pro Ala Glu Arg Arg Ala Ser Asn Asp Gln Arg Pro Gln Glu Val Pro 620 625 630 635 gca gag gct ctg gcc ccg gcc cca gtg gaa gtc cca gct cca gcc ccg 3350 Ala Glu Ala Leu Ala Pro Ala Pro Val Glu Val Pro Ala Pro Ala Pro 640 645 650 gaa ttc gat atc aag ctt atc gat acc gtc gac ct 3385 Glu Phe Asp Ile Lys Leu Ile Asp Thr Val Asp 655 660 2 1954 DNA Homo sapiens 2 atgacaggcc aggtgggagc gcagacggtc tcgggtggca aaagaagcat tgcaggtctg 60 acacttgtga ggccgctcag aagtgtgcac ctgcttgata tgtccgttca agtgatcagg 120 cctggagaag cctttcccac agctctggca gatgtaaggc ggaattcccc agagaagaag 180 ggtggtgaag actcccggct ctcagctgcc ccctgcatca gacccagcag ctcccctccc 240 actgtggctc ccgcatctgc ctccctgccc cagcccatcc tctctaacca aggaatcatg 300 ttcgttcagg aggaggccct ggccagcagc ctctcgtcca ctgacagtct gactcccgag 360 caccagccca ttgcccaggg atgttctgat tccttggagt ccatccctgc gggacaggca 420 gcttccgatg atttaaggga cgtgccagga gctgttggtg gtgcaagccc agaacatgcc 480 gagccggagg tccaggtggt gccggggtct ggccagatca tcttcctgcc cttcacctgc 540 attggctaca cggccaccaa tcaggacttc atccagcgcc tgagcacact gatccggcag 600 gccatcgagc ggcagctgcc tgcctggatc gaggctgcca accagcggga ggagggccag 660 ggtgaacagg gcgaggagga ggatgaggag gaggaagaag aggaggacgt ggctgagaac 720 cgctactttg aaatggggcc cccagacgtg gaggaggagg agggaggagg ccagggggag 780 gaagaggagg aggaagagga ggatgaagag gccgaggagg agcgcctggc tctggaatgg 840 gccctgggcg cggacgagga cttcctgctg gagcacatcc gcatcctcaa ggtgctgtgg 900 tgcttcctga tccatgtgca gggcagtatc cgccagttcg ccgcctgcct tgtgctcacc 960 gacttcggca tcgcagtctt cgagatcccg caccaggagt ctcggggcag cagccagcac 1020 atcctctcct ccctgcgctt tgtcttttgc ttcccgcatg gcgacctcac cgagtttggc 1080 ttcctcatgc cggagctgtg tctggtgctc aaggtacggc acagtgagaa cacgctcttc 1140 attatctcgg acgccgccaa cctgcacgag ttccacgcgg acctgcgctc atgctttgca 1200 ccccagcaca tggccatgct gtgtagcccc atcctctacg gcagccacac cagcctgcag 1260 gagttcctgc gccagctgct caccttctac aaggtggctg gcggctgcca ggagcgcagc 1320 cagggctgct tccccgtcta cctggtctac agtgacaagc gcatggtgca gacggccgcc 1380 ggggactact caggcaacat cgagtgggcc agctgcacac tctgttcagc cgtgcggcgc 1440 tcctgctgcg cgccctctga ggccgtcaag tccgccgcca tcccctactg gctgttgctc 1500 acgccccagc acctcaacgt catcaaggcc gacttcaacc ccatgcccaa ccgtggcacc 1560 cacaactgtc gcaaccgcaa cagcttcaag ctcagccgtg tgccgctctc caccgtgctg 1620 ctggacccca cacgcagctg tacccagcct cggggcgcct ttgctgatgg ccacgtgcta 1680 gagctgctcg tggggtaccg ctttgtcact gccatcttcg tgctgcccca cgagaagttc 1740 cacttcctgc gcgtctacaa ccagctgcgg gcctcgctgc aggacctgaa gactgtggtc 1800 atcgccaaga cccccgggac gggaggcagc ccccagggct cctttgcgga tggccagcct 1860 gccgagcgca gggccagcaa tgaccagcgt ccccaggagg tcccagcaga ggctctggcc 1920 ccggccccag tggaagtccc agctccagcc ccgg 1954 3 3318 DNA Homo sapiens 3 aattccagtt taatactaac cctaatgtgt gactgcggtt tacaaagagc tctgtatcac 60 ctgggatagc tttcagtagc aattcactac aactggtcct aaaaaataat aacaataata 120 ataataatta gagaattaaa acccaacagc atgttgaatg gttaaaatca cgtaagaact 180 gaaatttggg gtgggggtgt cctcaacagc tgagcttgtc ctagcagtga aaatgctcgc 240 ctccaagcag ggctcagaaa ggtctggagc cctccaggca gagggctgag ctcagggggc 300 tcttggagga cactcacccc atggtccatg ggatgcttct ggcttcctta aaaacagttg 360 ggcatccgca ttgtataagt aggtggagac cctagtgtgg ttcttttgaa ggatatggga 420 agggaggatg acgaactaga gaagtgggag gggaccaaaa tcactgaggt cccagaatat 480 catagatttg ggtataggat tggggtcact aagaattgag caccaggaat tccagcttct 540 tcccattaaa gaaactggga ctggttttgc cttggaggcc tatgtagtgt tttctgcccc 600 tgtcccatac caagtctcat tgatatttct gcagaatatc agatgaaaat ctatttctaa 660 agaccattgg gagaatgggt ggtggagaag gagttggagt ggggttgggg ggcagttaaa 720 aatgaataaa aatctctcag ctacagaacc caaacatcac ttccctccgc attcacagca 780 tttcccagca gtccccagat ggttgtttcc gtggggacac agcagctgcc tcatttccct 840 tcaggcccca tgggctgctg gtcaacctca ggatctacta aagatgacgc aaatgccgac 900 tgaacaatct gaaacccaaa ggactcgagg agagacatgt tctgctgagg agagaaaggt 960 gagccaaggg cagggcccag gtcccccagg gggcccccga gagcccggac atgcaccttc 1020 tggatgtgtt tgttcaagta ggacttagag cggaagaagc tcccacattc agggcatggg 1080 tacttcttct ccccatcaga ctccattttg tttttgggga ctgccatgtc gcaggagaaa 1140 gagccattgg cactctgctt ctctggcgtc ttcaggtcgc tggcatctga gaggtcacca 1200 taggagtcag agctctcaat cggatcctga tgtgagcatt tctggccttc tcggttacag 1260 atactgcaga agttgctggg cccctcgctg tgcttcttca ggtggtctgc catgtatgct 1320 gcccgcaagt acttcccaca cacctggcag ggcaccttgt cttcatgaca ggccaggtgg 1380 gagcgcagac ggtctcgggt ggcaaaagaa gcattgcagg tctgacactt gtgaggccgc 1440 tcagaagtgt gcacctgctt gatatgtccg ttcaagtgat caggcctgga gaagcctttc 1500 ccacagctct ggcagatgta aggcggaatt ccccagagaa gaagggtggt gaagactccc 1560 ggctctcagc tgccccctgc atcagaccca gcagctcccc tcccactgtg gctcccgcat 1620 ctgcctccct gccccagccc atcctctcta accaaggaat catgttcgtt caggaggagg 1680 ccctggccag cagcctctcg tccactgaca gtctgactcc cgagcaccag cccattgccc 1740 agggatgttc tgattccttg gagtccatcc ctgcgggaca ggcagcttcc gatgatttaa 1800 gggacgtgcc aggagctgtt ggtggtgcaa gcccagaaca tgccgagccg gaggtccagg 1860 tggtgccggg gtctggccag atcatcttcc tgcccttcac ctgcattggc tacacggcca 1920 ccaatcagga cttcatccag cgcctgagca cactgatccg gcaggccatc gagcggcagc 1980 tgcctgcctg gatcgaggct gccaaccagc gggaggaggg ccagggtgaa cagggcgagg 2040 aggaggatga ggaggaggaa gaagaggagg acgtggctga gaaccgctac tttgaaatgg 2100 ggcccccaga cgtggaggag gaggagggag gaggccaggg ggaggaagag gaggaggaag 2160 aggaggatga agaggccgag gaggagcgcc tggctctgga atgggccctg ggcgcggacg 2220 aggacttcct gctggagcac atccgcatcc tcaaggtgct gtggtgcttc ctgatccatg 2280 tgcagggcag tatccgccag ttcgccgcct gccttgtgct caccgacttc ggcatcgcag 2340 tcttcgagat cccgcaccag gagtctcggg gcagcagcca gcacatcctc tcctccctgc 2400 gctttgtctt ttgcttcccg catggcgacc tcaccgagtt tggcttcctc atgccggagc 2460 tgtgtctggt gctcaaggta cggcacagtg agaacacgct cttcattatc tcggacgccg 2520 ccaacctgca cgagttccac gcggacctgc gctcatgctt tgcaccccag cacatggcca 2580 tgctgtgtag ccccatcctc tacggcagcc acaccagcct gcaggagttc ctgcgccagc 2640 tgctcacctt ctacaaggtg gctggcggct gccaggagcg cagccagggc tgcttccccg 2700 tctacctggt ctacagtgac aagcgcatgg tgcagacggc cgccggggac tactcaggca 2760 acatcgagtg ggccagctgc acactctgtt cagccgtgcg gcgctcctgc tgcgcgccct 2820 ctgaggccgt caagtccgcc gccatcccct actggctgtt gctcacgccc cagcacctca 2880 acgtcatcaa ggccgacttc aaccccatgc ccaaccgtgg cacccacaac tgtcgcaacc 2940 gcaacagctt caagctcagc cgtgtgccgc tctccaccgt gctgctggac cccacacgca 3000 gctgtaccca gcctcggggc gcctttgctg atggccacgt gctagagctg ctcgtggggt 3060 accgctttgt cactgccatc ttcgtgctgc cccacgagaa gttccacttc ctgcgcgtct 3120 acaaccagct gcgggcctcg ctgcaggacc tgaagactgt ggtcatcgcc aagacccccg 3180 ggacgggagg cagcccccag ggctcctttg cggatggcca gcctgccgag cgcagggcca 3240 gcaatgacca gcgtccccag gaggtcccag cagaggctct ggccccggcc ccagtggaag 3300 tcccagctcc agccccgg 3318 4 1171 DNA Homo sapiens 4 gaggaggagg aagaggagga tgaagaggcc gaggaggagc gcctggctct ggaatgggcc 60 ctgggcgcgg acgaggactt cctgctggag cacatccgca tcctcaaggt gctgtggtgc 120 ttcctgatcc atgtgcaggg cagtatccgc cagttcgccg cctgccttgt gctcaccgac 180 ttcggcatcg cagtcttcga gatcccgcac caggagtctc ggggcagcag ccagcacatc 240 ctctcctccc tgcgctttgt cttttgcttc ccgcatggcg acctcaccga gtttggcttc 300 ctcatgccgg agctgtgtct ggtgctcaag gtacggcaca gtgagaacac gctcttcatt 360 atctcggacg ccgccaacct gcacgagttc cacgcggacc tgcgctcatg ctttgcaccc 420 cagcacatgg ccatgctgtg tagccccatc ctctacggca gccacaccag cctgcaggag 480 ttcctgcgcc agctgctcac cttctacaag gtggctggcg gctgccagga gcgcagccag 540 ggctgcttcc ccgtctacct ggtctacagt gacaagcgca tggtgcagac ggccgccggg 600 gactactcag gcaacatcga gtgggccagc tgcacactct gttcagccgt gcggcgctcc 660 tgctgcgcgc cctctgaggc cgtcaagtcc gccgccatcc cctactggct gttgctcacg 720 ccccagcacc tcaacgtcat caaggccgac ttcaacccca tgcccaaccg tggcacccac 780 aactgtcgca accgcaacag cttcaagctc agccgtgtgc cgctctccac cgtgctgctg 840 gaccccacac gcagctgtac ccagcctcgg ggcgcctttg ctgatggcca cgtgctagag 900 ctgctcgtgg ggtaccgctt tgtcactgcc atcttcgtgc tgccccacga gaagttccac 960 ttcctgcgcg tctacaacca gctgcgggcc tcgctgcagg acctgaagac tgtggtcatc 1020 gccaagaccc ccgggacggg aggcagcccc cagggctcct ttgcggatgg ccagcctgcc 1080 gagcgcaggg ccagcaatga ccagcgtccc caggaggtcc cagcagaggc tctggccccg 1140 gccccagtgg aagtcccagc tccagccccg g 1171 5 651 PRT Homo sapiens 5 Met Thr Gly Gln Val Gly Ala Gln Thr Val Ser Gly Gly Lys Arg Ser 1 5 10 15 Ile Ala Gly Leu Thr Leu Val Arg Pro Leu Arg Ser Val His Leu Leu 20 25 30 Asp Met Ser Val Gln Val Ile Arg Pro Gly Glu Ala Phe Pro Thr Ala 35 40 45 Leu Ala Asp Val Arg Arg Asn Ser Pro Glu Lys Lys Gly Gly Glu Asp 50 55 60 Ser Arg Leu Ser Ala Ala Pro Cys Ile Arg Pro Ser Ser Ser Pro Pro 65 70 75 80 Thr Val Ala Pro Ala Ser Ala Ser Leu Pro Gln Pro Ile Leu Ser Asn 85 90 95 Gln Gly Ile Met Phe Val Gln Glu Glu Ala Leu Ala Ser Ser Leu Ser 100 105 110 Ser Thr Asp Ser Leu Thr Pro Glu His Gln Pro Ile Ala Gln Gly Cys 115 120 125 Ser Asp Ser Leu Glu Ser Ile Pro Ala Gly Gln Ala Ala Ser Asp Asp 130 135 140 Leu Arg Asp Val Pro Gly Ala Val Gly Gly Ala Ser Pro Glu His Ala 145 150 155 160 Glu Pro Glu Val Gln Val Val Pro Gly Ser Gly Gln Ile Ile Phe Leu 165 170 175 Pro Phe Thr Cys Ile Gly Tyr Thr Ala Thr Asn Gln Asp Phe Ile Gln 180 185 190 Arg Leu Ser Thr Leu Ile Arg Gln Ala Ile Glu Arg Gln Leu Pro Ala 195 200 205 Trp Ile Glu Ala Ala Asn Gln Arg Glu Glu Gly Gln Gly Glu Gln Gly 210 215 220 Glu Glu Glu Asp Glu Glu Glu Glu Glu Glu Glu Asp Val Ala Glu Asn 225 230 235 240 Arg Tyr Phe Glu Met Gly Pro Pro Asp Val Glu Glu Glu Glu Gly Gly 245 250 255 Gly Gln Gly Glu Glu Glu Glu Glu Glu Glu Glu Asp Glu Glu Ala Glu 260 265 270 Glu Glu Arg Leu Ala Leu Glu Trp Ala Leu Gly Ala Asp Glu Asp Phe 275 280 285 Leu Leu Glu His Ile Arg Ile Leu Lys Val Leu Trp Cys Phe Leu Ile 290 295 300 His Val Gln Gly Ser Ile Arg Gln Phe Ala Ala Cys Leu Val Leu Thr 305 310 315 320 Asp Phe Gly Ile Ala Val Phe Glu Ile Pro His Gln Glu Ser Arg Gly 325 330 335 Ser Ser Gln His Ile Leu Ser Ser Leu Arg Phe Val Phe Cys Phe Pro 340 345 350 His Gly Asp Leu Thr Glu Phe Gly Phe Leu Met Pro Glu Leu Cys Leu 355 360 365 Val Leu Lys Val Arg His Ser Glu Asn Thr Leu Phe Ile Ile Ser Asp 370 375 380 Ala Ala Asn Leu His Glu Phe His Ala Asp Leu Arg Ser Cys Phe Ala 385 390 395 400 Pro Gln His Met Ala Met Leu Cys Ser Pro Ile Leu Tyr Gly Ser His 405 410 415 Thr Ser Leu Gln Glu Phe Leu Arg Gln Leu Leu Thr Phe Tyr Lys Val 420 425 430 Ala Gly Gly Cys Gln Glu Arg Ser Gln Gly Cys Phe Pro Val Tyr Leu 435 440 445 Val Tyr Ser Asp Lys Arg Met Val Gln Thr Ala Ala Gly Asp Tyr Ser 450 455 460 Gly Asn Ile Glu Trp Ala Ser Cys Thr Leu Cys Ser Ala Val Arg Arg 465 470 475 480 Ser Cys Cys Ala Pro Ser Glu Ala Val Lys Ser Ala Ala Ile Pro Tyr 485 490 495 Trp Leu Leu Leu Thr Pro Gln His Leu Asn Val Ile Lys Ala Asp Phe 500 505 510 Asn Pro Met Pro Asn Arg Gly Thr His Asn Cys Arg Asn Arg Asn Ser 515 520 525 Phe Lys Leu Ser Arg Val Pro Leu Ser Thr Val Leu Leu Asp Pro Thr 530 535 540 Arg Ser Cys Thr Gln Pro Arg Gly Ala Phe Ala Asp Gly His Val Leu 545 550 555 560 Glu Leu Leu Val Gly Tyr Arg Phe Val Thr Ala Ile Phe Val Leu Pro 565 570 575 His Glu Lys Phe His Phe Leu Arg Val Tyr Asn Gln Leu Arg Ala Ser 580 585 590 Leu Gln Asp Leu Lys Thr Val Val Ile Ala Lys Thr Pro Gly Thr Gly 595 600 605 Gly Ser Pro Gln Gly Ser Phe Ala Asp Gly Gln Pro Ala Glu Arg Arg 610 615 620 Ala Ser Asn Asp Gln Arg Pro Gln Glu Val Pro Ala Glu Ala Leu Ala 625 630 635 640 Pro Ala Pro Val Glu Val Pro Ala Pro Ala Pro 645 650 6 390 PRT Homo sapiens 6 Glu Glu Glu Glu Glu Glu Asp Glu Glu Ala Glu Glu Glu Arg Leu Ala 1 5 10 15 Leu Glu Trp Ala Leu Gly Ala Asp Glu Asp Phe Leu Leu Glu His Ile 20 25 30 Arg Ile Leu Lys Val Leu Trp Cys Phe Leu Ile His Val Gln Gly Ser 35 40 45 Ile Arg Gln Phe Ala Ala Cys Leu Val Leu Thr Asp Phe Gly Ile Ala 50 55 60 Val Phe Glu Ile Pro His Gln Glu Ser Arg Gly Ser Ser Gln His Ile 65 70 75 80 Leu Ser Ser Leu Arg Phe Val Phe Cys Phe Pro His Gly Asp Leu Thr 85 90 95 Glu Phe Gly Phe Leu Met Pro Glu Leu Cys Leu Val Leu Lys Val Arg 100 105 110 His Ser Glu Asn Thr Leu Phe Ile Ile Ser Asp Ala Ala Asn Leu His 115 120 125 Glu Phe His Ala Asp Leu Arg Ser Cys Phe Ala Pro Gln His Met Ala 130 135 140 Met Leu Cys Ser Pro Ile Leu Tyr Gly Ser His Thr Ser Leu Gln Glu 145 150 155 160 Phe Leu Arg Gln Leu Leu Thr Phe Tyr Lys Val Ala Gly Gly Cys Gln 165 170 175 Glu Arg Ser Gln Gly Cys Phe Pro Val Tyr Leu Val Tyr Ser Asp Lys 180 185 190 Arg Met Val Gln Thr Ala Ala Gly Asp Tyr Ser Gly Asn Ile Glu Trp 195 200 205 Ala Ser Cys Thr Leu Cys Ser Ala Val Arg Arg Ser Cys Cys Ala Pro 210 215 220 Ser Glu Ala Val Lys Ser Ala Ala Ile Pro Tyr Trp Leu Leu Leu Thr 225 230 235 240 Pro Gln His Leu Asn Val Ile Lys Ala Asp Phe Asn Pro Met Pro Asn 245 250 255 Arg Gly Thr His Asn Cys Arg Asn Arg Asn Ser Phe Lys Leu Ser Arg 260 265 270 Val Pro Leu Ser Thr Val Leu Leu Asp Pro Thr Arg Ser Cys Thr Gln 275 280 285 Pro Arg Gly Ala Phe Ala Asp Gly His Val Leu Glu Leu Leu Val Gly 290 295 300 Tyr Arg Phe Val Thr Ala Ile Phe Val Leu Pro His Glu Lys Phe His 305 310 315 320 Phe Leu Arg Val Tyr Asn Gln Leu Arg Ala Ser Leu Gln Asp Leu Lys 325 330 335 Thr Val Val Ile Ala Lys Thr Pro Gly Thr Gly Gly Ser Pro Gln Gly 340 345 350 Ser Phe Ala Asp Gly Gln Pro Ala Glu Arg Arg Ala Ser Asn Asp Gln 355 360 365 Arg Pro Gln Glu Val Pro Ala Glu Ala Leu Ala Pro Ala Pro Val Glu 370 375 380 Val Pro Ala Pro Ala Pro 385 390 7 20 DNA Homo sapiens 7 cttgaggatg cggatgtgct 20 8 18 DNA Homo sapiens 8 ccatggggtg agtgtcct 18 9 18 DNA Homo sapiens 9 aggacactca ccccatgg 18 10 20 DNA Homo sapiens 10 gtatgggaca ggggcagaaa 20 11 20 DNA Homo sapiens 11 tttctaaaga ccattgggag 20 12 20 DNA Homo sapiens 12 ccattttaaa gtagcggttc 20 13 20 DNA Homo sapiens 13 aggagagaaa ggtgagccaa 20 14 20 DNA Homo sapiens 14 gtagatcctg aggttgacca 20 15 20 DNA Homo sapiens 15 tgtgagcatt tctggccttc 20 16 20 DNA Homo sapiens 16 tgaagacgcc agagaagcag 20 17 20 DNA Homo sapiens 17 gcctcacaag tgtcagacct 20 18 18 DNA Homo sapiens 18 agaagggtgg tgaagact 18 19 20 DNA Homo sapiens 19 cttggttaga gaggatgggc 20 20 20 DNA Homo sapiens 20 gcccatcctc tctaaccaag 20 21 15202 DNA Homo sapiens 21 gatccgagct caattaaccc tcactaaagg gagtcgactc gatccttaaa atattcatat 60 ctcctggaca acctgtggcc atagtgcctg actgtaaacc caaagggttt gcctttgcca 120 gtgtagccca gcctggtgtc tgctgcccct cgcggtgtct gtgcacctgc cacgatgctg 180 accagacacc cttaaccagg ttcacccatc gcctgggcct ggagcagtcc ccctgatgct 240 ctgattggtc cttggacctt ctgttctccc aaaatcccag gtcagaaaat acctggaagt 300 ctatttgtgt cccacctccc tctttgtggc cgcaagtgcc ccttcctcca cacagtcaca 360 agaccatgag atgccatctc ctcccctcct gggctgcaga ctttgggaag ctcccaggcc 420 acagaggtgt cagctcctgt ccaggccctt gggaccttcc ctcattcaac caccctaccc 480 aaccccccac tgcctgccag ccaccactcc ctcccacatt tgcaggcggg ggccctgccc 540 tctcctgccg ctggttcccc tacccaggag gctctcccat cgctcttttg agagtctgcc 600 tcccacctct aactgggggc ttagttcaag ttgccccctt accctagtcc cagctgccca 660 agagcttgct gcctcctgtt cttggtgagg gactccagag acagatgtga gacctccctg 720 gacccctcca aggcattccc aggtcacttc catgagtagt gaagaaccgc ctctgagcag 780 gctgagcctc cctcagccta tggtgtcctc acgtggcttg gcccacagca ggtgctcacg 840 cctcctcctc agcagagcct accatcctcc tgccatgctc accagtcccc atgctgatag 900 ccatcaccag tccccatgct gatagccatc accagtcccc atgctgatag ccactttctg 960 gatgctctag gtctgtctgg atgacacagt gaccacagag aaggagctgg acactgtgga 1020 agtgctgaaa gcaattcaga aagccaagga ggtcaagtcc aaactgagca acccagagaa 1080 gaaggtgggt ttgtgtggca ggtgggaggg cagtggtgca gagccagccg ggataggagc 1140 cagttcgggg ggcttgggcc atgggactgc tcagggctgc cgagtcccag ctgcgcccct 1200 ccctggctgc atgacctcgg gcaagtcgcg gcctctctgt tctctgtggg gtggggacag 1260 tggtagttcc tgctctaagg atatgatgag accatcttta ccacccagtt ggtgggaacc 1320 gttgcgctcc ctcctcacac ccctggcctt ggggagctct gtgcttcctc ttctctcccg 1380 ggctgactca agcactcgtc ctcagggtgg tgaagactcc cggctctcag ctgccccctg 1440 catcagaccc agcagctccc ctcccactgt ggctcccgca tctgcctccc tgccccagcc 1500 catcctctct aaccaaggta atcgtgtatg tatcttgctt ctagtggagc cacacagccc 1560 tgcctgggcc ccctggctgg gctggggttg ggggagaggt gccagcacct gcttccaaca 1620 gggtcagaca cagggagggc agtgccttct gcaggctggt cctcgcgggg ggacacatgg 1680 caggggtgcc tggcctgatg ccagctgttg cttgcttggt gaggactccc aattgctctg 1740 atgcccacat ccagctcctc taggagaccg cagggtgtct gacaggccct gaggctgccc 1800 tctgaacagg ctcggggctg ttggctcatg ggacccattc cctcaccggc agcacaagca 1860 ggttggctcc tggttacagg aagccgggct tgtgacttta ctgtctggag cccgaatccc 1920 tgtgcaggga aaagcttgct tttatcactg cctcatctct gtggggtgac ccagccccag 1980 aacaccatgt ttgtggggcc aagatgggcc atctctgtcc ctgtggaccc atggaagacc 2040 aggcccattc gtctgcccac tatcttagcg ttttcaaagg gctttcacct ctgaacccag 2100 gcatcctcgg agatgagtga gtgaagcagg tctcatgagc gtgtctgctg gcccggcccc 2160 cacggaagag gggagggtgt gccgtcccga gtggagccga ggctcgggac acgcaggaaa 2220 ggacgccgcc tgcccgggct cctggagacg cagaacttgg tgtgaggtct tgggaaaaca 2280 gttcaacccg atgttttaag agccagaaaa acattcccac cccttgacct ggtaacccca 2340 ctggtgggga ttttctctta gagggataag ataccgggaa ggggaggtga aatgctcacc 2400 actgccaaaa cacgggctgc aactgcaaca tcggaggatg agagggagag tcggctgtgg 2460 tgcagaatgc tcagcagccc tcccagcagg gacaggaaga ctgggcagga agaggggaga 2520 agcattcaag ttaaggcaaa aggcccaacg cagagcagca cactgaggtc acacctgtga 2580 gatgtggaag agaattcctg agcgtggagc gatggggtta ggtgccagga tgattgccca 2640 ttttgcttct gtcagactct tgactaagga tttctggttg cattttatta cataaaagcc 2700 agggaggtta tatcacggtg agaaagcttc cctgacgccg cctcctgtag cgcagccaag 2760 cgagcctgtg gaggtaccat atgactgtag gcctctgggg acagggagct gcatctgctt 2820 ctcaaggcca gggacacagc catttctgcc agcatctgtt gatcagtgag tgagtgagtg 2880 ggcaggtaga gcaggagcca gtgaagagca ggccctggat gggtggggat gcaccatgtc 2940 cccaggctgc agctgcaggc agccccccac attgtcggag aagcctctgc accagctcag 3000 ccccctcctc actccccttg tgccctgggg acactctgca gaggggcact ctgcagtctg 3060 tccccgccat cgctggactt ctggacatgg cctccagatt tgcacctctt aaataaatct 3120 gcagtggatg tctttgtgtg cacctctctt tccttttggt gagaaacagc aaagatcgga 3180 cccctaagga ctctcctgat gtctccgctc tatccgctga gtgccctttc tgaccacttg 3240 tttgtacagg ccacggtcca ggacgggagc agatagactg tccctgtccc tgtccacatt 3300 tccttggtcc aaacagggct tgtgggaggt agtggcaaaa ggtgttggtc tttttctcac 3360 tgatttggag gcctccccgt gtgttttttc agccgcgtgt tcctgggtct tgcctggatg 3420 gacagggttt tttagcgcgt gggagcagct ttgctgacca tgcctgttgc ttccagcctg 3480 attcccgaga agggagcgtg cttgcgaagg aactggcact cgggcctgcc tgaagggggc 3540 gctgtccaga cacacccagc ctcccgtcgt ggcaggcgct gtcggagcca tggatgattg 3600 tgaccaatag gggtggtcgc cagagttgat tgtccagcca ggcccagggg ctgagaggag 3660 gctgtgtgga gaggtggtta ggagccaggg ctcggtcagc tgagttcgca tgccagcttc 3720 ctagctgtgg gacctcaagc aacttgtagc ccctctgaag ctgttttctc aactgtgaag 3780 tggacgcacc ctacttcatt gattctaaga ggcacgcatt tccaccttgt gacttctctg 3840 aaactgaggt gcgtctttca gtcagtggcg tctcatagtc gctgtcagcc agctggtatt 3900 cgagatggag tcgtggaaaa cccgtggaca ccttccgcta ggaccaagat ggcgccacct 3960 gccgcatctt agatttgatg aaatgtggta aataacgaga ggcatgcatg agcgaatgct 4020 ggggaggcgc ttggcactac ccagagctcc acagaggtgg tcgatgaggg ctgccctttc 4080 ccacatcctt agtagggggt tcaacatgac ccagactgtg cccctgggga gcttggagcc 4140 atgcgggagg atgagccatg tgctggagga gaacagggta ggatggtgtg gggcttttgt 4200 agactgtcta gagcagagaa ggtctgcagt ggaggtggtg tctgaggtga atctcgaagg 4260 tgaataggag ttgaacgtta gcaggcagag ggtggattgc aggagagcag cggcctgggc 4320 aggtgcccag cgtggcccat cagggtgctt catgcatggc tgtgtgcttg ccatccttcc 4380 tgcctgccta ccccctgctg cttcgcttca tgggggcgtt tgagcttggg cccacctgcc 4440 tgcctcgctt gtgggcagag gacccaggct gtgtgagttg tcctgtcccg gggagcagct 4500 gagcttgtcc gggttcctcg acctgtgggg cttcagagga cttcgggtca tttcaatggg 4560 ctgtggcgat gctggctgtg gaggtagcct agggctcctg tagccttcag tgagactggc 4620 ggcccgatgc ccagtgttca ccctgctggc ggcagtcagg aacatgttca caaagcttta 4680 cttcaagtgg tctagaggtg atctgaggtg gagtaacagg tccagatagg ctacgttcat 4740 aaaacagctt cagcggggtt taggaacact gtgcatttac gggacgcagt gggtcagagt 4800 gctgctgtcc gtgggaggtg gccccagggc aggtcagtgg gcacgtcctg tggtaagtgg 4860 gactgtggat gtgggctcag gctggactca gcagccctgc tggataccaa ggcctgcaag 4920 ggctggcccc ctggtgaatt gtcccgtgcc ctgtgtatct atgagtcctg cagagatgac 4980 aaatcagggg acggggtcat gtctagtcac cgtctgggaa aatgctccag gagtgaacac 5040 atttcaggct cttgatggat gtacctccaa actcttctct ggatgggtgg gccagcttgc 5100 atgcctgtgc cggcctctgc ccagcgaggt cagggccagg ccacacagtc agtctgactt 5160 tggcagaagt tgagaggcaa cacttgtctc ttgtttcagc ttgcctttct ttgtgtactt 5220 ctgagagcga gcattctttt catgttctat ccgctggccg ttcttctgcg gaatgtctgt 5280 tcacgtcctt tgcagtctgt taatgaggtt tccaaccttc cctcattttt gtaatctgta 5340 agaacttttt ccagactagc gatataaatc cttgtcaaat attgcaaaca cttttctcat 5400 ttcatctggt tttaatctat cctggttttt aaaaaatgtg tctgtggaag tttaattttt 5460 atgtagtcac atctcagttt ttttccattg catttattct cagaatgctt ctccctgccc 5520 tgagattaga taagcagtca tttgttcttt cttgagttat tttgagattt cagttttaac 5580 attttcttct ataatccatg tggctgggtt ttgggatctg gctaaccccc gccatgccag 5640 tagcctgagg ggcccagccc cacttgttga acagccgctc tccccgcccc acccaccctg 5700 cctgcctgcc cacccgccct ggtctctcca ggaatcatgt tcgttcagga ggaggccctg 5760 gccagcagcc tctcgtccac tgacagtctg actcccgagc accagcccat tgcccaggga 5820 tgttctgatt ccttggagtc catccctgcg ggacaggtaa tgccctcttc ccgcttctgg 5880 ggaccataca tctgtgggtg gactcttctg cttggggttg tgtgcagtag gaagtggcct 5940 agctggagct gaggcagatg cttccagggt ttggcgtcct ctgctttgcg ccacggtctt 6000 tctcttggac ctgtctctgg ttgagtgtct tcctgacaaa cacagtggtt aagggtttat 6060 tttcagcctc cctccttccc ttccccaccc accttggttg atgggaacag gcagttctct 6120 gtcactgggc ccagggcacg aggggggcag gtggagaggg tggcccttga ccctgtgagc 6180 aggcttccct ggggaaggca tttcaaaaga ccctcgtgca ggggcttgtt tgggtttctt 6240 ctctgtttcc tggcacccct ggagccactc ggcgcctttc cgcatgtcac cctggtggtc 6300 tgggaaacag tctcactctg gcgcctcctc tgtggttgtt actgagagtt ctggggcccc 6360 ttcctttgtc ctgaggaaag acaggaggaa agcaagggtg cttgctgtgt gcttcgcaaa 6420 tgtgcttggt gcctgggcct ccctccagcc ccatctctgc agcagcacaa ggttatggcc 6480 ttgtgacact gggacagttt gcagagtcct tgtctgtcct cagtactcca cagtattctg 6540 ccatcaccct ttccagggtc acacagcaag agattcccaa gccctaggta ttccccagtg 6600 cacagagacc attgggaggg acttgccagg gctgtgtcca ctgctggcca gttagggtcg 6660 gaccaaattt gtagactgtc tacctggacc cttgcgtggc acaaggagca gtcagatgct 6720 ggatccctgg agagtggcga gaggctctgg ccttaggttg cgagtgggaa tcccagccct 6780 gctgtgtgct ggtgggataa ccaagtgggt ctctgccctt gggtcccaga gtgggcccca 6840 gggtcccaga gtgggctcca gggtacagcg tggggatggg gagcctcctc agggcggtga 6900 tggagggcag aatgcccagc tcagggtctg gcaaccagta aatggctggg gctggctgca 6960 gtaggtgggg actgactgtg tttctttctc catcaggcag cttccgatga tttaagggac 7020 gtgccaggag ctgttggtgg tgcaaggtaa ggaagaggtt ggaaagggac ctgggcctgg 7080 ccacacagcc ttatgcacac acactgctgt gggccagggg tggccagtca ggttttttta 7140 aaaatccgtt cacagaaggc ctatagaact atttcttcct ctaaagagac acagatgaga 7200 tggacttttc aatctgtttc caaattctaa tacctaaact ctgctcagca catgttgccc 7260 tacaccaggg gttggcaaat caaggcctgt gtgtggccca cagcctggga gctaagaatg 7320 acagttacat tcttttttct ttttttgaga ctgagtctcg ctctgtcgcc caggctggag 7380 tgcagtggcg tgttcttggc tcactgcaac ccccgcctcc cagattaatg caattttcct 7440 gtctcagcct cagccttctg agtagcccgg accacaggcg cacgccacca cgcccaacta 7500 attttttata tttttagtag agacagagat tcaccatgtg gcctagctgg tctcgaactc 7560 ctgaactcca gtgatccacc aacctcggct tcctaaagta ctggaattac aggcatgagc 7620 caccgcgcct ggctagaata acagttactt tttttttctt tgagactgag tcttgctttg 7680 tcacccaggc tggagtgcag tggcacgatc tcagctcgct gcaacctccg cctcccgggt 7740 tcaagcgatt cttctgcctc agccacccaa ggtgcccgcc accacacctg gctaattttt 7800 ctgtttttag tagggacagg atttcgccat gttggacagt tacattctta aagggctgct 7860 gaagatcgta tggacatggt agcccataaa tcccaaaatg tgtactctga ccctttacag 7920 aagcttacta actcccactc tacatgtgag ggctgcggtg gccaagaaga gctggaattt 7980 aagtgtgaag gtcctaagac ctgccccagc ccacttccct gccccggagg ccaccagggg 8040 tgacaagtag attcatgccc tggagtgttc cttctctccg gggcttatgg cagcaactga 8100 atgacttaga agtccatggg agtgctttct gttgtgggaa ctcgtgtggt ctgggcatag 8160 ctgtgccagg cacctatggt ccaagcccct agaagcatag actctgacca aactggcgac 8220 ccagccttcc agcaggcagc actggctccc accagggccc tcatcctggg aactgacttg 8280 gccatgtggg aggcttggga gacccatggg ttggtttctc agggtcaggg tgtagcagtg 8340 ggctccagat gtggcaggtg ggaggtggga ggggcccctc ccagcatgcc actgacctgg 8400 cctctccctg cacagcccag aacatgccga gccggaggtc caggtggtgc cggggtctgg 8460 ccagatcatc ttcctgccct tcacctgcat tggctacacg gccaccaatc aggacttcat 8520 ccagcgcctg agcacactga tccggcaggc catcgagcgg cagctgcctg cctggatcga 8580 ggctgccaac cagcgggagg agggccaggg tgaacagggc gaggaggagg atgaggagga 8640 ggaagaagag gaggacgtgg ctgagaaccg ctactttgaa atggggcccc cagacgtgga 8700 ggaggaggag ggaggaggcc agggggagga agaggaggag gaagaggagg atgaagaggc 8760 cgaggaggag cgcctggctc tggaatgggc cctgggcgcg gacgaggact tcctgctgga 8820 gcacatccgc atcctcaagg tgctgtggtg cttcctgatc catgtgcagg gcagtatccg 8880 ccagttcgcc gcctgccttg tgctcaccga cttcggcatc gcagtcttcg agatcccgca 8940 ccaggagtct cggggcagca gccagcacat cctctcctcc ctgcgctttg tcttttgctt 9000 cccgcatggc gacctcaccg agtttggctt cctcatgccg gagctgtgtc tggtgctcaa 9060 ggtacggcac agtgagaaca cgctcttcat tatctcggac gccgccaacc tgcacgagtt 9120 ccacgcggac ctgcgctcat gctttgcacc ccagcacatg gccatgctgt gtagccccat 9180 cctctacggc agccacacca gcctgcagga gttcctgcgc cagctgctca ccttctacaa 9240 ggtggctggc ggctgccagg agcgcagcca gggctgcttc cccgtctacc tggtctacag 9300 tgacaagcgc atggtgcaga cggccgccgg ggactactca ggcaacatcg agtgggccag 9360 ctgcacactc tgttcagccg tgcggcgctc ctgctgcgcg ccctctgagg ccgtcaagtc 9420 cgccgccatc ccctactggc tgttgctcac gccccagcac ctcaacgtca tcaaggccga 9480 cttcaacccc atgcccaacc gtggcaccca caactgtcgc aaccgcaaca gcttcaagct 9540 cagccgtgtg ccgctctcca ccgtgctgct ggaccccaca cgcagctgta cccagcctcg 9600 gggcgccttt gctgatggcc acgtgctaga gctgctcgtg gggtaccgct ttgtcactgc 9660 catcttcgtg ctgccccacg agaagttcca cttcctgcgc gtctacaacc agctgcgggc 9720 ctcgctgcag gacctgaaga ctgtggtcat cgccaagacc cccgggacgg gaggcagccc 9780 ccagggctcc tttgcggatg gccagcctgc cgagcgcagg gccaggtgag atcaagcaca 9840 gctctcaggg gccccggggg cacgggtctg gcatgtgtgt gatctcagca tctgcggcta 9900 gtgtgggctg ggagttgctg cgagagctgg gccccctccc ccctgcccct cgcccccccc 9960 gggcctccct ctacatcacc accccaggtt tggtgccagg ctgctcctta tctcagtgct 10020 gtagaagaag cccaggaaag ctgtcctctc acaaaatggg ttggcccagc ctcttgccac 10080 ccatgaaggg caggccaagg gggctgcccc acctttgcct gcccagtggg agagcaacag 10140 gctgcagcac accgaggcca ggagagctgt caccctggct gctgtgctcc tctgggccca 10200 agcatggcct ctgggcacta cctcctccag ggtcacagtc ccacggatgg ctctgtgggc 10260 caggatctgc cttaggcttc acccacctca acatcttgct gtgttgttca ggctggtctc 10320 aaactttggg ctcaaacaat cctccgcctc agcctcccaa agtgctggga ttacagacat 10380 gagccaccgt gcccggccgt gctgttctgt tctccaatag agaagctggt ggaagtcccc 10440 agtaacccag aggtgatgtg tgatgcacac agtctcctca ctctgaagct gcacatgcga 10500 tgtgaatctt catttggggt ccgctgttaa tatggtgttt ttcgggggat acagcaatga 10560 ccagcgtccc caggaggtcc cagcagaggc tctggccccg gccccagtgg aagtcccagc 10620 tccagcccct gcagcagcct cagcctcagg cccagcgaag actccggccc cagcagaggc 10680 ctcaacttca gctttggtcc cagaggagac gccagtggaa gctccagccc cacccccagc 10740 cgaggcccct gcccagtacc cgagtgagca cctcatccag gccacctcgg aggagaatca 10800 gatcccctcg cacttgcctg cctgcccgtc gctccggcac gtcgccagcc tgcggggcag 10860 cgccatcatc gagctcttcc acagcagcat tgctgaggta gcggcccggg tgtgggtgcc 10920 agctatggca cggccagtcc tgagggcgag gccaagcttg gcttcaggtc agcctcaggt 10980 ccctggactt ccctgatgtc ggagtcctca gctgagctgc tcacagcttt gaggacctgg 11040 gcagtgaggt cctgagttgc cctccctggc catttgtgct gtgtcaccac ctcctgtgcc 11100 acttccagcc ccaggtagac ctcccaccaa cagccatctc ccacccctct cttcctctct 11160 gccttgaagc atacggattc attggtgagc caagaggggc ttcccatgtc tccttgtgga 11220 agctgtgggc atgtccctgg tatgtgcagg ttgctagggt ggtggagctg acaggaggcc 11280 ccccgtcttc aggttgaaaa cgaggagctg aggcacctca tgtggtcctc ggtggtgttc 11340 taccagaccc cagggctgga ggtgactgcc tgcgtgctgc tctccaccaa ggctgtgtac 11400 tttgtgctcc acgacggcct ccgccgctac ttctcagagc cactgcaggg taggcacagg 11460 gcctgctggg gctcaggagc ttggagtgtg tggttggggc aggcctgggg ggtcattctc 11520 tggagccagc tgtgtggctt caggcagcag tcagcgactt ggctgcagtg ggctgagagt 11580 tccttgtctg aggaagggag ctgtcatgag ggaggggtcc atggccagat gtgaacgcag 11640 aatgcactga gccagggcct ggtgactgct tgggaacagc ctgtgatgag aaggggttag 11700 gcagcctttg cccctggggc tgcacaggaa gccctagcca gcgacctggt gactcccctg 11760 agctggaaga ggctcagact ccagagggca ttgcctatgg ggctttgcac gggtggaagc 11820 caggccagcc aagaggacct gttcctgctg gatgtgctgc acacctagga accttgtgct 11880 tgcctgccac cgcctccctc tgtccctttc tccatcacac agatttctgg catcagaaaa 11940 acaccgacta caacaacagc cctttccaca tctcccagtg cttcgtgcta aagcttagtg 12000 acctgcagtc agtcaatgtg gggcttttcg accagcattt ccggctgacg cgtgggtgac 12060 cctctgtgct ttgtcctatt tcgggtgaag gccagcatca ccagtgggct tccaccttcc 12120 gtacgtgggt gggttatcat agacagttat ctctgtgctc aagagccact tcttacccgg 12180 ggtgggagga agcagcttca ggaactgctg agagagcaga actcacgctc cagggctcag 12240 agcaggaggt agggtgtgcg gcaagcgctg gcccggacag aagcagagtg ggccctggtc 12300 tcgggcagga tgtttctgac tcacatttcc tgaggagaga aagctaagct ctttgcctaa 12360 tgtctctgtc tccccttcca gaaaaatgcc tcagctcttc cggcctgaag gaatggcctc 12420 ctcccgggcc ccatgattct ttcctgtgtg ggccctcctg gccctggcct ctgggctgag 12480 gcttgctagg gactcggggt ggctctaagg ggcagggata gggctgggga gcgccggcct 12540 gtggccctga ccagcccctt ctcgtgcagg ttccaccccg atgcaggtgg tcacgtgctt 12600 gacgcgggac agctacctga cgcactgctt cctccagcac ctcatggtcg tgctgtcctc 12660 tctggaacgc acgccctcgc cggagcctgt tgacaaggac ttctactccg agtttgggaa 12720 caagaccaca ggtacccctg tctagctcag gctgcagaca ggctgcctgg acagacgtca 12780 tgggccccag ggtggctctc tgtgccccag aaccctctct gcctctatgt ctctcttttc 12840 tcacttagct ggccagggtt ttatgtgggg cttttcgatg gcagagtctc cactccagca 12900 gtccctcaac catctggcag acacatctcc agtgcctgct ttgggctcct ggcctgtggg 12960 ccccacactt ggagcatcct ctcctgcctg tctcatgccg gggtctctcg gttggcttgg 13020 ggcccttggt gctcccagcc ccaccagggg ccggttccag gctatagccc aggtggcatc 13080 tctctgcagg gaagatggag aactacgagc tgatccactc tagtcgcgtc aagtttacct 13140 accccagtga ggaggagatt ggggacctga cgttcactgt ggcccaaaag atggctgagc 13200 cagagaaggc cccagccctc agcatcctgc tgtacgtgca ggccttccag gtgggcatgc 13260 caccccctgg gtgctgcagg ggccccctgc gccccaagac actcctgctc accagctccg 13320 agatcttcct cctggatgag gactgtgtcc actacccact gcccgagttt gccaaagagc 13380 cgccgcagag agacaggtac cggctggacg atggccgccg cgtccgggac ctggaccgag 13440 tgctcatggg ctaccagacc tacccgcagg ccctcaccct cgtcttcgat gacgtgcaag 13500 gtcatgacct catgggcagt gtcaccctgg accactttgg ggaggtgcca ggtggcccgg 13560 ctagagccag ccagggccgt gaagtccagt ggcaggtgtt tgtccccagt gctgagagca 13620 gagagaagct catctcgctg ttggctcgcc agtgggaggc cctgtgtggc cgtgagctgc 13680 ctgtcgagct caccggctag cccaggccac agccagcctg tcgtgtccag cctgacgcct 13740 actggggcag ggcagcaggc ttttgtgttc tctaaaaatg ttttatcctc cctttggtac 13800 cttaatttga ctgtcctcgc agagaatgtg aacatgtgtg tgtgttgtgt taattctttc 13860 tcatgttggg agtgagaatg ccgggcccct cagggctgtc ggtgtgctgt cagcctccca 13920 caggtggtac agccgtgcac accagtgtcg tgtctgctgt tgtgggaccg ttgttaacac 13980 gtgacactgt gggtctgact ttctcttcta cacgtccttt cctgaagtgt cgagtccagt 14040 cctttgttgc tgttgctgtt gctgttgctg ttgctgttgg catcttgctg ctaatcctga 14100 ggctggtagc agaatgcaca ttggaagctc ccaccccata ttgttcttca aagtggaggt 14160 ctcccctgat ccagacaagt gggagagccc gtgggggcag gggacctgga gctgccagca 14220 ccaagcgtga ttcctgctgc ctgtattctc tattccaata aagcagagtt tgacaccgtc 14280 tgcatcttct aaaccaaggg tcactgggat cgagtcgacg gccctatagt gagtcgtatt 14340 agagctcgcg gccgcgagct ctagatgcat gctcgagcgg ccgccagtgt gatggatatc 14400 tgcagaattc cagcacactg gcggccgtta ctagtggatc cgagctccac agaggtggtc 14460 gatgagggct gccctttccc acatccttag tagggggttc aagatgaccc agactgtgcc 14520 cctggggagc ttggagccat gcgggaggat gagccatgtg ctggaggaga acagggtagg 14580 atggtgtggg gcttttgtag actgtctaga agcaaagaag gtctgcagtg gaggtggtgt 14640 ctgaggtgaa tctcgaaggt gaataggagt tgaacgttag caggcagagg gtggattgca 14700 ggagagcagc ggcctgggca ggtgcccagc gtggcccatc agggtgcttc atgcatggct 14760 gtgtgcttgc catccttcct gcctgcctac cccctgctgc ttcgcttcat gggggcgttt 14820 gagcttgggc ccacctgcct gcctcgcttg tgggcagagg acccaagctg tgtgagttgt 14880 cctgtcccgg ggagcagctg aactggtccg gggtctcgaa ctgtggggct caaaaggact 14940 ccggggtcat ttcactgggg ctgtgccgat tcctgggggc tgttnggaan gtaaaggcct 15000 aaaggggctc cctggttang gccctcaant ttaanaacct ggggccgggg cccggaattg 15060 cccccaantt tgtttcaacn ccccttggcc ttnggcnggg gcaaatttcc anggggaacc 15120 aatggntttc ccccaaaaan ggggccnttt taacccnttt ccaaantttg ggncctaaaa 15180 aagggtggan ttcctgaang gg 15202 22 1070 PRT Homo sapiens 22 Val Cys Leu Asp Asp Thr Val Thr Thr Glu Lys Glu Leu Asp Thr Val 1 5 10 15 Glu Val Leu Lys Ala Ile Gln Lys Ala Lys Glu Val Lys Ser Lys Leu 20 25 30 Ser Asn Pro Glu Lys Lys Gly Gly Glu Asp Ser Arg Leu Ser Ala Ala 35 40 45 Pro Cys Ile Arg Pro Ser Ser Ser Pro Pro Thr Val Ala Pro Ala Ser 50 55 60 Ala Ser Leu Pro Gln Pro Ile Leu Ser Asn Gln Gly Ile Met Phe Val 65 70 75 80 Gln Glu Glu Ala Leu Ala Ser Ser Leu Ser Ser Thr Asp Ser Leu Thr 85 90 95 Pro Glu His Gln Pro Ile Ala Gln Gly Cys Ser Asp Ser Leu Glu Ser 100 105 110 Ile Pro Ala Gly Gln Ala Ala Ser Asp Asp Leu Arg Asp Val Pro Gly 115 120 125 Ala Val Gly Gly Ala Ser Pro Glu His Ala Glu Pro Glu Val Gln Val 130 135 140 Val Pro Gly Ser Gly Gln Ile Ile Phe Leu Pro Phe Thr Cys Ile Gly 145 150 155 160 Tyr Thr Ala Thr Asn Gln Asp Phe Ile Gln Arg Leu Ser Thr Leu Ile 165 170 175 Arg Gln Ala Ile Glu Arg Gln Leu Pro Ala Trp Ile Glu Ala Ala Asn 180 185 190 Gln Arg Glu Glu Gly Gln Gly Glu Gln Gly Glu Glu Glu Asp Glu Glu 195 200 205 Glu Glu Glu Glu Glu Asp Val Ala Glu Asn Arg Tyr Phe Glu Met Gly 210 215 220 Pro Pro Asp Val Glu Glu Glu Glu Gly Gly Gly Gln Gly Glu Glu Glu 225 230 235 240 Glu Glu Glu Glu Glu Asp Glu Glu Ala Glu Glu Glu Arg Leu Ala Leu 245 250 255 Glu Trp Ala Leu Gly Ala Asp Glu Asp Phe Leu Leu Glu His Ile Arg 260 265 270 Ile Leu Lys Val Leu Trp Cys Phe Leu Ile His Val Gln Gly Ser Ile 275 280 285 Arg Gln Phe Ala Ala Cys Leu Val Leu Thr Asp Phe Gly Ile Ala Val 290 295 300 Phe Glu Ile Pro His Gln Glu Ser Arg Gly Ser Ser Gln His Ile Leu 305 310 315 320 Ser Ser Leu Arg Phe Val Phe Cys Phe Pro His Gly Asp Leu Thr Glu 325 330 335 Phe Gly Phe Leu Met Pro Glu Leu Cys Leu Val Leu Lys Val Arg His 340 345 350 Ser Glu Asn Thr Leu Phe Ile Ile Ser Asp Ala Ala Asn Leu His Glu 355 360 365 Phe His Ala Asp Leu Arg Ser Cys Phe Ala Pro Gln His Met Ala Met 370 375 380 Leu Cys Ser Pro Ile Leu Tyr Gly Ser His Thr Ser Leu Gln Glu Phe 385 390 395 400 Leu Arg Gln Leu Leu Thr Phe Tyr Lys Val Ala Gly Gly Cys Gln Glu 405 410 415 Arg Ser Gln Gly Cys Phe Pro Val Tyr Leu Val Tyr Ser Asp Lys Arg 420 425 430 Met Val Gln Thr Ala Ala Gly Asp Tyr Ser Gly Asn Ile Glu Trp Ala 435 440 445 Ser Cys Thr Leu Cys Ser Ala Val Arg Arg Ser Cys Cys Ala Pro Ser 450 455 460 Glu Ala Val Lys Ser Ala Ala Ile Pro Tyr Trp Leu Leu Leu Thr Pro 465 470 475 480 Gln His Leu Asn Val Ile Lys Ala Asp Phe Asn Pro Met Pro Asn Arg 485 490 495 Gly Thr His Asn Cys Arg Asn Arg Asn Ser Phe Lys Leu Ser Arg Val 500 505 510 Pro Leu Ser Thr Val Leu Leu Asp Pro Thr Arg Ser Cys Thr Gln Pro 515 520 525 Arg Gly Ala Phe Ala Asp Gly His Val Leu Glu Leu Leu Val Gly Tyr 530 535 540 Arg Phe Val Thr Ala Ile Phe Val Leu Pro His Glu Lys Phe His Phe 545 550 555 560 Leu Arg Val Tyr Asn Gln Leu Arg Ala Ser Leu Gln Asp Leu Lys Thr 565 570 575 Val Val Ile Ala Lys Thr Pro Gly Thr Gly Gly Ser Pro Gln Gly Ser 580 585 590 Phe Ala Asp Gly Gln Pro Ala Glu Arg Arg Ala Ser Asn Asp Gln Arg 595 600 605 Pro Gln Glu Val Pro Ala Glu Ala Leu Ala Pro Ala Pro Val Glu Val 610 615 620 Pro Ala Pro Ala Pro Ala Ala Ala Ser Ala Ser Gly Pro Ala Lys Thr 625 630 635 640 Pro Ala Pro Ala Glu Ala Ser Thr Ser Ala Leu Val Pro Glu Glu Thr 645 650 655 Pro Val Glu Ala Pro Ala Pro Pro Pro Ala Glu Ala Pro Ala Gln Tyr 660 665 670 Pro Ser Glu His Leu Ile Gln Ala Thr Ser Glu Glu Asn Gln Ile Pro 675 680 685 Ser His Leu Pro Ala Cys Pro Ser Leu Arg His Val Ala Ser Leu Arg 690 695 700 Gly Ser Ala Ile Ile Glu Leu Phe His Ser Ser Ile Ala Glu Val Glu 705 710 715 720 Asn Glu Glu Leu Arg His Leu Met Trp Ser Ser Val Val Phe Tyr Gln 725 730 735 Thr Pro Gly Leu Glu Val Thr Ala Cys Val Leu Leu Ser Thr Lys Ala 740 745 750 Val Tyr Phe Val Leu His Asp Gly Leu Arg Arg Tyr Phe Ser Glu Pro 755 760 765 Leu Gln Asp Phe Trp His Gln Lys Asn Thr Asp Tyr Asn Asn Ser Pro 770 775 780 Phe His Ile Ser Gln Cys Phe Val Leu Lys Leu Ser Asp Leu Gln Ser 785 790 795 800 Val Asn Val Gly Leu Phe Asp Gln His Phe Arg Leu Thr Gly Ser Thr 805 810 815 Pro Met Gln Val Val Thr Cys Leu Thr Arg Asp Ser Tyr Leu Thr His 820 825 830 Cys Phe Leu Gln His Leu Met Val Val Leu Ser Ser Leu Glu Arg Thr 835 840 845 Pro Ser Pro Glu Pro Val Asp Lys Asp Phe Tyr Ser Glu Phe Gly Asn 850 855 860 Lys Thr Thr Gly Lys Met Glu Asn Tyr Glu Leu Ile His Ser Ser Arg 865 870 875 880 Val Lys Phe Thr Tyr Pro Ser Glu Glu Glu Ile Gly Asp Leu Thr Phe 885 890 895 Thr Val Ala Gln Lys Met Ala Glu Pro Glu Lys Ala Pro Ala Leu Ser 900 905 910 Ile Leu Leu Tyr Val Gln Ala Phe Gln Val Gly Met Pro Pro Pro Gly 915 920 925 Cys Cys Arg Gly Pro Leu Arg Pro Lys Thr Leu Leu Leu Thr Ser Ser 930 935 940 Glu Ile Phe Leu Leu Asp Glu Asp Cys Val His Tyr Pro Leu Pro Glu 945 950 955 960 Phe Ala Lys Glu Pro Pro Gln Arg Asp Arg Tyr Arg Leu Asp Asp Gly 965 970 975 Arg Arg Val Arg Asp Leu Asp Arg Val Leu Met Gly Tyr Gln Thr Tyr 980 985 990 Pro Gln Ala Leu Thr Leu Val Phe Asp Asp Val Gln Gly His Asp Leu 995 1000 1005 Met Gly Ser Val Thr Leu Asp His Phe Gly Glu Val Pro Gly Gly Pro 1010 1015 1020 Ala Arg Ala Ser Gln Gly Arg Glu Val Gln Trp Gln Val Phe Val Pro 1025 1030 1035 1040 Ser Ala Glu Ser Arg Glu Lys Leu Ile Ser Leu Leu Ala Arg Gln Trp 1045 1050 1055 Glu Ala Leu Cys Gly Arg Glu Leu Pro Val Glu Leu Thr Gly 1060 1065 1070
Claims (36)
1. A DNA molecule encoding for a polypeptide including an amino acid sequence which is receptive to imidazoline compounds, said DNA molecule containing a DNA sequence with at least 75% sequence similarity with the DNA sequence shown in SEQ ID No. 4.
2. A DNA molecule according to claim 1 , containing a DNA sequence with at least 75% sequence similarity with the DNA sequence shown in SEQ ID No. 2.
3. A DNA molecule according to claim 2 , containing a DNA sequence with at least 75% sequence similarity with the DNA sequence of SEQ ID No. 3.
4. A DNA molecule according to claim 3 , containing a DNA sequence with at least 75% sequence similarity with the DNA sequence of SEQ ID No. 1.
5. A DNA molecule according to any one of claims 1 to 4 , containing a DNA sequence with at least 80% sequence similarity with the sequence of said SEQ ID No.
6. A DNA molecule according to any one of claims 1 to 4 , containing a DNA sequence with at least 85% sequence similarity with the sequence of said SEQ ID No.
7. A DNA molecule according to any one of claims 1 to 4 , containing a DNA sequence with at least 90% sequence similarity with the sequence of said SEQ ID No.
8. A DNA molecule according to any one of claims 1 to 4 , containing a DNA sequence with at least 95% sequence similarity with the sequence of said SEQ ID No.
9. A DNA molecule according to claim 1 , which is deposited with the ATCC under deposit accession no. ATCC 209217.
10. A genomic DNA molecule encoding for a polypeptide including an amino acid sequence which is receptive to imidazoline compounds, and wherein exon portions of said genomic DNA molecule include the DNA sequence as defined in claim 1 .
11. A genomic DNA molecule according to claim 10 , which is deposited with the ATCC under deposit accession no. ATCC 209216.
12. A 1110 bp ApaI-EcoRI restriction fragment of the DNA molecule according to claim 1 .
13. A 1.85 kb EcoRI restriction fragment of the DNA molecule according to claim 4 .
14. A vector containing a DNA sequence as defined in any one of claims 1-13.
15. A host cell transfected with a vector as defined in claim 14 .
16. An isolated polypeptide including a site which is receptive to imidazoline compounds, said polypeptide containing an amino acid sequence with at least 80% sequence similarity with the amino acid sequence shown in SEQ ID No. 6.
17. A polypeptide as defined in claim 16 , having a molecular weight of about 35 to 45 kDa.
18. A polypeptide as defined in claim 17 , having a molecular weight of about 37 kDa.
19. An isolated polypeptide including a site which is receptive to imidazoline compounds, said polypeptide containing an amino acid sequence with at least 80% sequence similarity with the amino acid sequence shown in SEQ ID No. 5.
20. A polypeptide as defined in claim 19 , having a molecular weight of about 60 to 85 kDa.
21. A polypeptide as defined in claim 20 , having a molecular weight of about 70 kDa.
22. A fragment of the amino acid sequence shown in SEQ ID No. 5 or 6, which fragment is receptive to imidazoline compounds.
23. A polypeptide according to any one of claims 16 to 22 , which is immunoreactive with at least one of Reis antiserum and Dontenwill antiserum.
24. A polypeptide according to any one of claims 16 to 23 , which is a human polypeptide.
25. A method of producing an isolated polypeptide including an amino acid sequence which is receptive to imidazoline compounds, said method comprising:
transfecting a host cell with a vector as defined in claim 14; and
culturing the transfected host cell in a culture medium to express the polypeptide.
26. An isolated polypeptide including an amino acid sequence which is receptive to imidazoline compounds, which polypeptide is expressed by the method of claim 25 .
27. A method of screening for a ligand of an imidazoline receptor, which method comprises:
culturing a host cell as defined in claim 15 in a culture medium to express a polypeptide including an amino acid sequence which is receptive to imidazoline compounds;
contacting said polypeptide with a labelled ligand for the imidazoline receptor under conditions effective to bind the labelled ligand thereto;
contacting said polypeptide with a candidate ligand; and
detecting any displacement of the labelled ligand from said polypeptide, wherein displacement signifies that the candidate ligand is a ligand for the imidazoline receptor.
28. The method of claim 27 , wherein said contacting steps are performed in an intact cultured host cell.
29. The method of claim 27 , further comprising isolating the cell membrane of said cultured host cell prior to performing said contacting steps.
30. The method of claim 27 , wherein said contacting of said imidazoline receptive polypeptide with said candidate ligand is conducted at a plurality of candidate ligand concentrations.
31. The method of claim 27 , wherein the labelled ligand is radiolabelled.
32. A method of obtaining a DNA material encoding a polypeptide which is receptive to imidazoline compounds, said method comprising:
providing a labelled DNA probe by labelling a DNA molecule identical or complementary to a DNA molecule as defined in any one of claims 1 to 9 or a restriction fragment thereof;
contacting said DNA probe with genetic material suspected of encoding said imidazoline receptive polypeptide;
hybridizing said DNA probe and said genetic material under stringent hybridization conditions;
identifying any portion of the genetic material which hybridizes to said DNA probe; and
isolating said identified material.
33. A method according to claim 32 , wherein the genetic material is derived from a library selected from the group consisting of RNA library, cDNA library and genomic DNA library.
34. A method according to claim 33 , wherein said library is a human library.
35. A method according to claim 32 , wherein the labelled DNA probe is provided by labelling a restriction fragment according to claim 12 or 13.
36. A method of raising antibodies immunoreactive with a polypeptide which is receptive to an imidazoline compound, which method comprises:
injecting an animal with a polypeptide as defined in any one of claims 16 to 24 and 26; and
isolating antibodies produced by the animal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/420,845 US20030180885A1 (en) | 1996-03-01 | 2003-04-23 | DNA molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US1260096P | 1996-03-01 | 1996-03-01 | |
US08/650,766 US6015690A (en) | 1996-05-20 | 1996-05-20 | DNA sequence encoding a human imidazoline receptor and method for cloning the same |
US09/414,643 US6881826B1 (en) | 1996-03-01 | 1999-10-08 | Imidazoline receptive polypeptides |
US09/922,635 US6515149B2 (en) | 2000-08-08 | 2001-08-07 | Acetal compound, polymer, resist composition and patterning response |
US10/420,845 US20030180885A1 (en) | 1996-03-01 | 2003-04-23 | DNA molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby |
Related Parent Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/414,643 Division US6881826B1 (en) | 1996-03-01 | 1999-10-08 | Imidazoline receptive polypeptides |
US09/922,635 Division US6515149B2 (en) | 1996-03-01 | 2001-08-07 | Acetal compound, polymer, resist composition and patterning response |
Publications (1)
Publication Number | Publication Date |
---|---|
US20030180885A1 true US20030180885A1 (en) | 2003-09-25 |
Family
ID=26683763
Family Applications (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/922,635 Expired - Fee Related US6033871A (en) | 1996-03-01 | 1997-09-03 | DNA molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby |
US09/389,487 Expired - Fee Related US6576742B1 (en) | 1996-03-01 | 1999-09-03 | DNA sequence encoding a human imidazoline receptor and method for cloning the same |
US10/420,845 Abandoned US20030180885A1 (en) | 1996-03-01 | 2003-04-23 | DNA molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby |
US10/421,763 Expired - Fee Related US7384752B2 (en) | 1996-03-01 | 2003-04-24 | DNA encoding a human imidazoline receptor and ligand binding assay employing same |
US10/947,444 Abandoned US20050084911A1 (en) | 1996-03-01 | 2004-09-23 | DNA molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/922,635 Expired - Fee Related US6033871A (en) | 1996-03-01 | 1997-09-03 | DNA molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby |
US09/389,487 Expired - Fee Related US6576742B1 (en) | 1996-03-01 | 1999-09-03 | DNA sequence encoding a human imidazoline receptor and method for cloning the same |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/421,763 Expired - Fee Related US7384752B2 (en) | 1996-03-01 | 2003-04-24 | DNA encoding a human imidazoline receptor and ligand binding assay employing same |
US10/947,444 Abandoned US20050084911A1 (en) | 1996-03-01 | 2004-09-23 | DNA molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby |
Country Status (2)
Country | Link |
---|---|
US (5) | US6033871A (en) |
WO (1) | WO1997031945A1 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1997031945A1 (en) * | 1996-03-01 | 1997-09-04 | The University Of Mississippi Medical Center | Dna encoding a human imidazoline receptor |
EP1025128A1 (en) * | 1997-09-03 | 2000-08-09 | The University of Mississippi Medical Center | Dna molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby |
US20090047989A1 (en) * | 2007-08-16 | 2009-02-19 | Questox Corporation | Cellular notebook |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5574059A (en) * | 1995-10-27 | 1996-11-12 | Cornell Research Foundation, Inc. | Treating disorders mediated by vascular smooth muscle cell proliferation |
US5726197A (en) * | 1992-11-02 | 1998-03-10 | Syntex (U.S.A.) Inc. | Isoindolinyl derivatives |
US6475752B1 (en) * | 1999-07-30 | 2002-11-05 | Incyte Genomics, Inc. | Mammalian imidazoline receptor |
US6538107B1 (en) * | 1994-09-30 | 2003-03-25 | Takeda Chemical Industries, Ltd. | G protein coupled receptor protein production, and use thereof |
US6635668B1 (en) * | 1998-07-22 | 2003-10-21 | The University Of North Carolina At Chapel Hill | Imidazoline receptor binding compounds |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH08505044A (en) * | 1992-09-25 | 1996-06-04 | シナプティック・ファーマスーティカル・コーポレーション | DNA encoding human α-adrenergic receptor and use thereof |
WO1997031945A1 (en) * | 1996-03-01 | 1997-09-04 | The University Of Mississippi Medical Center | Dna encoding a human imidazoline receptor |
-
1997
- 1997-02-28 WO PCT/US1997/003156 patent/WO1997031945A1/en active Application Filing
- 1997-09-03 US US08/922,635 patent/US6033871A/en not_active Expired - Fee Related
-
1999
- 1999-09-03 US US09/389,487 patent/US6576742B1/en not_active Expired - Fee Related
-
2003
- 2003-04-23 US US10/420,845 patent/US20030180885A1/en not_active Abandoned
- 2003-04-24 US US10/421,763 patent/US7384752B2/en not_active Expired - Fee Related
-
2004
- 2004-09-23 US US10/947,444 patent/US20050084911A1/en not_active Abandoned
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5726197A (en) * | 1992-11-02 | 1998-03-10 | Syntex (U.S.A.) Inc. | Isoindolinyl derivatives |
US6538107B1 (en) * | 1994-09-30 | 2003-03-25 | Takeda Chemical Industries, Ltd. | G protein coupled receptor protein production, and use thereof |
US5574059A (en) * | 1995-10-27 | 1996-11-12 | Cornell Research Foundation, Inc. | Treating disorders mediated by vascular smooth muscle cell proliferation |
US6635668B1 (en) * | 1998-07-22 | 2003-10-21 | The University Of North Carolina At Chapel Hill | Imidazoline receptor binding compounds |
US6475752B1 (en) * | 1999-07-30 | 2002-11-05 | Incyte Genomics, Inc. | Mammalian imidazoline receptor |
Also Published As
Publication number | Publication date |
---|---|
US6576742B1 (en) | 2003-06-10 |
WO1997031945A1 (en) | 1997-09-04 |
US7384752B2 (en) | 2008-06-10 |
US20050084911A1 (en) | 2005-04-21 |
US20030224429A1 (en) | 2003-12-04 |
US6033871A (en) | 2000-03-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Seulberger et al. | The inducible blood–brain barrier specific molecule HT7 is a novel immunoglobulin‐like cell surface glycoprotein. | |
AU685076B2 (en) | DNA encoding 5-HT4 serotonin receptors and uses thereof | |
US5595880A (en) | DNA encoding an α2B adrenergic receptor and uses thereof | |
CA2316403A1 (en) | Mammalian edg-5 receptor homologs | |
JP2002531091A5 (en) | ||
CA2116489A1 (en) | Pacap receptor protein, method for preparing said protein and use thereof | |
EP0656007B1 (en) | Delta opioid receptor genes | |
JPH1084976A (en) | New human g-protein coupled receptor | |
CA2304828A1 (en) | G-protein coupled glycoprotein hormone receptor hg38 | |
WO1994010311A1 (en) | The pct-65 serotonin receptor | |
JP2002536989A (en) | G protein-coupled receptor similar to galanin receptor | |
US20030219874A1 (en) | EDG8 receptor, its preparation and use | |
AU668106B2 (en) | cDNA encoding a dopamine transporter and protein encoded thereby | |
US20030180885A1 (en) | DNA molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby | |
CA2341351A1 (en) | Alpha-2/delta gene | |
WO2000029571A1 (en) | Gene encoding novel transmembrane protein | |
US6432652B1 (en) | Methods of screening modulators of opioid receptor activity | |
WO1999011668A1 (en) | Dna molecules encoding imidazoline receptive polypeptides and polypeptides encoded thereby | |
US7276576B1 (en) | Mammalian ICYP (iodocyanopindolol) receptor and its applications | |
US20080219925A1 (en) | Delta opioid receptor protein | |
US6881826B1 (en) | Imidazoline receptive polypeptides | |
US6015690A (en) | DNA sequence encoding a human imidazoline receptor and method for cloning the same | |
US20030113317A1 (en) | Molecules associated with apoptosis | |
JPH10117791A (en) | Human g protein bond receptor hlyaz61 | |
WO1994010310A1 (en) | THE St-B17 SEROTONIN RECEPTOR |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |