CA2402147A1 - Melanin concentrating hormone receptor chimeric and fusion proteins - Google Patents
Melanin concentrating hormone receptor chimeric and fusion proteins Download PDFInfo
- Publication number
- CA2402147A1 CA2402147A1 CA002402147A CA2402147A CA2402147A1 CA 2402147 A1 CA2402147 A1 CA 2402147A1 CA 002402147 A CA002402147 A CA 002402147A CA 2402147 A CA2402147 A CA 2402147A CA 2402147 A1 CA2402147 A1 CA 2402147A1
- Authority
- CA
- Canada
- Prior art keywords
- leu
- thr
- gly
- val
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 102000029828 Melanin-concentrating hormone receptor Human genes 0.000 title claims abstract description 110
- 108010047068 Melanin-concentrating hormone receptor Proteins 0.000 title claims abstract description 107
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 64
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 64
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 46
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 46
- 229920001184 polypeptide Polymers 0.000 claims abstract description 45
- 150000007523 nucleic acids Chemical class 0.000 claims description 47
- 108020004707 nucleic acids Proteins 0.000 claims description 36
- 102000039446 nucleic acids Human genes 0.000 claims description 36
- 108090000623 proteins and genes Proteins 0.000 claims description 36
- 102000004169 proteins and genes Human genes 0.000 claims description 32
- 101800002739 Melanin-concentrating hormone Proteins 0.000 claims description 28
- 230000000694 effects Effects 0.000 claims description 28
- 102400001132 Melanin-concentrating hormone Human genes 0.000 claims description 27
- 150000001413 amino acids Chemical group 0.000 claims description 26
- ORRDHOMWDPJSNL-UHFFFAOYSA-N melanin concentrating hormone Chemical compound N1C(=O)C(C(C)C)NC(=O)C(CCCNC(N)=N)NC(=O)CNC(=O)C(C(C)C)NC(=O)C(CCSC)NC(=O)C(NC(=O)C(CCCNC(N)=N)NC(=O)C(NC(=O)C(NC(=O)C(N)CC(O)=O)C(C)O)CCSC)CSSCC(C(=O)NC(CC=2C3=CC=CC=C3NC=2)C(=O)NC(CCC(O)=O)C(=O)NC(C(C)C)C(O)=O)NC(=O)C2CCCN2C(=O)C(CCCNC(N)=N)NC(=O)C1CC1=CC=C(O)C=C1 ORRDHOMWDPJSNL-UHFFFAOYSA-N 0.000 claims description 24
- 238000000034 method Methods 0.000 claims description 21
- 239000013604 expression vector Substances 0.000 claims description 18
- 239000002773 nucleotide Substances 0.000 claims description 16
- 125000003729 nucleotide group Chemical group 0.000 claims description 16
- 150000001875 compounds Chemical class 0.000 claims description 13
- 230000003834 intracellular effect Effects 0.000 claims description 13
- 230000027455 binding Effects 0.000 claims description 12
- 101500028288 Homo sapiens Melanin-concentrating hormone Proteins 0.000 claims description 8
- 238000002360 preparation method Methods 0.000 claims description 8
- 238000012360 testing method Methods 0.000 claims description 8
- 241000894007 species Species 0.000 claims description 7
- 241001465754 Metazoa Species 0.000 claims description 4
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 claims description 4
- 230000009261 transgenic effect Effects 0.000 claims description 4
- 102000034287 fluorescent proteins Human genes 0.000 abstract description 12
- 108091006047 fluorescent proteins Proteins 0.000 abstract description 12
- 241000699666 Mus <mouse, genus> Species 0.000 description 60
- 210000004027 cell Anatomy 0.000 description 43
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 37
- 241000880493 Leptailurus serval Species 0.000 description 35
- 108020004705 Codon Proteins 0.000 description 33
- 239000005090 green fluorescent protein Substances 0.000 description 29
- 108020004414 DNA Proteins 0.000 description 26
- 239000010976 emerald Substances 0.000 description 24
- 229910052876 emerald Inorganic materials 0.000 description 24
- 229940024606 amino acid Drugs 0.000 description 23
- 235000001014 amino acid Nutrition 0.000 description 22
- 230000004927 fusion Effects 0.000 description 21
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 20
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 20
- 101000581404 Mus musculus Melanin-concentrating hormone receptor 1 Proteins 0.000 description 20
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 19
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 18
- 241000579895 Chlorostilbon Species 0.000 description 18
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 18
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 18
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 16
- 108010062796 arginyllysine Proteins 0.000 description 16
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 16
- 108010061238 threonyl-glycine Proteins 0.000 description 16
- 239000012634 fragment Substances 0.000 description 15
- 108010037850 glycylvaline Proteins 0.000 description 15
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 14
- 101100183183 Mus musculus Mchr1 gene Proteins 0.000 description 14
- 235000018102 proteins Nutrition 0.000 description 14
- 108010047857 aspartylglycine Proteins 0.000 description 13
- 239000013598 vector Substances 0.000 description 13
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 description 12
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 12
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 12
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 12
- 108010015796 prolylisoleucine Proteins 0.000 description 12
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 12
- 239000011031 topaz Substances 0.000 description 12
- 229910052853 topaz Inorganic materials 0.000 description 12
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 11
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 11
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 11
- 101000581402 Homo sapiens Melanin-concentrating hormone receptor 1 Proteins 0.000 description 11
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 11
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 11
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 11
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 11
- 108010012988 lysyl-glutamyl-aspartyl-glycine Proteins 0.000 description 11
- 108700004896 tripeptide FEG Proteins 0.000 description 11
- 108010009962 valyltyrosine Proteins 0.000 description 11
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 10
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 10
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 10
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 10
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 10
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 10
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 10
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 10
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 10
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 10
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 10
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 10
- 108010053725 prolylvaline Proteins 0.000 description 10
- 108010073969 valyllysine Proteins 0.000 description 10
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 9
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 9
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 9
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 9
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 9
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 9
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 9
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 9
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 9
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 9
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 9
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 9
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 9
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 9
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 9
- DXWNFNOPBYAFRM-IHRRRGAJSA-N Phe-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N DXWNFNOPBYAFRM-IHRRRGAJSA-N 0.000 description 9
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 9
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 9
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 9
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 9
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 9
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 9
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 9
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 9
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 9
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 9
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 9
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 9
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 9
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 9
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 9
- 108010060455 des-Tyr- beta-casomorphin Proteins 0.000 description 9
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 9
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 9
- 108010034529 leucyl-lysine Proteins 0.000 description 9
- 102000005962 receptors Human genes 0.000 description 9
- 108020003175 receptors Proteins 0.000 description 9
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 9
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 9
- 108010000239 Aequorin Proteins 0.000 description 8
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 8
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 8
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 8
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 8
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 8
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 8
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 8
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 8
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 8
- SSNJZBGOMNLSLA-CIUDSAMLSA-N Cys-Leu-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O SSNJZBGOMNLSLA-CIUDSAMLSA-N 0.000 description 8
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 8
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 8
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 8
- WCHONUZTYDQMBY-PYJNHQTQSA-N His-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WCHONUZTYDQMBY-PYJNHQTQSA-N 0.000 description 8
- KWURTLAFFDOTEQ-GUBZILKMSA-N Leu-Cys-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KWURTLAFFDOTEQ-GUBZILKMSA-N 0.000 description 8
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 8
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 8
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 8
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 8
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 8
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 8
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 8
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 8
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 8
- NLHSFJQUHGCWSD-PYJNHQTQSA-N Met-Ile-His Chemical compound N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O NLHSFJQUHGCWSD-PYJNHQTQSA-N 0.000 description 8
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 description 8
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 8
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 8
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 8
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 8
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 8
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 8
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 8
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 8
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 8
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 8
- OWQKBXKXZFRRQL-XGEHTFHBSA-N Thr-Met-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N)O OWQKBXKXZFRRQL-XGEHTFHBSA-N 0.000 description 8
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 8
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 8
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 8
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 8
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 8
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 8
- ABZWHLRQBSBPTO-RNXOBYDBSA-N Tyr-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=C(C=C4)O)N ABZWHLRQBSBPTO-RNXOBYDBSA-N 0.000 description 8
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 8
- POFQRHFHYPSCOI-FHWLQOOXSA-N Val-Trp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N POFQRHFHYPSCOI-FHWLQOOXSA-N 0.000 description 8
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 108010005942 methionylglycine Proteins 0.000 description 8
- WKBPZYKAUNRMKP-UHFFFAOYSA-N 1-[2-(2,4-dichlorophenyl)pentyl]1,2,4-triazole Chemical compound C=1C=C(Cl)C=C(Cl)C=1C(CCC)CN1C=NC=N1 WKBPZYKAUNRMKP-UHFFFAOYSA-N 0.000 description 7
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 7
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 7
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 7
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 7
- LNENWJXDHCFVOF-DCAQKATOSA-N Asp-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LNENWJXDHCFVOF-DCAQKATOSA-N 0.000 description 7
- UPURLDIGQGTUPJ-ZKWXMUAHSA-N Cys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N UPURLDIGQGTUPJ-ZKWXMUAHSA-N 0.000 description 7
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 7
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 7
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 7
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 7
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 7
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 7
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 7
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 7
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 7
- LKXANTUNFMVCNF-IHPCNDPISA-N Leu-His-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LKXANTUNFMVCNF-IHPCNDPISA-N 0.000 description 7
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 7
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 7
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 7
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 7
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 7
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 7
- NIHNMOSRSAYZIT-BPNCWPANSA-N Tyr-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NIHNMOSRSAYZIT-BPNCWPANSA-N 0.000 description 7
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 7
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 7
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 7
- 108010044940 alanylglutamine Proteins 0.000 description 7
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 7
- 238000002825 functional assay Methods 0.000 description 7
- 108010024607 phenylalanylalanine Proteins 0.000 description 7
- 208000016261 weight loss Diseases 0.000 description 7
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 6
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 6
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 6
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 6
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 6
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 6
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 6
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 6
- 108091005973 GFP derivatives Proteins 0.000 description 6
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 6
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 6
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 6
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 6
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 6
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 6
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 6
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 6
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 6
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 6
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 6
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 6
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 6
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 6
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 6
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 6
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 6
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 6
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 6
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 6
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 6
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 6
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 6
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 6
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 6
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 210000004899 c-terminal region Anatomy 0.000 description 6
- 239000000499 gel Substances 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 108010027338 isoleucylcysteine Proteins 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 230000004580 weight loss Effects 0.000 description 6
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 5
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 5
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 5
- KIQKJXYVGSYDFS-ZLUOBGJFSA-N Cys-Asn-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KIQKJXYVGSYDFS-ZLUOBGJFSA-N 0.000 description 5
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 5
- LPJVZYMINRLCQA-AVGNSLFASA-N Gln-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N LPJVZYMINRLCQA-AVGNSLFASA-N 0.000 description 5
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 5
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 5
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 5
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 5
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 5
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 5
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 5
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 5
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 5
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 5
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 5
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 5
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 5
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 5
- 101150106280 Mchr1 gene Proteins 0.000 description 5
- 108700036626 Melanin-concentrating hormone receptor 1 Proteins 0.000 description 5
- 102100027375 Melanin-concentrating hormone receptor 1 Human genes 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 5
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 5
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 5
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 5
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 5
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 5
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 5
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 5
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 201000010099 disease Diseases 0.000 description 5
- 230000005714 functional activity Effects 0.000 description 5
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 5
- 108010050848 glycylleucine Proteins 0.000 description 5
- 108010015792 glycyllysine Proteins 0.000 description 5
- 210000003016 hypothalamus Anatomy 0.000 description 5
- 108010017391 lysylvaline Proteins 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 108010048818 seryl-histidine Proteins 0.000 description 5
- 238000003146 transient transfection Methods 0.000 description 5
- 108010051110 tyrosyl-lysine Proteins 0.000 description 5
- 238000000625 variable-angle polarised absorption spectroscopy Methods 0.000 description 5
- 239000004475 Arginine Substances 0.000 description 4
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 4
- YYSYDIYQTUPNQQ-SXTJYALSSA-N Asn-Ile-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YYSYDIYQTUPNQQ-SXTJYALSSA-N 0.000 description 4
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 4
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 4
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 4
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 4
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 4
- 239000000055 Corticotropin-Releasing Hormone Substances 0.000 description 4
- OHCQJHSOBUTRHG-KGGHGJDLSA-N FORSKOLIN Chemical compound O=C([C@@]12O)C[C@](C)(C=C)O[C@]1(C)[C@@H](OC(=O)C)[C@@H](O)[C@@H]1[C@]2(C)[C@@H](O)CCC1(C)C OHCQJHSOBUTRHG-KGGHGJDLSA-N 0.000 description 4
- LTXLIIZACMCQTO-GUBZILKMSA-N Gln-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LTXLIIZACMCQTO-GUBZILKMSA-N 0.000 description 4
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 4
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 4
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 4
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 4
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 4
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 4
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 4
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 4
- YYOCMTFVGKDNQP-IHRRRGAJSA-N His-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N YYOCMTFVGKDNQP-IHRRRGAJSA-N 0.000 description 4
- LVQDUPQUJZWKSU-PYJNHQTQSA-N Ile-Arg-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LVQDUPQUJZWKSU-PYJNHQTQSA-N 0.000 description 4
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 4
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 4
- MTONDYJJCIBZTK-PEDHHIEDSA-N Ile-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)O)N MTONDYJJCIBZTK-PEDHHIEDSA-N 0.000 description 4
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 4
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 4
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 4
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 4
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 4
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 4
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 4
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 4
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 4
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 4
- LXNPMPIQDNSMTA-AVGNSLFASA-N Lys-Gln-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 LXNPMPIQDNSMTA-AVGNSLFASA-N 0.000 description 4
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 4
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 4
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 4
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 4
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 4
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 4
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 4
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 4
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 4
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 4
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 4
- SNSYSBUTTJBPDG-OKZBNKHCSA-N Pro-Trp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N4CCC[C@@H]4C(=O)O SNSYSBUTTJBPDG-OKZBNKHCSA-N 0.000 description 4
- 102000001708 Protein Isoforms Human genes 0.000 description 4
- 108010029485 Protein Isoforms Proteins 0.000 description 4
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 4
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 4
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 4
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 4
- 108091081024 Start codon Proteins 0.000 description 4
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 4
- DDDLIMCZFKOERC-SVSWQMSJSA-N Thr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N DDDLIMCZFKOERC-SVSWQMSJSA-N 0.000 description 4
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 4
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 4
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 4
- HFJJDMOFTCQGEI-STECZYCISA-N Tyr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HFJJDMOFTCQGEI-STECZYCISA-N 0.000 description 4
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 4
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 4
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 4
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 4
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 4
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 4
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 4
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 4
- 239000000556 agonist Substances 0.000 description 4
- 239000005557 antagonist Substances 0.000 description 4
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 210000000170 cell membrane Anatomy 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 206010012601 diabetes mellitus Diseases 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 230000014509 gene expression Effects 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 3
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 3
- 241000243290 Aequorea Species 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 3
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 3
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 3
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 3
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 3
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 3
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 3
- QUBKBPZGMZWOKQ-SZMVWBNQSA-N Arg-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QUBKBPZGMZWOKQ-SZMVWBNQSA-N 0.000 description 3
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 3
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 3
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 3
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 3
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 3
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 3
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 3
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 3
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 3
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 3
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 3
- 241000699802 Cricetulus griseus Species 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 108091006027 G proteins Proteins 0.000 description 3
- 102000003688 G-Protein-Coupled Receptors Human genes 0.000 description 3
- 108090000045 G-Protein-Coupled Receptors Proteins 0.000 description 3
- 102000030782 GTP binding Human genes 0.000 description 3
- 108091000058 GTP-Binding Proteins 0.000 description 3
- 102000034354 Gi proteins Human genes 0.000 description 3
- 108091006101 Gi proteins Proteins 0.000 description 3
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 3
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 3
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 3
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 3
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 3
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 3
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 3
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 3
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 3
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 3
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 3
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 3
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 3
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 3
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 3
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 3
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 3
- RJVZMGQMJOQIAX-GJZGRUSLSA-N Gly-Trp-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(O)=O RJVZMGQMJOQIAX-GJZGRUSLSA-N 0.000 description 3
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 3
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 3
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 3
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 3
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 3
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 3
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 3
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 3
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 3
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 3
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 3
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 3
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 3
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 3
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 3
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 3
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 3
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- 206010028980 Neoplasm Diseases 0.000 description 3
- 108090000189 Neuropeptides Proteins 0.000 description 3
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 3
- RGZYXNFHYRFNNS-MXAVVETBSA-N Phe-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGZYXNFHYRFNNS-MXAVVETBSA-N 0.000 description 3
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 3
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 3
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 3
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 3
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 3
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 3
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 3
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 3
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 3
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 3
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 3
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 3
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 3
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 3
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 3
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 3
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 3
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 3
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 3
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 3
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 3
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 3
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 3
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 3
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 3
- 239000011543 agarose gel Substances 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 230000003281 allosteric effect Effects 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 235000009582 asparagine Nutrition 0.000 description 3
- 229960001230 asparagine Drugs 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000037396 body weight Effects 0.000 description 3
- 230000003491 cAMP production Effects 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 230000037406 food intake Effects 0.000 description 3
- 235000012631 food intake Nutrition 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 3
- 235000004554 glutamine Nutrition 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 108010077112 prolyl-proline Proteins 0.000 description 3
- 238000011282 treatment Methods 0.000 description 3
- 108010014563 tryptophyl-cysteinyl-serine Proteins 0.000 description 3
- 108010084932 tryptophyl-proline Proteins 0.000 description 3
- 239000004474 valine Substances 0.000 description 3
- 241000242764 Aequorea victoria Species 0.000 description 2
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 2
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- RYRQZJVFDVWURI-SRVKXCTJSA-N Arg-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N RYRQZJVFDVWURI-SRVKXCTJSA-N 0.000 description 2
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 2
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 2
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 2
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 2
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 2
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 2
- 102100025878 C1q-related factor Human genes 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 2
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 2
- SUZLHDUTVMZSEV-UHFFFAOYSA-N Deoxycoleonol Natural products C12C(=O)CC(C)(C=C)OC2(C)C(OC(=O)C)C(O)C2C1(C)C(O)CCC2(C)C SUZLHDUTVMZSEV-UHFFFAOYSA-N 0.000 description 2
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 2
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 2
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 2
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 2
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 2
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 2
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- WGVPDSNCHDEDBP-KKUMJFAQSA-N His-Asp-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WGVPDSNCHDEDBP-KKUMJFAQSA-N 0.000 description 2
- IGBBXBFSLKRHJB-BZSNNMDCSA-N His-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 IGBBXBFSLKRHJB-BZSNNMDCSA-N 0.000 description 2
- ZFDKSLBEWYCOCS-BZSNNMDCSA-N His-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CC=CC=C1 ZFDKSLBEWYCOCS-BZSNNMDCSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 2
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 2
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 2
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 2
- XUMBMVFBXHLACL-UHFFFAOYSA-N Melanin Chemical compound O=C1C(=O)C(C2=CNC3=C(C(C(=O)C4=C32)=O)C)=C2C4=CNC2=C1C XUMBMVFBXHLACL-UHFFFAOYSA-N 0.000 description 2
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 2
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 2
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 2
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 2
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 201000001880 Sexual dysfunction Diseases 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 2
- MEZCXKYMMQJRDE-PMVMPFDFSA-N Trp-Leu-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=C(O)C=C1 MEZCXKYMMQJRDE-PMVMPFDFSA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 239000005441 aurora Substances 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 230000029918 bioluminescence Effects 0.000 description 2
- 238000005415 bioluminescence Methods 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 210000000481 breast Anatomy 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- OHCQJHSOBUTRHG-UHFFFAOYSA-N colforsin Natural products OC12C(=O)CC(C)(C=C)OC1(C)C(OC(=O)C)C(O)C1C2(C)C(O)CCC1(C)C OHCQJHSOBUTRHG-UHFFFAOYSA-N 0.000 description 2
- 210000001072 colon Anatomy 0.000 description 2
- IDLFZVILOHSSID-OVLDLUHVSA-N corticotropin Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)NC(=O)[C@@H](N)CO)C1=CC=C(O)C=C1 IDLFZVILOHSSID-OVLDLUHVSA-N 0.000 description 2
- 229960000258 corticotropin Drugs 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 230000000763 evoking effect Effects 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 2
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 230000001639 hypophagic effect Effects 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 230000037323 metabolic rate Effects 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 230000001817 pituitary effect Effects 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 230000004952 protein activity Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 235000004400 serine Nutrition 0.000 description 2
- 231100000872 sexual dysfunction Toxicity 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 239000013589 supplement Substances 0.000 description 2
- 235000008521 threonine Nutrition 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 235000002374 tyrosine Nutrition 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 230000004584 weight gain Effects 0.000 description 2
- 235000019786 weight gain Nutrition 0.000 description 2
- SKPQXOSVPKPXML-ULQDDVLXSA-N 2-[[(2s)-1-[(2s)-3-phenyl-2-[[(2s)-pyrrolidine-2-carbonyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H]1NCCC1)CC1=CC=CC=C1 SKPQXOSVPKPXML-ULQDDVLXSA-N 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 101000997963 Aequorea victoria Green fluorescent protein Proteins 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- 241000252073 Anguilliformes Species 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- WQLJRNRLHWJIRW-KKUMJFAQSA-N Asn-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)O WQLJRNRLHWJIRW-KKUMJFAQSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241000255789 Bombyx mori Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 206010006895 Cachexia Diseases 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 208000024172 Cardiovascular disease Diseases 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 101100007328 Cocos nucifera COS-1 gene Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 208000032928 Dyslipidaemia Diseases 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 206010056465 Food craving Diseases 0.000 description 1
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- WYSJPCTWSBJFCO-AVGNSLFASA-N His-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N WYSJPCTWSBJFCO-AVGNSLFASA-N 0.000 description 1
- DGVYSZUCRYXKOJ-XIRDDKMYSA-N His-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N DGVYSZUCRYXKOJ-XIRDDKMYSA-N 0.000 description 1
- 101000740205 Homo sapiens Sal-like protein 1 Proteins 0.000 description 1
- 206010020772 Hypertension Diseases 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- HYLIOBDWPQNLKI-HVTMNAMFSA-N Ile-His-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HYLIOBDWPQNLKI-HVTMNAMFSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- FTUZWJVSNZMLPI-RVMXOQNASA-N Ile-Met-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N FTUZWJVSNZMLPI-RVMXOQNASA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- 206010022489 Insulin Resistance Diseases 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- SEOXPEFQEOYURL-PMVMPFDFSA-N Leu-Tyr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SEOXPEFQEOYURL-PMVMPFDFSA-N 0.000 description 1
- 239000012097 Lipofectamine 2000 Substances 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 239000000637 Melanocyte-Stimulating Hormone Substances 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- CEGVMWAVGBRVFS-XGEHTFHBSA-N Met-Cys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CEGVMWAVGBRVFS-XGEHTFHBSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 102000003797 Neuropeptides Human genes 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 208000012641 Pigmentation disease Diseases 0.000 description 1
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 102100037204 Sal-like protein 1 Human genes 0.000 description 1
- 241000277331 Salmonidae Species 0.000 description 1
- 241000242583 Scyphozoa Species 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- DSSOYPJWSWFOLK-CIUDSAMLSA-N Ser-Cys-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O DSSOYPJWSWFOLK-CIUDSAMLSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- 241000209140 Triticum Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- HDQJVXVRGJUDML-UBHSHLNASA-N Trp-Cys-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HDQJVXVRGJUDML-UBHSHLNASA-N 0.000 description 1
- QTQNGBOKNQNQLS-PMVMPFDFSA-N Trp-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N QTQNGBOKNQNQLS-PMVMPFDFSA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 102000030621 adenylate cyclase Human genes 0.000 description 1
- 108060000200 adenylate cyclase Proteins 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 208000022531 anorexia Diseases 0.000 description 1
- 230000036528 appetite Effects 0.000 description 1
- 235000019789 appetite Nutrition 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000001593 cAMP accumulation Effects 0.000 description 1
- 230000001275 ca(2+)-mobilization Effects 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 210000004671 cell-free system Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 108700010039 chimeric receptor Proteins 0.000 description 1
- 201000001883 cholelithiasis Diseases 0.000 description 1
- 238000004624 confocal microscopy Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 206010061428 decreased appetite Diseases 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 235000019688 fish Nutrition 0.000 description 1
- YFHXZQPUBCBNIP-UHFFFAOYSA-N fura-2 Chemical compound CC1=CC=C(N(CC(O)=O)CC(O)=O)C(OCCOC=2C(=CC=3OC(=CC=3C=2)C=2OC(=CN=2)C(O)=O)N(CC(O)=O)CC(O)=O)=C1 YFHXZQPUBCBNIP-UHFFFAOYSA-N 0.000 description 1
- 208000001130 gallstones Diseases 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 230000008774 maternal effect Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 210000004897 n-terminal region Anatomy 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 210000000287 oocyte Anatomy 0.000 description 1
- 230000001956 orexigenic effect Effects 0.000 description 1
- 201000008482 osteoarthritis Diseases 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- -1 physical size Chemical class 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 238000002731 protein assay Methods 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 238000001959 radiotherapy Methods 0.000 description 1
- 239000000018 receptor agonist Substances 0.000 description 1
- 229940044601 receptor agonist Drugs 0.000 description 1
- 239000002464 receptor antagonist Substances 0.000 description 1
- 229940044551 receptor antagonist Drugs 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 230000003938 response to stress Effects 0.000 description 1
- 230000004043 responsiveness Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 210000001995 reticulocyte Anatomy 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 230000009919 sequestration Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000004960 subcellular localization Effects 0.000 description 1
- 208000001608 teratocarcinoma Diseases 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 1
- 125000002987 valine group Chemical group [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 239000013585 weight reducing agent Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/72—Receptors; Cell surface antigens; Cell surface determinants for hormones
- C07K14/723—G protein coupled receptor, e.g. TSHR-thyrotropin-receptor, LH/hCG receptor, FSH receptor
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Immunology (AREA)
- Biochemistry (AREA)
- Cell Biology (AREA)
- Toxicology (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Endocrinology (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The present invention features melanin concentrating hormone receptor (MCH-R ) chimeric and fusion proteins. MCH-R chimeric proteins comprise an MCH-R polypeptide region made up of at least two or more polypeptide regions characteristic of MCH-R found in different species. MCH-R fusion proteins comprise an MCH-R polypeptide region and a fluorescent protein region.</SDOA B>
Description
TITLE OF THE INVENTION
MELANIN CONCENTRATING-HORMONE RECEPTOR CHIMERIC AND
FUSION PROTEINS
CROSS-REFERENCE TO RELATED APPLICATIONS
The present application claims priority to provisional application U.S.
Serial No. 601189,698, filed March 15, 2000, hereby incorporated by reference herein.
BACKGROUND OF THE INVENTION
The references cited herein are not admitted to be prior art to the claimed invention.
Neuropeptides present in the hypothalamus play a major role in mediating the control of body weight. (Flier et al., 1998. Cell, 92, 437-440.) Melanin-concentrating hormone (MCH) is a cyclic 19-amino acid neuropeptide synthesized as part of a larger pre-prohormone precursor in the hypothalamus which also encodes neuropeptides NEI and NGE. (Nation et al., 1990. Mol. Endocrifzol. 4, 632-637.) MCH was first identified in salmon pituitary, and in fish MCH affects melanin aggregation thus affecting skin pigmentation. In trout and in eels MCH has also been shown to be involved in stress induced or CRF-stimulated ACTH release.
(Kawauchi et al., 1983. NatuYe 305, 321-323.) In humans two genes encoding MCH have been identified that are expressed in the brain. (Breton et al., 1993. Mol. Brain Res. 18, 297-310.) In mammals MCH has been localized primarily to neuronal cell bodies of the hypothalamus which are implicated in the control of food intake, including perikarya of the lateral hypothalamus and zona inertia. (Knigge et al., 1996. Peptides 17, 1063-1073.) Pharmacological and genetic evidence suggest that the primary mode of MCH action is to promote feeding (orexigenic). MCH mRNA is up regulated in fasted mice and rats and in the oblob mouse. (Qu et al., 1996. Nature 380, 243-247.) Injection of MCH centrally (ICV) stimulates food intake and MCH antagonizes the hypophagic effects seen with oc-melanocyte stimulating hormone (aMSH). (Qu et al., 1996. Nature 380, 243-247.) MCH-deficient mice are lean, hypophagic, and have increased metabolic rate. (Shimada et al., 1998. Nature 396, 670-673.) MCH action is not limited to modulation of food intake as effects on the hypothalamic-pituitary-axis have been reported. (Nation 1994. Critical Rev. ifz Neurobiol. 8, 221-262.) MCH may be involved in the body response to stress as MCH can modulate the stress-induced release of CRF from the hypothalamus and ACTH from the pituitary. In addition, MCH neuronal systems may be involved in reproductive or maternal function.
Several references describe a receptor that is indicated to bind MCH.
(Chambers et al., 1999. Nature 400, 261-265; Saito et al., 1999. Nature 400, 265-269;
Bachner et al., 1999. FEBS Letters 457:522-524; Shimomura et al., 1999.
Biochemical and Biophysical Researcla Comnzunicatiofzs 261, 622-626; and Lembo et al., 1999. Nat. Cell Biol. 1, 267-271.) SUMMARY OF THE INVENTION
The present invention features melanin concentrating hormone receptor (MCH-R) chimeric and fusion proteins. MCH-R chimeric proteins comprise an MCH-R polypeptide region made up of at least two or more polypeptide regions characteristic of MCH-R found in different species. MCH-R fusion proteins comprise an MCH-R polypeptide region and a fluorescent protein region.
An MCH-R polypeptide region provides a functional G-protein coupled receptor region able to bind MCH and transduce an intracellular signal.
Examples of MCH-R polypeptide regions include naturally occurring MCH-R, chimeric MCH-R containing two or more regions from naturally occurring MCH-R, and functional derivatives thereof.
Reference to the terms "characteristic" and "derivatives thereof describe a relationship to a reference sequence. In both cases, there is at least about 75°70 sequence similarity to the reference sequence.
Thus, a first aspect of the present invention describes a fusion protein comprising (a) an MCH-R polypeptide region and (b) a fluorescent polypeptide region. The fluorescent polypeptide region is joined directly, or though a polypeptide linker, to the carboxy side of the MCH-R polypeptide region.
Another aspect of the present invention describes an MCH-R chimeric protein. The protein comprises: (a) an MCH-R binding region characteristic of a human MCH-R, (b) a transmembrane domain characteristic of a non-human MCH-R, and (c) an intracellular domain characteristic of a non-human MCH-R.
Another aspect of the present invention describes a nucleic acid encoding for an MCH-R fusion protein or an MCH-R chimeric protein described herein. Such nucleic acid comprises either a contiguous nucleotide sequence that _2_ codes for the protein or a sequence that is processed by a host cell to produce a contiguous nucleotide sequence encoding for the protein. Processing of a nucleic acid sequence to produce a contiguous nucleotide sequence encoding for a protein can occur by the splicing together of exons resulting in intron removal.
Another aspect of the present invention describes an expression vector comprising a nucleic acid encoding for an MCH-R fusion protein or an MCH-R
chimeric protein described herein.
Another aspect of the present invention describes a recombinant cell comprising nucleic acid encoding for an MCH-R fusion protein or an MCH-R
chimeric protein described herein. The nucleic acid may be part of the host genome or may exist independently of the host genome.
Another aspect of the present invention describes a non-human transgenic animal comprising nucleic acid encoding for an MCH-R fusion protein or an MCH-R chimeric protein described herein.
Another aspect of the present invention describes a method for assaying for MCH-R active compounds by measuring the effect of a test preparation on one or more MCH-R activities. The method is performed using either an MCH-R
fusion protein or an MCH-R chimeric protein described herein.
Other features and advantages of the present invention are apparent from the additional descriptions provided herein including the different examples.
The provided examples illustrate different components and methodology useful in practicing the present invention. The examples do not limit the claimed invention.
Based on the present disclosure the skilled artisan can identify and employ other components and methodology useful for practicing the present invention.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 illustrates aequorin assay results comparing a mouse MCH-R
fusion with a human wild type MCH-R and a CMV-EGFP control.
Figure 2 illustrates a cAMP flashplate assay of CHO cell clones stably expressing mMCH-1R-EGFP. Cells from individual clones were dissociated in enzyme free media and stimulated for 15 minutes at 37°C with human MCH
at the indicated concentrations in the presence of 10 ~M forskolin. Cells were then lysed and assayed for bound ~l2sl]CAMP. Mouse MCH-1R-EGFP clones exhibited EC50 values (0.1111, 0.1255, 0.1291, or 0.2304 nM) indistinguishable from that of a CHO
cell clone expressing the wild-type human short isoform of MCH-1R ( 0.1282 nM).
Figure 3 illustrates a cAMP flashplate assay of CHO cell clones stably expressing human short/mouse species chimeric MCH-1R-EGFP. Cells from individual clones were dissociated in enzyme free media and stimulated for 15 minutes at 37°C with human MCH at the indicated concentrations in the presence of ~M forskolin. Cells were then lysed and assayed for bound [12~I]CAMP. Human short/mouse species chimeric MCH-1R-EGFP clones exhibited EC50 values (0.0366, 0.0462, 0.2117, or 0.2499 nM) indistinguishable from that of a CHO cell clone expressing the wild-type human short isoform of MCH-1R (0.1137 nM).
The present invention features MCH-R chimeric and fusion proteins.
Such proteins have a variety of different uses including being used as a research tool to study MCH-R function and dynamics, and being used to screen for MCH-R
agonists and antagonists.
The MCH-R provides a target to achieve different beneficial effects in a patient. Preferably, MCH-R activity is modulated to achieve one or more of the following: weight loss, weight gain, treat cancer (e.g., colon or breast), reduce pain, treat diabetes, reduce stress, or teat sexual dysfunction.
Modulation of MCH-R activity can be achieved by evoking a response at the MCH receptor or by altering a response evoked by an MCH receptor agonist or antagonist. Compounds modulating MCH-R receptor activity include agonists, antagonists, and allosteric modulators. Generally, MCH-R antagonists and allosteric modulators negatively affecting activity will be used to achieve weight loss, treat cancer (e.g., colon or breast), reduce pain, reduce stress, or teat sexual dysfunction;
and MCH-R agonists and allosteric modulators positively affecting activity will be used to produce a weight gain.
Preferably, MCH-R activity is modulated to achieve a weight loss or to treat diabetes in a patient. Diabetes mellitus can be treated by modulating MCH-R
activity to achieve, for example, one or both of the following: enhancing glucose tolerance or decreasing insulin resistance.
Excessive body weight is a contributing factor to different diseases, including hypertension, diabetes, dyslipidemias, cardiovascular disease, gall stones, osteoarthritis, and certain forms of cancers. Bringing about a weight loss can be used, for example, to reduce the likelihood of such diseases and as part of a treatment for such diseases. Weight reduction can be achieved by modulating MCH-R activity to obtain, for example, one or more of the following effects: reducing appetite, increasing metabolic rate, reducing fat intake, or reducing carbohydrate craving.
Increasing body weight is particularly useful for a patient having a disease or disorder, or under going a treatment, accompanied by weight loss.
Examples of diseases or disorders accompanied by weight loss include anorexia, AIDS, wasting, cachexia, and frail elderly. Examples of treatments accompanied by weight Ioss include chemotherapy and radiation therapy.
MCH-R Chimeric Proteins MCH-R chimeric proteins contain an MCH-R polypeptide region made up by at least two or more polypeptide regions characteristic of MCH-R found in different species. The different polypeptide regions that are present provide for an N-terminal extracellular domain; a transmembrane domain made up of transmembrane regions, extracellular loop regions, and intracellular loop regions; and an intracellular carboxy terminus domain. Examples of MCH-R amino acid sequences include the following: SEQ. ID. NO. 1 (human MCH1R long form), SEQ. ID. NO. 2 (human MCH1R short form), and SEQ. ID. NO. 3 (mouse MCH1R).
Preferably, the MCH-R chimeric protein comprises an MCH-R binding region characteristic of a human MCH-R along with transmembrane and intracellular domains characteristic of a non-human MCH-R. There are substantial amino acid differences between the N-terminus of the MCH-R present in humans and that present in other species such as mice. Such differences could result in, for example, the mouse MCH-R having different intrinsic properties and responsiveness to agonists and/or antagonists than the human MCH-R. The presence of a human MCH-R
binding region provides for a "humanized" MCH-R chimeric receptor.
The transmembrane and intracellular domains characteristic of a non-human MCH-R can be used in conjunction with a non-human host to provide a more naturally occurring environment for these regions. For example, an MCH-R
chimeric having mouse transmembrane and intracellular domains are preferably used in murine cells lines or in transgenic mice.
MCH-R chimeric proteins may contain regions other than extracellular, transmembrane, and intracellular domains that do not substantially decrease the activity of the protein. Preferably, additional regions do not cause a decrease of more than about 25°70 of MCH-R activity as measured using one or more of the assays described in the examples provided below. Examples of additional regions that may be present include fluorescent protein regions and linker regions.
In an embodiment of the present invention, the MCH-R chimeric protein comprises: (a) an MCH binding region characteristic of a first species and (b) a transmembrane and intracellular domain region characteristic of a second species joined directly, or though a linker, to the carboxy side of the MCH binding region.
Preferably, the protein comprises, consists, or consists essentially of an MCH-R
polypeptide having a sequence similarity of at least about 75%, at least 85%, or at least 95% with either SEQ.1D. NO. 4 (human short form/mouse species chimeric MCH1R) or SEQ. ID. NO. 5 (human Long form/mouse species chimeric). Even more preferably, the protein comprises, consists essentially of, or consists of, SEQ. )D. NO.
4 or SEQ. ID. NO. 5.
Sequence similarity for polypeptides can be determined by BLAST.
(Altschul et al., 1997. Nucleic Acids Res. 25, 3389-3402, hereby incorporated by reference herein.) In an embodiment of the present invention, sequence similarity is determined using tBLASTn search program with the following parameters:
MATRIX:BLOSLTM62, PER RESIDUE GAP COST: 11, and Lambda ratio: 1.
Differences in naturally occurring amino acids are due to different R
groups. An R group effects different properties of the amino acid such as physical size, charge, and hydrophobicity. Amino acids can be divided into different groups as follows: neutral and hydrophobic (alanine, valine, leucine, isoleucine, proline, tryptophan, phenylalanine, and methionine); neutral and polar (glycine, serine, threonine, tyrosine, cysteine, asparagine, and glutamine); basic (lysine, arginine, and histidine); and acidic (aspartic acid and glutamic acid).
Generally, in substituting different amino acids it is preferable to exchange amino acids having similar properties. Substituting different amino acids within a particular group, such as substituting valine for leucine, arginine for lysine, and asparagine for glutamine are good candidates for not causing a change in polypeptide functioning.
Changes outside of different amino acids groups can also be made.
Preferably, such changes are made taking into account the position of the amino acid to be substituted in the polypeptide. For example, arginine can substitute more freely for nonpolor amino acids in the interior of a polypeptide then glutamate because of its long aliphatic side chain. (See, Ausubel, Current Protocols in Molecular Biology, John Wiley, 1987-1998, Supplement 33 Appendix 1C.) MCH-R Fusion Proteins MCH-R fusion proteins contain an MCH-R polypeptide region and a fluorescent protein region either directly joined together or joined together through a linker. These regions provide MCH-R activity and a marker for evaluating MCH-R
dynamics.
An MCH-R polypeptide region provides functional MCH-R activity and includes naturally occurring MCH-R, chimeric MCH-R, and derivatives thereof.
Preferred derivatives thereof have a sequence similarity of at least about 75%, at least about 85%, or at least about 95% to a naturally occurring MCH-R or a chimeric MCH-R described herein.
A fluorescent protein region contains a chromophore that fluoresces.
Preferably, the fluorescent protein region is the green fluorescent protein of the jellyfish Aequorea victories or a derivative thereof. Preferred derivatives have a sequence similarity of at least about 75%, at least about 85%, or at least about 95% to the Aequorea victories green fluorescent protein (GFP). The Aequorea victories green fluorescent protein and examples of derivatives thereof are described by Cormack et al., 1996. Gene 17, 33-38; Yang et al., 1996. Nucleic Acids Research 24, 4592-4593;
Tsien et al., U.S. Patent No. 5,625,048; Tsien et al., U.S. Patent No.
5,777,079; and Cormack et al., U.S. Patent No. 5,804,387 (each of which are hereby incorporated by reference herein).
In different embodiments the MCH-R polypeptide region comprises, consists essentially of, or consists of, a sequence selected from the group consisting of: SEQ. m. NO. 1, SEQ. m. NO. 2, SEQ.1D. NO. 3, SEQ. m. NO. 4, and SEQ. m.
NO. 5; and the fluorescent polypeptide region comprises, consists essentially of, or consists of, an amino acid sequence selected from the group consisting of SEQ.
m.
NO. 6 (GFP), SEQ. ID. NO. 7 (EGFP), SEQ. m. NO. 8 (Emerald), SEQ. m. NO. 9 (Topaz), and SEQ. )D. NO. 10 (Wlb). EGFP, Emerald, Topaz, and Wlb are derivatives of GFP.
The optionally present linker is a polypeptide region that is preferably from 1 to about 100 amino acids in length. In different embodiments the linker is up to 75, 50 or 25 amino acids in length.
Preferably, the MCH-R fusion protein comprises, consists essentially of, or consists of, the MCH-R polypeptide region and the fluorescent polypeptide region. More preferably, the protein comprises, consists essentially of, or consists of, _7_ an amino acid sequence selected from the group consisting of: SEQ. ID. NO. 11 (mouse MCH1R-linker-EGFP), SEQ. ID. NO. 12 (mouse MCH1R/EGFP direct fusion), SEQ. ID. NO. 13 (human short form/mouse species chimeric MCH1R-linker-EGFP), or SEQ. ID. NO. 14 (human long form/mouse species chimeric MCH1R-linker-EGFP).
MCH-R Chimeric and Fusion Proteins Nucleic Acid and Expression MCH-R chimeric and fusion proteins can be produced using techniques well known in the art. Preferably, such proteins are produced by recombinant expression inside a host cell by way of an expression vector or by way of nucleic acid integrated into the host genome. Examples of nucleic acid sequences encoding for MCH-R polypeptide regions, fluorescent protein regions, MCH-R
chimeric proteins, and MCH-R fusion proteins are provided for by SEQ. ID. NOs.
29 (see Example 1, infra).
Starting with a particular amino acid sequence and the known degeneracy of the genetic code, a large number of different encoding nucleic acid sequences can be obtained. The degeneracy of the genetic code arises because almost all amino acids are encoded for by different combinations of nucleotide triplets or codons. The translation of a particular codon into a particular amino acid is well known in the art (see, e.g., Lewin GENES IV, p. 119, Oxford University Press, 1990).
Amino acids are encoded for by codons as follows:
A=Ala=Alanine: codons GCA, GCC, GCG, GCU
C=Cys=Cysteine: codons UGC, UGU
D=Asp=Aspartic acid: codons GAC, GAU
E=Glu=Glutamic acid: codons GAA, GAG
F=Phe=Phenylalanine: codons UUC, UUU
G=Gly=Glycine: codons GGA, GGC, GGG, GGU
H=His=Histidine: codons CAC, CAU
I=Ile=Isoleucine: codons AUA, AUC, AUU
K=Lys=Lysine: codons AAA, AAG
L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU
M=Met=Methionine: codon AUG
N=Asn=Asparagine: codons AAC, AAU
P=Pro=Proline: codons CCA, CCC, CCG, CCU
Q=Gln=Glutamine: codons CAA, CAG
_g_ R=Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU
S=Ser=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU
T=Thr=Threonine: codons ACA, ACC, ACG, ACU
V=Val=Valine: codons GUA, GUC, GUG, GUU
W=Trp=Tryptophan: codon UGG
Y=Tyr=Tyrosine: codons UAC, UAU
Examples of techniques for introducing nucleic acid into a cell and expressing the nucleic acid to produce protein are provided in references such as Ausubel, Current Protocols ifa Molecular Biology, John Wiley, 1987-1998, and Sambrook, et al., in Molecular Cloni~eg, A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory Press, 1989.
An expression vector contains recombinant nucleic acid encoding for a polypeptide along with regulatory elements for proper transcription and processing.
The recombinant nucleic acid contains two or more nucleic acid regions not naturally associated with each other. Exogenous regulatory elements such as an exogenous promoter can be useful for expressing recombinant nucleic acid in a particular host.
Examples of expression vectors are cloning vectors, modified cloning vectors, specifically designed plasmids, and viruses.
Generally, the regulatory elements that are present in an expression vector include a transcriptional promoter, a ribosome binding site, a terminator, and an optionally present operator. Another preferred element is a polyadenylation signal providing for processing in eukaryotic cells. Preferably, an expression vector also contains an origin of replication for autonomous replication in a host cell, a selectable marker, a limited number of useful restriction enzyme sites, and a potential for high copy number.
Expression vectors providing suitable levels of polypeptide expression in different hosts are well known in the art. Mammalian expression vectors well known in the art include pcDNA3 (Invitrogen), pMClneo (Stratagene), pXT1 (Stratagene), pSGS (Stratagene), EBO-pSV2-neo (ATCC 37593), pBPV-1(8-2) (ATCC 37110), pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo (ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), pCI-neo (Promega) and .lambda.ZD35 (ATCC 37565). Bacterial expression vectors well known in the art include pETlla (Novagen), lambda gtll (Invitrogen), pcDNAII
(Invitrogen), and pKK223-3 (Pharmacia). Fungal cell expression vectors well known _g_ in the art include pYES2 (Invitrogen) and Pichia expression vector (Invitrogen).
Insect cell expression vectors well known in the art include Blue Bac III
(Invitrogen).
Recombinant host cells may be prokaryotic or eukaryotic. Examples of recombinant host cells include the following: bacteria such as E. coli;
fungal cells such as yeast; mammalian cells such as human, bovine, porcine, monkey, hampster, and rodent; and insect cells such as Drosophila and silkworm derived cell lines.
Commercially available mammalian cell lines include L cells L-M(TI~-) (ATCC
CCL 1.3), L cells L-M (ATCC CCL 1.2), 293 (ATCC CRL 1573), Raji (ATCC CCL
86), CV-1 (ATCC CCL 70),,COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS-C-1 (ATCC CCL 26) and MRC-5 (ATCC CCL 171).
To enhance expression in a particular host it may be useful to modify the sequence to take into account codon usage of the host. Codon usage of different organisms are well known in the art. (See, Ausubel, Current Protocols ih Molecular Biology, John Wiley, 1987-1998, Supplement 33 Appendix 1C.) Expression vectors may be introduced into host cells using standard techniques. Examples of such techniques include transformation, transfection, lipofection, protoplast fusion, and electroporation.
Nucleic acid encoding for a polypeptide can be expressed in a cell without the use of an expression vector employing, for example, synthetic mRNA
or native mRNA. Additionally, mRNA can be translated in various cell-free systems such as wheat germ extracts and reticulocyte extracts, as well as in cell based systems, such as frog oocytes. Introduction of mRNA into cell based systems can be achieved, for example, by microinjection.
Techniques for producing transgenic animals are well known in the art.
Examples of such techniques are provided for by Teratocarcinomas and embryonic stem cells: a practical approach. Ed. By E. J. Robertson, IRL Press Limited, Oxford, England (1987); and Gene Targeting: a practical approach. Ed. By A. L. Joyner, Oxford University Press Inc. New York, NY (1993).
G-Protein Coupled Receptor Assaxs MCH-R is G-protein coupled receptor. Techniques for measuring different G-protein activities, such as Gi/o, Gs, and Gq are well known in the art.
MCH-R activity is preferably assayed for by measuring either Gi/o or Gq.
Gi/o and Gs activity can be measured using techniques such as a melonaphore assay, measuring cAMP production, measuring inhibition of cAMP
accumulation, and measuring binding of 35S-GTP. cAMP can be measured using different techniques such as radioimmunoassay and indirectly by cAMP
responsive gene reporter proteins.
Gq activity can be measured using techniques such as those measuring intracellular Ca2+. Examples of techniques well known in the art that can be employed to measure Ca2+ include the use of dyes such as Fura-2 and the use of Ca2+-bioluminescent sensitive reporter proteins such as aequorin. An example of a cell line employing aequorin to measure G-protein activity is HEK293/aeql7. (Button et al.,1993. Cell Calcium 14, 663-671, and Feighner et al., 1999. Scief2ce 284, 2188, both of which are hereby incorporated by reference herein.) Functional assays can be performed using individual compounds or preparations containing different compounds. A preparation containing different compounds where one or more compounds affect MCH-R chimeric or fusion protein activity can be divided into smaller groups of compounds to identify the compounds) affecting MCH-R chimeric or fusion protein activity. In an embodiment of the present invention a test preparation containing at least 10 compounds is used in a functional assay.
Functional assays can be performed using recombinantly produced MCH-R chimeric or fusion protein present in different environments. Such environments include, for example, cell extracts and purified cell extracts containing the MCH-R chimeric or fusion protein expressed from recombinant nucleic acid and an appropriate membrane for the polypeptide; and the use of a purified MCH-R
chimeric or fusion protein produced by recombinant means that is introduced into a different environment suitable for measuring G-protein activity.
Fluorescent Protein Assays Fluorescent protein joined to an MCH receptor can be employed to study different aspects of receptor dynamics including receptor sequestration, receptor densitization, and receptor localization. The fluorescent protein can be used in ira vitro or in vivo systems.
ha vitro applications of fluorescent proteins can be performed using techniques well known in the art. Examples of such techniques are provided by Barak et al., 1997. Mol PlaaYm. S, 177-184; Tarasova et al., 1997. J. Biol. Chem.
272, 14817-14824; Lin et al., 1998. Mol. Cell. Endo. 146, 27-37; Tarasova et al., 1998. J.
Biol. Chenz. 273, 15883-15886; Kallal et al., 1998. J. Biol. Chem. 273, 322-328;
Groake et al., 1999. J. Biol. Chem. 274, 23263-23269; Doherty et al., 1999.
Biochem. J. 341, 415-422; Brock et al., 1999. Proc. Natl. Acad. Sci. USA 96, 10128; Cornea et al., 1999. Endocri~zology 140, 4272-4280; and Lembo et al., 1999.
Nat. Cell Biol. 1, 267-271 (these references are not admitted to be prior art to the claimed invention).
IfZ vivo applications of fluorescent proteins can be performed using techniques well known in the art. Examples of such techniques are provided by Mombaerts et al., 1996. Cell 87, 675-686; Rodriquez et al., 1999. Cell 97, 199-208;
Spergel et al., 1999. J. Neurosci. l, 2037-2050; and Zuo et al., 1999. Proc.
Natl.
Acad. Sci. USA 96, 14100-14105 (these references are not admitted to be prior art to the claimed invention).
EXAMPLES
Examples are provided below to further illustrate different features and advantages of the present invention. The examples also illustrate useful methodology for practicing the invention. These examples do not limit the claimed invention.
Example 1:, Amino acid and nucleic acid sequence information for SEQ..ID. NOs.
1-29 are provided below. SEQ. ~. NOs. 1-29 include examples of polypeptide and encoding nucleic acid sequences for MCH-R polypeptide regions, fluorescent polypeptide regions, fusion proteins and chimeric proteins. In some cases the encoding nucleic acid is shown with additional nucleic acid upsteam or downstream from an open reading frame.
SEQ. ID. N0.1: Human long form MCH1R
MS VGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP
AWVEGSSARLWEQATGTGWMDLEASLLPTGPNASNTSDGPDNLTSAGSPPR
LLFLLGMPFMIHQLMGNGV WHFGETMCTLITAMDANS QFTSTYILTAMAmR
YLATVHPISSTKFRKPSVATLVICLLWALSFISITPVWLYARLIPFPGGAVGCGI
RLPNPDTDLYWFTLYQFFLAFALPFV VITAAYVRILQRMTS S VAPAS QRSIRLR
TKRVTRTAIAICLVFFVCWAPYYVLQLTQLSISRPTLTFVYLYNAAISLGYANS
CLNPFVYIVLCETFRKRLVLS VKPAAQGQLRAVSNAQTADEERTESKGT
SEQ. ID. NO. 2: Human short form MCH1R
MDLEASLLPTGPNASNTSDGPDNLTSAGSPPRTGSISYINmVIPSVFGTICLLGIIG
NSTVIFAV VKKSKLHW CNNVPDIFBNLS V VDLLFLLGMPFMIHQLMGNGV WH
FGETMCTLITAMDANSQFTSTYILTAMAlDRYLATVHPISSTKFRKPSVATLVI
CLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWFTLYQFFLAFA
LPFV VITAAYVRILQRMTS S VAPAS QRSIRLRTKRVTRTAIAICLVFFV C WAPY
YVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLSV
KPAAQGQLRAVSNAQTADEERTESKGT
SEQ. ID. NO. 3: Mouse MCH1R
MDLQASLLS TGPNASNISDGQDNFTLAGPPPRTRS VS YIIVIIMPS VFGTICLLGI
VGNSTVIFAVVKKSKLHWCSNVPDIFIINLSVVDLLFLLGMPFMIHQLMGNGV
WHFGETMCTLITAMDANSQFTSTYILTAMAIDRYLATVHPISSTKFRKPSMAT
LVICLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWF TLYQFFLA
FALPFV VITAAYVKILQRMTS S VAPAS QRSIRLRTKRVTRTAIAICLVFFV C WA
PYYVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLS
VKPAAQGQLRTVSNAQTADEERTESKGT
SEQ. ID. NO. 4: Human short form/mouse species chimeric MCH1R
MDLEASLLPTGPNASNTSDGPDNLTSAGSPPRTGSISYIIVI>MPS VFGTICLLGIIG
FGETMCTLITAMDANSQFTSTYILTAMAIDRYLATVHPISSTKFRKPSMATLVI
CLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWFTLYQFFLAFA
YVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLSV
KPAAQGQLRTVSNAQTADEERTESKGT
SEQ. TD. NO. 5: Human long form/mouse species chimeric MCH1R
MSVGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP
AWVEGSSARLWEQATGTGWMDLEASLLPTGPNASNTSDGPDNLTSAGSPPR
TGSISYINIIMPSVFGTICLLGIIGNSTVIFAVVKKSKLHWCNNVPDIFIINLSVVD
T.T.FT-T.GMPFM113QLMGNGVWHFGETMCTLITAMDANSQFTSTYILTAMAIDR
YLATVHPIS STKFRKPSMATLVICLLWALSFISITPV WLYARLIPFPGGAV GCGI
RLPNPDTDLYWFTLYQFFLAFALPFV VITAAYVKILQRMTS S VAPAS QRSIRLR
TKRVTRTAIAICLVFFVCWAPYYVLQLTQLSISRPTLTFVYLYNAAISLGYANS
CLNPFVYIVLCETFRKRLVLS VKPAAQGQLRTVSNAQTADEERTESKGT
SEQ. ID. NO. 6: GFP
MSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKL
PVPWPTLVTTFSYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGN
KNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKD
PNEKRDHMVLLEFVTAAGITHGMDELYK
SEQ. ID. NO. 7: EGFP
MVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGK
LPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDG
NYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADK
QKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSK
DPNEKRDHMVLLEFVTAAGITLGMDELYK
SEQ. ID. NO. 8: Emerald Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu VaI Lys Phe Glu GIy Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn lle Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Lys Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Thr Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro lle Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys SEQ. ID. NO. 9: Topaz Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Gly Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Arg Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Va1 Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr GIn Gln Asn Thr Pro Ile Gly Asp Gly Pro Va1 Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gln Sex Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys SEQ. ID. N0.10: W1B
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Arg Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Trp Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr lle Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Ile Ser His Asn Val Tyr Ile Thr Ala Asp Lys GIn Lys Asn Gly Ile Lys Ala His Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu AIa Asp His Tyr GIn Gln Asn Thr Pro Ile GIy Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys SEQ. ID. N0.11: Mouse MCH1R-linker-EGFP
MDLQASLLSTGPNASNISDGQDNFTLAGPPPRTRSVSYINmVIPSVFGTICLLGI
VGNSTVIFAVVKKSKLHWCSNVPDIFaNLSVVDLLFLLGMPFMIHQLMGNGV
WHFGETMCTLITAMDANS QFTSTYILTAMAIDRYLATVHPIS STI~FRRKPSMAT
LVICLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWF'TLYQFFLA
FALPFVVITAAYVKILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWA
PYYVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLS
VKPAAQGQLRTVSNAQTADEERTESKGTVDGTAGPGSIATMVSKGEELFTGV
VPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTL
TYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFE
HNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVL
LEFVTAAGITLGMDELYK
SEQ. ID. N0.12: Mouse MCH1R/EGFP direct fusion MDLQASLLSTGPNASNISDGQDNFTLAGPPPRTRS V S YIIVIIMPS VFGTICLLGI
WHFGETMCTLITAMDANSQFTSTYILTAMA>DRYLATVHPISSTKFRKPSMAT
LVICLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWFTLYQFFLA
FALPFVVITAAYVKILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWA
PYYVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLS
VKPAAQGQLRTVSNAQTADEERTESKGTMVSKGEELFTGVVPILVELDGDVN
GHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPD
HYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLG
MDELYK
SEQ. ID. N0.13: Human short form/mouse species chimeric MCH1R-linker-EGFP
MDLEASLLPTGPNASNTSDGPDNLTSAGSPPRTGSISYINIIn~IPSVFGTICLLGIIG
FGETMCTLITAMDANSQFTSTYII,TAMAIDRYLATVHPISSTKFRKPSMATLVI
CLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWF'TLYQFFLAFA
LPFVVITAAYVKILQRMTSS VAPASQRSIRLRTKRVTRTAIAICLVFFVCWAPY
YVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLS V
KPAAQGQLRTVSNAQTADEERTESKGTVDGTAGPGSIATMVSKGEELFTGVV
PILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLT
NIEDGS VQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLL
EFVTAAGITLGMDELYK
SEQ. ID. N0.14: Human Iong form/mouse species chimeric MCHIR-linker-EGFP
MS VGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP
AWVEGSSARLWEQATGTGWMDLEASLLPTGPNASNTSDGPDNLTSAGSPPR
TGSISYINInVIPSVFGTICLLGIIGNSTVIFAVVKKSKLHWCNNVPDIFaNLSVVD
LLFLLGMPFMIHQLMGNGVWHFGETMCTLITAMDANSQFTSTYILTAMAIDR
YLATVHPISSTKFRKPSMATLVICLLWALSFISITPVWLYARLIPFPGGAVGCGI
RLPNPDTDLYWFTLYQFFLAFALPFV VITAAYVKILQRMTS S VAPAS QRSIRLR
TKRVTRTAIAICLVFFVCWAPYYVLQLTQLSISRPTLTFVYLYNAAISLGYANS
CLNPFVYIVLCETFRKRLVLSVKPAAQGQLRTVSNAQTADEERTESKGTVDGT
AGPGSIATMVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLT
LKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQE
RTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSH
LSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYK
SEQ. ID. N0.15: Human long form MCH1R cDNA
ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC
GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG
CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT
CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC
GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT
CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT
CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCA
TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC
TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCC
ATCTCTTCCACGAAGTTCCGGAAGCCCTCTGTGGCCACCCTGGTGATCTGC
CTCCTGTGGGCCCTCTCCTTCATCAGCATCACCCCTGTGTGGCTGTATGCC
AGACTCATCCCCTTCCCAGGAGGTGCAGTGGGCTGCGGCATACGCCTGCC
CAACCCAGACACTGACCTCTACTGGTTCACCCTGTACCAGTTTTTCCTGGC
CTTTGCCCTGCCT'TTTGTGGTCATCACAGCCGCATACGTGAGGATCCTGCA
GCGCATGACGTCCTCAGTGGCCCCCGCCTCCCAGCGCAGCATCCGGCTGC
GGACAAAGAGGGTGACCCGCACAGCCATCGCCATCTGTCTGGTCTTCTTT
GTGTGCTGGGCACCCTACTATGTGCTACAGCTGACCCAGTTGTCCATCAGC
CGCCCGACCCTCACCTTTGTCTACTTATACAATGCGGCCATCAGCTTGGGC
TATGCCAACAGCTGCCTCAACCCCTTTGTGTACATCGTGCTCTGTGAGACG
TTCCGCAAACGCTTGGTCCTGTCGGTGAAGCCTGCAGCCCAGGGGCAGCT
TCGCGCTGTCAGCAACGCTCAGACGGCTGACGAGGAGAGGACAGAAAGC
AAAGGCACCTGA
SEQ. ID. N0.16: Human short form MCH1R cDNA
ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC
CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT
GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA
CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG
ACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC
ACGAAGTTCCGGAAGCCCTCTGTGGCCACCCTGGTGATCTGCCTCCTGTGG
GCCCTCTCCTTCATCAGCATCACCCCTGTGTGGCTGTATGCCAGACTCATC
CCCTTCCCAGGAGGTGCAGTGGGCTGCGGCATACGCCTGCCCAACCCAGA
CACTGACCTCTACTGGTTCACCCTGTACCAGTTTTTCCTGGCCTTTGCCCTG
CCTTTTGTGGTCATCACAGCCGCATACGTGAGGATCCTGCAGCGCATGAC
GTCCTCAGTGGCCCCCGCCTCCCAGCGCAGCATCCGGCTGCGGACAAAGA
GGGTGACCCGCACAGCCATCGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG
CACCCTACTATGTGCTACAGCTGACCCAGTTGTCCATCAGCCGCCCGACCC
TCACCTTTGTCTACTTATACAATGCGGCCATCAGCTTGGGCTATGCCAACA
GCTGCCTCAACCCCTTTGTGTACATCGTGCTCTGTGAGACGTTCCGCAAAC
GCTTGGTCCTGTCGGTGAAGCCTGCAGCCCAGGGGCAGCTTCGCGCTGTC
AGCAACGCTCAGACGGCTGACGAGGAGAGGACAGAAAGCAAAGGCACCT
GA
_18_ SEQ. ID. N0.17: Mouse MCH1R cDNA
Nucleic acid sequence start and stop codons are highlighted:
GGCGGTAGAGGAAGACCCTTTTCTGGACTGCGGGGCTCAAGCTCCGGACA
AGGCGGTGGAGGGCGCTGGAGGCTGCCGCAGCCTGCGTGGGTGGACGGG
CGCTCCACTCCAGGGAGCAGGCGACCTGCACCGGCTGCATGGATCTGCAA
GCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACATCTCCGATGGCCA
GGATAATTTCACATTGGCGGGGCCACCTCCTCGCACAAGGAGTGTCTCCT
ACATCAACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCA
TTGTGGGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTG
CACTGGTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTG
GATCTGCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGT
AATGGTGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCAT
GGACGCCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCA
TTGACCGCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGA
AGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCA
TTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGG
GTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACT
GGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCAT
CACTGCTGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCC
CAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACA
GCCATTGCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTG
CTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTAC
CTGTACAATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCC
CTTTGTGTACATAGTACTCTGTGAGACCTTTCGAAAACGCTTGGTGCTGTC
GGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGA
CAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACCTGACAATCCCCCCC
GGTCACCTCCAAGTCAGGTCACCGCATCAAACCATGGGGAGAGATACTGA
GATAAACCCGGGGCTACCCTGGGAGGATGCAGAAGCTGGAGGCTGGGGG
CTTGTAGCAAACCACATTCCACGGGGCCCACAAATTGCTAGGGAGGCTTG
CAGCCTGGTTTGGGGGGGAAGCCTCAGACTGCAGGGATCCCCTTGACAGA
ATAGAAGCGGAGCAAGAAGGAAAGGGTGGTTTGACTGGTTCTCGGGGTCT
GTATCTGTTGGCTCGCATATATCTTTCTCTCAAGGGAAGAAGGCGGAGGT
GCCTAGCTGGGTTCCTTTAAAACTAGGCAGGGCTAGGATCTGAGCAGCTA
GGGCTCTACTGTGAGACTGGGCAAGCCGAGCGTTCCCTCCCATCTCTCATT
GGTGTTGATAGAAGGCAGTCTTTCTCCCAAGCTGGTGGATCTCCTGAAGC
ACGCTGCCTGGGCTCCAGCATCCTGTGCGGATTTCACGTTCTCTTTAGGGG
ATGCATGTTGACACTGGGGTGTGGGCTCTGAGCCCACAGGAGTTTAAAAA
ACCAAAAGAGCTCAGAGTGTCGAGAGAGACCCAATCACCGAGAATGACA
AGGCAACCTGGGGTGGATGTGGATCTTGAAACTAATAAAAAGGGGTTTTC
ACAGTGACAGCGACATTCTCTTCATAGGGCACAGCTGTCAGTCTATGGCT
GATCCAGAGCGAGCATCCATGAATTCTGCATGTGCAGGGGTCACTCTAAT
ACCTGATATGTTGGCATCATCTTTGTGCTTGAGCCTTCCNCTCCCAAATGG
GAATGAAATAAAGGCAAATTCCCNCCCCCCCCAAAAAAGGGGNAAAAAA
AAAAAAAAAAAAAAAAAAAA
SEQ. ID. N0.18: Mouse MCH1R genomic DNA
Nucleic acid sequence start and stop codons, as well as intron borders, are highlighted:
GGCGGTAGAGGAAGACCCTTTTCTGGACTGCGGGGCTCAAGCTCCGGACA
AGGCGGTGGAGGGCGCTGGAGGCTGCCGCAGCCTGCGTGGGTGGACGGG
CGCTCCACTCCAGGGAGCAGGCGACCTGCACCGGCTGCATGGATCTGCAA
GCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACATCTCCGATGGCCA
GGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTCCTCCCTCCTCCG
GGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAGCCAAACTGCCTG
GGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCAGTCGGCTTTGCC
TGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGACAGAAGGGGCA
GGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAACAACAAGGGTTG
GTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGCTGATGAGTCCCA
AATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGGCTGGAGAAGCAA
GCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGTGGGAGCCAGCAG
AAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTCTGGCCTCGCACTG
GGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGCAAAGGCTGCTGG
GAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTTTCTGGTGAATTA
AGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATTCTAGCTCTGAAG
GAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGTGGAGAAATTCCC
AAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCTATTGGGAAAGCC
CAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACAGAAAACCGGGAG
AGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACCCATGGTGACAGTC
ACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTTGGCCATCTCTCTC
TGAAAAGATGAGGCTGCAAATACACACACACACACACACACACACACAC
ACACACACACACACACACACACACACACACACACAAATGTCCTTCAAGCC
TTTT'TGACAAGGTTTTCTGGTGGATCCCGGGGATATGAAGTTGTTCTCAGC
AGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAAATGGATGAAGCG
AAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTAGAAAATCCTATAG
GAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACAGAACTAACAATAG
CCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAGACGTAGTGACCCC
AAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAGGCCGACATGGCAG
GTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTGCTCTTTCTATCCTC
AGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATCAACATCATCATGC
CTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTGGGAAACTCCACAG
TCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTGGTGCAGCAACGTC
CCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCTGCTTTTCCTGCTGG
GCATGCCTTTCATGATCCACCAGCTCATGGGTAATGGTGTCTGGCACTTTG
GGGAAACCATGTGCACCCTCATCACAGCCATGGACGCCAACAGTCAGTTC
ACCAGCACCTACATCCTGACTGCTATGGCCATTGACCGCTACTTGGCCACC
GTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCTCCATGGCCACCCTG
GTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGG
CTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATC
CGCCTACCAAACCCAGATACTGATCTTTACTGGTTCACTCTGTATCAGTTT
TTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGCTGCGTACGTGAAA
ATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCAT
CCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGG
TCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGT
CCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTACAATGCGGCCATCA
GCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCT
GTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAG
GGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGA
CAGAAAGCAAAGGCACCTGACAATCCCCCCCGGTCACCTCCAAGTCAGGT
CACCGCATCAAACCATGGGGAGAGATACTGAGATAAACCCGGGGCTACC
CTGGGAGGATGCAGAAGCTGGAGGCTGGGGGCTTGTAGCAAACCACATTC
CACGGGGCCCACAAATTGCTAGGGAGGCTTGCAGCCTGGTTTGGGGGGGA
AGCCTCAGACTGCAGGGATCCCCTTGACAGAATAGAAGCGGAGCAAGAA
GGAAAGGGTGGTTTGACTGGTTCTCGGGGTCTGTATCTGTTGGCTCGCATA
TATCTTTCTCTCAAGGGAAGAAGGCGGAGGTGCCTAGCTGGGTTCCTTTA
AAACTAGGCAGGGCTAGGATCTGAGCAGCTAGGGCTCTACTGTGAGACTG
GGCAAGCCGAGCGTTCCCTCCCATCTCTCATTGGTGTTGATAGAAGGCAG
TCTTTCTCCCAAGCTGGTGGATCTCCTGAAGCACGCTGCCTGGGCTCCAGC
ATCCTGTGCGGATTTCACGTTCTCTTTAGGGGATGCATGTTGACACTGGGG
TGTGGGCTCTGAGCCCACAGGAGTTTAAAAAACCAAAAGAGCTCAGAGTG
TCGAGAGAGACCCAATCACCGAGAATGACAAGGCAACCTGGGGTGGATG
TGGATCTTGAAACTAATAAAAAGGGGTTTTCACAGTGACAGCGACATTCT
CTTCATAGGGCACAGCTGTCAGTCTATGGCTGATCCAGAGCGAGCATCCA
TGAATTCTGCATGTGCAGGGGTCACTCTAATACCTGATATGTTGGCATCAT
CTTTGTGCTTGAGCCTTCCNCTCCCAAATGGGAATGAAATAAAGGCAAAT
TCCCNCCCCCCCCAAAAAAGGGGNAAAAAAAAAAAAAAAAAAAAAAAA
AA
SEQ. ID. N0.19: Human short form/mouse species chimeric MCH1R
ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC
CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT
GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA
CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG
ACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC
ACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTG
GGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTAT
CCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAG
ATACTGATCTTTACTGGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCT
TCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGCGCATGAC
GTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGA
GGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG
CGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACC
CTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTATGCCAAC
AGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTTCGAAAA
CGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGT
CAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACC
TGA
SEQ. ID. NO. 20: Human long form/mouse species chimeric MCH1R
ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC
GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG
CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT
CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC
GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT
CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT
CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCA
TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC
TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCC
ATCTCTTCCACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGC
CTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCC
AGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCA
AACCCAGATACTGATCTTTACTGGTTCACTCTGTATCAGTTTTTCCTGGCCT
TCGCCCTTCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGC
GCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGG
ACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTG
TGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGC
CCGACCCTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTAT
GCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTT
CGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCG
CACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAA
GGCACCTGA
SEQ. ID. NO. 21: Aequorea victoria Green Fluorescent Protein (GFP) cDNA
Nucleic acid sequence start and stop codons are highlighted:
TACACACGAATAAAAGATAACAAAGATGAGTAAAGGAGAAGAACTTTTC
ACTGGAGTTGTCCCAATTCTTGTTGAATTAGATGGTGATGTTAATGGGCAC
AAATTTTCTGTCAGTGGAGAGGGTGAAGGTGATGCAACATACGGAAAACT
TACCCTTAAATTTATTTGCACTACTGGAAAACTACCTGTTCCATGGCCAAC
ACTTGTCACTACTTTCTCTTATGGTGTTCAATGCTTTTCAAGATACCCAGAT
CATATGAAACAGCATGACTTTTTCAAGAGTGCCATGCCCGAAGGTTATGT
ACAGGAAAGAACTATATT'TTTCAAAGATGACGGGAACTACAAGACACGTG
CTGAAGTCAAGTTTGAAGGTGATACCCTTGTTAATAGAATCGAGTTAAAA
GGTATTGATTTTAAAGAAGATGGAAACATTCTTGGACACAAATTGGAATA
CAACTATAACTCACACAATGTATACATCATGGCAGACAAACAAAAGAATG
GAATCAAAGTTAACTTCAAAATTAGACACAACATTGAAGATGGAAGCGTT
CAACTAGCAGACCATTATCAACAAAATACTCCAATTGGCGATGGCCCTGT
CCTTTTACCAGACAACCATTACCTGTCCACACAATCTGCCCTTTCGAAAGA
TCCCAACGAAAAGAGAGACCACATGGTCCTTCTTGAGTTTGTAACAGCTG
CTGGGATTACACATGGCATGGATGAACTATACAAATAAATGTCCAGACTT
CCAATTGACACTAAAGTGTCCGAACAATTACTAAAATCTCAGGGTTCCTG
GTTAAATTCAGGCTGAGATATTATTTATATATTTATAGATTCATTAAAATT
GTATGAATAATTTATTGATGTTATTGATAGAGGTTATTTTCTTATTAAACA
GGCTACTTGGAGTGTATTCTTAATTCTATATTAATTACAATTTGATTTGACT
TGCTCAAA
SEQ. TD. NO. 22: EGFP + Linker Nucleic acid sequence start and stop codons are highlighted and a 12 amino acid linker sequence is denoted in lower case:
gtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGAGCTGTT
CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCC
ACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAA
GCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGC
CCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTAC
CCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGG
CTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGA
CCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAG
CTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGC
TGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAG
AAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACG
GCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGAC
GGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCT
GAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCG
TGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGC
GGCCGC
SEQ. ID. NO. 23: Emerald ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT
CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG
GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTGACCT
ACGGCGTGCAGTGCTTCGCCCGCTACCCCGACCACATGAAGCAGCACGAC
TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACA
AGGTCTATATCACCGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTC
AAGACCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACT
ACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAAC
CACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCG
CGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCG
GCATGGACGAGCTGTACAAGTAA
SEQ. ID. NO. 24: Topaz ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT
CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG
GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTCGGCT
ACGGCGTGCAGTGCTTCGCCCGCTACCCCGACCACATGCGCCAGCACGAC
TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACA
ACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTC
AAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTA
CCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC
ACTACCTGAGCTACCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGC
GATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGG
CATGGACGAGCTGTACAAGTAA
SEQ. ID. NO. 25: W1B
ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT
CGAGCTGGACGGCGACGTAAACGGCCACAGGTTCAGCGTGTCCGGCGAG
GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT
GGGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC
TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGTACCATCTTC
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACATCAGCCACA
ACGTCTATATCACCGCCGACAAGCAGAAGAACGGCATCAAGGCCCACTTC
AAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTA
CCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC
ACTACC'TGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGC
GATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGG
CATGGACGAGCTGTACAAGTAA
SEQ. ID. NO. 26: Mouse MCH1R-linker-EGFP
Nucleic acid sequence start codon and start and stop codons for mouse MCH1R
and EGFP, respectively, as well as intron borders, are highlighted and a 12 amino acid linker sequence is denoted in lower case:
ATGGATCTGCAAGCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACAT
CTCCGATGGCCAGGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTC
CTCCCTCCTCCGGGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAG
CCAAACTGCCTGGGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCA
GTCGGCTTTGCCTGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGA
CAGAAGGGGCAGGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAAC
AACAAGGGTTGGTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGC
TGATGAGTCCCAAATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGG
CTGGAGAAGCAAGCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGT
GGGAGCCAGCAGAAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTC
TGGCCTCGCACTGGGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGC
AAAGGCTGCTGGGAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTT
TCTGGTGAATTAAGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATT
CTAGCTCTGAAGGAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGT
GGAGAAATTCCCAAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCT
ATTGGGAAAGCCCAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACA
GAAAACCGGGAGAGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACC
CATGGTGACAGTCACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTT
GGCCATCTCTCTCTGAAAAGATGAGGCTGCAAATACACACACACACACAC
ACACACACACACACACACACACACACACACACACACACACACACACAAA
TGTCCTTCAAGCCTTTTTGACAAGGTTTTCTGGTGGATCCCGGGGATATGA
AGTTGTTCTCAGCAGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAA
ATGGATGAAGCGAAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTA
GAAAATCCTATAGGAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACA
GAACTAACAATAGCCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAG
ACGTAGTGACCCCAAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAG
GCCGACATGGCAGGTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTG
CTCTTTCTATCCTCAGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATC
AACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTG
GGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTG
GTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCT
GCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGTAATGG
TGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCATGGACG
CCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCATTGACC
GCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCT
CCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCA
TCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTG
TGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACTGGTTCA
CTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGC
TGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCT
CTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATT
GCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAG
CTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTAC
AATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTG
TACATAGTACTCTGTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAA
GCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTG
ACGAGGAGAGGACAGAAAGCAAAGGCACCgtcgacggtaccgcgggcccgggatccatcg ccaccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCC
TGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGC
GAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTG
CACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGA
CCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCAC
GACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCAT
CTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCG
AGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAG
GAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCC
ACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAA
CTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACC
ACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGAC
AACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAA
GCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTC
TCGGCATGGACGAGCTGTACAAGTAA
SEQ. ID. NO. 27: Mouse MCH1R/EGFP direct fusion Nucleic acid sequence start codon and start and stop codons for mouse MCH1R
and ~ EGFP, respectively, as well as intron borders, are highlighted:
ATGGATCTGCAAGCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACAT
CTCCGATGGCCAGGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTC
CTCCCTCCTCCGGGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAG
CCAAACTGCCTGGGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCA
GTCGGCTTTGCCTGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGA
CAGAAGGGGCAGGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAAC
AACAAGGGTTGGTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGC
TGATGAGTCCCAAATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGG
CTGGAGAAGCAAGCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGT
GGGAGCCAGCAGAAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTC
TGGCCTCGCACTGGGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGC
AAAGGCTGCTGGGAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTT
TCTGGTGAATTAAGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATT
CTAGCTCTGAAGGAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGT
GGAGAAATTCCCAAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCT
ATTGGGAAAGCCCAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACA
GAAAACCGGGAGAGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACC
CATGGTGACAGTCACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTT
GGCCATCTCTCTCTGAAAAGATGAGGCTGCAAATACACACACACACACAC
ACACACACACACACACACACACACACACACACACACACACACACACAAA
TGTCCTTCAAGCCTTTTTGACAAGGTTTTCTGGTGGATCCCGGGGATATGA
AGTTGTTCTCAGCAGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAA
ATGGATGAAGCGAAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTA
GAAAATCCTATAGGAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACA
GAACTAACAATAGCCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAG
ACGTAGTGACCCCAAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAG
GCCGACATGGCAGGTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTG
CTCTTTCTATCCTCAGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATC
AACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTG
GGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTG
GTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCT
GCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGTAATGG
TGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCATGGACG
CCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCATTGACC
GCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCT
CCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCA
TCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTG
TGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACTGGTTCA
CTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGC
TGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCT
CTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATT
GCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAG
CTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTAC
AATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTG
TACATAGTACTCTGTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAA
GCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTG
ACGAGGAGAGGACAGAAAGCAAAGGCACCATGGTGAGCAAGGGCGAGG
AGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTA
AACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCT
ACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTG
CCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAG
CCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGC
CCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAAC
TACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACC
GCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGG
GCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCG
ACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACAT
CGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCC
ATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCA
GTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGC
TGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTAC
AAGTAA
SEQ. ID. NO. 28: Human short form/mouse species chimeric MCH1R-linker-EGFP
Nucleic acid sequence start codon and start and stop codons for mouse MCH1R
and EGFP, respectively, are highlighted and a 12 amino acid linker sequence is denoted in lower case:
ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC
CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT
GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA
CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG
ACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC
ACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTG
GGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTAT
CCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAG
ATACTGATCTTTACTGGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCT
TCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGCGCATGAC
GTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGA
GGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG
CGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACC
CTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTATGCCAAC
AGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTTCGAAAA
CGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGT
CAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACC
gtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGAGCTGTT
CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCC
ACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAA
GCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGC
CCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTAC
CCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGG
CTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGA
CCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAG
CTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGC
TGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAG
AAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACG
GCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGAC
GGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCT
GAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCG
TGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAA
SEQ. ID. NO. 29: Human long form/mouse species chimeric MCH1R-linker-EGFP
Nucleic acid sequence start codon and start and stop codons for mouse MCH1R
and EGFP, respectively, are highlighted and a 12 amino acid linker sequence is denoted in lower case:
ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC
GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG
CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT
CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC
GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT
CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT
CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCA
TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC
TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCC
ATCTCTTCCACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGC
CTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCC
AGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCA
AACCCAGATACTGATCTT'TACTGGTTCACTCTGTATCAGTTTTTCCTGGCCT
TCGCCCTTCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGC
GCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGG
ACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTG
TGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGC
CCGACCCTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTAT
GCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTT
CGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCG
CACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAA
GGCACCgtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGA
GCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAA
ACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTAC
GGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCC
CTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCC
GCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCC
GAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTA
CAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGC
ATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGC
ACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGAC
AAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCG
AGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATC
GGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTC
CGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGG
AGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAG
TAA
Example 2: Generation of Chimeric and Fusion Proteins DNA vectors encoding fusion proteins between a MCH-R receptor (MCH1R) and several different superbright variants of Green Fluorescent Protein (GFP) were generated. GFP variants were fused either via a 12 amino acid linker:
TCGACGGTACCGCGGGCCCGGGATCCATCGCCACC (SEQ. ID. NO. 30), amino acid sequence: VDGTAGPGSIAT (SEQ. ID. NO. 31) (linker fusions) or directly to the C-terminus of MCH1R (direct fusions).
Mouse MCH1R-linker-GFP Variant Fusion Constructs Initially, mouse MCH1R was fused in frame via the linker to Enhanced Green Fluorescent Protein (EGFP). MCH1R was PCR-amplified (95°C
for 5 minutes; 95°C for 30 seconds, 60°C for 45 seconds, 68°C
for 3.5 minutes, for 15 cycles; 68°C for 7 minutes) from a full-length mouse MCH1R genomic DNA
lambda clone utilizing a high fidelity polymerase mix (Expand High Fidelity PCR
System from Boehringer Mannheim) and PCR primers [MCH1R (Eco RI) 5':
GCGAATTCACCATGGATCTGCAAGCCTCG (SEQ. ID. NO. 32), MCH1R (Sal I) 3': GCGTCGACGGTGCCTTTGCTTTCTGTCC (SEQ. m. NO. 33)] that generated Eco RI and Sal I enzymatic restriction sites at the N- and C-terminus, respectively.
The MCH1R N-terminal PCR primer was also designed to introduce a Kozak consensus sequence for translation which contained an Nco I site (5'-ACCATGG-3'), and the MCH1R C-terminal PCR primer was also designed to eliminate the endogenous stop codon present in the mouse MCH1R gene. The resulting PCR
product was phenol/chloroform extracted, restriction digested with Eco RI and Sal I, gel purified, and subcloned in frame into the multicloning site of Clontech's pEGFP-N3 vector between Eco RI and Sal I sites. Several resulting clones for this construct were sequenced to identify a clone with an entirely correct nucleotide sequence. This clone was named mMCHIR-1-EGFP for mouse MCH1R-linker-EGFP.
An approximately 760 by Sal I to Not I fragment of mMCHIR-1-EGFP
was excised, gel purified, and subcloned into the multicloning site of pBluescript (SK+) (Stratagene) between Sal1 and Not I sites. An approximately 710 by Nco I
to Bsr G1 fragment of EGFP was excised from the resulting pBluescript-EGFP vector and replaced with the corresponding Nco I to Bsr G1 fragment of either Emerald, Topaz, or W1B (other superbright GFP variants), which were excised from vectors pRSET-Emerald, pRSET-Topaz, and pRSET-W1B, respectively. pRSET-Emerald, pRSET-Topaz, and pRSET-W 1B were obtained from Aurora Biosciences Co. Sal I to Not I fragments containing either Emerald, Topaz, or W 1B were excised from the resulting pBluescript-Emerald, pBluescript-Topaz, and pBluescript-W1B vectors, respectively. Appropriate fragments were gel purified and subcloned into mMCHIR-1-EGFP digested with Sal I and Not I, replacing the Sal I to Not I EGFP
fragment with the corresponding Sal I to Not I fragment from either Emerald, Topaz, or W1B.
Several clones for each construct were sequenced to confirm the presence of the appropriate GFP variant. The resulting vectors were named mMCHIR-1-Emerald, mMCHIR-1-Topaz, and mMCHIR-1-W 1B for mouse MCH1R-linker-Emerald, mouse MCH1R-linker-Topaz, and mouse MCH1R-linker-W1B, respectively.
Mouse MCH1R/GFP Variant Direct Fusion Constructs A two step PCR strategy was employed to generate the direct fusion constructs. First, mouse MCH1R, EGFP, and Emerald were PCR-amplified from a full-length mouse MCH1R genomic DNA lambda clone, Clontech's pEGFP-N3 vector, and Aurora's pRSET-Emerald vector, respectively. Mouse MCH1R was PCR-amplified according to the previously mentioned conditions utilizing the same N-terminal PCR primer [MCH1R (Eco RI) 5': GCGAATTCACCATGGATCTGCA
AGCCTCG (SEQ. ID. NO. 32)], but in this case a different C-terminal PCR primer was employed. The C-terminal PCR primer [MCH1R (EGFP/Emerald) 3' CCTTGCTCACCATGGTGCCTTTGCTTTCTGTCC (SEQ. ID. NO. 34)] eliminated the endogenous stop codon of mouse MCH1R as before and introduced a region of nucleotide sequence complementary to the nucleotide sequence of the N-terminus of EGFP.
EGFP and Emerald were PCR-amplified (95°C for 5 minutes;
95°C for seconds, 60°C for 45 seconds, 68°C for 1.5 minutes, for 15 cycles; 68°C for 7 minutes) separately with a high fidelity polymerase mix (Advantage HF-2 from Clontech) from their respective templates utilizing a common N-terminal PCR
primer 25 [EGFP/Emerald (MCH1R) 5': CAGAAAGCAAAGGCACCATGGTGAGCAA
GGGCGAGGAGC (SEQ. ID. NO. 35)] that generated a region of nucleotide sequence complementary to the C-terminus of mouse MCH1R and C-terminal PCR
primers [EGFP 3' : GGCGGATCCTCTAGAGTCGCGGCC (SEQ. ID. NO. 36), or Emerald (EGFP) 3' : GCTCTAGAGTCGCGGCCGCTTACTTGTACAGCTCGTCC
30 (SEQ. ID. NO. 37)] that generated a Not I site at the C-terminus. The resulting PCR
products were electrophoresed on an agarose gel and the appropriate fragments were gel purified.
In a second PCR step, PCR reactions were set up between the previously generated mouse MCH1R and EGFP, or mouse MCH1R and Emerald PCR products. Following an initial 5 minute denaturation step at 95°C, two rounds of thermocycling (95°C for 30 seconds, 60°C for 45 seconds, 68°C for 4 minutes) were performed in the absence of PCR primers. This allowed the mouse MCH1R and GFP
variants to anneal at their complementary regions and to be filled in by the high fidelity polymerase mix (Expand High Fidelity PCR System from Boehringer Mannheim), yielding double stranded template DNA.
Subsequently, the common N-terminal mouse MCH1R [MCH1R (Eco RI) 5' : GCGAATTCACCATGGATCTGCAAGCCTCG (SEQ. ID. NO. 32)] and appropriate C-terminal PCR primers [EGFP 3': GGCGGATCCTCTAGAGTC
GCGGCC (SEQ. ll~. NO. 36) or Emerald (EGFP) 3': GCTCTAGAGTCGCGG
CCGCTTACTTGTACAGCTCGTCC (SEQ. ID. NO. 37)] were added to the reactions and thermocycling was continued for an additional fifteen cycles followed by a final extension at 68°C for 7 minutes. The resulting PCR products were phenol/chloroform extracted, restriction digested with Eco RI and Not I, electrophoresed on an agarose gel, and appropriate fragments were gel purified.
These Eco RI to Not I fragments represent direct fusions between either mouse MCH1R and EGFP, or mouse MCH1R and Emerald. Clontech's pEGFP-N3 vector was restriction digested with Eco RI and Not I liberating an approximately 780 by Eco RI to Not I EGFP fragment. This restriction digest was electrophoresed on an agarose gel and the approximately 3.9 Kb pEGFP-N3 vector backbone was gel purified. Eco RI to Not I mouse MCH1R/EGFP or mouse MCH1R/Emerald direct fusion fragments were subcloned into the pEGFP-N3 vector backbone between Eco RI and Not I sites. Several resulting clones for each of these two constructs were sequenced to identify clones with correct nucleotide sequence;
however, no clones with entirely correct nucleotide sequences were identified.
Fortunately, several clones for each of the two constructs only had nucleotide mismatches in the intron region of mouse MCH1R, and therefore, were not expected to effect the functionality of the resulting fusion proteins. These clones were named mMCHIR/EGFP and mMCHIR/Emerald for mouse MCHIRIEGFP direct fusion and mouse MCH1R/Emerald direct fusion, respectively.
Human Short and Long Form/Mouse Species Chimeric MCH1R-linker-GFP Variant Fusion Constructs The initial mouse MCH1R-linker-GFP variant fusion constructs were modified to generate both human short form and human long form/mouse species chimeric MCH1R-linker-GFP variant fusion constructs. An approximately 1.7 kb Hind III to Bsp EI fragment of the mouse MCH1R gene containing exon l, the intron, and 127 amino acids of exon 2 was excised from the various mouse MCH1R-linker-GFP variant fusion constructs and replaced by either an approximately 470 by Hind III to Bsp EI fragment from the wild-type human MCH1R short form or an approximately 670 by Hind III to Bsp EI fragment from the wild-type human long form.
Several clones for each construct were sequenced to confirm the presence of the N-terminal region of either the human MCH1R short or long forms.
These clones were named hshort/mMCHIR-1-GFP variant or hlong/mMCHIR-1-GFP
variant for human short form/mouse species chimeric MCH1R-linker-GFP variant and human long form/mouse species chimeric MCH1R-linker-GFP variant, respectively.
Example 3: Functional Evaluation of MCH1R/GFP Variant Fusion Proteins Both HEK293 Aequorin (National Institutes of Health) and CHO
mammalian cell lines were transiently transfected with the various MCH1R/GFP
variant fusion constructs, as well as the appropriate control constructs.
Transfection was performed using Lipofectamine 2000 (Gibco BRL) per the manufacturer recommended protocol. Approximately 48 hours after transfection cells were harvested, stimulated with various concentrations of human MCH, and assayed for either aequorin bioluminescence (HEK293 Aequorin cells) or cAMP production (CHO cells). Aequorin bioluminescence is a representative measure of intracellular Ca2+mobilization. cAMP production was measured with the Adenylyl Cyclase Activation FlashPlate Assay (NEN Life Science Products, Inc.).
Following transient transfection of the mMCHIR-linker-EGFP
construct (MCH-R-1-EGFP) into HEK293 Aequorin cells, the resulting fusion protein exhibited functional activity comparable to that of the wild-type human MCH1R
short form (MCH-R wt). By this functional assay, the EC50 value for mMCHIR-1-EGFP
was nearly identical to that of the wild-type human short form receptor (Figure 1).
Following transient transfections of the mMCHIR-1-EGFP and mMCHIR/EGFP fusion constructs into CHO cells, the resulting fusion proteins exhibited functional activity comparable to that of the wild-type human MCH1R
short form. By this functional assay, the EC50 values for mMCHIR-1-EGFP and mMCHIR/EGFP were comparable to that of the wild-type human receptor (Table 1).
Transient transfections with the corresponding Emerald constructs yielded similar results (data not shown).
Table 1 Rece for EC50 (nM) Wild-t a Human MCH1R Short 2.166 Form Mouse MCH1R/EGFP 0.819 Mouse MCH1R-1-EGFP 3.199 Following transient transfections of the human short form/mouse species chimeric MCH1R-1-EGFP (HuShort/mMCHIR-1-EGFP) and human long form/mouse species chimeric MCH1R-1-EGFP (HuLong/mMCHIR-1-EGFP) constructs into HEK293 cells, the resulting fusion proteins exhibited functional activity comparable to that of the wild-type human MHC1R short and long forms, respectively. By this functional assay, the EC50 value for each fusion proteins was nearly identical to that of the corresponding wild-type human receptor (Table 2).
Table 2 Rece for ECSp (nM) Wild-t a Human MCH1R Short 22.27 Form HuShort/mMCHIR-1-EGFP 19.54 Wild-t a Human MCH1R Lon 196.7 Form HuLon mMCHIR-1-EGFP Form 217.5 Following transient transfections of the human short form/mouse species chimeric MCH1R-1-EGFP (HuShort/mMCHIR-1-EGFP) and human long form/mouse species chimeric MCH1R-1-EGFP (HuLong/mMCHIR-1-EGFP) constructs into CHO cells, the resulting fusion proteins exhibited functional activity comparable to or less than that of the wild-type human MHC1R short and long forms, respectively (Table 3). By this functional assay, the EC50 value for the human short form/mouse species chimeric MCH1R-1-EGFP fusion protein was comparable to that of the corresponding wild-type human receptor, whereas, the human long form/mouse species chimeric MCH1R-1-EGFP fusion protein had an EC50 value approximately 7.5-fold higher than that of its corresponding wild-type control.
Table 3 Rece for EC50 (nM) Wild-t a Human MCH1R Short 1.029 Form Wild-t a Human MCH1R Lon 1.515 Form HuShort/mMCHIR-1-EFGP 1.565 HuLon mMCHIR-1-EGFP 11.580 Transient expression of all the MCH1R/GFP variant fusion proteins that underwent functional evaluation resulted in fluorescence primarily associated with the plasma membrane in both HEK293 and CHO cells (data not shown). This pattern of fluorescence is consistent with a predominant membrane associated localization.
Example 4: Generation of Stable Cell Lines Wild-type CHO cells were transfected using SuperFect (Qiagen) and either mouse MCH-1R-EGFP or human short/mouse species chimeric MCH-1R-EGFP. Forty-eight hours after transfection, transfected cells were subjected to positive selection for approximately ten days in media containing 6418.
Following selection, MCH-1R-EGFP expressing CHO cells were bulk sorted by Fluorescence Assisted Cell Sorting (FACS) for one or two rounds on the basis of fluorescence intensity to increase the population of cells expressing EGFP. Following bulk sorts, individual clones of varying fluorescence intensities were isolated by FAGS
and expanded.
Fluorometric Microvolume Assay Technology (FMAT) was initially employed to screen a large number of stable clones by whole cell binding with a fluorescently labeled MCH derivative (SymJz-MCH, PE Biosystems) to identify those clones with good specific binding windows. Several clones exhibiting specific binding windows greater than 3-fold were further evaluated for MCH binding with the SPA-based Binding Assay. Cells from individual clones were dissociated in enzyme free dissociation media and cell membranes were prepared and subsequently tested for their ability to bind [lzsl]Phel3Tyr19-MCH in the presence of human MCH. CHO
cell lines expressing either mouse MCH-1R-EGFP or human short/mouse species chimeric MCH-1R-EGFP (Figure 4) displayed IC50 values with MCH that were indistinguishable from the corresponding IC50 values obtained with a CHO cell line expressing the wild-type human short isoform of MCH-1R.
The functional activity of these clones was evaluated with the cAMP
Flashplate Assay (Figures 2 and 3). CHO cell lines expressing either mouse MCH-1R-EGFP (Figure 2) or human short/mouse species chimeric MCH-1R-EGFP (Figure 3) displayed EC50 values with human MCH that were indistinguishable from the EC50 value obtained with a CHO cell line expressing the wild-type human short isoform of MCH-1R.
The subcellular localization of the MCH-1R-EGFP fusion proteins were determined by confocal microscopy utilizing EGFP fluorescence as a marker for MCH-1R expression. CHO cell lines stably expressing either mouse MCH-1R-EGFP
or human short/mouse species chimeric MCH-1R-EGFP displayed EGFP
fluorescence primarily associated with the plasma membrane, demonstrating that these MCH-1R-EGFP fusion proteins are primarily associated with the plasma membrane.
Other embodiments are within the following claims. While several embodiments have been shown and described, various modifications may be made without departing from the spirit and scope of the present invention.
SEQUENCE LISTING
<110> Merck & Co., Inc.
<120> MELANIN CONCENTRATING HORMONE RECEPTOR
CHIMERIC AND FUSTON PROTEINS
<130> 20652 PCT
<150> 60/189,698 <151> 2000-03-15 <160> 37 <170> FastSEQ for Windows Version 4.0 <210> 1 <211> 422 <212> PRT
<213> Human <400> 1 Met Ser Val Gly Ala Met Lys Lys Gly Val Gly Arg Ala Val Gly Leu Gly Gly Gly Ser Gly Cys Gln Ala Thr Glu Glu Asp Pro Leu Pro Asn Cys Gly Ala Cys Ala Pro Gly Gln Gly Gly Arg Arg Trp Arg Leu Pro Gln Pro Ala Trp Val Glu Gly Ser Ser Ala Arg Leu Trp Glu Gln Ala Thr Gly Thr Gly Trp Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Va1 Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Val Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Arg Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Ala Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr <210> 2 <211> 353 <212> PRT
<213> Human <400> 2 Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Val A1a Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Arg Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Ala Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr <210> 3 <211> 353 <212> PRT
<213> Mouse <400> 3 Met Asp Leu Gln Ala Ser Leu Leu Ser Thr Gly Pro Asn Ala Ser Asn Ile Ser Asp Gly Gln Asp Asn Phe Thr Leu Ala Gly Pro Pro Pro Arg Thr Arg Ser Val Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Val Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Ser Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Tle Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr <210> 4 <211> 353 <212> PRT
<213> Artificial Sequence <220>
<223> Human short form/mouse species chimeric MCH1R
<400> 4 Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser IIe Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Tle Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr <210> 5 <211> 422 <212> PRT
<213> Artificial Sequence <220>
MELANIN CONCENTRATING-HORMONE RECEPTOR CHIMERIC AND
FUSION PROTEINS
CROSS-REFERENCE TO RELATED APPLICATIONS
The present application claims priority to provisional application U.S.
Serial No. 601189,698, filed March 15, 2000, hereby incorporated by reference herein.
BACKGROUND OF THE INVENTION
The references cited herein are not admitted to be prior art to the claimed invention.
Neuropeptides present in the hypothalamus play a major role in mediating the control of body weight. (Flier et al., 1998. Cell, 92, 437-440.) Melanin-concentrating hormone (MCH) is a cyclic 19-amino acid neuropeptide synthesized as part of a larger pre-prohormone precursor in the hypothalamus which also encodes neuropeptides NEI and NGE. (Nation et al., 1990. Mol. Endocrifzol. 4, 632-637.) MCH was first identified in salmon pituitary, and in fish MCH affects melanin aggregation thus affecting skin pigmentation. In trout and in eels MCH has also been shown to be involved in stress induced or CRF-stimulated ACTH release.
(Kawauchi et al., 1983. NatuYe 305, 321-323.) In humans two genes encoding MCH have been identified that are expressed in the brain. (Breton et al., 1993. Mol. Brain Res. 18, 297-310.) In mammals MCH has been localized primarily to neuronal cell bodies of the hypothalamus which are implicated in the control of food intake, including perikarya of the lateral hypothalamus and zona inertia. (Knigge et al., 1996. Peptides 17, 1063-1073.) Pharmacological and genetic evidence suggest that the primary mode of MCH action is to promote feeding (orexigenic). MCH mRNA is up regulated in fasted mice and rats and in the oblob mouse. (Qu et al., 1996. Nature 380, 243-247.) Injection of MCH centrally (ICV) stimulates food intake and MCH antagonizes the hypophagic effects seen with oc-melanocyte stimulating hormone (aMSH). (Qu et al., 1996. Nature 380, 243-247.) MCH-deficient mice are lean, hypophagic, and have increased metabolic rate. (Shimada et al., 1998. Nature 396, 670-673.) MCH action is not limited to modulation of food intake as effects on the hypothalamic-pituitary-axis have been reported. (Nation 1994. Critical Rev. ifz Neurobiol. 8, 221-262.) MCH may be involved in the body response to stress as MCH can modulate the stress-induced release of CRF from the hypothalamus and ACTH from the pituitary. In addition, MCH neuronal systems may be involved in reproductive or maternal function.
Several references describe a receptor that is indicated to bind MCH.
(Chambers et al., 1999. Nature 400, 261-265; Saito et al., 1999. Nature 400, 265-269;
Bachner et al., 1999. FEBS Letters 457:522-524; Shimomura et al., 1999.
Biochemical and Biophysical Researcla Comnzunicatiofzs 261, 622-626; and Lembo et al., 1999. Nat. Cell Biol. 1, 267-271.) SUMMARY OF THE INVENTION
The present invention features melanin concentrating hormone receptor (MCH-R) chimeric and fusion proteins. MCH-R chimeric proteins comprise an MCH-R polypeptide region made up of at least two or more polypeptide regions characteristic of MCH-R found in different species. MCH-R fusion proteins comprise an MCH-R polypeptide region and a fluorescent protein region.
An MCH-R polypeptide region provides a functional G-protein coupled receptor region able to bind MCH and transduce an intracellular signal.
Examples of MCH-R polypeptide regions include naturally occurring MCH-R, chimeric MCH-R containing two or more regions from naturally occurring MCH-R, and functional derivatives thereof.
Reference to the terms "characteristic" and "derivatives thereof describe a relationship to a reference sequence. In both cases, there is at least about 75°70 sequence similarity to the reference sequence.
Thus, a first aspect of the present invention describes a fusion protein comprising (a) an MCH-R polypeptide region and (b) a fluorescent polypeptide region. The fluorescent polypeptide region is joined directly, or though a polypeptide linker, to the carboxy side of the MCH-R polypeptide region.
Another aspect of the present invention describes an MCH-R chimeric protein. The protein comprises: (a) an MCH-R binding region characteristic of a human MCH-R, (b) a transmembrane domain characteristic of a non-human MCH-R, and (c) an intracellular domain characteristic of a non-human MCH-R.
Another aspect of the present invention describes a nucleic acid encoding for an MCH-R fusion protein or an MCH-R chimeric protein described herein. Such nucleic acid comprises either a contiguous nucleotide sequence that _2_ codes for the protein or a sequence that is processed by a host cell to produce a contiguous nucleotide sequence encoding for the protein. Processing of a nucleic acid sequence to produce a contiguous nucleotide sequence encoding for a protein can occur by the splicing together of exons resulting in intron removal.
Another aspect of the present invention describes an expression vector comprising a nucleic acid encoding for an MCH-R fusion protein or an MCH-R
chimeric protein described herein.
Another aspect of the present invention describes a recombinant cell comprising nucleic acid encoding for an MCH-R fusion protein or an MCH-R
chimeric protein described herein. The nucleic acid may be part of the host genome or may exist independently of the host genome.
Another aspect of the present invention describes a non-human transgenic animal comprising nucleic acid encoding for an MCH-R fusion protein or an MCH-R chimeric protein described herein.
Another aspect of the present invention describes a method for assaying for MCH-R active compounds by measuring the effect of a test preparation on one or more MCH-R activities. The method is performed using either an MCH-R
fusion protein or an MCH-R chimeric protein described herein.
Other features and advantages of the present invention are apparent from the additional descriptions provided herein including the different examples.
The provided examples illustrate different components and methodology useful in practicing the present invention. The examples do not limit the claimed invention.
Based on the present disclosure the skilled artisan can identify and employ other components and methodology useful for practicing the present invention.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 illustrates aequorin assay results comparing a mouse MCH-R
fusion with a human wild type MCH-R and a CMV-EGFP control.
Figure 2 illustrates a cAMP flashplate assay of CHO cell clones stably expressing mMCH-1R-EGFP. Cells from individual clones were dissociated in enzyme free media and stimulated for 15 minutes at 37°C with human MCH
at the indicated concentrations in the presence of 10 ~M forskolin. Cells were then lysed and assayed for bound ~l2sl]CAMP. Mouse MCH-1R-EGFP clones exhibited EC50 values (0.1111, 0.1255, 0.1291, or 0.2304 nM) indistinguishable from that of a CHO
cell clone expressing the wild-type human short isoform of MCH-1R ( 0.1282 nM).
Figure 3 illustrates a cAMP flashplate assay of CHO cell clones stably expressing human short/mouse species chimeric MCH-1R-EGFP. Cells from individual clones were dissociated in enzyme free media and stimulated for 15 minutes at 37°C with human MCH at the indicated concentrations in the presence of ~M forskolin. Cells were then lysed and assayed for bound [12~I]CAMP. Human short/mouse species chimeric MCH-1R-EGFP clones exhibited EC50 values (0.0366, 0.0462, 0.2117, or 0.2499 nM) indistinguishable from that of a CHO cell clone expressing the wild-type human short isoform of MCH-1R (0.1137 nM).
The present invention features MCH-R chimeric and fusion proteins.
Such proteins have a variety of different uses including being used as a research tool to study MCH-R function and dynamics, and being used to screen for MCH-R
agonists and antagonists.
The MCH-R provides a target to achieve different beneficial effects in a patient. Preferably, MCH-R activity is modulated to achieve one or more of the following: weight loss, weight gain, treat cancer (e.g., colon or breast), reduce pain, treat diabetes, reduce stress, or teat sexual dysfunction.
Modulation of MCH-R activity can be achieved by evoking a response at the MCH receptor or by altering a response evoked by an MCH receptor agonist or antagonist. Compounds modulating MCH-R receptor activity include agonists, antagonists, and allosteric modulators. Generally, MCH-R antagonists and allosteric modulators negatively affecting activity will be used to achieve weight loss, treat cancer (e.g., colon or breast), reduce pain, reduce stress, or teat sexual dysfunction;
and MCH-R agonists and allosteric modulators positively affecting activity will be used to produce a weight gain.
Preferably, MCH-R activity is modulated to achieve a weight loss or to treat diabetes in a patient. Diabetes mellitus can be treated by modulating MCH-R
activity to achieve, for example, one or both of the following: enhancing glucose tolerance or decreasing insulin resistance.
Excessive body weight is a contributing factor to different diseases, including hypertension, diabetes, dyslipidemias, cardiovascular disease, gall stones, osteoarthritis, and certain forms of cancers. Bringing about a weight loss can be used, for example, to reduce the likelihood of such diseases and as part of a treatment for such diseases. Weight reduction can be achieved by modulating MCH-R activity to obtain, for example, one or more of the following effects: reducing appetite, increasing metabolic rate, reducing fat intake, or reducing carbohydrate craving.
Increasing body weight is particularly useful for a patient having a disease or disorder, or under going a treatment, accompanied by weight loss.
Examples of diseases or disorders accompanied by weight loss include anorexia, AIDS, wasting, cachexia, and frail elderly. Examples of treatments accompanied by weight Ioss include chemotherapy and radiation therapy.
MCH-R Chimeric Proteins MCH-R chimeric proteins contain an MCH-R polypeptide region made up by at least two or more polypeptide regions characteristic of MCH-R found in different species. The different polypeptide regions that are present provide for an N-terminal extracellular domain; a transmembrane domain made up of transmembrane regions, extracellular loop regions, and intracellular loop regions; and an intracellular carboxy terminus domain. Examples of MCH-R amino acid sequences include the following: SEQ. ID. NO. 1 (human MCH1R long form), SEQ. ID. NO. 2 (human MCH1R short form), and SEQ. ID. NO. 3 (mouse MCH1R).
Preferably, the MCH-R chimeric protein comprises an MCH-R binding region characteristic of a human MCH-R along with transmembrane and intracellular domains characteristic of a non-human MCH-R. There are substantial amino acid differences between the N-terminus of the MCH-R present in humans and that present in other species such as mice. Such differences could result in, for example, the mouse MCH-R having different intrinsic properties and responsiveness to agonists and/or antagonists than the human MCH-R. The presence of a human MCH-R
binding region provides for a "humanized" MCH-R chimeric receptor.
The transmembrane and intracellular domains characteristic of a non-human MCH-R can be used in conjunction with a non-human host to provide a more naturally occurring environment for these regions. For example, an MCH-R
chimeric having mouse transmembrane and intracellular domains are preferably used in murine cells lines or in transgenic mice.
MCH-R chimeric proteins may contain regions other than extracellular, transmembrane, and intracellular domains that do not substantially decrease the activity of the protein. Preferably, additional regions do not cause a decrease of more than about 25°70 of MCH-R activity as measured using one or more of the assays described in the examples provided below. Examples of additional regions that may be present include fluorescent protein regions and linker regions.
In an embodiment of the present invention, the MCH-R chimeric protein comprises: (a) an MCH binding region characteristic of a first species and (b) a transmembrane and intracellular domain region characteristic of a second species joined directly, or though a linker, to the carboxy side of the MCH binding region.
Preferably, the protein comprises, consists, or consists essentially of an MCH-R
polypeptide having a sequence similarity of at least about 75%, at least 85%, or at least 95% with either SEQ.1D. NO. 4 (human short form/mouse species chimeric MCH1R) or SEQ. ID. NO. 5 (human Long form/mouse species chimeric). Even more preferably, the protein comprises, consists essentially of, or consists of, SEQ. )D. NO.
4 or SEQ. ID. NO. 5.
Sequence similarity for polypeptides can be determined by BLAST.
(Altschul et al., 1997. Nucleic Acids Res. 25, 3389-3402, hereby incorporated by reference herein.) In an embodiment of the present invention, sequence similarity is determined using tBLASTn search program with the following parameters:
MATRIX:BLOSLTM62, PER RESIDUE GAP COST: 11, and Lambda ratio: 1.
Differences in naturally occurring amino acids are due to different R
groups. An R group effects different properties of the amino acid such as physical size, charge, and hydrophobicity. Amino acids can be divided into different groups as follows: neutral and hydrophobic (alanine, valine, leucine, isoleucine, proline, tryptophan, phenylalanine, and methionine); neutral and polar (glycine, serine, threonine, tyrosine, cysteine, asparagine, and glutamine); basic (lysine, arginine, and histidine); and acidic (aspartic acid and glutamic acid).
Generally, in substituting different amino acids it is preferable to exchange amino acids having similar properties. Substituting different amino acids within a particular group, such as substituting valine for leucine, arginine for lysine, and asparagine for glutamine are good candidates for not causing a change in polypeptide functioning.
Changes outside of different amino acids groups can also be made.
Preferably, such changes are made taking into account the position of the amino acid to be substituted in the polypeptide. For example, arginine can substitute more freely for nonpolor amino acids in the interior of a polypeptide then glutamate because of its long aliphatic side chain. (See, Ausubel, Current Protocols in Molecular Biology, John Wiley, 1987-1998, Supplement 33 Appendix 1C.) MCH-R Fusion Proteins MCH-R fusion proteins contain an MCH-R polypeptide region and a fluorescent protein region either directly joined together or joined together through a linker. These regions provide MCH-R activity and a marker for evaluating MCH-R
dynamics.
An MCH-R polypeptide region provides functional MCH-R activity and includes naturally occurring MCH-R, chimeric MCH-R, and derivatives thereof.
Preferred derivatives thereof have a sequence similarity of at least about 75%, at least about 85%, or at least about 95% to a naturally occurring MCH-R or a chimeric MCH-R described herein.
A fluorescent protein region contains a chromophore that fluoresces.
Preferably, the fluorescent protein region is the green fluorescent protein of the jellyfish Aequorea victories or a derivative thereof. Preferred derivatives have a sequence similarity of at least about 75%, at least about 85%, or at least about 95% to the Aequorea victories green fluorescent protein (GFP). The Aequorea victories green fluorescent protein and examples of derivatives thereof are described by Cormack et al., 1996. Gene 17, 33-38; Yang et al., 1996. Nucleic Acids Research 24, 4592-4593;
Tsien et al., U.S. Patent No. 5,625,048; Tsien et al., U.S. Patent No.
5,777,079; and Cormack et al., U.S. Patent No. 5,804,387 (each of which are hereby incorporated by reference herein).
In different embodiments the MCH-R polypeptide region comprises, consists essentially of, or consists of, a sequence selected from the group consisting of: SEQ. m. NO. 1, SEQ. m. NO. 2, SEQ.1D. NO. 3, SEQ. m. NO. 4, and SEQ. m.
NO. 5; and the fluorescent polypeptide region comprises, consists essentially of, or consists of, an amino acid sequence selected from the group consisting of SEQ.
m.
NO. 6 (GFP), SEQ. ID. NO. 7 (EGFP), SEQ. m. NO. 8 (Emerald), SEQ. m. NO. 9 (Topaz), and SEQ. )D. NO. 10 (Wlb). EGFP, Emerald, Topaz, and Wlb are derivatives of GFP.
The optionally present linker is a polypeptide region that is preferably from 1 to about 100 amino acids in length. In different embodiments the linker is up to 75, 50 or 25 amino acids in length.
Preferably, the MCH-R fusion protein comprises, consists essentially of, or consists of, the MCH-R polypeptide region and the fluorescent polypeptide region. More preferably, the protein comprises, consists essentially of, or consists of, _7_ an amino acid sequence selected from the group consisting of: SEQ. ID. NO. 11 (mouse MCH1R-linker-EGFP), SEQ. ID. NO. 12 (mouse MCH1R/EGFP direct fusion), SEQ. ID. NO. 13 (human short form/mouse species chimeric MCH1R-linker-EGFP), or SEQ. ID. NO. 14 (human long form/mouse species chimeric MCH1R-linker-EGFP).
MCH-R Chimeric and Fusion Proteins Nucleic Acid and Expression MCH-R chimeric and fusion proteins can be produced using techniques well known in the art. Preferably, such proteins are produced by recombinant expression inside a host cell by way of an expression vector or by way of nucleic acid integrated into the host genome. Examples of nucleic acid sequences encoding for MCH-R polypeptide regions, fluorescent protein regions, MCH-R
chimeric proteins, and MCH-R fusion proteins are provided for by SEQ. ID. NOs.
29 (see Example 1, infra).
Starting with a particular amino acid sequence and the known degeneracy of the genetic code, a large number of different encoding nucleic acid sequences can be obtained. The degeneracy of the genetic code arises because almost all amino acids are encoded for by different combinations of nucleotide triplets or codons. The translation of a particular codon into a particular amino acid is well known in the art (see, e.g., Lewin GENES IV, p. 119, Oxford University Press, 1990).
Amino acids are encoded for by codons as follows:
A=Ala=Alanine: codons GCA, GCC, GCG, GCU
C=Cys=Cysteine: codons UGC, UGU
D=Asp=Aspartic acid: codons GAC, GAU
E=Glu=Glutamic acid: codons GAA, GAG
F=Phe=Phenylalanine: codons UUC, UUU
G=Gly=Glycine: codons GGA, GGC, GGG, GGU
H=His=Histidine: codons CAC, CAU
I=Ile=Isoleucine: codons AUA, AUC, AUU
K=Lys=Lysine: codons AAA, AAG
L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU
M=Met=Methionine: codon AUG
N=Asn=Asparagine: codons AAC, AAU
P=Pro=Proline: codons CCA, CCC, CCG, CCU
Q=Gln=Glutamine: codons CAA, CAG
_g_ R=Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU
S=Ser=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU
T=Thr=Threonine: codons ACA, ACC, ACG, ACU
V=Val=Valine: codons GUA, GUC, GUG, GUU
W=Trp=Tryptophan: codon UGG
Y=Tyr=Tyrosine: codons UAC, UAU
Examples of techniques for introducing nucleic acid into a cell and expressing the nucleic acid to produce protein are provided in references such as Ausubel, Current Protocols ifa Molecular Biology, John Wiley, 1987-1998, and Sambrook, et al., in Molecular Cloni~eg, A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory Press, 1989.
An expression vector contains recombinant nucleic acid encoding for a polypeptide along with regulatory elements for proper transcription and processing.
The recombinant nucleic acid contains two or more nucleic acid regions not naturally associated with each other. Exogenous regulatory elements such as an exogenous promoter can be useful for expressing recombinant nucleic acid in a particular host.
Examples of expression vectors are cloning vectors, modified cloning vectors, specifically designed plasmids, and viruses.
Generally, the regulatory elements that are present in an expression vector include a transcriptional promoter, a ribosome binding site, a terminator, and an optionally present operator. Another preferred element is a polyadenylation signal providing for processing in eukaryotic cells. Preferably, an expression vector also contains an origin of replication for autonomous replication in a host cell, a selectable marker, a limited number of useful restriction enzyme sites, and a potential for high copy number.
Expression vectors providing suitable levels of polypeptide expression in different hosts are well known in the art. Mammalian expression vectors well known in the art include pcDNA3 (Invitrogen), pMClneo (Stratagene), pXT1 (Stratagene), pSGS (Stratagene), EBO-pSV2-neo (ATCC 37593), pBPV-1(8-2) (ATCC 37110), pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo (ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), pCI-neo (Promega) and .lambda.ZD35 (ATCC 37565). Bacterial expression vectors well known in the art include pETlla (Novagen), lambda gtll (Invitrogen), pcDNAII
(Invitrogen), and pKK223-3 (Pharmacia). Fungal cell expression vectors well known _g_ in the art include pYES2 (Invitrogen) and Pichia expression vector (Invitrogen).
Insect cell expression vectors well known in the art include Blue Bac III
(Invitrogen).
Recombinant host cells may be prokaryotic or eukaryotic. Examples of recombinant host cells include the following: bacteria such as E. coli;
fungal cells such as yeast; mammalian cells such as human, bovine, porcine, monkey, hampster, and rodent; and insect cells such as Drosophila and silkworm derived cell lines.
Commercially available mammalian cell lines include L cells L-M(TI~-) (ATCC
CCL 1.3), L cells L-M (ATCC CCL 1.2), 293 (ATCC CRL 1573), Raji (ATCC CCL
86), CV-1 (ATCC CCL 70),,COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS-C-1 (ATCC CCL 26) and MRC-5 (ATCC CCL 171).
To enhance expression in a particular host it may be useful to modify the sequence to take into account codon usage of the host. Codon usage of different organisms are well known in the art. (See, Ausubel, Current Protocols ih Molecular Biology, John Wiley, 1987-1998, Supplement 33 Appendix 1C.) Expression vectors may be introduced into host cells using standard techniques. Examples of such techniques include transformation, transfection, lipofection, protoplast fusion, and electroporation.
Nucleic acid encoding for a polypeptide can be expressed in a cell without the use of an expression vector employing, for example, synthetic mRNA
or native mRNA. Additionally, mRNA can be translated in various cell-free systems such as wheat germ extracts and reticulocyte extracts, as well as in cell based systems, such as frog oocytes. Introduction of mRNA into cell based systems can be achieved, for example, by microinjection.
Techniques for producing transgenic animals are well known in the art.
Examples of such techniques are provided for by Teratocarcinomas and embryonic stem cells: a practical approach. Ed. By E. J. Robertson, IRL Press Limited, Oxford, England (1987); and Gene Targeting: a practical approach. Ed. By A. L. Joyner, Oxford University Press Inc. New York, NY (1993).
G-Protein Coupled Receptor Assaxs MCH-R is G-protein coupled receptor. Techniques for measuring different G-protein activities, such as Gi/o, Gs, and Gq are well known in the art.
MCH-R activity is preferably assayed for by measuring either Gi/o or Gq.
Gi/o and Gs activity can be measured using techniques such as a melonaphore assay, measuring cAMP production, measuring inhibition of cAMP
accumulation, and measuring binding of 35S-GTP. cAMP can be measured using different techniques such as radioimmunoassay and indirectly by cAMP
responsive gene reporter proteins.
Gq activity can be measured using techniques such as those measuring intracellular Ca2+. Examples of techniques well known in the art that can be employed to measure Ca2+ include the use of dyes such as Fura-2 and the use of Ca2+-bioluminescent sensitive reporter proteins such as aequorin. An example of a cell line employing aequorin to measure G-protein activity is HEK293/aeql7. (Button et al.,1993. Cell Calcium 14, 663-671, and Feighner et al., 1999. Scief2ce 284, 2188, both of which are hereby incorporated by reference herein.) Functional assays can be performed using individual compounds or preparations containing different compounds. A preparation containing different compounds where one or more compounds affect MCH-R chimeric or fusion protein activity can be divided into smaller groups of compounds to identify the compounds) affecting MCH-R chimeric or fusion protein activity. In an embodiment of the present invention a test preparation containing at least 10 compounds is used in a functional assay.
Functional assays can be performed using recombinantly produced MCH-R chimeric or fusion protein present in different environments. Such environments include, for example, cell extracts and purified cell extracts containing the MCH-R chimeric or fusion protein expressed from recombinant nucleic acid and an appropriate membrane for the polypeptide; and the use of a purified MCH-R
chimeric or fusion protein produced by recombinant means that is introduced into a different environment suitable for measuring G-protein activity.
Fluorescent Protein Assays Fluorescent protein joined to an MCH receptor can be employed to study different aspects of receptor dynamics including receptor sequestration, receptor densitization, and receptor localization. The fluorescent protein can be used in ira vitro or in vivo systems.
ha vitro applications of fluorescent proteins can be performed using techniques well known in the art. Examples of such techniques are provided by Barak et al., 1997. Mol PlaaYm. S, 177-184; Tarasova et al., 1997. J. Biol. Chem.
272, 14817-14824; Lin et al., 1998. Mol. Cell. Endo. 146, 27-37; Tarasova et al., 1998. J.
Biol. Chenz. 273, 15883-15886; Kallal et al., 1998. J. Biol. Chem. 273, 322-328;
Groake et al., 1999. J. Biol. Chem. 274, 23263-23269; Doherty et al., 1999.
Biochem. J. 341, 415-422; Brock et al., 1999. Proc. Natl. Acad. Sci. USA 96, 10128; Cornea et al., 1999. Endocri~zology 140, 4272-4280; and Lembo et al., 1999.
Nat. Cell Biol. 1, 267-271 (these references are not admitted to be prior art to the claimed invention).
IfZ vivo applications of fluorescent proteins can be performed using techniques well known in the art. Examples of such techniques are provided by Mombaerts et al., 1996. Cell 87, 675-686; Rodriquez et al., 1999. Cell 97, 199-208;
Spergel et al., 1999. J. Neurosci. l, 2037-2050; and Zuo et al., 1999. Proc.
Natl.
Acad. Sci. USA 96, 14100-14105 (these references are not admitted to be prior art to the claimed invention).
EXAMPLES
Examples are provided below to further illustrate different features and advantages of the present invention. The examples also illustrate useful methodology for practicing the invention. These examples do not limit the claimed invention.
Example 1:, Amino acid and nucleic acid sequence information for SEQ..ID. NOs.
1-29 are provided below. SEQ. ~. NOs. 1-29 include examples of polypeptide and encoding nucleic acid sequences for MCH-R polypeptide regions, fluorescent polypeptide regions, fusion proteins and chimeric proteins. In some cases the encoding nucleic acid is shown with additional nucleic acid upsteam or downstream from an open reading frame.
SEQ. ID. N0.1: Human long form MCH1R
MS VGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP
AWVEGSSARLWEQATGTGWMDLEASLLPTGPNASNTSDGPDNLTSAGSPPR
LLFLLGMPFMIHQLMGNGV WHFGETMCTLITAMDANS QFTSTYILTAMAmR
YLATVHPISSTKFRKPSVATLVICLLWALSFISITPVWLYARLIPFPGGAVGCGI
RLPNPDTDLYWFTLYQFFLAFALPFV VITAAYVRILQRMTS S VAPAS QRSIRLR
TKRVTRTAIAICLVFFVCWAPYYVLQLTQLSISRPTLTFVYLYNAAISLGYANS
CLNPFVYIVLCETFRKRLVLS VKPAAQGQLRAVSNAQTADEERTESKGT
SEQ. ID. NO. 2: Human short form MCH1R
MDLEASLLPTGPNASNTSDGPDNLTSAGSPPRTGSISYINmVIPSVFGTICLLGIIG
NSTVIFAV VKKSKLHW CNNVPDIFBNLS V VDLLFLLGMPFMIHQLMGNGV WH
FGETMCTLITAMDANSQFTSTYILTAMAlDRYLATVHPISSTKFRKPSVATLVI
CLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWFTLYQFFLAFA
LPFV VITAAYVRILQRMTS S VAPAS QRSIRLRTKRVTRTAIAICLVFFV C WAPY
YVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLSV
KPAAQGQLRAVSNAQTADEERTESKGT
SEQ. ID. NO. 3: Mouse MCH1R
MDLQASLLS TGPNASNISDGQDNFTLAGPPPRTRS VS YIIVIIMPS VFGTICLLGI
VGNSTVIFAVVKKSKLHWCSNVPDIFIINLSVVDLLFLLGMPFMIHQLMGNGV
WHFGETMCTLITAMDANSQFTSTYILTAMAIDRYLATVHPISSTKFRKPSMAT
LVICLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWF TLYQFFLA
FALPFV VITAAYVKILQRMTS S VAPAS QRSIRLRTKRVTRTAIAICLVFFV C WA
PYYVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLS
VKPAAQGQLRTVSNAQTADEERTESKGT
SEQ. ID. NO. 4: Human short form/mouse species chimeric MCH1R
MDLEASLLPTGPNASNTSDGPDNLTSAGSPPRTGSISYIIVI>MPS VFGTICLLGIIG
FGETMCTLITAMDANSQFTSTYILTAMAIDRYLATVHPISSTKFRKPSMATLVI
CLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWFTLYQFFLAFA
YVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLSV
KPAAQGQLRTVSNAQTADEERTESKGT
SEQ. TD. NO. 5: Human long form/mouse species chimeric MCH1R
MSVGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP
AWVEGSSARLWEQATGTGWMDLEASLLPTGPNASNTSDGPDNLTSAGSPPR
TGSISYINIIMPSVFGTICLLGIIGNSTVIFAVVKKSKLHWCNNVPDIFIINLSVVD
T.T.FT-T.GMPFM113QLMGNGVWHFGETMCTLITAMDANSQFTSTYILTAMAIDR
YLATVHPIS STKFRKPSMATLVICLLWALSFISITPV WLYARLIPFPGGAV GCGI
RLPNPDTDLYWFTLYQFFLAFALPFV VITAAYVKILQRMTS S VAPAS QRSIRLR
TKRVTRTAIAICLVFFVCWAPYYVLQLTQLSISRPTLTFVYLYNAAISLGYANS
CLNPFVYIVLCETFRKRLVLS VKPAAQGQLRTVSNAQTADEERTESKGT
SEQ. ID. NO. 6: GFP
MSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKL
PVPWPTLVTTFSYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGN
KNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKD
PNEKRDHMVLLEFVTAAGITHGMDELYK
SEQ. ID. NO. 7: EGFP
MVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGK
LPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDG
NYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADK
QKNGIKVNFKIRHNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSK
DPNEKRDHMVLLEFVTAAGITLGMDELYK
SEQ. ID. NO. 8: Emerald Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu VaI Lys Phe Glu GIy Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn lle Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Lys Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Thr Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro lle Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys SEQ. ID. NO. 9: Topaz Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Gly Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Arg Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Va1 Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr GIn Gln Asn Thr Pro Ile Gly Asp Gly Pro Va1 Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gln Sex Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys SEQ. ID. N0.10: W1B
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Arg Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Trp Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr lle Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Ile Ser His Asn Val Tyr Ile Thr Ala Asp Lys GIn Lys Asn Gly Ile Lys Ala His Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu AIa Asp His Tyr GIn Gln Asn Thr Pro Ile GIy Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys SEQ. ID. N0.11: Mouse MCH1R-linker-EGFP
MDLQASLLSTGPNASNISDGQDNFTLAGPPPRTRSVSYINmVIPSVFGTICLLGI
VGNSTVIFAVVKKSKLHWCSNVPDIFaNLSVVDLLFLLGMPFMIHQLMGNGV
WHFGETMCTLITAMDANS QFTSTYILTAMAIDRYLATVHPIS STI~FRRKPSMAT
LVICLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWF'TLYQFFLA
FALPFVVITAAYVKILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWA
PYYVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLS
VKPAAQGQLRTVSNAQTADEERTESKGTVDGTAGPGSIATMVSKGEELFTGV
VPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTL
TYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFE
HNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVL
LEFVTAAGITLGMDELYK
SEQ. ID. N0.12: Mouse MCH1R/EGFP direct fusion MDLQASLLSTGPNASNISDGQDNFTLAGPPPRTRS V S YIIVIIMPS VFGTICLLGI
WHFGETMCTLITAMDANSQFTSTYILTAMA>DRYLATVHPISSTKFRKPSMAT
LVICLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWFTLYQFFLA
FALPFVVITAAYVKILQRMTSSVAPASQRSIRLRTKRVTRTAIAICLVFFVCWA
PYYVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLS
VKPAAQGQLRTVSNAQTADEERTESKGTMVSKGEELFTGVVPILVELDGDVN
GHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPD
HYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLG
MDELYK
SEQ. ID. N0.13: Human short form/mouse species chimeric MCH1R-linker-EGFP
MDLEASLLPTGPNASNTSDGPDNLTSAGSPPRTGSISYINIIn~IPSVFGTICLLGIIG
FGETMCTLITAMDANSQFTSTYII,TAMAIDRYLATVHPISSTKFRKPSMATLVI
CLLWALSFISITPVWLYARLIPFPGGAVGCGIRLPNPDTDLYWF'TLYQFFLAFA
LPFVVITAAYVKILQRMTSS VAPASQRSIRLRTKRVTRTAIAICLVFFVCWAPY
YVLQLTQLSISRPTLTFVYLYNAAISLGYANSCLNPFVYIVLCETFRKRLVLS V
KPAAQGQLRTVSNAQTADEERTESKGTVDGTAGPGSIATMVSKGEELFTGVV
PILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPTLVTTLT
NIEDGS VQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLL
EFVTAAGITLGMDELYK
SEQ. ID. N0.14: Human Iong form/mouse species chimeric MCHIR-linker-EGFP
MS VGAMKKGVGRAVGLGGGSGCQATEEDPLPNCGACAPGQGGRRWRLPQP
AWVEGSSARLWEQATGTGWMDLEASLLPTGPNASNTSDGPDNLTSAGSPPR
TGSISYINInVIPSVFGTICLLGIIGNSTVIFAVVKKSKLHWCNNVPDIFaNLSVVD
LLFLLGMPFMIHQLMGNGVWHFGETMCTLITAMDANSQFTSTYILTAMAIDR
YLATVHPISSTKFRKPSMATLVICLLWALSFISITPVWLYARLIPFPGGAVGCGI
RLPNPDTDLYWFTLYQFFLAFALPFV VITAAYVKILQRMTS S VAPAS QRSIRLR
TKRVTRTAIAICLVFFVCWAPYYVLQLTQLSISRPTLTFVYLYNAAISLGYANS
CLNPFVYIVLCETFRKRLVLSVKPAAQGQLRTVSNAQTADEERTESKGTVDGT
AGPGSIATMVSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLT
LKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQE
RTIFFKDDGNYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNYNSH
LSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYK
SEQ. ID. N0.15: Human long form MCH1R cDNA
ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC
GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG
CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT
CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC
GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT
CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT
CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCA
TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC
TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCC
ATCTCTTCCACGAAGTTCCGGAAGCCCTCTGTGGCCACCCTGGTGATCTGC
CTCCTGTGGGCCCTCTCCTTCATCAGCATCACCCCTGTGTGGCTGTATGCC
AGACTCATCCCCTTCCCAGGAGGTGCAGTGGGCTGCGGCATACGCCTGCC
CAACCCAGACACTGACCTCTACTGGTTCACCCTGTACCAGTTTTTCCTGGC
CTTTGCCCTGCCT'TTTGTGGTCATCACAGCCGCATACGTGAGGATCCTGCA
GCGCATGACGTCCTCAGTGGCCCCCGCCTCCCAGCGCAGCATCCGGCTGC
GGACAAAGAGGGTGACCCGCACAGCCATCGCCATCTGTCTGGTCTTCTTT
GTGTGCTGGGCACCCTACTATGTGCTACAGCTGACCCAGTTGTCCATCAGC
CGCCCGACCCTCACCTTTGTCTACTTATACAATGCGGCCATCAGCTTGGGC
TATGCCAACAGCTGCCTCAACCCCTTTGTGTACATCGTGCTCTGTGAGACG
TTCCGCAAACGCTTGGTCCTGTCGGTGAAGCCTGCAGCCCAGGGGCAGCT
TCGCGCTGTCAGCAACGCTCAGACGGCTGACGAGGAGAGGACAGAAAGC
AAAGGCACCTGA
SEQ. ID. N0.16: Human short form MCH1R cDNA
ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC
CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT
GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA
CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG
ACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC
ACGAAGTTCCGGAAGCCCTCTGTGGCCACCCTGGTGATCTGCCTCCTGTGG
GCCCTCTCCTTCATCAGCATCACCCCTGTGTGGCTGTATGCCAGACTCATC
CCCTTCCCAGGAGGTGCAGTGGGCTGCGGCATACGCCTGCCCAACCCAGA
CACTGACCTCTACTGGTTCACCCTGTACCAGTTTTTCCTGGCCTTTGCCCTG
CCTTTTGTGGTCATCACAGCCGCATACGTGAGGATCCTGCAGCGCATGAC
GTCCTCAGTGGCCCCCGCCTCCCAGCGCAGCATCCGGCTGCGGACAAAGA
GGGTGACCCGCACAGCCATCGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG
CACCCTACTATGTGCTACAGCTGACCCAGTTGTCCATCAGCCGCCCGACCC
TCACCTTTGTCTACTTATACAATGCGGCCATCAGCTTGGGCTATGCCAACA
GCTGCCTCAACCCCTTTGTGTACATCGTGCTCTGTGAGACGTTCCGCAAAC
GCTTGGTCCTGTCGGTGAAGCCTGCAGCCCAGGGGCAGCTTCGCGCTGTC
AGCAACGCTCAGACGGCTGACGAGGAGAGGACAGAAAGCAAAGGCACCT
GA
_18_ SEQ. ID. N0.17: Mouse MCH1R cDNA
Nucleic acid sequence start and stop codons are highlighted:
GGCGGTAGAGGAAGACCCTTTTCTGGACTGCGGGGCTCAAGCTCCGGACA
AGGCGGTGGAGGGCGCTGGAGGCTGCCGCAGCCTGCGTGGGTGGACGGG
CGCTCCACTCCAGGGAGCAGGCGACCTGCACCGGCTGCATGGATCTGCAA
GCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACATCTCCGATGGCCA
GGATAATTTCACATTGGCGGGGCCACCTCCTCGCACAAGGAGTGTCTCCT
ACATCAACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCA
TTGTGGGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTG
CACTGGTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTG
GATCTGCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGT
AATGGTGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCAT
GGACGCCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCA
TTGACCGCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGA
AGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCA
TTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGG
GTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACT
GGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCAT
CACTGCTGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCC
CAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACA
GCCATTGCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTG
CTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTAC
CTGTACAATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCC
CTTTGTGTACATAGTACTCTGTGAGACCTTTCGAAAACGCTTGGTGCTGTC
GGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGA
CAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACCTGACAATCCCCCCC
GGTCACCTCCAAGTCAGGTCACCGCATCAAACCATGGGGAGAGATACTGA
GATAAACCCGGGGCTACCCTGGGAGGATGCAGAAGCTGGAGGCTGGGGG
CTTGTAGCAAACCACATTCCACGGGGCCCACAAATTGCTAGGGAGGCTTG
CAGCCTGGTTTGGGGGGGAAGCCTCAGACTGCAGGGATCCCCTTGACAGA
ATAGAAGCGGAGCAAGAAGGAAAGGGTGGTTTGACTGGTTCTCGGGGTCT
GTATCTGTTGGCTCGCATATATCTTTCTCTCAAGGGAAGAAGGCGGAGGT
GCCTAGCTGGGTTCCTTTAAAACTAGGCAGGGCTAGGATCTGAGCAGCTA
GGGCTCTACTGTGAGACTGGGCAAGCCGAGCGTTCCCTCCCATCTCTCATT
GGTGTTGATAGAAGGCAGTCTTTCTCCCAAGCTGGTGGATCTCCTGAAGC
ACGCTGCCTGGGCTCCAGCATCCTGTGCGGATTTCACGTTCTCTTTAGGGG
ATGCATGTTGACACTGGGGTGTGGGCTCTGAGCCCACAGGAGTTTAAAAA
ACCAAAAGAGCTCAGAGTGTCGAGAGAGACCCAATCACCGAGAATGACA
AGGCAACCTGGGGTGGATGTGGATCTTGAAACTAATAAAAAGGGGTTTTC
ACAGTGACAGCGACATTCTCTTCATAGGGCACAGCTGTCAGTCTATGGCT
GATCCAGAGCGAGCATCCATGAATTCTGCATGTGCAGGGGTCACTCTAAT
ACCTGATATGTTGGCATCATCTTTGTGCTTGAGCCTTCCNCTCCCAAATGG
GAATGAAATAAAGGCAAATTCCCNCCCCCCCCAAAAAAGGGGNAAAAAA
AAAAAAAAAAAAAAAAAAAA
SEQ. ID. N0.18: Mouse MCH1R genomic DNA
Nucleic acid sequence start and stop codons, as well as intron borders, are highlighted:
GGCGGTAGAGGAAGACCCTTTTCTGGACTGCGGGGCTCAAGCTCCGGACA
AGGCGGTGGAGGGCGCTGGAGGCTGCCGCAGCCTGCGTGGGTGGACGGG
CGCTCCACTCCAGGGAGCAGGCGACCTGCACCGGCTGCATGGATCTGCAA
GCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACATCTCCGATGGCCA
GGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTCCTCCCTCCTCCG
GGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAGCCAAACTGCCTG
GGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCAGTCGGCTTTGCC
TGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGACAGAAGGGGCA
GGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAACAACAAGGGTTG
GTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGCTGATGAGTCCCA
AATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGGCTGGAGAAGCAA
GCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGTGGGAGCCAGCAG
AAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTCTGGCCTCGCACTG
GGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGCAAAGGCTGCTGG
GAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTTTCTGGTGAATTA
AGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATTCTAGCTCTGAAG
GAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGTGGAGAAATTCCC
AAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCTATTGGGAAAGCC
CAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACAGAAAACCGGGAG
AGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACCCATGGTGACAGTC
ACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTTGGCCATCTCTCTC
TGAAAAGATGAGGCTGCAAATACACACACACACACACACACACACACAC
ACACACACACACACACACACACACACACACACACAAATGTCCTTCAAGCC
TTTT'TGACAAGGTTTTCTGGTGGATCCCGGGGATATGAAGTTGTTCTCAGC
AGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAAATGGATGAAGCG
AAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTAGAAAATCCTATAG
GAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACAGAACTAACAATAG
CCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAGACGTAGTGACCCC
AAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAGGCCGACATGGCAG
GTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTGCTCTTTCTATCCTC
AGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATCAACATCATCATGC
CTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTGGGAAACTCCACAG
TCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTGGTGCAGCAACGTC
CCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCTGCTTTTCCTGCTGG
GCATGCCTTTCATGATCCACCAGCTCATGGGTAATGGTGTCTGGCACTTTG
GGGAAACCATGTGCACCCTCATCACAGCCATGGACGCCAACAGTCAGTTC
ACCAGCACCTACATCCTGACTGCTATGGCCATTGACCGCTACTTGGCCACC
GTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCTCCATGGCCACCCTG
GTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGG
CTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATC
CGCCTACCAAACCCAGATACTGATCTTTACTGGTTCACTCTGTATCAGTTT
TTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGCTGCGTACGTGAAA
ATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCAT
CCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGG
TCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGT
CCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTACAATGCGGCCATCA
GCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCT
GTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAG
GGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGA
CAGAAAGCAAAGGCACCTGACAATCCCCCCCGGTCACCTCCAAGTCAGGT
CACCGCATCAAACCATGGGGAGAGATACTGAGATAAACCCGGGGCTACC
CTGGGAGGATGCAGAAGCTGGAGGCTGGGGGCTTGTAGCAAACCACATTC
CACGGGGCCCACAAATTGCTAGGGAGGCTTGCAGCCTGGTTTGGGGGGGA
AGCCTCAGACTGCAGGGATCCCCTTGACAGAATAGAAGCGGAGCAAGAA
GGAAAGGGTGGTTTGACTGGTTCTCGGGGTCTGTATCTGTTGGCTCGCATA
TATCTTTCTCTCAAGGGAAGAAGGCGGAGGTGCCTAGCTGGGTTCCTTTA
AAACTAGGCAGGGCTAGGATCTGAGCAGCTAGGGCTCTACTGTGAGACTG
GGCAAGCCGAGCGTTCCCTCCCATCTCTCATTGGTGTTGATAGAAGGCAG
TCTTTCTCCCAAGCTGGTGGATCTCCTGAAGCACGCTGCCTGGGCTCCAGC
ATCCTGTGCGGATTTCACGTTCTCTTTAGGGGATGCATGTTGACACTGGGG
TGTGGGCTCTGAGCCCACAGGAGTTTAAAAAACCAAAAGAGCTCAGAGTG
TCGAGAGAGACCCAATCACCGAGAATGACAAGGCAACCTGGGGTGGATG
TGGATCTTGAAACTAATAAAAAGGGGTTTTCACAGTGACAGCGACATTCT
CTTCATAGGGCACAGCTGTCAGTCTATGGCTGATCCAGAGCGAGCATCCA
TGAATTCTGCATGTGCAGGGGTCACTCTAATACCTGATATGTTGGCATCAT
CTTTGTGCTTGAGCCTTCCNCTCCCAAATGGGAATGAAATAAAGGCAAAT
TCCCNCCCCCCCCAAAAAAGGGGNAAAAAAAAAAAAAAAAAAAAAAAA
AA
SEQ. ID. N0.19: Human short form/mouse species chimeric MCH1R
ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC
CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT
GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA
CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG
ACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC
ACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTG
GGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTAT
CCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAG
ATACTGATCTTTACTGGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCT
TCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGCGCATGAC
GTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGA
GGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG
CGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACC
CTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTATGCCAAC
AGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTTCGAAAA
CGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGT
CAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACC
TGA
SEQ. ID. NO. 20: Human long form/mouse species chimeric MCH1R
ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC
GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG
CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT
CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC
GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT
CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT
CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCA
TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC
TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCC
ATCTCTTCCACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGC
CTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCC
AGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCA
AACCCAGATACTGATCTTTACTGGTTCACTCTGTATCAGTTTTTCCTGGCCT
TCGCCCTTCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGC
GCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGG
ACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTG
TGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGC
CCGACCCTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTAT
GCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTT
CGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCG
CACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAA
GGCACCTGA
SEQ. ID. NO. 21: Aequorea victoria Green Fluorescent Protein (GFP) cDNA
Nucleic acid sequence start and stop codons are highlighted:
TACACACGAATAAAAGATAACAAAGATGAGTAAAGGAGAAGAACTTTTC
ACTGGAGTTGTCCCAATTCTTGTTGAATTAGATGGTGATGTTAATGGGCAC
AAATTTTCTGTCAGTGGAGAGGGTGAAGGTGATGCAACATACGGAAAACT
TACCCTTAAATTTATTTGCACTACTGGAAAACTACCTGTTCCATGGCCAAC
ACTTGTCACTACTTTCTCTTATGGTGTTCAATGCTTTTCAAGATACCCAGAT
CATATGAAACAGCATGACTTTTTCAAGAGTGCCATGCCCGAAGGTTATGT
ACAGGAAAGAACTATATT'TTTCAAAGATGACGGGAACTACAAGACACGTG
CTGAAGTCAAGTTTGAAGGTGATACCCTTGTTAATAGAATCGAGTTAAAA
GGTATTGATTTTAAAGAAGATGGAAACATTCTTGGACACAAATTGGAATA
CAACTATAACTCACACAATGTATACATCATGGCAGACAAACAAAAGAATG
GAATCAAAGTTAACTTCAAAATTAGACACAACATTGAAGATGGAAGCGTT
CAACTAGCAGACCATTATCAACAAAATACTCCAATTGGCGATGGCCCTGT
CCTTTTACCAGACAACCATTACCTGTCCACACAATCTGCCCTTTCGAAAGA
TCCCAACGAAAAGAGAGACCACATGGTCCTTCTTGAGTTTGTAACAGCTG
CTGGGATTACACATGGCATGGATGAACTATACAAATAAATGTCCAGACTT
CCAATTGACACTAAAGTGTCCGAACAATTACTAAAATCTCAGGGTTCCTG
GTTAAATTCAGGCTGAGATATTATTTATATATTTATAGATTCATTAAAATT
GTATGAATAATTTATTGATGTTATTGATAGAGGTTATTTTCTTATTAAACA
GGCTACTTGGAGTGTATTCTTAATTCTATATTAATTACAATTTGATTTGACT
TGCTCAAA
SEQ. TD. NO. 22: EGFP + Linker Nucleic acid sequence start and stop codons are highlighted and a 12 amino acid linker sequence is denoted in lower case:
gtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGAGCTGTT
CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCC
ACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAA
GCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGC
CCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTAC
CCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGG
CTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGA
CCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAG
CTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGC
TGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAG
AAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACG
GCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGAC
GGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCT
GAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCG
TGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAAAGC
GGCCGC
SEQ. ID. NO. 23: Emerald ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT
CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG
GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTGACCT
ACGGCGTGCAGTGCTTCGCCCGCTACCCCGACCACATGAAGCAGCACGAC
TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACA
AGGTCTATATCACCGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTC
AAGACCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACT
ACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAAC
CACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCG
CGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCG
GCATGGACGAGCTGTACAAGTAA
SEQ. ID. NO. 24: Topaz ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT
CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAG
GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTCGGCT
ACGGCGTGCAGTGCTTCGCCCGCTACCCCGACCACATGCGCCAGCACGAC
TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTC
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCCACA
ACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAACTTC
AAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTA
CCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC
ACTACCTGAGCTACCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGC
GATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGG
CATGGACGAGCTGTACAAGTAA
SEQ. ID. NO. 25: W1B
ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT
CGAGCTGGACGGCGACGTAAACGGCCACAGGTTCAGCGTGTCCGGCGAG
GGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCAC
CACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGACCT
GGGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCACGAC
TTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGTACCATCTTC
TTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGG
GCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAG
GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACATCAGCCACA
ACGTCTATATCACCGCCGACAAGCAGAAGAACGGCATCAAGGCCCACTTC
AAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCACTA
CCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC
ACTACC'TGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGC
GATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGG
CATGGACGAGCTGTACAAGTAA
SEQ. ID. NO. 26: Mouse MCH1R-linker-EGFP
Nucleic acid sequence start codon and start and stop codons for mouse MCH1R
and EGFP, respectively, as well as intron borders, are highlighted and a 12 amino acid linker sequence is denoted in lower case:
ATGGATCTGCAAGCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACAT
CTCCGATGGCCAGGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTC
CTCCCTCCTCCGGGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAG
CCAAACTGCCTGGGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCA
GTCGGCTTTGCCTGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGA
CAGAAGGGGCAGGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAAC
AACAAGGGTTGGTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGC
TGATGAGTCCCAAATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGG
CTGGAGAAGCAAGCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGT
GGGAGCCAGCAGAAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTC
TGGCCTCGCACTGGGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGC
AAAGGCTGCTGGGAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTT
TCTGGTGAATTAAGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATT
CTAGCTCTGAAGGAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGT
GGAGAAATTCCCAAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCT
ATTGGGAAAGCCCAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACA
GAAAACCGGGAGAGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACC
CATGGTGACAGTCACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTT
GGCCATCTCTCTCTGAAAAGATGAGGCTGCAAATACACACACACACACAC
ACACACACACACACACACACACACACACACACACACACACACACACAAA
TGTCCTTCAAGCCTTTTTGACAAGGTTTTCTGGTGGATCCCGGGGATATGA
AGTTGTTCTCAGCAGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAA
ATGGATGAAGCGAAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTA
GAAAATCCTATAGGAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACA
GAACTAACAATAGCCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAG
ACGTAGTGACCCCAAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAG
GCCGACATGGCAGGTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTG
CTCTTTCTATCCTCAGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATC
AACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTG
GGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTG
GTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCT
GCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGTAATGG
TGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCATGGACG
CCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCATTGACC
GCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCT
CCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCA
TCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTG
TGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACTGGTTCA
CTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGC
TGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCT
CTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATT
GCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAG
CTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTAC
AATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTG
TACATAGTACTCTGTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAA
GCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTG
ACGAGGAGAGGACAGAAAGCAAAGGCACCgtcgacggtaccgcgggcccgggatccatcg ccaccATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCC
TGGTCGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGC
GAGGGCGAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTG
CACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCCTGA
CCTACGGCGTGCAGTGCTTCAGCCGCTACCCCGACCACATGAAGCAGCAC
GACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCAT
CTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGTGAAGTTCG
AGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGACTTCAAG
GAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACAGCC
ACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAA
CTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACC
ACTACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGAC
AACCACTACCTGAGCACCCAGTCCGCCCTGAGCAAAGACCCCAACGAGAA
GCGCGATCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTC
TCGGCATGGACGAGCTGTACAAGTAA
SEQ. ID. NO. 27: Mouse MCH1R/EGFP direct fusion Nucleic acid sequence start codon and start and stop codons for mouse MCH1R
and ~ EGFP, respectively, as well as intron borders, are highlighted:
ATGGATCTGCAAGCCTCGTTGCTGTCCACTGGCCCCAATGCCAGCAACAT
CTCCGATGGCCAGGATAATTTCACATTGGCGGGTGAGTCGAGTTGGAGTC
CTCCCTCCTCCGGGATGGGTGTGGAAAATGGGAAGGTTTCACCTCCCAAG
CCAAACTGCCTGGGAAACTTTATCTTACAGTTCTTGGTGATAAGATCTGCA
GTCGGCTTTGCCTGAAGAGGAAGAGGAGAGGAGGGGACACCAGCTAGGA
CAGAAGGGGCAGGGAGGAATAGAGATGGGGCAGAGGCACATTTAGAAAC
AACAAGGGTTGGTGACAAGACGTGAGGCAGGCTTGAGGGGAAAGCTTGC
TGATGAGTCCCAAATATGCTTTGCAGGGGGGGGGGGGGGGGAATCAAGG
CTGGAGAAGCAAGCAAGCAAGACAGCAAGACAGCGGGCGGGTAGTATGT
GGGAGCCAGCAGAAGCGCTTTGATTCACCGCTATCCTGGGCTCAATCCTC
TGGCCTCGCACTGGGGAAATGGGGTCTGAGTGGTCCTTGCTGTCTTCTGGC
AAAGGCTGCTGGGAGCAAAAGACTTCACAGGGCGTGAGAGGATTAACTTT
TCTGGTGAATTAAGCTTCTTGACATTTGCAGAACGTCAATGCCTTAAAATT
CTAGCTCTGAAGGAGAAGGGAATGAAGGGGAAAGAGGGAAGGTTGGTGT
GGAGAAATTCCCAAGCTTCTGGGGTGTAACACAGCTCCAGTCCCTACCCT
ATTGGGAAAGCCCAGACTCAGGAGACATGGTCCAAGGAAATCCCTGACA
GAAAACCGGGAGAGGGCAGGGCTGTGGAGCCTGAAACACACCCCACACC
CATGGTGACAGTCACTTCTCACATATGCCTAGGAACCTATCTGAAACCTTT
GGCCATCTCTCTCTGAAAAGATGAGGCTGCAAATACACACACACACACAC
ACACACACACACACACACACACACACACACACACACACACACACACAAA
TGTCCTTCAAGCCTTTTTGACAAGGTTTTCTGGTGGATCCCGGGGATATGA
AGTTGTTCTCAGCAGATATCTGGGAGTCTTGACTCCTGGCCCTCTGAGTAA
ATGGATGAAGCGAAGAAGAATGGGGTCCTCTGAGTAACAGGTGGATCTA
GAAAATCCTATAGGAGTCACCAGGGCACGGTGGAGGAGGGTAAGGTACA
GAACTAACAATAGCCCGAGAAGGGGAAACAGCAGGAGATGATTCCAGAG
ACGTAGTGACCCCAAGCTGCAAGGGAAAGCATGAGGGGCCAGCAGGAAG
GCCGACATGGCAGGTTGTCAGCTTCTAGATCGGAAGGCGGGTCACACTTG
CTCTTTCTATCCTCAGGGCCACCTCCTCGCACAAGGAGTGTCTCCTACATC
AACATCATCATGCCTTCAGTGTTTGGTACCATCTGTCTCCTGGGCATTGTG
GGAAACTCCACAGTCATTTTTGCCGTGGTGAAGAAATCCAAGCTGCACTG
GTGCAGCAACGTCCCTGACATCTTCATCATCAACCTCTCTGTGGTGGATCT
GCTTTTCCTGCTGGGCATGCCTTTCATGATCCACCAGCTCATGGGTAATGG
TGTCTGGCACTTTGGGGAAACCATGTGCACCCTCATCACAGCCATGGACG
CCAACAGTCAGTTCACCAGCACCTACATCCTGACTGCTATGGCCATTGACC
GCTACTTGGCCACCGTCCATCCCATCTCCTCCACCAAGTTCCGGAAGCCCT
CCATGGCCACCCTGGTGATCTGCCTCCTGTGGGCTCTCTCGTTCATTAGCA
TCACTCCTGTGTGGCTCTATGCCAGGCTTATCCCCTTCCCAGGGGGTGCTG
TGGGCTGTGGCATCCGCCTACCAAACCCAGATACTGATCTTTACTGGTTCA
CTCTGTATCAGTTTTTCCTGGCCTTCGCCCTTCCGTTTGTGGTCATCACTGC
TGCGTACGTGAAAATACTACAGCGCATGACGTCTTCGGTGGCCCCAGCCT
CTCAACGCAGCATCCGGCTTCGGACAAAGAGGGTGACCCGCACAGCCATT
GCCATCTGTCTGGTCTTCTTTGTGTGCTGGGCGCCCTACTACGTGCTGCAG
CTGACCCAGTTGTCCATCAGCCGCCCGACCCTCACATTCGTCTACCTGTAC
AATGCGGCCATCAGCTTGGGCTATGCCAACAGCTGCCTCAATCCCTTTGTG
TACATAGTACTCTGTGAGACCTTTCGAAAACGCTTGGTGCTGTCGGTGAA
GCCCGCGGCCCAGGGGCAGCTTCGCACGGTCAGCAATGCTCAGACAGCTG
ACGAGGAGAGGACAGAAAGCAAAGGCACCATGGTGAGCAAGGGCGAGG
AGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTA
AACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCT
ACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTG
CCCTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAG
CCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGC
CCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAAC
TACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACC
GCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGG
GCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCG
ACAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACAT
CGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCC
ATCGGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCA
GTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGC
TGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTAC
AAGTAA
SEQ. ID. NO. 28: Human short form/mouse species chimeric MCH1R-linker-EGFP
Nucleic acid sequence start codon and start and stop codons for mouse MCH1R
and EGFP, respectively, are highlighted and a 12 amino acid linker sequence is denoted in lower case:
ATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAATGCCAGCAACAC
CTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCTCCTCGCACGG
GGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTCGGCACCATCT
GCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCGGTCGTGAAG
AAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTTCATCATCAA
CCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTTCATGATCCA
CCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCATGTGCACCC
TCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACCTACATCCTG
ACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCCATCTCTTCC
ACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGCCTCCTGTG
GGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCCAGGCTTAT
CCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCAAACCCAG
ATACTGATCTTTACTGGTTCACTCTGTATCAGTTTTTCCTGGCCTTCGCCCT
TCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGCGCATGAC
GTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGGACAAAGA
GGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTGTGCTGGG
CGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGCCCGACC
CTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTATGCCAAC
AGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTTCGAAAA
CGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCGCACGGT
CAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAAGGCACC
gtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGAGCTGTT
CACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCC
ACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAA
GCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGC
CCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCCGCTAC
CCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGG
CTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGA
CCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAG
CTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGCACAAGC
TGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGACAAGCAG
AAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAGGACG
GCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCGAC
GGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTCCGCCCT
GAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCG
TGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGTAA
SEQ. ID. NO. 29: Human long form/mouse species chimeric MCH1R-linker-EGFP
Nucleic acid sequence start codon and start and stop codons for mouse MCH1R
and EGFP, respectively, are highlighted and a 12 amino acid linker sequence is denoted in lower case:
ATGTCAGTGGGAGCCATGAAGAAGGGAGTGGGGAGGGCAGTTGGGCTTG
GAGGCGGCAGCGGCTGCCAGGCTACGGAGGAAGACCCCCTTCCCAACTGC
GGGGCTTGCGCTCCGGGACAAGGTGGCAGGCGCTGGAGGCTGCCGCAGC
CTGCGTGGGTGGAGGGGAGCTCAGCTCGGTTGTGGGAGCAGGCGACCGG
CACTGGCTGGATGGACCTGGAAGCCTCGCTGCTGCCCACTGGTCCCAACG
CCAGCAACACCTCTGATGGCCCCGATAACCTCACTTCGGCAGGATCACCT
CCTCGCACGGGGAGCATCTCCTACATCAACATCATCATGCCTTCGGTGTTC
GGCACCATCTGCCTCCTGGGCATCATCGGGAACTCCACGGTCATCTTCGCG
GTCGTGAAGAAGTCCAAGCTGCACTGGTGCAACAACGTCCCCGACATCTT
CATCATCAACCTCTCGGTAGTAGATCTCCTCTTTCTCCTGGGCATGCCCTT
CATGATCCACCAGCTCATGGGCAATGGGGTGTGGCACTTTGGGGAGACCA
TGTGCACCCTCATCACGGCCATGGATGCCAATAGTCAGTTCACCAGCACC
TACATCCTGACCGCCATGGCCATTGACCGCTACCTGGCCACTGTCCACCCC
ATCTCTTCCACGAAGTTCCGGAAGCCCTCCATGGCCACCCTGGTGATCTGC
CTCCTGTGGGCTCTCTCGTTCATTAGCATCACTCCTGTGTGGCTCTATGCC
AGGCTTATCCCCTTCCCAGGGGGTGCTGTGGGCTGTGGCATCCGCCTACCA
AACCCAGATACTGATCTT'TACTGGTTCACTCTGTATCAGTTTTTCCTGGCCT
TCGCCCTTCCGTTTGTGGTCATCACTGCTGCGTACGTGAAAATACTACAGC
GCATGACGTCTTCGGTGGCCCCAGCCTCTCAACGCAGCATCCGGCTTCGG
ACAAAGAGGGTGACCCGCACAGCCATTGCCATCTGTCTGGTCTTCTTTGTG
TGCTGGGCGCCCTACTACGTGCTGCAGCTGACCCAGTTGTCCATCAGCCGC
CCGACCCTCACATTCGTCTACCTGTACAATGCGGCCATCAGCTTGGGCTAT
GCCAACAGCTGCCTCAATCCCTTTGTGTACATAGTACTCTGTGAGACCTTT
CGAAAACGCTTGGTGCTGTCGGTGAAGCCCGCGGCCCAGGGGCAGCTTCG
CACGGTCAGCAATGCTCAGACAGCTGACGAGGAGAGGACAGAAAGCAAA
GGCACCgtcgacggtaccgcgggcccgggatccatcgccaccATGGTGAGCAAGGGCGAGGA
GCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAA
ACGGCCACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTAC
GGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCC
CTGGCCCACCCTCGTGACCACCCTGACCTACGGCGTGCAGTGCTTCAGCC
GCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCC
GAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGACGGCAACTA
CAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGAACCGC
ATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGGGC
ACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGAC
AAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCG
AGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATC
GGCGACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCACCCAGTC
CGCCCTGAGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGG
AGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAG
TAA
Example 2: Generation of Chimeric and Fusion Proteins DNA vectors encoding fusion proteins between a MCH-R receptor (MCH1R) and several different superbright variants of Green Fluorescent Protein (GFP) were generated. GFP variants were fused either via a 12 amino acid linker:
TCGACGGTACCGCGGGCCCGGGATCCATCGCCACC (SEQ. ID. NO. 30), amino acid sequence: VDGTAGPGSIAT (SEQ. ID. NO. 31) (linker fusions) or directly to the C-terminus of MCH1R (direct fusions).
Mouse MCH1R-linker-GFP Variant Fusion Constructs Initially, mouse MCH1R was fused in frame via the linker to Enhanced Green Fluorescent Protein (EGFP). MCH1R was PCR-amplified (95°C
for 5 minutes; 95°C for 30 seconds, 60°C for 45 seconds, 68°C
for 3.5 minutes, for 15 cycles; 68°C for 7 minutes) from a full-length mouse MCH1R genomic DNA
lambda clone utilizing a high fidelity polymerase mix (Expand High Fidelity PCR
System from Boehringer Mannheim) and PCR primers [MCH1R (Eco RI) 5':
GCGAATTCACCATGGATCTGCAAGCCTCG (SEQ. ID. NO. 32), MCH1R (Sal I) 3': GCGTCGACGGTGCCTTTGCTTTCTGTCC (SEQ. m. NO. 33)] that generated Eco RI and Sal I enzymatic restriction sites at the N- and C-terminus, respectively.
The MCH1R N-terminal PCR primer was also designed to introduce a Kozak consensus sequence for translation which contained an Nco I site (5'-ACCATGG-3'), and the MCH1R C-terminal PCR primer was also designed to eliminate the endogenous stop codon present in the mouse MCH1R gene. The resulting PCR
product was phenol/chloroform extracted, restriction digested with Eco RI and Sal I, gel purified, and subcloned in frame into the multicloning site of Clontech's pEGFP-N3 vector between Eco RI and Sal I sites. Several resulting clones for this construct were sequenced to identify a clone with an entirely correct nucleotide sequence. This clone was named mMCHIR-1-EGFP for mouse MCH1R-linker-EGFP.
An approximately 760 by Sal I to Not I fragment of mMCHIR-1-EGFP
was excised, gel purified, and subcloned into the multicloning site of pBluescript (SK+) (Stratagene) between Sal1 and Not I sites. An approximately 710 by Nco I
to Bsr G1 fragment of EGFP was excised from the resulting pBluescript-EGFP vector and replaced with the corresponding Nco I to Bsr G1 fragment of either Emerald, Topaz, or W1B (other superbright GFP variants), which were excised from vectors pRSET-Emerald, pRSET-Topaz, and pRSET-W1B, respectively. pRSET-Emerald, pRSET-Topaz, and pRSET-W 1B were obtained from Aurora Biosciences Co. Sal I to Not I fragments containing either Emerald, Topaz, or W 1B were excised from the resulting pBluescript-Emerald, pBluescript-Topaz, and pBluescript-W1B vectors, respectively. Appropriate fragments were gel purified and subcloned into mMCHIR-1-EGFP digested with Sal I and Not I, replacing the Sal I to Not I EGFP
fragment with the corresponding Sal I to Not I fragment from either Emerald, Topaz, or W1B.
Several clones for each construct were sequenced to confirm the presence of the appropriate GFP variant. The resulting vectors were named mMCHIR-1-Emerald, mMCHIR-1-Topaz, and mMCHIR-1-W 1B for mouse MCH1R-linker-Emerald, mouse MCH1R-linker-Topaz, and mouse MCH1R-linker-W1B, respectively.
Mouse MCH1R/GFP Variant Direct Fusion Constructs A two step PCR strategy was employed to generate the direct fusion constructs. First, mouse MCH1R, EGFP, and Emerald were PCR-amplified from a full-length mouse MCH1R genomic DNA lambda clone, Clontech's pEGFP-N3 vector, and Aurora's pRSET-Emerald vector, respectively. Mouse MCH1R was PCR-amplified according to the previously mentioned conditions utilizing the same N-terminal PCR primer [MCH1R (Eco RI) 5': GCGAATTCACCATGGATCTGCA
AGCCTCG (SEQ. ID. NO. 32)], but in this case a different C-terminal PCR primer was employed. The C-terminal PCR primer [MCH1R (EGFP/Emerald) 3' CCTTGCTCACCATGGTGCCTTTGCTTTCTGTCC (SEQ. ID. NO. 34)] eliminated the endogenous stop codon of mouse MCH1R as before and introduced a region of nucleotide sequence complementary to the nucleotide sequence of the N-terminus of EGFP.
EGFP and Emerald were PCR-amplified (95°C for 5 minutes;
95°C for seconds, 60°C for 45 seconds, 68°C for 1.5 minutes, for 15 cycles; 68°C for 7 minutes) separately with a high fidelity polymerase mix (Advantage HF-2 from Clontech) from their respective templates utilizing a common N-terminal PCR
primer 25 [EGFP/Emerald (MCH1R) 5': CAGAAAGCAAAGGCACCATGGTGAGCAA
GGGCGAGGAGC (SEQ. ID. NO. 35)] that generated a region of nucleotide sequence complementary to the C-terminus of mouse MCH1R and C-terminal PCR
primers [EGFP 3' : GGCGGATCCTCTAGAGTCGCGGCC (SEQ. ID. NO. 36), or Emerald (EGFP) 3' : GCTCTAGAGTCGCGGCCGCTTACTTGTACAGCTCGTCC
30 (SEQ. ID. NO. 37)] that generated a Not I site at the C-terminus. The resulting PCR
products were electrophoresed on an agarose gel and the appropriate fragments were gel purified.
In a second PCR step, PCR reactions were set up between the previously generated mouse MCH1R and EGFP, or mouse MCH1R and Emerald PCR products. Following an initial 5 minute denaturation step at 95°C, two rounds of thermocycling (95°C for 30 seconds, 60°C for 45 seconds, 68°C for 4 minutes) were performed in the absence of PCR primers. This allowed the mouse MCH1R and GFP
variants to anneal at their complementary regions and to be filled in by the high fidelity polymerase mix (Expand High Fidelity PCR System from Boehringer Mannheim), yielding double stranded template DNA.
Subsequently, the common N-terminal mouse MCH1R [MCH1R (Eco RI) 5' : GCGAATTCACCATGGATCTGCAAGCCTCG (SEQ. ID. NO. 32)] and appropriate C-terminal PCR primers [EGFP 3': GGCGGATCCTCTAGAGTC
GCGGCC (SEQ. ll~. NO. 36) or Emerald (EGFP) 3': GCTCTAGAGTCGCGG
CCGCTTACTTGTACAGCTCGTCC (SEQ. ID. NO. 37)] were added to the reactions and thermocycling was continued for an additional fifteen cycles followed by a final extension at 68°C for 7 minutes. The resulting PCR products were phenol/chloroform extracted, restriction digested with Eco RI and Not I, electrophoresed on an agarose gel, and appropriate fragments were gel purified.
These Eco RI to Not I fragments represent direct fusions between either mouse MCH1R and EGFP, or mouse MCH1R and Emerald. Clontech's pEGFP-N3 vector was restriction digested with Eco RI and Not I liberating an approximately 780 by Eco RI to Not I EGFP fragment. This restriction digest was electrophoresed on an agarose gel and the approximately 3.9 Kb pEGFP-N3 vector backbone was gel purified. Eco RI to Not I mouse MCH1R/EGFP or mouse MCH1R/Emerald direct fusion fragments were subcloned into the pEGFP-N3 vector backbone between Eco RI and Not I sites. Several resulting clones for each of these two constructs were sequenced to identify clones with correct nucleotide sequence;
however, no clones with entirely correct nucleotide sequences were identified.
Fortunately, several clones for each of the two constructs only had nucleotide mismatches in the intron region of mouse MCH1R, and therefore, were not expected to effect the functionality of the resulting fusion proteins. These clones were named mMCHIR/EGFP and mMCHIR/Emerald for mouse MCHIRIEGFP direct fusion and mouse MCH1R/Emerald direct fusion, respectively.
Human Short and Long Form/Mouse Species Chimeric MCH1R-linker-GFP Variant Fusion Constructs The initial mouse MCH1R-linker-GFP variant fusion constructs were modified to generate both human short form and human long form/mouse species chimeric MCH1R-linker-GFP variant fusion constructs. An approximately 1.7 kb Hind III to Bsp EI fragment of the mouse MCH1R gene containing exon l, the intron, and 127 amino acids of exon 2 was excised from the various mouse MCH1R-linker-GFP variant fusion constructs and replaced by either an approximately 470 by Hind III to Bsp EI fragment from the wild-type human MCH1R short form or an approximately 670 by Hind III to Bsp EI fragment from the wild-type human long form.
Several clones for each construct were sequenced to confirm the presence of the N-terminal region of either the human MCH1R short or long forms.
These clones were named hshort/mMCHIR-1-GFP variant or hlong/mMCHIR-1-GFP
variant for human short form/mouse species chimeric MCH1R-linker-GFP variant and human long form/mouse species chimeric MCH1R-linker-GFP variant, respectively.
Example 3: Functional Evaluation of MCH1R/GFP Variant Fusion Proteins Both HEK293 Aequorin (National Institutes of Health) and CHO
mammalian cell lines were transiently transfected with the various MCH1R/GFP
variant fusion constructs, as well as the appropriate control constructs.
Transfection was performed using Lipofectamine 2000 (Gibco BRL) per the manufacturer recommended protocol. Approximately 48 hours after transfection cells were harvested, stimulated with various concentrations of human MCH, and assayed for either aequorin bioluminescence (HEK293 Aequorin cells) or cAMP production (CHO cells). Aequorin bioluminescence is a representative measure of intracellular Ca2+mobilization. cAMP production was measured with the Adenylyl Cyclase Activation FlashPlate Assay (NEN Life Science Products, Inc.).
Following transient transfection of the mMCHIR-linker-EGFP
construct (MCH-R-1-EGFP) into HEK293 Aequorin cells, the resulting fusion protein exhibited functional activity comparable to that of the wild-type human MCH1R
short form (MCH-R wt). By this functional assay, the EC50 value for mMCHIR-1-EGFP
was nearly identical to that of the wild-type human short form receptor (Figure 1).
Following transient transfections of the mMCHIR-1-EGFP and mMCHIR/EGFP fusion constructs into CHO cells, the resulting fusion proteins exhibited functional activity comparable to that of the wild-type human MCH1R
short form. By this functional assay, the EC50 values for mMCHIR-1-EGFP and mMCHIR/EGFP were comparable to that of the wild-type human receptor (Table 1).
Transient transfections with the corresponding Emerald constructs yielded similar results (data not shown).
Table 1 Rece for EC50 (nM) Wild-t a Human MCH1R Short 2.166 Form Mouse MCH1R/EGFP 0.819 Mouse MCH1R-1-EGFP 3.199 Following transient transfections of the human short form/mouse species chimeric MCH1R-1-EGFP (HuShort/mMCHIR-1-EGFP) and human long form/mouse species chimeric MCH1R-1-EGFP (HuLong/mMCHIR-1-EGFP) constructs into HEK293 cells, the resulting fusion proteins exhibited functional activity comparable to that of the wild-type human MHC1R short and long forms, respectively. By this functional assay, the EC50 value for each fusion proteins was nearly identical to that of the corresponding wild-type human receptor (Table 2).
Table 2 Rece for ECSp (nM) Wild-t a Human MCH1R Short 22.27 Form HuShort/mMCHIR-1-EGFP 19.54 Wild-t a Human MCH1R Lon 196.7 Form HuLon mMCHIR-1-EGFP Form 217.5 Following transient transfections of the human short form/mouse species chimeric MCH1R-1-EGFP (HuShort/mMCHIR-1-EGFP) and human long form/mouse species chimeric MCH1R-1-EGFP (HuLong/mMCHIR-1-EGFP) constructs into CHO cells, the resulting fusion proteins exhibited functional activity comparable to or less than that of the wild-type human MHC1R short and long forms, respectively (Table 3). By this functional assay, the EC50 value for the human short form/mouse species chimeric MCH1R-1-EGFP fusion protein was comparable to that of the corresponding wild-type human receptor, whereas, the human long form/mouse species chimeric MCH1R-1-EGFP fusion protein had an EC50 value approximately 7.5-fold higher than that of its corresponding wild-type control.
Table 3 Rece for EC50 (nM) Wild-t a Human MCH1R Short 1.029 Form Wild-t a Human MCH1R Lon 1.515 Form HuShort/mMCHIR-1-EFGP 1.565 HuLon mMCHIR-1-EGFP 11.580 Transient expression of all the MCH1R/GFP variant fusion proteins that underwent functional evaluation resulted in fluorescence primarily associated with the plasma membrane in both HEK293 and CHO cells (data not shown). This pattern of fluorescence is consistent with a predominant membrane associated localization.
Example 4: Generation of Stable Cell Lines Wild-type CHO cells were transfected using SuperFect (Qiagen) and either mouse MCH-1R-EGFP or human short/mouse species chimeric MCH-1R-EGFP. Forty-eight hours after transfection, transfected cells were subjected to positive selection for approximately ten days in media containing 6418.
Following selection, MCH-1R-EGFP expressing CHO cells were bulk sorted by Fluorescence Assisted Cell Sorting (FACS) for one or two rounds on the basis of fluorescence intensity to increase the population of cells expressing EGFP. Following bulk sorts, individual clones of varying fluorescence intensities were isolated by FAGS
and expanded.
Fluorometric Microvolume Assay Technology (FMAT) was initially employed to screen a large number of stable clones by whole cell binding with a fluorescently labeled MCH derivative (SymJz-MCH, PE Biosystems) to identify those clones with good specific binding windows. Several clones exhibiting specific binding windows greater than 3-fold were further evaluated for MCH binding with the SPA-based Binding Assay. Cells from individual clones were dissociated in enzyme free dissociation media and cell membranes were prepared and subsequently tested for their ability to bind [lzsl]Phel3Tyr19-MCH in the presence of human MCH. CHO
cell lines expressing either mouse MCH-1R-EGFP or human short/mouse species chimeric MCH-1R-EGFP (Figure 4) displayed IC50 values with MCH that were indistinguishable from the corresponding IC50 values obtained with a CHO cell line expressing the wild-type human short isoform of MCH-1R.
The functional activity of these clones was evaluated with the cAMP
Flashplate Assay (Figures 2 and 3). CHO cell lines expressing either mouse MCH-1R-EGFP (Figure 2) or human short/mouse species chimeric MCH-1R-EGFP (Figure 3) displayed EC50 values with human MCH that were indistinguishable from the EC50 value obtained with a CHO cell line expressing the wild-type human short isoform of MCH-1R.
The subcellular localization of the MCH-1R-EGFP fusion proteins were determined by confocal microscopy utilizing EGFP fluorescence as a marker for MCH-1R expression. CHO cell lines stably expressing either mouse MCH-1R-EGFP
or human short/mouse species chimeric MCH-1R-EGFP displayed EGFP
fluorescence primarily associated with the plasma membrane, demonstrating that these MCH-1R-EGFP fusion proteins are primarily associated with the plasma membrane.
Other embodiments are within the following claims. While several embodiments have been shown and described, various modifications may be made without departing from the spirit and scope of the present invention.
SEQUENCE LISTING
<110> Merck & Co., Inc.
<120> MELANIN CONCENTRATING HORMONE RECEPTOR
CHIMERIC AND FUSTON PROTEINS
<130> 20652 PCT
<150> 60/189,698 <151> 2000-03-15 <160> 37 <170> FastSEQ for Windows Version 4.0 <210> 1 <211> 422 <212> PRT
<213> Human <400> 1 Met Ser Val Gly Ala Met Lys Lys Gly Val Gly Arg Ala Val Gly Leu Gly Gly Gly Ser Gly Cys Gln Ala Thr Glu Glu Asp Pro Leu Pro Asn Cys Gly Ala Cys Ala Pro Gly Gln Gly Gly Arg Arg Trp Arg Leu Pro Gln Pro Ala Trp Val Glu Gly Ser Ser Ala Arg Leu Trp Glu Gln Ala Thr Gly Thr Gly Trp Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Va1 Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Val Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Arg Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Ala Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr <210> 2 <211> 353 <212> PRT
<213> Human <400> 2 Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Val A1a Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Arg Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Ala Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr <210> 3 <211> 353 <212> PRT
<213> Mouse <400> 3 Met Asp Leu Gln Ala Ser Leu Leu Ser Thr Gly Pro Asn Ala Ser Asn Ile Ser Asp Gly Gln Asp Asn Phe Thr Leu Ala Gly Pro Pro Pro Arg Thr Arg Ser Val Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Val Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Ser Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Tle Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr <210> 4 <211> 353 <212> PRT
<213> Artificial Sequence <220>
<223> Human short form/mouse species chimeric MCH1R
<400> 4 Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser IIe Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Tle Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr <210> 5 <211> 422 <212> PRT
<213> Artificial Sequence <220>
<223> Human long form/mouse species chimeric MCH1R
<400> 5 Met Ser Val Gly Ala Met Lys Lys Gly Val Gly Arg Ala Val Gly Leu Gly Gly Gly Ser Gly Cys Gln Ala Thr Glu Glu Asp Pro Leu Pro Asn Cys Gly Ala Cys Ala Pro Gly Gln Gly Gly Arg Arg Trp Arg Leu Pro Gln Pro Ala Trp Val Glu Gly Ser Ser Ala Arg Leu Trp Glu Gln Ala Thr Gly Thr Gly Trp Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr <210> 6 <211> 238 <212> PRT
<213> Aequorea Victoria <400> 6 Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Ser Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His G1y Met Asp Glu Leu Tyr Lys <210> 7 <211> 239 <212> PRT
<213> Artificial Sequence <220>
<223> GFP derivative <400> 7 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly G1u Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu 195 200 . 205 Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 8 <211> 239 <212> PRT
<213> Artificial Sequence <220>
<223> GFP derivative <400> 8 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Lys Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Thr Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 9 <211> 239 <212> PRT
<213> Artificial Sequence <220>
<223> GFP derivative <400> 9 Met.Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Gly Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Arg Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 10 <211> 239 <212> PRT
<213> Artificial Sequence <220>
<223> GFP derivative <400> 10 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Arg Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Trp Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly l15 120 125 Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Ile Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Ala His Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly _g_ Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 11 <211> 604 <212> PRT
<213> Artificial Sequence <220>
<223> Mouse MCH1R-linker-EGFP
<400> 11 Met Asp Leu Gln Ala Ser Leu Leu Ser Thr Gly Pro Asn Ala Ser Asn Ile Ser Asp Gly Gln Asp Asn Phe Thr Leu Ala Gly Pro Pro Pro Arg Thr Arg Ser Val Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Val Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Ser Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr Val Asp Gly Thr Ala Gly Pro Gly Ser Ile Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 12 <211> 592 <212> PRT
<213> Artificial Sequence <220>
<223> Mouse MCH1R/EGFP
<400> 12 Met Asp Leu Gln Ala Ser Leu Leu Ser Thr Gly Pro Asn Ala Ser Asn Ile Ser Asp Gly Gln Asp Asn Phe Thr Leu Ala Gly Pro Pro Pro Arg Thr Arg Ser Val Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Val Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Ser Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly I1e Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Va1 Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 13 <211> 604 <212> PRT
<213> Artificial Sequence <220>
<223> MCH1R-linker-EGFP
<400> 13 Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr Val Asp Gly Thr Ala Gly Pro Gly Ser Ile Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr G1y Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Tle Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Va1 Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr-Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 14 <211> 673 <212> PRT
<213> Artificial Sequence <220>
<223> MCH1R-linker-EGFP
<400> 14 Met Ser Val Gly Ala Met Lys Lys Gly Val Gly Arg Ala Val Gly Leu Gly Gly Gly Ser Gly Cys Gln Ala Thr Glu Glu Asp Pro Leu Pro Asn Cys Gly Ala Cys Ala Pro Gly Gln Gly Gly Arg Arg Trp Arg Leu Pro Gln Pro Ala Trp Val Glu Gly Ser Ser Ala Arg Leu Trp Glu Gln Ala Thr Gly Thr Gly Trp Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr Val Asp Gly Thr Ala Gly Pro Gly Ser Ile Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 465 470 475 ' 480 Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp G1y Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 15 <211> 1269 <212> DNA
<213> Human <400>
atgtcagtgggagccatgaagaagggagtggggagggcagttgggcttggaggcggcagc 60 ggctgccaggctacggaggaagacccccttcccaactgcggggcttgcgctccgggacaa 120 ggtggcaggcgctggaggctgccgcagcctgcgtgggtggaggggagctcagctcggttg 180 tgggagcaggcgaccggcactggctggatggacctggaagcctcgctgctgcccactggt 240 cccaacgccagcaacacctctgatggccccgataacctcacttcggcaggatcacctcct 300 cgcacggggagcatctcctacatcaacatcatcatgccttcggtgttcggcaccatctgc 360 ctcctgggcatcatcgggaactccacggtcatcttcgcggtcgtgaagaagtccaagctg 420 cactggtgcaacaacgtccccgacatcttcatcatcaacctctcggtagtagatctcctc480 tttctcctgggcatgcccttcatgatccaccagctcatgggcaatggggtgtggcacttt540 ggggagaccatgtgcaccctcatcacggccatggatgccaatagtcagttcaccagcacc600 tacatcctgaccgccatggccattgaccgctacctggccactgtccaccccatctcttcc660 acgaagttccggaagccctctgtggccaccctggtgatctgcctcctgtgggccctctcc720 ttcatcagcatcacccctgtgtggctgtatgccagactcatccccttcccaggaggtgca780 gtgggctgcggcatacgcctgcccaacccagacactgacctctactggttcaccctgtac840 cagtttttcctggcctttgccctgccttttgtggtcatcacagccgcatacgtgaggatc900 ctgcagcgcatgacgtcctcagtggcccccgcctcccagcgcagcatccggctgcggaca960 aagagggtgacccgcacagccatcgccatctgtctggtcttctttgtgtgctgggcaccc1020 tactatgtgctacagctgacccagttgtccatcagccgcccgaccctcacctttgtctac1080 ttatacaatgcggccatcagcttgggctatgccaacagctgcctcaacccctttgtgtac1140 atcgtgctctgtgagacgttccgcaaacgcttggtcctgtcggtgaagcctgcagcccag1200 gggcagcttcgcgctgtcagcaacgctcagacggctgacgaggagaggacagaaagcaaa1260 ggcacctga <210>
<211>
<212>
DNA
<213>
Human <400>
atggacctggaagcctcgctgctgcccactggtcccaatgccagcaacacctctgatggc60 cccgataacctcacttcggcaggatcacctcctcgcacggggagcatctcctacatcaac120 atcatcatgccttcggtgttcggcaccatctgcctcctgggcatcatcgggaactccacg180 gtcatcttcgcggtcgtgaagaagtccaagctgcactggtgcaacaacgtccccgacatc240 ttcatcatcaacctctcggtagtagatctcctctttctcctgggcatgcccttcatgatc300 caccagctcatgggcaatggggtgtggcactttggggagaccatgtgcaccctcatcacg360 gccatggatgccaatagtcagttcaccagcacctacatcctgaccgccatggccattgac420 cgctacctggccactgtccaccccatctcttccacgaagttccggaagccctctgtggcc480 accctggtgatctgcctcctgtgggccctctccttcatcagcatcacccctgtgtggctg540 tatgccagactcatccccttcccaggaggtgcagtgggctgcggcatacgcctgcccaac600 ccagacactgacctctactggttcaccctgtaccagtttttcctggcctttgccctgcct660 tttgtggtcatcacagccgcatacgtgaggatcctgcagcgcatgacgtcctcagtggcc720 cccgcctcccagcgcagcatccggctgcggacaaagagggtgacccgcacagccatcgcc780 atctgtctggtcttctttgtgtgctgggcaccctactatgtgctacagctgacccagttg840 tccatcagccgcccgaccctcacctttgtctacttatacaatgcggccatcagcttgggc900 tatgccaacagctgcctcaacccctttgtgtacatcgtgctctgtgagacgttccgcaaa960 cgcttggtcctgtcggtgaagcctgcagcccaggggcagcttcgcgctgtcagcaacgct1020 cagacggctgacgaggagaggacagaaagcaaaggcacctga 1062 <210>
<211>
<212>
DNA
<213>
Mouse <220>
<221> feature misc_ <222>
(1)...(2080) <223>
n = A,T,C
or G
<400>
ggcggtagaggaagacccttttctggactgcggggctcaagctccggacaaggcggtgga 60 gggcgctggaggctgccgcagcctgcgtgggtggacgggcgctccactccagggagcagg 120 cgacctgcaccggctgcatggatctgcaagcctcgttgctgtccactggccccaatgcca 180 gcaacatctccgatggccaggataatttcacattggcggggccacctcctcgcacaagga 240 gtgtctcctacatcaacatcatcatgccttcagtgtttggtaccatctgtctcctgggca 300 ttgtgggaaactccacagtcatttttgccgtggtgaagaaatccaagctgcactggtgca 360 gcaacgtccctgacatcttcatcatcaacctctctgtggtggatctgcttttcctgctgg 420 gcatgcctttcatgatccaccagctcatgggtaatggtgtctggcactttggggaaacca 480 tgtgcaccctcatcacagccatggacgccaacagtcagttcaccagcacctacatcctga 540 ctgctatggccattgaccgctacttggccaccgtccatcccatctcctccaccaagttcc 600 ggaagccctccatggccaccctggtgatctgcctcctgtgggctctctcgttcattagca660 tcactcctgtgtggctctatgccaggcttatccccttcccagggggtgctgtgggctgtg720 gcatccgcctaccaaacccagatactgatctttactggttcactctgtatcagtttttcc780 tggccttcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaatactacagcgca840 tgacgtcttcggtggccccagcctctcaacgcagcatccggcttcggacaaagagggtga900 cccgcacagccattgccatctgtctggtcttctttgtgtgctgggcgccctactacgtgc960 tgcagctgacccagttgtccatcagccgcccgaccctcacattcgtctacctgtacaatg1020 cggccatcagcttgggctatgccaacagctgcctcaatccctttgtgtacatagtactct1080 gtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgcggcccaggggcagcttc1140 gcacggtcagcaatgctcagacagctgacgaggagaggacagaaagcaaaggcacctgac1200 aatcccccccggtcacctccaagtcaggtcaccgcatcaaaccatggggagagatactga1260 gataaacccggggctaccctgggaggatgcagaagctggaggctgggggcttgtagcaaa1320 ccacattccacggggcccacaaattgctagggaggcttgcagcctggtttgggggggaag1380 cctcagactgcagggatccccttgacagaatagaagcggagcaagaaggaaagggtggtt1440 tgactggttctcggggtctgtatctgttggctcgcatatatctttctctcaagggaagaa1500 ggcggaggtgcctagctgggttcctttaaaactaggcagggctaggatctgagcagctag1560 ggctctactgtgagactgggcaagccgagcgttccctcccatctctcattggtgttgata1620 gaaggcagtctttctcccaagctggtggatctcctgaagcacgctgcctgggctccagca1680 tcctgtgcggatttcacgttctctttaggggatgcatgttgacactggggtgtgggctct1740 gagcccacaggagtttaaaaaaccaaaagagctcagagtgtcgagagagacccaatcacc1800 gagaatgacaaggcaacctggggtggatgtggatcttgaaactaataaaaaggggttttc1860 acagtgacagcgacattctcttcatagggcacagctgtcagtctatggctgatccagagc1920 gagcatccatgaattctgcatgtgcaggggtcactctaatacctgatatgttggcatcat1980 ctttgtgcttgagccttccnctcccaaatgggaatgaaataaaggcaaattcccnccccc2040 cccaaaaaaggggnaaaaaaaaaaaaaaaaaaaaaaaaaa 2080 <210>
<211>
<212>
DNA
<213>
Mouse <220>
<221> feature misc _ <222>
(1).
.(3357) <223>
n = A,T,C
or G
<400>
ggcggtagaggaagacccttttctggactgcggggctcaagctccggacaaggcggtgga60 gggcgctggaggctgccgcagcctgcgtgggtggacgggcgctccactccagggagcagg120 cgacctgcaccggctgcatggatctgcaagCCtCgttgCtgtCCaCtggCCCCaatgCCa180 gcaacatctccgatggccaggataatttcacattggcgggtgagtcgagttggagtcctc240 cctcctccgggatgggtgtggaaaatgggaaggtttcacctcccaagccaaactgcctgg300 gaaactttatcttacagttcttggtgataagatctgcagtcggctttgcctgaagaggaa360 gaggagaggaggggacaccagctaggacagaaggggcagggaggaatagagatggggcag420 aggcacatttagaaacaacaagggttggtgacaagacgtgaggcaggcttgaggggaaag480 cttgctgatgagtcccaaatatgctttgcaggggggggggggggggaatcaaggctggag540 aagcaagcaagcaagacagcaagacagcgggcgggtagtatgtgggagccagcagaagcg600 ctttgattcaccgctatcctgggctcaatcctctggcctcgcactggggaaatggggtct660 gagtggtccttgctgtcttctggcaaaggctgctgggagcaaaagacttcacagggcgtg720 agaggattaacttttctggtgaattaagcttcttgacatttgcagaacgtcaatgcctta780 aaattctagctctgaaggagaagggaatgaaggggaaagagggaaggttggtgtggagaa840 attcccaagcttctggggtgtaacacagctccagtccctaccctattgggaaagcccaga900 ctcaggagacatggtccaaggaaatccctgacagaaaaccgggagagggcagggctgtgg960 agcctgaaacacaccccacacccatggtgacagtcacttctcacatatgcctaggaacct1020 atctgaaacctttggccatctctctctgaaaagatgaggctgcaaatacacacacacaca1080 cacacacacacacacacacacacacacacacacacacacacacacacacaaatgtccttc1140 aagcctttttgacaaggttttctggtggatcccggggatatgaagttgttctcagcagat1200 atctgggagtcttgactcctggccctctgagtaaatggatgaagcgaagaagaatggggt1260 cctctgagtaacaggtggatctagaaaatcctataggagtcaccagggcacggtggagga1320 gggtaaggtacagaactaacaatagcccgagaaggggaaacagcaggagatgattccaga1380 gacgtagtgaccccaagctgcaagggaaagcatgaggggccagcaggaaggccgacatgg1440 caggttgtcagcttctagatcggaaggcgggtcacacttgctctttctatcctcagggcc1500 acctcctcgcacaaggagtgtctcctacatcaacatcatcatgccttcagtgtttggtac1560 catctgtctcctgggcattgtgggaaactccacagtcatttttgccgtggtgaagaaatc1620 caagctgcactggtgcagcaacgtccctgacatcttcatcatcaacctctctgtggtgga1680 tctgcttttcctgctgggcatg'cctttcatgatccaccagctcatgggtaatggtgtctg1740 gcactttggggaaaccatgtgcaccctcatcacagccatggacgccaacagtcagttcac1800 cagcacctacatcctgactgctatggccattgaccgctacttggccaccgtccatcccat1860 ctcctccaccaagttccggaagccctccatggccaccctggtgatctgcctcctgtgggc1920 tctctcgttcattagcatcactcctgtgtggctctatgccaggcttatccccttcccagg1980 gggtgctgtgggctgtggcatccgcctaccaaacccagatactgatctttactggttcac2040 tctgtatcagtttttcctggccttcgcccttccgtttgtggtcatcactgctgcgtacgt2100 gaaaatactacagcgcatgacgtcttcggtggccccagcctctcaacgcagcatccggct2160 tcggacaaagagggtgacccgcacagccattgccatctgtctggtcttctttgtgtgctg2220 ggcgccctactacgtgctgcagctgacccagttgtccatcagccgcccgaccctcacatt2280 cgtctacctgtacaatgcggccatcagcttgggctatgccaacagctgcctcaatccctt2340 tgtgtacatagtactctgtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgc2400 ggcccaggggcagcttcgcacggtcagcaatgctcagacagctgacgaggagaggacaga2460 aagcaaaggcacctgacaatcccccccggtcacctccaagtcaggtcaccgcatcaaacc2520 atggggagagatactgagataaacccggggctaccctgggaggatgcagaagctggaggc2580 tgggggcttgtagcaaaccacattccacggggcccacaaattgctagggaggcttgcagc2640 ctggtttgggggggaagcctcagactgcagggatccccttgacagaatagaagcggagca2700 agaaggaaagggtggtttgactggttctcggggtctgtatctgttggctcgcatatatct2760 ttctctcaagggaagaaggcggaggtgcctagctgggttcctttaaaactaggcagggct2820 aggatctgagcagctagggctctactgtgagactgggcaagccgagcgttCCCtCCCatC2880 tctcattggtgttgatagaaggcagtctttctcccaagctggtggatctcctgaagcacg2940 ctgcctgggctccagcatcctgtgcggatttcacgttctctttaggggatgcatgttgac3000 actggggtgtgggctctgagcccacaggagtttaaaaaaccaaaagagctcagagtgtcg3060 agagagacccaatcaccgagaatgacaaggcaacctggggtggatgtggatcttgaaact3120 aataaaaaggggttttcacagtgacagcgacattctcttcatagggcacagctgtcagtc3180 tatggctgatccagagcgagcatccatgaattctgcatgtgcaggggtcactctaatacc3240 tgatatgttggcatcatctttgtgcttgagccttccnctcccaaatgggaatgaaataaa3300 ggcaaattcccnccccccccaaaaaaggggnaaaaaaaaaaaaaaaaaaaaaaaaaa 3357 <210>
<211>
<212>
DNA
<213> icial Artif Sequence <220>
<223> short Human form/mouse species chimeric <400>
atggacctggaagcctcgctgctgcccactggtcccaatgccagcaacacctctgatggc60 cccgataacctcacttcggcaggatcacctcctcgcacggggagcatctcctacatcaac120 atcatcatgccttcggtgttcggcaccatctgcctcctgggcatcatcgggaactccacg180 gtcatcttcgcggtcgtgaagaagtccaagctgcactggtgcaacaacgtccccgacatc240 ttcatcatcaacctctcggtagtagatctcctctttctcctgggcatgcccttcatgatc300 caccagctcatgggcaatggggtgtggcactttggggagaccatgtgcaccctcatcacg360 gccatggatgccaatagtcagttcaccagcacctacatcctgaccgccatggccattgac420 cgctacctggccactgtccaccccatctcttccacgaagttccggaagccctccatggcc480 accctggtgatctgcctcctgtgggctctctcgttcattagcatcactcctgtgtggctc540 tatgccaggcttatccccttcccagggggtgctgtgggctgtggcatccgcctaccaaac600 ccagatactgatctttactggttcactctgtatcagtttttcctggccttcgcccttccg660 tttgtggtcatcactgctgcgtacgtgaaaatactacagcgcatgacgtcttcggtggcc720 ccagcctctcaacgcagcatccggcttcggacaaagagggtgacccgcacagccattgcc780 atctgtctggtcttctttgtgtgctgggcgccctactacgtgctgcagctgacccagttg840 tccatcagccgcccgaccctcacattcgtctacctgtacaatgcggccatcagcttgggc900 tatgccaacagctgcctcaatccctttgtgtacatagtactctgtgagacctttcgaaaa960 cgcttggtgctgtcggtgaagcccgcggcccaggggcagcttcgcacggtcagcaatgct1020 cagacagctgacgaggagaggacagaaagcaaaggcacctga 1062 <210>
<211>
<212>
DNA
<213>
Artificial Sequence <220>
<223> MCH1R
Human long form/mouse species chimeric <400>
atgtcagtgggagccatgaagaagggagtggggagggcagttgggcttggaggcggcagc60 ggctgccaggctacggaggaagacccccttcccaactgcggggcttgcgctccgggacaa120 ggtggcaggcgctggaggctgccgcagcctgcgtgggtggaggggagctcagctcggttg180 tgggagcaggcgaccggcactggctggatggacctggaagcctcgctgctgcccactggt240 cccaacgccagcaacacctctgatggccccgataacctcacttcggcaggatcacctcct300 cgcacggggagcatctcctacatcaacatcatcatgccttcggtgttcggcaccatctgc360 ctcctgggcatcatcgggaactccacggtcatcttcgcggtcgtgaagaagtccaagctg420 cactggtgcaacaacgtccccgacatcttcatcatcaacctctcggtagtagatctcctc480 tttctcctgggcatgcccttcatgatccaccagctcatgggcaatggggtgtggcacttt540 ggggagaccatgtgcaccctcatcacggccatggatgccaatagtcagttcaccagcacc600 tacatcctgaccgccatggccattgaccgctacctggccactgtccaccccatctcttcc660 acgaagttccggaagccctccatggccaccctggtgatctgcctcctgtgggctctctcg720 ttcattagcatcactcctgtgtggctctatgccaggcttatccccttcccagggggtgct780 gtgggctgtggcatccgcctaccaaacccagatactgatctttactggttcactctgtat840 cagtttttcctggccttcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaata900 ctacagcgcatgacgtcttcggtggccccagcctctcaacgcagcatccggcttcggaca960 aagagggtgacccgcacagccattgccatctgtctggtcttctttgtgtgctgggcgccc1020 tactacgtgctgcagctgacccagttgtccatcagccgcccgaccctcacattcgtctac1080 ctgtacaatgcggccatcagcttgggctatgccaacagctgcctcaatccctttgtgtac1140 atagtactctgtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgcggcccag1200 gggcagcttcgcacggtcagcaatgctcagacagctgacgaggagaggacagaaagcaaa1260 ggcacctga 1269 <210>
<211>
<212>
DNA
<213>
Aequorea Victoria <400>
tacacacgaataaaagataacaaagatgagtaaaggagaagaacttttcactggagttgt60 cccaattcttgttgaattagatggtgatgttaatgggcacaaattttctgtcagtggaga120 gggtgaaggtgatgcaacatacggaaaacttacccttaaatttatttgcactactggaaa180 actacctgttccatggccaacacttgtcactactttctcttatggtgttcaatgcttttc240 aagatacccagatcatatgaaacagcatgactttttcaagagtgccatgcccgaaggtta300 tgtacaggaaagaactatatttttcaaagatgacgggaactacaagacacgtgctgaagt360 caagtttgaaggtgatacccttgttaatagaatcgagttaaaaggtattgattttaaaga420 agatggaaacattcttggacacaaattggaatacaactataactcacacaatgtatacat480 catggcagacaaacaaaagaatggaatcaaagttaacttcaaaattagacacaacattga540 agatggaagcgttcaactagcagaccattatcaacaaaatactccaattggcgatggccc600 tgtccttttaccagacaaccattacctgtccacacaatctgccctttcgaaagatcccaa660 cgaaaagagagaccacatggtccttcttgagtttgtaacagctgctgggattacacatgg720 catggatgaactatacaaataaatgtccagacttccaattgacactaaagtgtccgaaca780 attactaaaatctcagggttcctggttaaattcaggctgagatattatttatatatttat840 agattcattaaaattgtatgaataatttattgatgttattgatagaggttattttcttat900 taaacaggctacttggagtgtattcttaattctatattaattacaatttgatttgacttg960 ctcaaa 966 <210> 22 <211> 765 <212> DNA
<213> Artificial Sequence <220>
<223> GFP derivative <400> 22 gtcgacggtaccgcgggcccgggatccatcgccaccatggtgagcaagggcgaggagctg 60 ttcaccggggtggtgcccatcctggtcgagctggacggcgacgtaaacggccacaagttc 120 agcgtgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaagttcatc 180 tgcaccaccggcaagctgcccgtgccctggcccaccctcgtgaccaccctgacctacggc 240 gtgcagtgcttcagccgctaccccgaccacatgaagcagcacgacttcttcaagtccgcc 300 atgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacgacggcaactacaag 360 acccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctgaagggc 420 atcgacttcaaggaggacggcaacatcctggggcacaagctggagtacaactacaacagc 480 cacaacgtctatatcatggccgacaagcagaagaacggcatcaaggtgaacttcaagatc 540 cgccacaacatcgaggacggcagcgtgcagctcgccgaccactaccagcagaacaccccc 600 atcggcgacggccccgtgctgctgcccgacaaccactacctgagcacccagtccgccctg 660 agcaaagaccccaacgagaagcgcgatcacatggtcctgctggagttcgtgaccgccgcc 720 gggatcactctcggcatggacgagctgtacaagtaaagcggccgc 765 <210>
<211>
<212>
DNA
<213> ficial Arti Sequence <220>
<223> derivative GFP
<400>
atggtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggac 60 ggcgacgtaaacggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctac 120 ggcaagctgaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccacc 180 ctcgtgaccaccttgacctacggcgtgcagtgcttcgcccgctaccccgaccacatgaag 240 cagcacgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttc 300 ttcaaggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctg 360 gtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcac 420 aagctggagtacaactacaacagccacaaggtctatatcaccgccgacaagcagaagaac 480 ggcatcaaggtgaacttcaagacccgccacaacatcgaggacggcagcgtgcagctcgcc 540 gaccactaccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccac 600 tacctgagcacccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtc 660 ctgctggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagtaa 720 <210>
<211>
<212>
DNA
<213> ficial Arti Sequence <220>
<223> derivative GFP
<400>
atggtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggac60 ggcgacgtaaacggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctac120 ggcaagctgaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccacc180 ctcgtgaccaccttcggctacggcgtgcagtgcttcgcccgctaccccgaccacatgcgc240 cagcacgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttc300 ttcaaggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctg360 gtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcac420 aagctggagtacaactacaacagccacaacgtctatatcatggccgacaagcagaagaac480 ggcatcaaggtgaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgcc540 gaccactaccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccac600 tacctgagctaccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtc660 ctgctggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagtaa720 <210>
<211>
<212>
DNA
<213> Artificial Sequence <220>
<223> GFP derivative <400> 25 atggtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggac60 ggcgacgtaaacggccacaggttcagcgtgtccggcgagggcgagggcgatgccacctac120 ggcaagctgaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccacc180 ctcgtgaccaccctgacctggggcgtgcagtgcttcagccgctaccccgaccacatgaag240 cagcacgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgtaccatcttc300 ttcaaggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctg360 gtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcac420 aagctggagtacaactacatcagccacaacgtctatatcaccgccgacaagcagaagaac480 ggcatcaaggcccacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgcc540 gaccactaccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccac600 tacctgagcacccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtc660 ctgctggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagtaa720 <210>
<211>
<212>
DNA
<213>
Artificial Sequence <220>
<223> MCH1R-linker-EGFP
Mouse <400> 26 atggatctgcaagcctcgttgctgtccactggccccaatgccagcaacatctccgatggc 60 caggataatttcacattggcgggtgagtcgagttggagtcctccctcctccgggatgggt 120 gtggaaaatgggaaggtttcacctcccaagccaaactgcctgggaaactttatcttacag 180 ttcttggtgataagatctgcagtcggctttgcctgaagaggaagaggagaggaggggaca 240 ccagctaggacagaaggggcagggaggaatagagatggggcagaggcacatttagaaaca 300 acaagggttggtgacaagacgtgaggcaggcttgaggggaaagcttgctgatgagtccca 360 aatatgctttgcaggggggggggggggggaatcaaggctggagaagcaagcaagcaagac 420 agcaagacagcgggcgggtagtatgtgggagccagcagaagcgctttgattcaccgctat 480 cctgggctcaatcctctggcctcgcactggggaaatggggtctgagtggtccttgctgtc 540 ttctggcaaaggctgctgggagcaaaagacttcacagggcgtgagaggattaacttttct 600 ggtgaattaagcttcttgacatttgcagaacgtcaatgccttaaaattctagctctgaag 660 gagaagggaatgaaggggaaagagggaaggttggtgtggagaaattcccaagcttctggg 720 gtgtaacacagctccagtccctaccctattgggaaagcccagactcaggagacatggtcc 780 aaggaaatccctgacagaaaaccgggagagggcagggctgtggagcctgaaacacacccc 840 acacccatggtgacagtcacttctcacatatgcctaggaacctatctgaaacctttggcc 900 atctctctctgaaaagatgaggctgcaaatacacacacacacacacacacacacacacac 960 acacacacacacacacacacacacacacacacaaatgtccttcaagcctttttgacaagg 1020 ttttctggtggatcccggggatatgaagttgttctcagcagatatctgggagtcttgact 1080 cctggccctctgagtaaatggatgaagcgaagaagaatggggtcctctgagtaacaggtg 1140 gatctagaaaatcctataggagtcaccagggcacggtggaggagggtaaggtacagaact 1200 aacaatagcccgagaaggggaaacagcaggagatgattccagagacgtagtgaccccaag 1260 ctgcaagggaaagcatgaggggccagcaggaaggccgacatggcaggttgtcagcttcta 1320 gatcggaaggcgggtcacacttgctctttctatcctcagggccacctcctcgcacaagga 1380 gtgtctcctacatcaacatcatcatgccttcagtgtttggtaccatctgtctcctgggca 1440 ttgtgggaaactccacagtcatttttgccgtggtgaagaaatccaagctgcactggtgca 1500 gcaacgtccctgacatcttcatcatcaacctctctgtggtggatctgcttttcctgctgg 1560 gcatgccttcatgatccact gtaatggtgtctggcactttggggaaacca 1620 cagctcatgg tgtgcaccctcatcacagccatggacgccaacagtcagttcaccagcacctacatcctga 1680 ctgctatggccattgaccgctacttggccaccgtccatcccatctcctccaccaagttcc 1740 ggaagccctccatggccaccctggtgatctgcctcctgtgggctctctcgttcattagca 1800 tcactcctgtgtggctctatgccaggcttatccccttcccagggggtgctgtgggctgtg 1860 gcatccgcctaccaaacccagatactgatctttactggttcactctgtatcagtttttcc 1920 tggccttcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaatactacagcgca 1980 tgacgtcttcggtggccccagcctctcaacgcagcatccggcttcggacaaagagggtga 2040 cccgcacagccattgccatctgtctggtcttctttgtgtgctgggcgccctactacgtgc2100 tgcagctgacccagttgtccatcagccgcccgaccctcacattcgtctacctgtacaatg2160 cggccatcagcttgggctatgccaacagctgcctcaatccctttgtgtacatagtactct2220 gtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgcggcccaggggcagcttc2280 gcacggtcagcaatgctcagacagctgacgaggagaggacagaaagcaaaggcaccgtcg2340 acggtaccgcgggcccgggatccatcgccaccatggtgagcaagggcgaggagctgttca2400 ccggggtggtgcccatcctggtcgagctggacggcgacgtaaacggccacaagttcagcg2460 tgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaagttcatctgca2520 ccaccggcaagctgcccgtgccctggcccaccctcgtgaccaccctgacctacggcgtgc2580 agtgcttcagccgctaccccgaccacatgaagcagcacgacttcttcaagtccgccatgc2640 ccgaaggctacgtccaggagcgcaccatcttcttcaaggacgacggcaactacaagaccc2700 gcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctgaagggcatcg2760 acttcaaggaggacggcaacatcctggggcacaagctggagtacaactacaacagccaca2820 acgtctatatcatggccgacaagcagaagaacggcatcaaggtgaacttcaagatccgcc2880 acaacatcgaggacggcagcgtgcagctcgccgaccactaccagcagaacacccccatcg2940 gcgacggccccgtgctgctgcccgacaaccactacctgagcacccagtccgccctgagca3000 aagaccccaacgagaagcgcgatcacatggtcctgctggagttcgtgaccgccgccggga3060 tcactctcggcatggacgagctgtacaagtas 3092 <210> 27 <211> 3056 <212> DNA
<213> Artificial Sequence <220>
<223> Mouse MCH1R/EGFP direct fusion <400>
atggatctgcaagcctcgttgctgtccactggccccaatgccagcaacatctccgatggc60 caggataatttcacattggcgggtgagtcgagttggagtcctccctcctccgggatgggt120 gtggaaaatgggaaggtttcacctcccaagccaaactgcctgggaaactttatcttacag180 ttcttggtgataagatctgcagtcggctttgcctgaagaggaagaggagaggaggggaca240 ccagctaggacagaaggggcagggaggaatagagatggggcagaggcacatttagaaaca300 acaagggttggtgacaagacgtgaggcaggcttgaggggaaagcttgctgatgagtccca360 aatatgctttgcaggggggggggggggggaatcaaggctggagaagcaagcaagcaagac420 agcaagacagcgggcgggtagtatgtgggagccagcagaagcgctttgattcaccgctat480 cctgggctcaatcctctggcctcgcactggggaaatggggtctgagtggtccttgctgtc540 ttctggcaaaggctgctgggagcaaaagacttcacagggcgtgagaggattaacttttct600 ggtgaattaagcttcttgacatttgcagaacgtcaatgccttaaaattctagctctgaag660 gagaagggaatgaaggggaaagagggaaggttggtgtggagaaattcccaagcttctggg720 gtgtaacacagctccagtccctaccctattgggaaagcccagactcaggagacatggtcc780 aaggaaatccctgacagaaaaccgggagagggcagggctgtggagcctgaaacacacccc840 acacccatggtgacagtcacttctcacatatgcctaggaacctatctgaaacctttggcc900 atctctctctgaaaagatgaggctgcaaatacacacacacacacacacacacacacacac960 acacacacacacacacacacacacacacacacaaatgtccttcaagcctttttgacaagg1020 ttttctggtggatcccggggatatgaagttgttctcagcagatatctgggagtcttgact1080 cctggccctctgagtaaatggatgaagcgaagaagaatggggtcctctgagtaacaggtg1140 gatctagaaaatcctataggagtcaccagggcacggtggaggagggtaaggtacagaact1200 aacaatagcccgagaaggggaaacagcaggagatgattccagagacgtagtgaccccaag1260 ctgcaagggaaagcatgaggggccagcaggaaggccgacatggcaggttgtcagcttcta1320 gatcggaaggcgggtcacacttgctctttctatcctcagggccacctcctcgcacaagga1380 gtgtctcctacatcaacatcatcatgccttcagtgtttggtaccatctgtctcctgggca1440 ttgtgggaaactccacagtcatttttgccgtggtgaagaaatccaagctgcactggtgca1500 gcaacgtccctgacatcttcatcatcaacctctctgtggtggatctgcttttcctgctgg1560 gcatgcctttcatgatccaccagctcatgggtaatggtgtctggcactttggggaaacca1620 tgtgcaccctcatcacagccatggacgccaacagtcagttcaccagcacctacatcctga1680 ctgctatggccattgaccgctacttggccaccgtccatcccatctcctccaccaagttcc1740 ggaagccctccatggccaccctggtgatctgcctcctgtgggctctctcgttcattagca1800 tcactcctgtgtggctctatgccaggcttatccccttcccagggggtgctgtgggctgtg1860 gcatccgcctaccaaacccagatactgatctttactggttcactctgtatcagtttttcc1920 tggccttcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaatactacagcgca1980 tgacgtcttcggtggccccagcctctcaacgcagcatccggcttcggacaaagagggtga2040 cccgcacagccattgccatctgtctggtcttctttgtgtgctgggcgccctactacgtgc2100 tgcagctgacccagttgtccatcagccgcccgaccctcacattcgtctacctgtacaatg2160 cggccatcagcttgggctatgccaacagctgcctcaatccctttgtgtacatagtactct2220 gtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgcggcccaggggcagcttc2280 gcacggtcagcaatgctcagacagctgacgaggagaggacagaaagcaaaggcaccatgg2340 tgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggacggcg2400 acgtaaacggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctacggca2460 agctgaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccaccctcg2520 tgaccaccctgacctacggcgtgcagtgcttcagccgctaccccgaccacatgaagcagc2580 acgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttca2640 aggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctggtga2700 accgcatcgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcacaagc2760 tggagtacaactacaacagccacaacgtctatatcatggccgacaagcagaagaacggca2820 tcaaggtgaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgccgacc2880 actaccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccactacc2940 tgagcacccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtcctgc3000 tggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagtaa 3056 <210> 28 <211> 1815 <212> DNA
<213> Artificial Sequence <220>
<223> Human short form/mouse species chimeric MCH1R-linker-EGFP
<400> 28 atggacctggaagcctcgctgctgcccactggtcccaatgccagcaacacctctgatggc60 cccgataacctcacttcggcaggatcacctcctcgcacggggagcatctcctacatcaac120 atcatcatgccttcggtgttcggcaccatctgcctcctgggcatcatcgggaactccacg180 gtcatcttcgcggtcgtgaagaagtccaagctgcactggtgcaacaacgtccccgacatc240 ttcatcatcaacctctcggtagtagatctcctctttctcctgggcatgcccttcatgatc300 caccagctcatgggcaatggggtgtggcactttggggagaccatgtgcaccctcatcacg360 gccatggatgccaatagtcagttcaccagcacctacatcctgaccgccatggccattgac420 cgctacctggccactgtccaccccatctcttccacgaagttccggaagccctccatggcc480 accctggtgatctgcctcctgtgggctctctcgttcattagcatcactcctgtgtggctc540 tatgccaggcttatccccttcccagggggtgctgtgggctgtggcatccgcctaccaaac600 ccagatactgatctttactggttcactctgtatcagtttttcctggccttcgcccttccg660 tttgtggtcatcactgctgcgtacgtgaaaatactacagcgcatgacgtcttcggtggcc720 ccagcctctcaacgcagcatccggcttcggacaaagagggtgacccgcacagccattgcc780 atctgtctggtcttctttgtgtgctgggcgccctactacgtgctgcagctgacccagttg840 tccatcagccgcccgaccctcacattcgtctacctgtacaatgcggccatcagcttgggc900 tatgccaacagctgcctcaatccctttgtgtacatagtactctgtgagacctttcgaaaa960 cgcttggtgctgtcggtgaagcccgcggcccaggggcagcttcgcacggtcagcaatgct1020 cagacagctgacgaggagaggacagaaagcaaaggcaccgtcgacggtaccgcgggcccg1080 ggatccatcgccaccatggtgagcaagggcgaggagctgttcaccggggtggtgcccatc1140 ctggtcgagctggacggcgacgtaaacggccacaagttcagcgtgtccggcgagggcgag1200 ggcgatgccacctacggcaagctgaccctgaagttcatctgcaccaccggcaagctgccc1260 gtgccctggcccaccctcgtgaccaccctgacctacggcgtgcagtgcttcagccgctac1320 cccgaccacatgaagcagcacgacttcttcaagtccgccatgcccgaaggctacgtccag1380 gagcgcaccatcttcttcaaggacgacggcaactacaagacccgcgccgaggtgaagttc1440 gagggcgacaccctggtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggc1500 aacatcctggggcacaagctggagtacaactacaacagccacaacgtctatatcatggcc1560 gacaagcagaagaacggcatcaaggtgaacttcaagatccgccacaacatcgaggacggc1620 agcgtgcagctcgccgaccactaccagcagaacacccccatcggcgacggccccgtgctg1680 ctgcccgacaaccactacctgagcacccagtccgccctgagcaaagaccccaacgagaag1740 cgcgatcacatggtcctgctggagttcgtgaccgccgccgggatcactctcggcatggac1800 gagctgtacaagtaa 1815 <210> 29 <211> 2022 <212> DNA
<213> Artificial Sequence <220>
<223> Human long form/mouse species chimeric MCH1R-linker-EGFP
<400>
atgtcagtgggagccatgaagaagggagtggggagggcagttgggcttggaggcggcagc60 ggctgccaggctacggaggaagacccccttcccaactgcggggcttgcgctccgggacaa120 ggtggcaggcgctggaggctgccgcagcctgcgtgggtggaggggagctcagctcggttg180 tgggagcaggcgaccggcactggctggatggacctggaagcctcgctgctgcccactggt240 cccaacgccagcaacacctctgatggccccgataacctcacttcggcaggatcacctcct300 cgcacggggagcatctcctacatcaacatcatcatgccttcggtgttcggcaccatctgc360 ctcctgggcatcatcgggaactccacggtcatcttcgcggtcgtgaagaagtccaagctg420 cactggtgcaacaacgtccccgacatcttcatcatcaacctctcggtagtagatctcctc480 tttctcctgggcatgcccttcatgatccaccagctcatgggcaatggggtgtggcacttt540 ggggagaccatgtgcaccctcatcacggccatggatgccaatagtcagttcaccagcacc600 tacatcctgaccgccatggccattgaccgctaCCtggCCaCtgtCCaCCCCatCtCttCC660 acgaagttccggaagccctccatggccaccctggtgatctgcctcctgtgggctctctcg720 ttcattagcatcactcctgtgtggctctatgccaggcttatccccttcccagggggtgct780 gtgggctgtggcatccgcctaccaaacccagatactgatctttactggttcactctgtat840 cagtttttcctggccttcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaata900 ctacagcgcatgacgtcttcggtggccccagcctctcaacgcagcatccggcttcggaca960 aagagggtgacccgcacagccattgccatctgtctggtcttctttgtgtgctgggcgccc1020 tactacgtgctgCagCtgaCCCagttgtCCatCagCCgCCCgaCCCtC2.CattCgtCtaC1080 ctgtacaatgcggccatcagcttgggctatgccaacagctgcctcaatccctttgtgtac1140 atagtactctgtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgcggcccag1200 gggcagcttcgcacggtcagcaatgctcagacagctgacgaggagaggacagaaagcaaa1260 ggcaccgtcgacggtaccgcgggcccgggatccatcgccaccatggtgagcaagggcgag1320 gagctgttcaccggggtggtgcccatcctggtcgagctggacggcgacgtaaacggccac1380 aagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaag1440 ttcatctgcaccaccggcaagctgcccgtgccctggcccaccctcgtgaccaccctgacc1500 tacggcgtgcagtgcttcagccgctaccccgaccacatgaagcagcacgacttcttcaag1560 tccgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacgacggcaac1620 tacaagacccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctg1680 aagggcatcgacttcaaggaggacggcaacatcctggggcacaagctggagtacaactac1740 aacagccacaacgtctatatcatggccgacaagcagaagaacggcatcaaggtgaacttc1800 aagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccactaccagcagaac1860 acccccatcggcgacggccccgtgctgctgcccgacaaccactacctgagcacccagtcc1920 gccctgagcaaagaccccaacgagaagcgcgatcacatggtcctgctggagttcgtgacc1980 gccgccgggatcactctcggcatggacgagctgtacaagtas 2022 <210> 30 <211> 35 <212> DNA
<213> Artificial Sequence <220>
<223> Linker <400> 30 tcgacggtac cgcgggcccg ggatccatcg ccacc 35 <210> 31 <211> 12 <212> PRT
<213> Artificial Sequence <220>
<223> Linker <400> 31 Val Asp Gly Thr Ala Gly Pro Gly Ser Ile Ala Thr <210> 32 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 32 gcgaattcac catggatctg caagcctcg 29 <210> 33 <211> 28 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 33 gcgtcgacgg tgcctttgct ttctgtcc 28 <210> 34 <211> 33 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 34 ccttgctcac catggtgcct ttgctttctg tcc 33 <210> 35 <211> 39 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 35 cagaaagcaa aggcaccatg gtgagcaagg gcgaggagc 39 <210> 36 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 36 ggcggatcct ctagagtcgc ggcc 24 <210> 37 <211> 38 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 37 gctctagagt cgcggccgct tacttgtaca gctcgtcc 38
<400> 5 Met Ser Val Gly Ala Met Lys Lys Gly Val Gly Arg Ala Val Gly Leu Gly Gly Gly Ser Gly Cys Gln Ala Thr Glu Glu Asp Pro Leu Pro Asn Cys Gly Ala Cys Ala Pro Gly Gln Gly Gly Arg Arg Trp Arg Leu Pro Gln Pro Ala Trp Val Glu Gly Ser Ser Ala Arg Leu Trp Glu Gln Ala Thr Gly Thr Gly Trp Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr <210> 6 <211> 238 <212> PRT
<213> Aequorea Victoria <400> 6 Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Ser Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr His G1y Met Asp Glu Leu Tyr Lys <210> 7 <211> 239 <212> PRT
<213> Artificial Sequence <220>
<223> GFP derivative <400> 7 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly G1u Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu 195 200 . 205 Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 8 <211> 239 <212> PRT
<213> Artificial Sequence <220>
<223> GFP derivative <400> 8 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Lys Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Thr Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 9 <211> 239 <212> PRT
<213> Artificial Sequence <220>
<223> GFP derivative <400> 9 Met.Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe Gly Tyr Gly Val Gln Cys Phe Ala Arg Tyr Pro Asp His Met Arg Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 10 <211> 239 <212> PRT
<213> Artificial Sequence <220>
<223> GFP derivative <400> 10 Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Arg Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Trp Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly l15 120 125 Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Ile Ser His Asn Val Tyr Ile Thr Ala Asp Lys Gln Lys Asn Gly Ile Lys Ala His Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly _g_ Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 11 <211> 604 <212> PRT
<213> Artificial Sequence <220>
<223> Mouse MCH1R-linker-EGFP
<400> 11 Met Asp Leu Gln Ala Ser Leu Leu Ser Thr Gly Pro Asn Ala Ser Asn Ile Ser Asp Gly Gln Asp Asn Phe Thr Leu Ala Gly Pro Pro Pro Arg Thr Arg Ser Val Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Val Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Ser Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr Val Asp Gly Thr Ala Gly Pro Gly Ser Ile Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 12 <211> 592 <212> PRT
<213> Artificial Sequence <220>
<223> Mouse MCH1R/EGFP
<400> 12 Met Asp Leu Gln Ala Ser Leu Leu Ser Thr Gly Pro Asn Ala Ser Asn Ile Ser Asp Gly Gln Asp Asn Phe Thr Leu Ala Gly Pro Pro Pro Arg Thr Arg Ser Val Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Val Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Ser Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly I1e Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Va1 Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 13 <211> 604 <212> PRT
<213> Artificial Sequence <220>
<223> MCH1R-linker-EGFP
<400> 13 Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr Val Asp Gly Thr Ala Gly Pro Gly Ser Ile Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr G1y Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Tle Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Va1 Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr-Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 14 <211> 673 <212> PRT
<213> Artificial Sequence <220>
<223> MCH1R-linker-EGFP
<400> 14 Met Ser Val Gly Ala Met Lys Lys Gly Val Gly Arg Ala Val Gly Leu Gly Gly Gly Ser Gly Cys Gln Ala Thr Glu Glu Asp Pro Leu Pro Asn Cys Gly Ala Cys Ala Pro Gly Gln Gly Gly Arg Arg Trp Arg Leu Pro Gln Pro Ala Trp Val Glu Gly Ser Ser Ala Arg Leu Trp Glu Gln Ala Thr Gly Thr Gly Trp Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro Pro Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Met Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val Ile Thr Ala Ala Tyr Val Lys Ile Leu Gln Arg Met Thr Ser Ser Val Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg Thr Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys Gly Thr Val Asp Gly Thr Ala Gly Pro Gly Ser Ile Ala Thr Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 465 470 475 ' 480 Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp G1y Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys <210> 15 <211> 1269 <212> DNA
<213> Human <400>
atgtcagtgggagccatgaagaagggagtggggagggcagttgggcttggaggcggcagc 60 ggctgccaggctacggaggaagacccccttcccaactgcggggcttgcgctccgggacaa 120 ggtggcaggcgctggaggctgccgcagcctgcgtgggtggaggggagctcagctcggttg 180 tgggagcaggcgaccggcactggctggatggacctggaagcctcgctgctgcccactggt 240 cccaacgccagcaacacctctgatggccccgataacctcacttcggcaggatcacctcct 300 cgcacggggagcatctcctacatcaacatcatcatgccttcggtgttcggcaccatctgc 360 ctcctgggcatcatcgggaactccacggtcatcttcgcggtcgtgaagaagtccaagctg 420 cactggtgcaacaacgtccccgacatcttcatcatcaacctctcggtagtagatctcctc480 tttctcctgggcatgcccttcatgatccaccagctcatgggcaatggggtgtggcacttt540 ggggagaccatgtgcaccctcatcacggccatggatgccaatagtcagttcaccagcacc600 tacatcctgaccgccatggccattgaccgctacctggccactgtccaccccatctcttcc660 acgaagttccggaagccctctgtggccaccctggtgatctgcctcctgtgggccctctcc720 ttcatcagcatcacccctgtgtggctgtatgccagactcatccccttcccaggaggtgca780 gtgggctgcggcatacgcctgcccaacccagacactgacctctactggttcaccctgtac840 cagtttttcctggcctttgccctgccttttgtggtcatcacagccgcatacgtgaggatc900 ctgcagcgcatgacgtcctcagtggcccccgcctcccagcgcagcatccggctgcggaca960 aagagggtgacccgcacagccatcgccatctgtctggtcttctttgtgtgctgggcaccc1020 tactatgtgctacagctgacccagttgtccatcagccgcccgaccctcacctttgtctac1080 ttatacaatgcggccatcagcttgggctatgccaacagctgcctcaacccctttgtgtac1140 atcgtgctctgtgagacgttccgcaaacgcttggtcctgtcggtgaagcctgcagcccag1200 gggcagcttcgcgctgtcagcaacgctcagacggctgacgaggagaggacagaaagcaaa1260 ggcacctga <210>
<211>
<212>
DNA
<213>
Human <400>
atggacctggaagcctcgctgctgcccactggtcccaatgccagcaacacctctgatggc60 cccgataacctcacttcggcaggatcacctcctcgcacggggagcatctcctacatcaac120 atcatcatgccttcggtgttcggcaccatctgcctcctgggcatcatcgggaactccacg180 gtcatcttcgcggtcgtgaagaagtccaagctgcactggtgcaacaacgtccccgacatc240 ttcatcatcaacctctcggtagtagatctcctctttctcctgggcatgcccttcatgatc300 caccagctcatgggcaatggggtgtggcactttggggagaccatgtgcaccctcatcacg360 gccatggatgccaatagtcagttcaccagcacctacatcctgaccgccatggccattgac420 cgctacctggccactgtccaccccatctcttccacgaagttccggaagccctctgtggcc480 accctggtgatctgcctcctgtgggccctctccttcatcagcatcacccctgtgtggctg540 tatgccagactcatccccttcccaggaggtgcagtgggctgcggcatacgcctgcccaac600 ccagacactgacctctactggttcaccctgtaccagtttttcctggcctttgccctgcct660 tttgtggtcatcacagccgcatacgtgaggatcctgcagcgcatgacgtcctcagtggcc720 cccgcctcccagcgcagcatccggctgcggacaaagagggtgacccgcacagccatcgcc780 atctgtctggtcttctttgtgtgctgggcaccctactatgtgctacagctgacccagttg840 tccatcagccgcccgaccctcacctttgtctacttatacaatgcggccatcagcttgggc900 tatgccaacagctgcctcaacccctttgtgtacatcgtgctctgtgagacgttccgcaaa960 cgcttggtcctgtcggtgaagcctgcagcccaggggcagcttcgcgctgtcagcaacgct1020 cagacggctgacgaggagaggacagaaagcaaaggcacctga 1062 <210>
<211>
<212>
DNA
<213>
Mouse <220>
<221> feature misc_ <222>
(1)...(2080) <223>
n = A,T,C
or G
<400>
ggcggtagaggaagacccttttctggactgcggggctcaagctccggacaaggcggtgga 60 gggcgctggaggctgccgcagcctgcgtgggtggacgggcgctccactccagggagcagg 120 cgacctgcaccggctgcatggatctgcaagcctcgttgctgtccactggccccaatgcca 180 gcaacatctccgatggccaggataatttcacattggcggggccacctcctcgcacaagga 240 gtgtctcctacatcaacatcatcatgccttcagtgtttggtaccatctgtctcctgggca 300 ttgtgggaaactccacagtcatttttgccgtggtgaagaaatccaagctgcactggtgca 360 gcaacgtccctgacatcttcatcatcaacctctctgtggtggatctgcttttcctgctgg 420 gcatgcctttcatgatccaccagctcatgggtaatggtgtctggcactttggggaaacca 480 tgtgcaccctcatcacagccatggacgccaacagtcagttcaccagcacctacatcctga 540 ctgctatggccattgaccgctacttggccaccgtccatcccatctcctccaccaagttcc 600 ggaagccctccatggccaccctggtgatctgcctcctgtgggctctctcgttcattagca660 tcactcctgtgtggctctatgccaggcttatccccttcccagggggtgctgtgggctgtg720 gcatccgcctaccaaacccagatactgatctttactggttcactctgtatcagtttttcc780 tggccttcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaatactacagcgca840 tgacgtcttcggtggccccagcctctcaacgcagcatccggcttcggacaaagagggtga900 cccgcacagccattgccatctgtctggtcttctttgtgtgctgggcgccctactacgtgc960 tgcagctgacccagttgtccatcagccgcccgaccctcacattcgtctacctgtacaatg1020 cggccatcagcttgggctatgccaacagctgcctcaatccctttgtgtacatagtactct1080 gtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgcggcccaggggcagcttc1140 gcacggtcagcaatgctcagacagctgacgaggagaggacagaaagcaaaggcacctgac1200 aatcccccccggtcacctccaagtcaggtcaccgcatcaaaccatggggagagatactga1260 gataaacccggggctaccctgggaggatgcagaagctggaggctgggggcttgtagcaaa1320 ccacattccacggggcccacaaattgctagggaggcttgcagcctggtttgggggggaag1380 cctcagactgcagggatccccttgacagaatagaagcggagcaagaaggaaagggtggtt1440 tgactggttctcggggtctgtatctgttggctcgcatatatctttctctcaagggaagaa1500 ggcggaggtgcctagctgggttcctttaaaactaggcagggctaggatctgagcagctag1560 ggctctactgtgagactgggcaagccgagcgttccctcccatctctcattggtgttgata1620 gaaggcagtctttctcccaagctggtggatctcctgaagcacgctgcctgggctccagca1680 tcctgtgcggatttcacgttctctttaggggatgcatgttgacactggggtgtgggctct1740 gagcccacaggagtttaaaaaaccaaaagagctcagagtgtcgagagagacccaatcacc1800 gagaatgacaaggcaacctggggtggatgtggatcttgaaactaataaaaaggggttttc1860 acagtgacagcgacattctcttcatagggcacagctgtcagtctatggctgatccagagc1920 gagcatccatgaattctgcatgtgcaggggtcactctaatacctgatatgttggcatcat1980 ctttgtgcttgagccttccnctcccaaatgggaatgaaataaaggcaaattcccnccccc2040 cccaaaaaaggggnaaaaaaaaaaaaaaaaaaaaaaaaaa 2080 <210>
<211>
<212>
DNA
<213>
Mouse <220>
<221> feature misc _ <222>
(1).
.(3357) <223>
n = A,T,C
or G
<400>
ggcggtagaggaagacccttttctggactgcggggctcaagctccggacaaggcggtgga60 gggcgctggaggctgccgcagcctgcgtgggtggacgggcgctccactccagggagcagg120 cgacctgcaccggctgcatggatctgcaagCCtCgttgCtgtCCaCtggCCCCaatgCCa180 gcaacatctccgatggccaggataatttcacattggcgggtgagtcgagttggagtcctc240 cctcctccgggatgggtgtggaaaatgggaaggtttcacctcccaagccaaactgcctgg300 gaaactttatcttacagttcttggtgataagatctgcagtcggctttgcctgaagaggaa360 gaggagaggaggggacaccagctaggacagaaggggcagggaggaatagagatggggcag420 aggcacatttagaaacaacaagggttggtgacaagacgtgaggcaggcttgaggggaaag480 cttgctgatgagtcccaaatatgctttgcaggggggggggggggggaatcaaggctggag540 aagcaagcaagcaagacagcaagacagcgggcgggtagtatgtgggagccagcagaagcg600 ctttgattcaccgctatcctgggctcaatcctctggcctcgcactggggaaatggggtct660 gagtggtccttgctgtcttctggcaaaggctgctgggagcaaaagacttcacagggcgtg720 agaggattaacttttctggtgaattaagcttcttgacatttgcagaacgtcaatgcctta780 aaattctagctctgaaggagaagggaatgaaggggaaagagggaaggttggtgtggagaa840 attcccaagcttctggggtgtaacacagctccagtccctaccctattgggaaagcccaga900 ctcaggagacatggtccaaggaaatccctgacagaaaaccgggagagggcagggctgtgg960 agcctgaaacacaccccacacccatggtgacagtcacttctcacatatgcctaggaacct1020 atctgaaacctttggccatctctctctgaaaagatgaggctgcaaatacacacacacaca1080 cacacacacacacacacacacacacacacacacacacacacacacacacaaatgtccttc1140 aagcctttttgacaaggttttctggtggatcccggggatatgaagttgttctcagcagat1200 atctgggagtcttgactcctggccctctgagtaaatggatgaagcgaagaagaatggggt1260 cctctgagtaacaggtggatctagaaaatcctataggagtcaccagggcacggtggagga1320 gggtaaggtacagaactaacaatagcccgagaaggggaaacagcaggagatgattccaga1380 gacgtagtgaccccaagctgcaagggaaagcatgaggggccagcaggaaggccgacatgg1440 caggttgtcagcttctagatcggaaggcgggtcacacttgctctttctatcctcagggcc1500 acctcctcgcacaaggagtgtctcctacatcaacatcatcatgccttcagtgtttggtac1560 catctgtctcctgggcattgtgggaaactccacagtcatttttgccgtggtgaagaaatc1620 caagctgcactggtgcagcaacgtccctgacatcttcatcatcaacctctctgtggtgga1680 tctgcttttcctgctgggcatg'cctttcatgatccaccagctcatgggtaatggtgtctg1740 gcactttggggaaaccatgtgcaccctcatcacagccatggacgccaacagtcagttcac1800 cagcacctacatcctgactgctatggccattgaccgctacttggccaccgtccatcccat1860 ctcctccaccaagttccggaagccctccatggccaccctggtgatctgcctcctgtgggc1920 tctctcgttcattagcatcactcctgtgtggctctatgccaggcttatccccttcccagg1980 gggtgctgtgggctgtggcatccgcctaccaaacccagatactgatctttactggttcac2040 tctgtatcagtttttcctggccttcgcccttccgtttgtggtcatcactgctgcgtacgt2100 gaaaatactacagcgcatgacgtcttcggtggccccagcctctcaacgcagcatccggct2160 tcggacaaagagggtgacccgcacagccattgccatctgtctggtcttctttgtgtgctg2220 ggcgccctactacgtgctgcagctgacccagttgtccatcagccgcccgaccctcacatt2280 cgtctacctgtacaatgcggccatcagcttgggctatgccaacagctgcctcaatccctt2340 tgtgtacatagtactctgtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgc2400 ggcccaggggcagcttcgcacggtcagcaatgctcagacagctgacgaggagaggacaga2460 aagcaaaggcacctgacaatcccccccggtcacctccaagtcaggtcaccgcatcaaacc2520 atggggagagatactgagataaacccggggctaccctgggaggatgcagaagctggaggc2580 tgggggcttgtagcaaaccacattccacggggcccacaaattgctagggaggcttgcagc2640 ctggtttgggggggaagcctcagactgcagggatccccttgacagaatagaagcggagca2700 agaaggaaagggtggtttgactggttctcggggtctgtatctgttggctcgcatatatct2760 ttctctcaagggaagaaggcggaggtgcctagctgggttcctttaaaactaggcagggct2820 aggatctgagcagctagggctctactgtgagactgggcaagccgagcgttCCCtCCCatC2880 tctcattggtgttgatagaaggcagtctttctcccaagctggtggatctcctgaagcacg2940 ctgcctgggctccagcatcctgtgcggatttcacgttctctttaggggatgcatgttgac3000 actggggtgtgggctctgagcccacaggagtttaaaaaaccaaaagagctcagagtgtcg3060 agagagacccaatcaccgagaatgacaaggcaacctggggtggatgtggatcttgaaact3120 aataaaaaggggttttcacagtgacagcgacattctcttcatagggcacagctgtcagtc3180 tatggctgatccagagcgagcatccatgaattctgcatgtgcaggggtcactctaatacc3240 tgatatgttggcatcatctttgtgcttgagccttccnctcccaaatgggaatgaaataaa3300 ggcaaattcccnccccccccaaaaaaggggnaaaaaaaaaaaaaaaaaaaaaaaaaa 3357 <210>
<211>
<212>
DNA
<213> icial Artif Sequence <220>
<223> short Human form/mouse species chimeric <400>
atggacctggaagcctcgctgctgcccactggtcccaatgccagcaacacctctgatggc60 cccgataacctcacttcggcaggatcacctcctcgcacggggagcatctcctacatcaac120 atcatcatgccttcggtgttcggcaccatctgcctcctgggcatcatcgggaactccacg180 gtcatcttcgcggtcgtgaagaagtccaagctgcactggtgcaacaacgtccccgacatc240 ttcatcatcaacctctcggtagtagatctcctctttctcctgggcatgcccttcatgatc300 caccagctcatgggcaatggggtgtggcactttggggagaccatgtgcaccctcatcacg360 gccatggatgccaatagtcagttcaccagcacctacatcctgaccgccatggccattgac420 cgctacctggccactgtccaccccatctcttccacgaagttccggaagccctccatggcc480 accctggtgatctgcctcctgtgggctctctcgttcattagcatcactcctgtgtggctc540 tatgccaggcttatccccttcccagggggtgctgtgggctgtggcatccgcctaccaaac600 ccagatactgatctttactggttcactctgtatcagtttttcctggccttcgcccttccg660 tttgtggtcatcactgctgcgtacgtgaaaatactacagcgcatgacgtcttcggtggcc720 ccagcctctcaacgcagcatccggcttcggacaaagagggtgacccgcacagccattgcc780 atctgtctggtcttctttgtgtgctgggcgccctactacgtgctgcagctgacccagttg840 tccatcagccgcccgaccctcacattcgtctacctgtacaatgcggccatcagcttgggc900 tatgccaacagctgcctcaatccctttgtgtacatagtactctgtgagacctttcgaaaa960 cgcttggtgctgtcggtgaagcccgcggcccaggggcagcttcgcacggtcagcaatgct1020 cagacagctgacgaggagaggacagaaagcaaaggcacctga 1062 <210>
<211>
<212>
DNA
<213>
Artificial Sequence <220>
<223> MCH1R
Human long form/mouse species chimeric <400>
atgtcagtgggagccatgaagaagggagtggggagggcagttgggcttggaggcggcagc60 ggctgccaggctacggaggaagacccccttcccaactgcggggcttgcgctccgggacaa120 ggtggcaggcgctggaggctgccgcagcctgcgtgggtggaggggagctcagctcggttg180 tgggagcaggcgaccggcactggctggatggacctggaagcctcgctgctgcccactggt240 cccaacgccagcaacacctctgatggccccgataacctcacttcggcaggatcacctcct300 cgcacggggagcatctcctacatcaacatcatcatgccttcggtgttcggcaccatctgc360 ctcctgggcatcatcgggaactccacggtcatcttcgcggtcgtgaagaagtccaagctg420 cactggtgcaacaacgtccccgacatcttcatcatcaacctctcggtagtagatctcctc480 tttctcctgggcatgcccttcatgatccaccagctcatgggcaatggggtgtggcacttt540 ggggagaccatgtgcaccctcatcacggccatggatgccaatagtcagttcaccagcacc600 tacatcctgaccgccatggccattgaccgctacctggccactgtccaccccatctcttcc660 acgaagttccggaagccctccatggccaccctggtgatctgcctcctgtgggctctctcg720 ttcattagcatcactcctgtgtggctctatgccaggcttatccccttcccagggggtgct780 gtgggctgtggcatccgcctaccaaacccagatactgatctttactggttcactctgtat840 cagtttttcctggccttcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaata900 ctacagcgcatgacgtcttcggtggccccagcctctcaacgcagcatccggcttcggaca960 aagagggtgacccgcacagccattgccatctgtctggtcttctttgtgtgctgggcgccc1020 tactacgtgctgcagctgacccagttgtccatcagccgcccgaccctcacattcgtctac1080 ctgtacaatgcggccatcagcttgggctatgccaacagctgcctcaatccctttgtgtac1140 atagtactctgtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgcggcccag1200 gggcagcttcgcacggtcagcaatgctcagacagctgacgaggagaggacagaaagcaaa1260 ggcacctga 1269 <210>
<211>
<212>
DNA
<213>
Aequorea Victoria <400>
tacacacgaataaaagataacaaagatgagtaaaggagaagaacttttcactggagttgt60 cccaattcttgttgaattagatggtgatgttaatgggcacaaattttctgtcagtggaga120 gggtgaaggtgatgcaacatacggaaaacttacccttaaatttatttgcactactggaaa180 actacctgttccatggccaacacttgtcactactttctcttatggtgttcaatgcttttc240 aagatacccagatcatatgaaacagcatgactttttcaagagtgccatgcccgaaggtta300 tgtacaggaaagaactatatttttcaaagatgacgggaactacaagacacgtgctgaagt360 caagtttgaaggtgatacccttgttaatagaatcgagttaaaaggtattgattttaaaga420 agatggaaacattcttggacacaaattggaatacaactataactcacacaatgtatacat480 catggcagacaaacaaaagaatggaatcaaagttaacttcaaaattagacacaacattga540 agatggaagcgttcaactagcagaccattatcaacaaaatactccaattggcgatggccc600 tgtccttttaccagacaaccattacctgtccacacaatctgccctttcgaaagatcccaa660 cgaaaagagagaccacatggtccttcttgagtttgtaacagctgctgggattacacatgg720 catggatgaactatacaaataaatgtccagacttccaattgacactaaagtgtccgaaca780 attactaaaatctcagggttcctggttaaattcaggctgagatattatttatatatttat840 agattcattaaaattgtatgaataatttattgatgttattgatagaggttattttcttat900 taaacaggctacttggagtgtattcttaattctatattaattacaatttgatttgacttg960 ctcaaa 966 <210> 22 <211> 765 <212> DNA
<213> Artificial Sequence <220>
<223> GFP derivative <400> 22 gtcgacggtaccgcgggcccgggatccatcgccaccatggtgagcaagggcgaggagctg 60 ttcaccggggtggtgcccatcctggtcgagctggacggcgacgtaaacggccacaagttc 120 agcgtgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaagttcatc 180 tgcaccaccggcaagctgcccgtgccctggcccaccctcgtgaccaccctgacctacggc 240 gtgcagtgcttcagccgctaccccgaccacatgaagcagcacgacttcttcaagtccgcc 300 atgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacgacggcaactacaag 360 acccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctgaagggc 420 atcgacttcaaggaggacggcaacatcctggggcacaagctggagtacaactacaacagc 480 cacaacgtctatatcatggccgacaagcagaagaacggcatcaaggtgaacttcaagatc 540 cgccacaacatcgaggacggcagcgtgcagctcgccgaccactaccagcagaacaccccc 600 atcggcgacggccccgtgctgctgcccgacaaccactacctgagcacccagtccgccctg 660 agcaaagaccccaacgagaagcgcgatcacatggtcctgctggagttcgtgaccgccgcc 720 gggatcactctcggcatggacgagctgtacaagtaaagcggccgc 765 <210>
<211>
<212>
DNA
<213> ficial Arti Sequence <220>
<223> derivative GFP
<400>
atggtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggac 60 ggcgacgtaaacggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctac 120 ggcaagctgaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccacc 180 ctcgtgaccaccttgacctacggcgtgcagtgcttcgcccgctaccccgaccacatgaag 240 cagcacgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttc 300 ttcaaggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctg 360 gtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcac 420 aagctggagtacaactacaacagccacaaggtctatatcaccgccgacaagcagaagaac 480 ggcatcaaggtgaacttcaagacccgccacaacatcgaggacggcagcgtgcagctcgcc 540 gaccactaccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccac 600 tacctgagcacccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtc 660 ctgctggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagtaa 720 <210>
<211>
<212>
DNA
<213> ficial Arti Sequence <220>
<223> derivative GFP
<400>
atggtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggac60 ggcgacgtaaacggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctac120 ggcaagctgaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccacc180 ctcgtgaccaccttcggctacggcgtgcagtgcttcgcccgctaccccgaccacatgcgc240 cagcacgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttc300 ttcaaggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctg360 gtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcac420 aagctggagtacaactacaacagccacaacgtctatatcatggccgacaagcagaagaac480 ggcatcaaggtgaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgcc540 gaccactaccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccac600 tacctgagctaccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtc660 ctgctggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagtaa720 <210>
<211>
<212>
DNA
<213> Artificial Sequence <220>
<223> GFP derivative <400> 25 atggtgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggac60 ggcgacgtaaacggccacaggttcagcgtgtccggcgagggcgagggcgatgccacctac120 ggcaagctgaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccacc180 ctcgtgaccaccctgacctggggcgtgcagtgcttcagccgctaccccgaccacatgaag240 cagcacgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgtaccatcttc300 ttcaaggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctg360 gtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcac420 aagctggagtacaactacatcagccacaacgtctatatcaccgccgacaagcagaagaac480 ggcatcaaggcccacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgcc540 gaccactaccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccac600 tacctgagcacccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtc660 ctgctggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagtaa720 <210>
<211>
<212>
DNA
<213>
Artificial Sequence <220>
<223> MCH1R-linker-EGFP
Mouse <400> 26 atggatctgcaagcctcgttgctgtccactggccccaatgccagcaacatctccgatggc 60 caggataatttcacattggcgggtgagtcgagttggagtcctccctcctccgggatgggt 120 gtggaaaatgggaaggtttcacctcccaagccaaactgcctgggaaactttatcttacag 180 ttcttggtgataagatctgcagtcggctttgcctgaagaggaagaggagaggaggggaca 240 ccagctaggacagaaggggcagggaggaatagagatggggcagaggcacatttagaaaca 300 acaagggttggtgacaagacgtgaggcaggcttgaggggaaagcttgctgatgagtccca 360 aatatgctttgcaggggggggggggggggaatcaaggctggagaagcaagcaagcaagac 420 agcaagacagcgggcgggtagtatgtgggagccagcagaagcgctttgattcaccgctat 480 cctgggctcaatcctctggcctcgcactggggaaatggggtctgagtggtccttgctgtc 540 ttctggcaaaggctgctgggagcaaaagacttcacagggcgtgagaggattaacttttct 600 ggtgaattaagcttcttgacatttgcagaacgtcaatgccttaaaattctagctctgaag 660 gagaagggaatgaaggggaaagagggaaggttggtgtggagaaattcccaagcttctggg 720 gtgtaacacagctccagtccctaccctattgggaaagcccagactcaggagacatggtcc 780 aaggaaatccctgacagaaaaccgggagagggcagggctgtggagcctgaaacacacccc 840 acacccatggtgacagtcacttctcacatatgcctaggaacctatctgaaacctttggcc 900 atctctctctgaaaagatgaggctgcaaatacacacacacacacacacacacacacacac 960 acacacacacacacacacacacacacacacacaaatgtccttcaagcctttttgacaagg 1020 ttttctggtggatcccggggatatgaagttgttctcagcagatatctgggagtcttgact 1080 cctggccctctgagtaaatggatgaagcgaagaagaatggggtcctctgagtaacaggtg 1140 gatctagaaaatcctataggagtcaccagggcacggtggaggagggtaaggtacagaact 1200 aacaatagcccgagaaggggaaacagcaggagatgattccagagacgtagtgaccccaag 1260 ctgcaagggaaagcatgaggggccagcaggaaggccgacatggcaggttgtcagcttcta 1320 gatcggaaggcgggtcacacttgctctttctatcctcagggccacctcctcgcacaagga 1380 gtgtctcctacatcaacatcatcatgccttcagtgtttggtaccatctgtctcctgggca 1440 ttgtgggaaactccacagtcatttttgccgtggtgaagaaatccaagctgcactggtgca 1500 gcaacgtccctgacatcttcatcatcaacctctctgtggtggatctgcttttcctgctgg 1560 gcatgccttcatgatccact gtaatggtgtctggcactttggggaaacca 1620 cagctcatgg tgtgcaccctcatcacagccatggacgccaacagtcagttcaccagcacctacatcctga 1680 ctgctatggccattgaccgctacttggccaccgtccatcccatctcctccaccaagttcc 1740 ggaagccctccatggccaccctggtgatctgcctcctgtgggctctctcgttcattagca 1800 tcactcctgtgtggctctatgccaggcttatccccttcccagggggtgctgtgggctgtg 1860 gcatccgcctaccaaacccagatactgatctttactggttcactctgtatcagtttttcc 1920 tggccttcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaatactacagcgca 1980 tgacgtcttcggtggccccagcctctcaacgcagcatccggcttcggacaaagagggtga 2040 cccgcacagccattgccatctgtctggtcttctttgtgtgctgggcgccctactacgtgc2100 tgcagctgacccagttgtccatcagccgcccgaccctcacattcgtctacctgtacaatg2160 cggccatcagcttgggctatgccaacagctgcctcaatccctttgtgtacatagtactct2220 gtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgcggcccaggggcagcttc2280 gcacggtcagcaatgctcagacagctgacgaggagaggacagaaagcaaaggcaccgtcg2340 acggtaccgcgggcccgggatccatcgccaccatggtgagcaagggcgaggagctgttca2400 ccggggtggtgcccatcctggtcgagctggacggcgacgtaaacggccacaagttcagcg2460 tgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaagttcatctgca2520 ccaccggcaagctgcccgtgccctggcccaccctcgtgaccaccctgacctacggcgtgc2580 agtgcttcagccgctaccccgaccacatgaagcagcacgacttcttcaagtccgccatgc2640 ccgaaggctacgtccaggagcgcaccatcttcttcaaggacgacggcaactacaagaccc2700 gcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctgaagggcatcg2760 acttcaaggaggacggcaacatcctggggcacaagctggagtacaactacaacagccaca2820 acgtctatatcatggccgacaagcagaagaacggcatcaaggtgaacttcaagatccgcc2880 acaacatcgaggacggcagcgtgcagctcgccgaccactaccagcagaacacccccatcg2940 gcgacggccccgtgctgctgcccgacaaccactacctgagcacccagtccgccctgagca3000 aagaccccaacgagaagcgcgatcacatggtcctgctggagttcgtgaccgccgccggga3060 tcactctcggcatggacgagctgtacaagtas 3092 <210> 27 <211> 3056 <212> DNA
<213> Artificial Sequence <220>
<223> Mouse MCH1R/EGFP direct fusion <400>
atggatctgcaagcctcgttgctgtccactggccccaatgccagcaacatctccgatggc60 caggataatttcacattggcgggtgagtcgagttggagtcctccctcctccgggatgggt120 gtggaaaatgggaaggtttcacctcccaagccaaactgcctgggaaactttatcttacag180 ttcttggtgataagatctgcagtcggctttgcctgaagaggaagaggagaggaggggaca240 ccagctaggacagaaggggcagggaggaatagagatggggcagaggcacatttagaaaca300 acaagggttggtgacaagacgtgaggcaggcttgaggggaaagcttgctgatgagtccca360 aatatgctttgcaggggggggggggggggaatcaaggctggagaagcaagcaagcaagac420 agcaagacagcgggcgggtagtatgtgggagccagcagaagcgctttgattcaccgctat480 cctgggctcaatcctctggcctcgcactggggaaatggggtctgagtggtccttgctgtc540 ttctggcaaaggctgctgggagcaaaagacttcacagggcgtgagaggattaacttttct600 ggtgaattaagcttcttgacatttgcagaacgtcaatgccttaaaattctagctctgaag660 gagaagggaatgaaggggaaagagggaaggttggtgtggagaaattcccaagcttctggg720 gtgtaacacagctccagtccctaccctattgggaaagcccagactcaggagacatggtcc780 aaggaaatccctgacagaaaaccgggagagggcagggctgtggagcctgaaacacacccc840 acacccatggtgacagtcacttctcacatatgcctaggaacctatctgaaacctttggcc900 atctctctctgaaaagatgaggctgcaaatacacacacacacacacacacacacacacac960 acacacacacacacacacacacacacacacacaaatgtccttcaagcctttttgacaagg1020 ttttctggtggatcccggggatatgaagttgttctcagcagatatctgggagtcttgact1080 cctggccctctgagtaaatggatgaagcgaagaagaatggggtcctctgagtaacaggtg1140 gatctagaaaatcctataggagtcaccagggcacggtggaggagggtaaggtacagaact1200 aacaatagcccgagaaggggaaacagcaggagatgattccagagacgtagtgaccccaag1260 ctgcaagggaaagcatgaggggccagcaggaaggccgacatggcaggttgtcagcttcta1320 gatcggaaggcgggtcacacttgctctttctatcctcagggccacctcctcgcacaagga1380 gtgtctcctacatcaacatcatcatgccttcagtgtttggtaccatctgtctcctgggca1440 ttgtgggaaactccacagtcatttttgccgtggtgaagaaatccaagctgcactggtgca1500 gcaacgtccctgacatcttcatcatcaacctctctgtggtggatctgcttttcctgctgg1560 gcatgcctttcatgatccaccagctcatgggtaatggtgtctggcactttggggaaacca1620 tgtgcaccctcatcacagccatggacgccaacagtcagttcaccagcacctacatcctga1680 ctgctatggccattgaccgctacttggccaccgtccatcccatctcctccaccaagttcc1740 ggaagccctccatggccaccctggtgatctgcctcctgtgggctctctcgttcattagca1800 tcactcctgtgtggctctatgccaggcttatccccttcccagggggtgctgtgggctgtg1860 gcatccgcctaccaaacccagatactgatctttactggttcactctgtatcagtttttcc1920 tggccttcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaatactacagcgca1980 tgacgtcttcggtggccccagcctctcaacgcagcatccggcttcggacaaagagggtga2040 cccgcacagccattgccatctgtctggtcttctttgtgtgctgggcgccctactacgtgc2100 tgcagctgacccagttgtccatcagccgcccgaccctcacattcgtctacctgtacaatg2160 cggccatcagcttgggctatgccaacagctgcctcaatccctttgtgtacatagtactct2220 gtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgcggcccaggggcagcttc2280 gcacggtcagcaatgctcagacagctgacgaggagaggacagaaagcaaaggcaccatgg2340 tgagcaagggcgaggagctgttcaccggggtggtgcccatcctggtcgagctggacggcg2400 acgtaaacggccacaagttcagcgtgtccggcgagggcgagggcgatgccacctacggca2460 agctgaccctgaagttcatctgcaccaccggcaagctgcccgtgccctggcccaccctcg2520 tgaccaccctgacctacggcgtgcagtgcttcagccgctaccccgaccacatgaagcagc2580 acgacttcttcaagtccgccatgcccgaaggctacgtccaggagcgcaccatcttcttca2640 aggacgacggcaactacaagacccgcgccgaggtgaagttcgagggcgacaccctggtga2700 accgcatcgagctgaagggcatcgacttcaaggaggacggcaacatcctggggcacaagc2760 tggagtacaactacaacagccacaacgtctatatcatggccgacaagcagaagaacggca2820 tcaaggtgaacttcaagatccgccacaacatcgaggacggcagcgtgcagctcgccgacc2880 actaccagcagaacacccccatcggcgacggccccgtgctgctgcccgacaaccactacc2940 tgagcacccagtccgccctgagcaaagaccccaacgagaagcgcgatcacatggtcctgc3000 tggagttcgtgaccgccgccgggatcactctcggcatggacgagctgtacaagtaa 3056 <210> 28 <211> 1815 <212> DNA
<213> Artificial Sequence <220>
<223> Human short form/mouse species chimeric MCH1R-linker-EGFP
<400> 28 atggacctggaagcctcgctgctgcccactggtcccaatgccagcaacacctctgatggc60 cccgataacctcacttcggcaggatcacctcctcgcacggggagcatctcctacatcaac120 atcatcatgccttcggtgttcggcaccatctgcctcctgggcatcatcgggaactccacg180 gtcatcttcgcggtcgtgaagaagtccaagctgcactggtgcaacaacgtccccgacatc240 ttcatcatcaacctctcggtagtagatctcctctttctcctgggcatgcccttcatgatc300 caccagctcatgggcaatggggtgtggcactttggggagaccatgtgcaccctcatcacg360 gccatggatgccaatagtcagttcaccagcacctacatcctgaccgccatggccattgac420 cgctacctggccactgtccaccccatctcttccacgaagttccggaagccctccatggcc480 accctggtgatctgcctcctgtgggctctctcgttcattagcatcactcctgtgtggctc540 tatgccaggcttatccccttcccagggggtgctgtgggctgtggcatccgcctaccaaac600 ccagatactgatctttactggttcactctgtatcagtttttcctggccttcgcccttccg660 tttgtggtcatcactgctgcgtacgtgaaaatactacagcgcatgacgtcttcggtggcc720 ccagcctctcaacgcagcatccggcttcggacaaagagggtgacccgcacagccattgcc780 atctgtctggtcttctttgtgtgctgggcgccctactacgtgctgcagctgacccagttg840 tccatcagccgcccgaccctcacattcgtctacctgtacaatgcggccatcagcttgggc900 tatgccaacagctgcctcaatccctttgtgtacatagtactctgtgagacctttcgaaaa960 cgcttggtgctgtcggtgaagcccgcggcccaggggcagcttcgcacggtcagcaatgct1020 cagacagctgacgaggagaggacagaaagcaaaggcaccgtcgacggtaccgcgggcccg1080 ggatccatcgccaccatggtgagcaagggcgaggagctgttcaccggggtggtgcccatc1140 ctggtcgagctggacggcgacgtaaacggccacaagttcagcgtgtccggcgagggcgag1200 ggcgatgccacctacggcaagctgaccctgaagttcatctgcaccaccggcaagctgccc1260 gtgccctggcccaccctcgtgaccaccctgacctacggcgtgcagtgcttcagccgctac1320 cccgaccacatgaagcagcacgacttcttcaagtccgccatgcccgaaggctacgtccag1380 gagcgcaccatcttcttcaaggacgacggcaactacaagacccgcgccgaggtgaagttc1440 gagggcgacaccctggtgaaccgcatcgagctgaagggcatcgacttcaaggaggacggc1500 aacatcctggggcacaagctggagtacaactacaacagccacaacgtctatatcatggcc1560 gacaagcagaagaacggcatcaaggtgaacttcaagatccgccacaacatcgaggacggc1620 agcgtgcagctcgccgaccactaccagcagaacacccccatcggcgacggccccgtgctg1680 ctgcccgacaaccactacctgagcacccagtccgccctgagcaaagaccccaacgagaag1740 cgcgatcacatggtcctgctggagttcgtgaccgccgccgggatcactctcggcatggac1800 gagctgtacaagtaa 1815 <210> 29 <211> 2022 <212> DNA
<213> Artificial Sequence <220>
<223> Human long form/mouse species chimeric MCH1R-linker-EGFP
<400>
atgtcagtgggagccatgaagaagggagtggggagggcagttgggcttggaggcggcagc60 ggctgccaggctacggaggaagacccccttcccaactgcggggcttgcgctccgggacaa120 ggtggcaggcgctggaggctgccgcagcctgcgtgggtggaggggagctcagctcggttg180 tgggagcaggcgaccggcactggctggatggacctggaagcctcgctgctgcccactggt240 cccaacgccagcaacacctctgatggccccgataacctcacttcggcaggatcacctcct300 cgcacggggagcatctcctacatcaacatcatcatgccttcggtgttcggcaccatctgc360 ctcctgggcatcatcgggaactccacggtcatcttcgcggtcgtgaagaagtccaagctg420 cactggtgcaacaacgtccccgacatcttcatcatcaacctctcggtagtagatctcctc480 tttctcctgggcatgcccttcatgatccaccagctcatgggcaatggggtgtggcacttt540 ggggagaccatgtgcaccctcatcacggccatggatgccaatagtcagttcaccagcacc600 tacatcctgaccgccatggccattgaccgctaCCtggCCaCtgtCCaCCCCatCtCttCC660 acgaagttccggaagccctccatggccaccctggtgatctgcctcctgtgggctctctcg720 ttcattagcatcactcctgtgtggctctatgccaggcttatccccttcccagggggtgct780 gtgggctgtggcatccgcctaccaaacccagatactgatctttactggttcactctgtat840 cagtttttcctggccttcgcccttccgtttgtggtcatcactgctgcgtacgtgaaaata900 ctacagcgcatgacgtcttcggtggccccagcctctcaacgcagcatccggcttcggaca960 aagagggtgacccgcacagccattgccatctgtctggtcttctttgtgtgctgggcgccc1020 tactacgtgctgCagCtgaCCCagttgtCCatCagCCgCCCgaCCCtC2.CattCgtCtaC1080 ctgtacaatgcggccatcagcttgggctatgccaacagctgcctcaatccctttgtgtac1140 atagtactctgtgagacctttcgaaaacgcttggtgctgtcggtgaagcccgcggcccag1200 gggcagcttcgcacggtcagcaatgctcagacagctgacgaggagaggacagaaagcaaa1260 ggcaccgtcgacggtaccgcgggcccgggatccatcgccaccatggtgagcaagggcgag1320 gagctgttcaccggggtggtgcccatcctggtcgagctggacggcgacgtaaacggccac1380 aagttcagcgtgtccggcgagggcgagggcgatgccacctacggcaagctgaccctgaag1440 ttcatctgcaccaccggcaagctgcccgtgccctggcccaccctcgtgaccaccctgacc1500 tacggcgtgcagtgcttcagccgctaccccgaccacatgaagcagcacgacttcttcaag1560 tccgccatgcccgaaggctacgtccaggagcgcaccatcttcttcaaggacgacggcaac1620 tacaagacccgcgccgaggtgaagttcgagggcgacaccctggtgaaccgcatcgagctg1680 aagggcatcgacttcaaggaggacggcaacatcctggggcacaagctggagtacaactac1740 aacagccacaacgtctatatcatggccgacaagcagaagaacggcatcaaggtgaacttc1800 aagatccgccacaacatcgaggacggcagcgtgcagctcgccgaccactaccagcagaac1860 acccccatcggcgacggccccgtgctgctgcccgacaaccactacctgagcacccagtcc1920 gccctgagcaaagaccccaacgagaagcgcgatcacatggtcctgctggagttcgtgacc1980 gccgccgggatcactctcggcatggacgagctgtacaagtas 2022 <210> 30 <211> 35 <212> DNA
<213> Artificial Sequence <220>
<223> Linker <400> 30 tcgacggtac cgcgggcccg ggatccatcg ccacc 35 <210> 31 <211> 12 <212> PRT
<213> Artificial Sequence <220>
<223> Linker <400> 31 Val Asp Gly Thr Ala Gly Pro Gly Ser Ile Ala Thr <210> 32 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 32 gcgaattcac catggatctg caagcctcg 29 <210> 33 <211> 28 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 33 gcgtcgacgg tgcctttgct ttctgtcc 28 <210> 34 <211> 33 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 34 ccttgctcac catggtgcct ttgctttctg tcc 33 <210> 35 <211> 39 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 35 cagaaagcaa aggcaccatg gtgagcaagg gcgaggagc 39 <210> 36 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 36 ggcggatcct ctagagtcgc ggcc 24 <210> 37 <211> 38 <212> DNA
<213> Artificial Sequence <220>
<223> PCR primer <400> 37 gctctagagt cgcggccgct tacttgtaca gctcgtcc 38
Claims (30)
1. A fusion protein comprising:
a) a melanin concentrating hormone receptor polypeptide region comprising a sequence selected from the group consisting of: SEQ. ID.
NO. 1, SEQ. ID. NO. 2, SEQ. SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ. ID. NO. 5; and b) a fluorescent polypeptide region joined directly, or though a linker, to the carboxy side of said melanin concentrating hormone receptor polypeptide region.
a) a melanin concentrating hormone receptor polypeptide region comprising a sequence selected from the group consisting of: SEQ. ID.
NO. 1, SEQ. ID. NO. 2, SEQ. SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ. ID. NO. 5; and b) a fluorescent polypeptide region joined directly, or though a linker, to the carboxy side of said melanin concentrating hormone receptor polypeptide region.
2. The protein of claim 1, wherein said fluorescent polypeptide region consists of an amino acid sequence selected from the group consisting of SEQ.
ID. NO. 6, SEQ. ID. NO. 7, SEQ. ID. NO. 8, SEQ. ID. NO. 9, and SEQ. ID. NO.
10.
ID. NO. 6, SEQ. ID. NO. 7, SEQ. ID. NO. 8, SEQ. ID. NO. 9, and SEQ. ID. NO.
10.
3. The protein of claim 2, wherein said melanin concentrating hormone polypeptide region consists of a sequence selected from the group consisting of: SEQ. ID. NO. 1, SEQ. ID. NO. 2, SEQ. ID. NO. 3, SEQ. ID. NO. 4, and SEQ.
ID.
NO. 5.
ID.
NO. 5.
4. The protein of claim 3, wherein said protein consists essentially of said melanin concentrating hormone receptor polypeptide region and said fluorescent polypeptide region.
5. The protein of claim 4, wherein said protein consists of the amino acid sequence of SEQ. ID. NO. 11 or SEQ. ID. NO. 12.
6. The protein of claim 1, wherein said melanin concentrating hormone polypeptide region is a chimeric polypeptide comprising (a) an MCH
binding region from a first species and (b) a transmembrane and intracellular domain region from a second species joined directly, or though a linker, to the carboxy side of said MCH binding region.
binding region from a first species and (b) a transmembrane and intracellular domain region from a second species joined directly, or though a linker, to the carboxy side of said MCH binding region.
7. The protein of claim 6, wherein said fluorescent polypeptide region consists of an amino acid sequence selected from the group consisting of: SEQ.
ID. NO. 6, SEQ. ID. NO. 7, SEQ. ID. NO. 8, SEQ. ID. NO. 9, and SEQ. ID. NO.
10.
ID. NO. 6, SEQ. ID. NO. 7, SEQ. ID. NO. 8, SEQ. ID. NO. 9, and SEQ. ID. NO.
10.
8. The protein of claim 7, wherein said protein consists of the amino acid sequence of SEQ. ID. NO. 13 or SEQ. ID. NO. 14.
9. A chimeric melanin concentrating hormone protein comprising:
a) a melanin concentrating hormone binding region characteristic of a human melanin concentrating hormone receptor;
b) a transmembrane domain characteristic of a non-human melanin concentrating hormone receptor; and c) an intracellular domain characteristic of a non-human melanin concentrating hormone receptor.
a) a melanin concentrating hormone binding region characteristic of a human melanin concentrating hormone receptor;
b) a transmembrane domain characteristic of a non-human melanin concentrating hormone receptor; and c) an intracellular domain characteristic of a non-human melanin concentrating hormone receptor.
10. The protein of claim 9, wherein said protein comprises a melanin concentrating hormone receptor polypeptide having a sequence similarity of at least 75% with either SEQ. ID. NO. 4 or SEQ. ID. NO. 5.
11. The protein of claim 10, wherein said protein comprises the sequence of SEQ. ID. NO. 4 or SEQ. ID. NO. 5.
12. The protein of claim 11, wherein said protein consists of the sequence of SEQ. ID. NO. 4 or SEQ. ID. NO. 5.
13. A nucleic acid comprising a nucleotide sequence encoding for the protein of claim 1.
14. The nucleic acid of claim 13, wherein said nucleotide sequence is a contiguous sequence.
15. The nucleic acid of claim 13, wherein said nucleotide sequence is selected from the group consisting of SEQ. ID. NO. 26, SEQ. ID. NO. 27, SEQ. ID.
NO. 28 and SEQ. ID. NO. 29.
NO. 28 and SEQ. ID. NO. 29.
16. A nucleic acid comprising a nucleotide sequence encoding for the protein of claim 9.
17. The nucleic acid of claim 16, wherein said nucleotide sequence is a contiguous sequence.
18. The nucleic acid of claim 16, wherein said nucleotide sequence is selected from the group consisting of SEQ. ID. NO. 19 and SEQ. ID. NO. 20.
19. An expression vector comprising the nucleic acid of claim 13.
20. An expression vector comprising the nucleic acid of claim 16.
21. A recombinant cell comprising the nucleic acid of claim 13.
22. The recombinant cell of claim 21, wherein said nucleic acid is present in an expression vector.
23. The recombinant cell of claim 21, wherein said nucleic acid is present in the genome of said cell.
24. A recombinant cell comprising the nucleic acid of claim 16.
25. The recombinant cell of claim 24, wherein said nucleic acid is present in an expression vector.
26. The recombinant cell of claim 24, wherein said nucleic acid is present in the genome of said cell.
27. A non-human transgenic animal comprising the nucleic acid of claim 13.
28. A non-human transgenic animal comprising the nucleic acid of claim 16.
29. A method for assaying for melanin concentrating hormone receptor active compounds comprising the steps of:
a) contacting the cell of claim 21 with a test preparation comprising one or more test compounds; and b) measuring the effect of said test preparation on one or more melanin concentrating hormone receptor activities.
a) contacting the cell of claim 21 with a test preparation comprising one or more test compounds; and b) measuring the effect of said test preparation on one or more melanin concentrating hormone receptor activities.
30. A method for assaying for melanin concentrating hormone receptor active compounds comprising the steps of:
a) contacting the cell of claim 24 with a test preparation comprising one or more test compounds; and b) measuring the effect of said test preparation on one or more melanin concentrating hormone receptor activities.
a) contacting the cell of claim 24 with a test preparation comprising one or more test compounds; and b) measuring the effect of said test preparation on one or more melanin concentrating hormone receptor activities.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18969800P | 2000-03-15 | 2000-03-15 | |
US60/189,698 | 2000-03-15 | ||
PCT/US2001/008071 WO2001068706A1 (en) | 2000-03-15 | 2001-03-14 | Melanin concentrating hormone receptor chimeric and fusion proteins |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2402147A1 true CA2402147A1 (en) | 2001-09-20 |
Family
ID=22698408
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002402147A Abandoned CA2402147A1 (en) | 2000-03-15 | 2001-03-14 | Melanin concentrating hormone receptor chimeric and fusion proteins |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1265926A1 (en) |
JP (1) | JP2003532388A (en) |
CA (1) | CA2402147A1 (en) |
WO (1) | WO2001068706A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109813889A (en) * | 2017-11-18 | 2019-05-28 | 镇江亿特生物科技发展有限公司 | The time-resolved fluoroimmunoassay for detecting progesterone chromatographs quantitative testing test paper item |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6221613B1 (en) | 1998-12-31 | 2001-04-24 | Synaptic Pharmaceutical Corporation | DNA encoding a human melanin concentrating hormone receptor (MCH1) and uses thereof |
CA2441708A1 (en) * | 2001-03-22 | 2002-10-03 | Merck & Co., Inc. | Mch1r deficient mice |
US7078484B2 (en) | 2001-04-19 | 2006-07-18 | Neurogen Corporation | Melanin concentrating hormone receptors |
US7078187B2 (en) | 2001-04-19 | 2006-07-18 | Neurogen Corporation | Melanin concentrating hormone receptors |
US6809104B2 (en) | 2001-05-04 | 2004-10-26 | Tularik Inc. | Fused heterocyclic compounds |
US7141391B2 (en) | 2001-11-13 | 2006-11-28 | Neurogen Corporation | Monkey and canine melanin concentrating hormone receptors |
BR0316070A (en) | 2002-11-06 | 2005-09-27 | Tularik Inc | Compound, pharmaceutical composition, use of a compound and method for modulating mchr |
US20090143529A1 (en) * | 2004-06-14 | 2009-06-04 | Milton Thomas William Hearn | Peptide purification by means of hard metal ion affinity chromatography |
-
2001
- 2001-03-14 EP EP01922374A patent/EP1265926A1/en not_active Withdrawn
- 2001-03-14 CA CA002402147A patent/CA2402147A1/en not_active Abandoned
- 2001-03-14 JP JP2001567796A patent/JP2003532388A/en not_active Withdrawn
- 2001-03-14 WO PCT/US2001/008071 patent/WO2001068706A1/en not_active Application Discontinuation
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109813889A (en) * | 2017-11-18 | 2019-05-28 | 镇江亿特生物科技发展有限公司 | The time-resolved fluoroimmunoassay for detecting progesterone chromatographs quantitative testing test paper item |
Also Published As
Publication number | Publication date |
---|---|
JP2003532388A (en) | 2003-11-05 |
WO2001068706A1 (en) | 2001-09-20 |
EP1265926A1 (en) | 2002-12-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU731087B2 (en) | KCNQ potassium channels and methods of modulating same | |
CA2402147A1 (en) | Melanin concentrating hormone receptor chimeric and fusion proteins | |
US6391561B1 (en) | Protein that enhances expression of potassium channels on cell surfaces and nucleic acids that encode the same | |
US7029878B2 (en) | Melanin concentrating hormone receptor chimeric and fusion proteins | |
US7148026B2 (en) | Dog melanin-concentrating hormone receptor | |
WO2001036479A1 (en) | G-protein coupled receptor | |
US7666994B2 (en) | Cloning and expression of gonadotropin-releasing hormone (GnRH) receptors | |
US7208282B2 (en) | Rhesus monkey, dog and ferret melanin-concentrating hormone type 2 receptor | |
An et al. | Identification of a novel bursicon‐regulated transcriptional regulator, md13379, in the house fly Musca domestica | |
US20040248129A1 (en) | Rhesus monkey, dog and feeret melanin-concentrating hormone type 1 receptor | |
US20070037219A1 (en) | Rhesus monkey bombesin receptor subtype-3 (brs-3), nucleotides encoding same, and uses thereof | |
US20070248995A1 (en) | Molecules | |
WO2000044787A1 (en) | Nucleic acid encoding bovine calcitonin receptor-like receptor | |
US20050222402A1 (en) | Dna encoding rat bombesin receptor subtype-3 (brs-3) and uses thereof | |
JP2004537521A (en) | Melanin-concentrating hormone analogues | |
CA2389329A1 (en) | Dog and rabbit motilin receptor orthologs | |
WO2002033124A1 (en) | KChAP-MODULATOR OF POTASSIUM CHANNELS |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Discontinued |