CA2325822A1 - Soluble protein ztmpo-1 - Google Patents
Soluble protein ztmpo-1 Download PDFInfo
- Publication number
- CA2325822A1 CA2325822A1 CA002325822A CA2325822A CA2325822A1 CA 2325822 A1 CA2325822 A1 CA 2325822A1 CA 002325822 A CA002325822 A CA 002325822A CA 2325822 A CA2325822 A CA 2325822A CA 2325822 A1 CA2325822 A1 CA 2325822A1
- Authority
- CA
- Canada
- Prior art keywords
- ser
- leu
- glu
- amino acid
- polypeptide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108090000623 proteins and genes Proteins 0.000 title abstract description 147
- 102000004169 proteins and genes Human genes 0.000 title abstract description 91
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 242
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 228
- 229920001184 polypeptide Polymers 0.000 claims abstract description 210
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 94
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 94
- 239000002157 polynucleotide Substances 0.000 claims abstract description 94
- 210000004027 cell Anatomy 0.000 claims description 128
- 238000000034 method Methods 0.000 claims description 97
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 71
- 239000002773 nucleotide Substances 0.000 claims description 51
- 125000003729 nucleotide group Chemical group 0.000 claims description 51
- 241000282414 Homo sapiens Species 0.000 claims description 38
- 125000000539 amino acid group Chemical group 0.000 claims description 36
- 238000006467 substitution reaction Methods 0.000 claims description 32
- 230000000295 complement effect Effects 0.000 claims description 27
- 230000002068 genetic effect Effects 0.000 claims description 20
- 239000013604 expression vector Substances 0.000 claims description 17
- 239000007795 chemical reaction product Substances 0.000 claims description 16
- 230000003248 secreting effect Effects 0.000 claims description 15
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 claims description 14
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 13
- 108010055341 glutamyl-glutamic acid Proteins 0.000 claims description 13
- 238000013518 transcription Methods 0.000 claims description 9
- 230000035897 transcription Effects 0.000 claims description 9
- 108090000790 Enzymes Proteins 0.000 claims description 8
- 108091008324 binding proteins Proteins 0.000 claims description 8
- 102000004190 Enzymes Human genes 0.000 claims description 7
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 claims description 6
- 210000004748 cultured cell Anatomy 0.000 claims description 6
- 108020001507 fusion proteins Proteins 0.000 claims description 6
- 102000037865 fusion proteins Human genes 0.000 claims description 6
- 102000005720 Glutathione transferase Human genes 0.000 claims description 5
- 108010070675 Glutathione transferase Proteins 0.000 claims description 5
- 108010021625 Immunoglobulin Fragments Proteins 0.000 claims description 5
- 102000008394 Immunoglobulin Fragments Human genes 0.000 claims description 5
- 230000005856 abnormality Effects 0.000 claims description 5
- 229920002704 polyhistidine Polymers 0.000 claims description 5
- 102000006496 Immunoglobulin Heavy Chains Human genes 0.000 claims description 4
- 108010019476 Immunoglobulin Heavy Chains Proteins 0.000 claims description 4
- 238000012258 culturing Methods 0.000 claims description 4
- 241001529936 Murinae Species 0.000 claims description 3
- 239000008194 pharmaceutical composition Substances 0.000 claims description 3
- 230000003302 anti-idiotype Effects 0.000 claims description 2
- 230000002103 transcriptional effect Effects 0.000 claims description 2
- 102000023732 binding proteins Human genes 0.000 claims 1
- 108010056197 emerin Proteins 0.000 abstract description 23
- 102100034239 Emerin Human genes 0.000 abstract description 19
- HCHFRAXBELVCGG-JYFOCSDGSA-N (2z,3z)-2,3-bis[(4-methoxyphenyl)methylidene]butanedinitrile Chemical compound C1=CC(OC)=CC=C1\C=C(/C#N)\C(\C#N)=C\C1=CC=C(OC)C=C1 HCHFRAXBELVCGG-JYFOCSDGSA-N 0.000 abstract description 18
- HCHFRAXBELVCGG-UHFFFAOYSA-N Emerin Natural products C1=CC(OC)=CC=C1C=C(C#N)C(C#N)=CC1=CC=C(OC)C=C1 HCHFRAXBELVCGG-UHFFFAOYSA-N 0.000 abstract description 17
- 239000000898 Thymopoietin Substances 0.000 abstract description 17
- 230000004663 cell proliferation Effects 0.000 abstract description 6
- 230000024245 cell differentiation Effects 0.000 abstract description 4
- 108020004414 DNA Proteins 0.000 description 88
- 235000018102 proteins Nutrition 0.000 description 87
- 235000001014 amino acid Nutrition 0.000 description 69
- 239000000523 sample Substances 0.000 description 51
- 229940024606 amino acid Drugs 0.000 description 48
- 150000001413 amino acids Chemical class 0.000 description 46
- 238000009396 hybridization Methods 0.000 description 36
- 108091034117 Oligonucleotide Proteins 0.000 description 34
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 33
- 238000003752 polymerase chain reaction Methods 0.000 description 32
- 108020004635 Complementary DNA Proteins 0.000 description 28
- 238000003556 assay Methods 0.000 description 28
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 26
- 239000013615 primer Substances 0.000 description 26
- 239000013598 vector Substances 0.000 description 26
- 210000001519 tissue Anatomy 0.000 description 25
- 230000027455 binding Effects 0.000 description 24
- 238000010804 cDNA synthesis Methods 0.000 description 24
- 239000002299 complementary DNA Substances 0.000 description 24
- 230000014509 gene expression Effects 0.000 description 21
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 20
- 238000004458 analytical method Methods 0.000 description 20
- 230000000694 effects Effects 0.000 description 19
- 108020004705 Codon Proteins 0.000 description 18
- 102000005962 receptors Human genes 0.000 description 18
- 108020003175 receptors Proteins 0.000 description 18
- 239000000243 solution Substances 0.000 description 18
- 239000012634 fragment Substances 0.000 description 17
- 150000007523 nucleic acids Chemical class 0.000 description 17
- 241000894007 species Species 0.000 description 17
- 108091028043 Nucleic acid sequence Proteins 0.000 description 16
- 102000039446 nucleic acids Human genes 0.000 description 16
- 108020004707 nucleic acids Proteins 0.000 description 16
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 15
- 239000005557 antagonist Substances 0.000 description 15
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 14
- 238000001727 in vivo Methods 0.000 description 14
- 102100023981 Lamina-associated polypeptide 2, isoform alpha Human genes 0.000 description 12
- 241001452677 Ogataea methanolica Species 0.000 description 12
- 230000000747 cardiac effect Effects 0.000 description 12
- 238000006243 chemical reaction Methods 0.000 description 12
- 238000004519 manufacturing process Methods 0.000 description 12
- 108020004999 messenger RNA Proteins 0.000 description 12
- 241000701161 unidentified adenovirus Species 0.000 description 12
- 241001465754 Metazoa Species 0.000 description 11
- 239000000872 buffer Substances 0.000 description 11
- 239000003550 marker Substances 0.000 description 11
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 10
- 241000282326 Felis catus Species 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 10
- 229940079593 drug Drugs 0.000 description 10
- 239000003814 drug Substances 0.000 description 10
- 239000000203 mixture Substances 0.000 description 10
- 238000012216 screening Methods 0.000 description 10
- 241000700605 Viruses Species 0.000 description 9
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 9
- 108010050848 glycylleucine Proteins 0.000 description 9
- 238000000338 in vitro Methods 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 238000005406 washing Methods 0.000 description 9
- 102000053602 DNA Human genes 0.000 description 8
- 241000282412 Homo Species 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- 230000000890 antigenic effect Effects 0.000 description 8
- 238000013459 approach Methods 0.000 description 8
- 108010013835 arginine glutamate Proteins 0.000 description 8
- 239000011159 matrix material Substances 0.000 description 8
- 230000035772 mutation Effects 0.000 description 8
- 239000000047 product Substances 0.000 description 8
- 230000005855 radiation Effects 0.000 description 8
- 238000010561 standard procedure Methods 0.000 description 8
- 108010049777 Ankyrins Proteins 0.000 description 7
- 102000008102 Ankyrins Human genes 0.000 description 7
- 102000014914 Carrier Proteins Human genes 0.000 description 7
- 108091060211 Expressed sequence tag Proteins 0.000 description 7
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- 238000000636 Northern blotting Methods 0.000 description 7
- 108091027981 Response element Proteins 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 230000003321 amplification Effects 0.000 description 7
- 230000008901 benefit Effects 0.000 description 7
- 239000003795 chemical substances by application Substances 0.000 description 7
- 230000007423 decrease Effects 0.000 description 7
- 230000002950 deficient Effects 0.000 description 7
- 201000010099 disease Diseases 0.000 description 7
- 239000003446 ligand Substances 0.000 description 7
- 239000012528 membrane Substances 0.000 description 7
- 238000003199 nucleic acid amplification method Methods 0.000 description 7
- 239000002987 primer (paints) Substances 0.000 description 7
- 108010090894 prolylleucine Proteins 0.000 description 7
- 238000000746 purification Methods 0.000 description 7
- 238000001890 transfection Methods 0.000 description 7
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 6
- 101710163560 Lamina-associated polypeptide 2, isoform alpha Proteins 0.000 description 6
- 101710189385 Lamina-associated polypeptide 2, isoforms beta/gamma Proteins 0.000 description 6
- 101710097668 Leucine aminopeptidase 2 Proteins 0.000 description 6
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 6
- 206010028980 Neoplasm Diseases 0.000 description 6
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 6
- 241000700159 Rattus Species 0.000 description 6
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 6
- 239000002253 acid Substances 0.000 description 6
- 230000004913 activation Effects 0.000 description 6
- 210000000349 chromosome Anatomy 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 229940088598 enzyme Drugs 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 230000001965 increasing effect Effects 0.000 description 6
- 238000002955 isolation Methods 0.000 description 6
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 210000004962 mammalian cell Anatomy 0.000 description 6
- 238000002703 mutagenesis Methods 0.000 description 6
- 231100000350 mutagenesis Toxicity 0.000 description 6
- 101710082686 probable leucine aminopeptidase 2 Proteins 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 210000001550 testis Anatomy 0.000 description 6
- 230000014616 translation Effects 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- 108091029865 Exogenous DNA Proteins 0.000 description 5
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 5
- 241000238631 Hexapoda Species 0.000 description 5
- 101000925840 Homo sapiens Emerin Proteins 0.000 description 5
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 5
- 108010052285 Membrane Proteins Proteins 0.000 description 5
- 241000699666 Mus <mouse, genus> Species 0.000 description 5
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 5
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 5
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 239000011324 bead Substances 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 229960002685 biotin Drugs 0.000 description 5
- 235000020958 biotin Nutrition 0.000 description 5
- 239000011616 biotin Substances 0.000 description 5
- 231100000433 cytotoxic Toxicity 0.000 description 5
- 230000001472 cytotoxic effect Effects 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000037430 deletion Effects 0.000 description 5
- 108010054813 diprotin B Proteins 0.000 description 5
- 230000004927 fusion Effects 0.000 description 5
- -1 guanidinium cations Chemical class 0.000 description 5
- 230000002163 immunogen Effects 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- 229960003136 leucine Drugs 0.000 description 5
- 108010034529 leucyl-lysine Proteins 0.000 description 5
- 239000002502 liposome Substances 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 108010051242 phenylalanylserine Proteins 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- 230000001131 transforming effect Effects 0.000 description 5
- 108010073969 valyllysine Proteins 0.000 description 5
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- 101001047746 Homo sapiens Lamina-associated polypeptide 2, isoform alpha Proteins 0.000 description 4
- 101001047731 Homo sapiens Lamina-associated polypeptide 2, isoforms beta/gamma Proteins 0.000 description 4
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 4
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 4
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 4
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 4
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 4
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- 241000699670 Mus sp. Species 0.000 description 4
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 4
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 4
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 4
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 4
- 230000004075 alteration Effects 0.000 description 4
- 230000003171 anti-complementary effect Effects 0.000 description 4
- 239000000427 antigen Substances 0.000 description 4
- 108091007433 antigens Proteins 0.000 description 4
- 102000036639 antigens Human genes 0.000 description 4
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 201000011510 cancer Diseases 0.000 description 4
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 4
- 230000001086 cytosolic effect Effects 0.000 description 4
- 238000003745 diagnosis Methods 0.000 description 4
- 230000004069 differentiation Effects 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 239000000975 dye Substances 0.000 description 4
- 238000004520 electroporation Methods 0.000 description 4
- 108010078144 glutaminyl-glycine Proteins 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 102000046104 human TMPO Human genes 0.000 description 4
- 238000007901 in situ hybridization Methods 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 239000003112 inhibitor Substances 0.000 description 4
- 230000005764 inhibitory process Effects 0.000 description 4
- 210000004185 liver Anatomy 0.000 description 4
- 238000013507 mapping Methods 0.000 description 4
- 229930182817 methionine Natural products 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 230000007170 pathology Effects 0.000 description 4
- 230000037361 pathway Effects 0.000 description 4
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 230000002062 proliferating effect Effects 0.000 description 4
- 230000035755 proliferation Effects 0.000 description 4
- 108010077112 prolyl-proline Proteins 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 229920005989 resin Polymers 0.000 description 4
- 239000011347 resin Substances 0.000 description 4
- 238000003757 reverse transcription PCR Methods 0.000 description 4
- 239000011734 sodium Substances 0.000 description 4
- 230000021595 spermatogenesis Effects 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 230000001225 therapeutic effect Effects 0.000 description 4
- 238000012384 transportation and delivery Methods 0.000 description 4
- 238000011282 treatment Methods 0.000 description 4
- 239000013603 viral vector Substances 0.000 description 4
- 229920000936 Agarose Polymers 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 3
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 3
- IPWKGIFRRBGCJO-IMJSIDKUSA-N Ala-Ser Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O IPWKGIFRRBGCJO-IMJSIDKUSA-N 0.000 description 3
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 3
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 3
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 3
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 3
- 239000004475 Arginine Substances 0.000 description 3
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 3
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 3
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 3
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 3
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 3
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 3
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 3
- 108010045403 Calcium-Binding Proteins Proteins 0.000 description 3
- 102000005701 Calcium-Binding Proteins Human genes 0.000 description 3
- 102100035882 Catalase Human genes 0.000 description 3
- 108010053835 Catalase Proteins 0.000 description 3
- IVOMOUWHDPKRLL-KQYNXXCUSA-N Cyclic adenosine monophosphate Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=CN=C2N)=C2N=C1 IVOMOUWHDPKRLL-KQYNXXCUSA-N 0.000 description 3
- 241000702421 Dependoparvovirus Species 0.000 description 3
- 229920002307 Dextran Polymers 0.000 description 3
- 101150029662 E1 gene Proteins 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 3
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 3
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 3
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 3
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 3
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 3
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 3
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 3
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 3
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 3
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 3
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 3
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 3
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 3
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 3
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 3
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 3
- 102000018697 Membrane Proteins Human genes 0.000 description 3
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- 108010079364 N-glycylalanine Proteins 0.000 description 3
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 3
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 3
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 3
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 3
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 3
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 108700008625 Reporter Genes Proteins 0.000 description 3
- 108010083644 Ribonucleases Proteins 0.000 description 3
- 102000006382 Ribonucleases Human genes 0.000 description 3
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 3
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 3
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 3
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 3
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 3
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 3
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 3
- 108020004682 Single-Stranded DNA Proteins 0.000 description 3
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 3
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 3
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 3
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 3
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 3
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 3
- 239000004473 Threonine Substances 0.000 description 3
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 3
- IVOMOUWHDPKRLL-UHFFFAOYSA-N UNPD107823 Natural products O1C2COP(O)(=O)OC2C(O)C1N1C(N=CN=C2N)=C2N=C1 IVOMOUWHDPKRLL-UHFFFAOYSA-N 0.000 description 3
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 3
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 3
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 3
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 3
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 3
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 3
- 230000002378 acidificating effect Effects 0.000 description 3
- 150000007513 acids Chemical class 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 239000000556 agonist Substances 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 239000012472 biological sample Substances 0.000 description 3
- 229940098773 bovine serum albumin Drugs 0.000 description 3
- 229910052799 carbon Inorganic materials 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 230000036755 cellular response Effects 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 229940095074 cyclic amp Drugs 0.000 description 3
- 238000004925 denaturation Methods 0.000 description 3
- 230000036425 denaturation Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000002255 enzymatic effect Effects 0.000 description 3
- 235000020776 essential amino acid Nutrition 0.000 description 3
- 239000003797 essential amino acid Substances 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 3
- 238000001502 gel electrophoresis Methods 0.000 description 3
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 3
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 210000002064 heart cell Anatomy 0.000 description 3
- 208000019622 heart disease Diseases 0.000 description 3
- 238000002744 homologous recombination Methods 0.000 description 3
- 230000006801 homologous recombination Effects 0.000 description 3
- 239000005556 hormone Substances 0.000 description 3
- 229940088597 hormone Drugs 0.000 description 3
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 3
- 230000028993 immune response Effects 0.000 description 3
- 230000002401 inhibitory effect Effects 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 230000004807 localization Effects 0.000 description 3
- 229920002521 macromolecule Polymers 0.000 description 3
- 230000005291 magnetic effect Effects 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 229910021645 metal ion Inorganic materials 0.000 description 3
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 3
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 3
- 210000003205 muscle Anatomy 0.000 description 3
- 201000006938 muscular dystrophy Diseases 0.000 description 3
- 210000000633 nuclear envelope Anatomy 0.000 description 3
- 235000015097 nutrients Nutrition 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 230000036961 partial effect Effects 0.000 description 3
- 210000001322 periplasm Anatomy 0.000 description 3
- 238000002823 phage display Methods 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 108010053725 prolylvaline Proteins 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- 230000000638 stimulation Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 108010001055 thymocartin Proteins 0.000 description 3
- 239000003053 toxin Substances 0.000 description 3
- 231100000765 toxin Toxicity 0.000 description 3
- 108700012359 toxins Proteins 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 108010003137 tyrosyltyrosine Proteins 0.000 description 3
- 108700026220 vif Genes Proteins 0.000 description 3
- 238000001262 western blot Methods 0.000 description 3
- DFZVZEMNPGABKO-ZETCQYMHSA-N (2s)-2-amino-3-pyridin-3-ylpropanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CN=C1 DFZVZEMNPGABKO-ZETCQYMHSA-N 0.000 description 2
- FQFVANSXYKWQOT-ZETCQYMHSA-N (2s)-2-azaniumyl-3-pyridin-4-ylpropanoate Chemical compound OC(=O)[C@@H](N)CC1=CC=NC=C1 FQFVANSXYKWQOT-ZETCQYMHSA-N 0.000 description 2
- XWHHYOYVRVGJJY-QMMMGPOBSA-N 4-fluoro-L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(F)C=C1 XWHHYOYVRVGJJY-QMMMGPOBSA-N 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 2
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- 101100437119 Arabidopsis thaliana AUG2 gene Proteins 0.000 description 2
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 2
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 2
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 2
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 2
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 2
- JTZUZBADHGISJD-SRVKXCTJSA-N Arg-His-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JTZUZBADHGISJD-SRVKXCTJSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 2
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 2
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 2
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 2
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 2
- CIBWFJFMOBIFTE-CIUDSAMLSA-N Asn-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N CIBWFJFMOBIFTE-CIUDSAMLSA-N 0.000 description 2
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 2
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 2
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 2
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 2
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 2
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 2
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 2
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 2
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 2
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 241000972773 Aulopiformes Species 0.000 description 2
- 241000271566 Aves Species 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 208000002061 Cardiac Conduction System Disease Diseases 0.000 description 2
- 208000020446 Cardiac disease Diseases 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- 208000035473 Communicable disease Diseases 0.000 description 2
- 108050006400 Cyclin Proteins 0.000 description 2
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 2
- AOJJSUZBOXZQNB-TZSSRYMLSA-N Doxorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 AOJJSUZBOXZQNB-TZSSRYMLSA-N 0.000 description 2
- 206010059866 Drug resistance Diseases 0.000 description 2
- 201000009344 Emery-Dreifuss muscular dystrophy Diseases 0.000 description 2
- 108010067193 Formaldehyde transketolase Proteins 0.000 description 2
- 108090000698 Formate Dehydrogenases Proteins 0.000 description 2
- 108010046649 GDNP peptide Proteins 0.000 description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 2
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 2
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 2
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 2
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 2
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 2
- QENSHQJGWGRPQS-QEJZJMRPSA-N Gln-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)N)C(O)=O)=CNC2=C1 QENSHQJGWGRPQS-QEJZJMRPSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 2
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- XEKAJTCACGEBOK-KKUMJFAQSA-N Glu-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XEKAJTCACGEBOK-KKUMJFAQSA-N 0.000 description 2
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 2
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 2
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- ZRALSGWEFCBTJO-UHFFFAOYSA-N Guanidine Chemical compound NC(N)=N ZRALSGWEFCBTJO-UHFFFAOYSA-N 0.000 description 2
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 2
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 2
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 2
- 241000700588 Human alphaherpesvirus 1 Species 0.000 description 2
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 2
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 2
- YBHKCXNNNVDYEB-SPOWBLRKSA-N Ile-Trp-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N YBHKCXNNNVDYEB-SPOWBLRKSA-N 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- VYZAGTDAHUIRQA-WHFBIAKZSA-N L-alanyl-L-glutamic acid Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O VYZAGTDAHUIRQA-WHFBIAKZSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 2
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- LJKJVTCIRDCITR-SRVKXCTJSA-N Leu-Cys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LJKJVTCIRDCITR-SRVKXCTJSA-N 0.000 description 2
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 2
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 2
- 206010025323 Lymphomas Diseases 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 2
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 2
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 2
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 2
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 2
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 2
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 2
- KYJHWKAMFISDJE-RCWTZXSCSA-N Met-Thr-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCSC KYJHWKAMFISDJE-RCWTZXSCSA-N 0.000 description 2
- 108091092878 Microsatellite Proteins 0.000 description 2
- 108091061960 Naked DNA Proteins 0.000 description 2
- 108020004485 Nonsense Codon Proteins 0.000 description 2
- 108091060545 Nonsense suppressor Proteins 0.000 description 2
- 102000007999 Nuclear Proteins Human genes 0.000 description 2
- 108010089610 Nuclear Proteins Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 108010067902 Peptide Library Proteins 0.000 description 2
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 2
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 2
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 2
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 2
- 101710182846 Polyhedrin Proteins 0.000 description 2
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 2
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 2
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 2
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 2
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 2
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 2
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 2
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- 102000009339 Proliferating Cell Nuclear Antigen Human genes 0.000 description 2
- 108091034057 RNA (poly(A)) Proteins 0.000 description 2
- 108020004518 RNA Probes Proteins 0.000 description 2
- 239000003391 RNA probe Substances 0.000 description 2
- 101100342721 Rattus norvegicus Tmpo gene Proteins 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- LTFSLKWFMWZEBD-IMJSIDKUSA-N Ser-Asn Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O LTFSLKWFMWZEBD-IMJSIDKUSA-N 0.000 description 2
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 2
- BLPYXIXXCFVIIF-FXQIFTODSA-N Ser-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N)CN=C(N)N BLPYXIXXCFVIIF-FXQIFTODSA-N 0.000 description 2
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 2
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 2
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- 108700025832 Serum Response Element Proteins 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- 241000700584 Simplexvirus Species 0.000 description 2
- 241000256251 Spodoptera frugiperda Species 0.000 description 2
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 2
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 108020004566 Transfer RNA Proteins 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 2
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 2
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 2
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 2
- 108020005202 Viral DNA Proteins 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 238000001261 affinity purification Methods 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- 229960003767 alanine Drugs 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 125000003282 alkyl amino group Chemical group 0.000 description 2
- 238000010171 animal model Methods 0.000 description 2
- 230000000692 anti-sense effect Effects 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000000975 bioactive effect Effects 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 238000009395 breeding Methods 0.000 description 2
- 230000001488 breeding effect Effects 0.000 description 2
- 239000007975 buffered saline Substances 0.000 description 2
- 125000000484 butyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 2
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 239000004202 carbamide Substances 0.000 description 2
- 125000000837 carbohydrate group Chemical group 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 210000004413 cardiac myocyte Anatomy 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 230000004640 cellular pathway Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000007385 chemical modification Methods 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 239000012501 chromatography medium Substances 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 230000001351 cycling effect Effects 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 230000007812 deficiency Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 230000000368 destabilizing effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 229960002086 dextran Drugs 0.000 description 2
- 238000002405 diagnostic procedure Methods 0.000 description 2
- 238000000502 dialysis Methods 0.000 description 2
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 2
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 238000005194 fractionation Methods 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 238000001415 gene therapy Methods 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 2
- 239000008187 granular material Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 230000004217 heart function Effects 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 108010044853 histidine-rich proteins Proteins 0.000 description 2
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 210000003917 human chromosome Anatomy 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 230000003053 immunization Effects 0.000 description 2
- 238000002649 immunization Methods 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 238000007834 ligase chain reaction Methods 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 238000001638 lipofection Methods 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 239000006249 magnetic particle Substances 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 239000003068 molecular probe Substances 0.000 description 2
- 201000000585 muscular atrophy Diseases 0.000 description 2
- 230000003472 neutralizing effect Effects 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 230000037434 nonsense mutation Effects 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 230000005298 paramagnetic effect Effects 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 108010025488 pinealon Proteins 0.000 description 2
- 229920002401 polyacrylamide Polymers 0.000 description 2
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 230000033764 rhythmic process Effects 0.000 description 2
- 235000019515 salmon Nutrition 0.000 description 2
- FSYKKLYZXJSNPZ-UHFFFAOYSA-N sarcosine Chemical compound C[NH2+]CC([O-])=O FSYKKLYZXJSNPZ-UHFFFAOYSA-N 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 210000002027 skeletal muscle Anatomy 0.000 description 2
- 239000001509 sodium citrate Substances 0.000 description 2
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 238000010532 solid phase synthesis reaction Methods 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- 125000001424 substituent group Chemical group 0.000 description 2
- 239000011593 sulfur Substances 0.000 description 2
- 229910052717 sulfur Inorganic materials 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 230000002381 testicular Effects 0.000 description 2
- 125000003396 thiol group Chemical group [H]S* 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 229960004295 valine Drugs 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 239000003981 vehicle Substances 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- UJXJZOCXEZPHIE-YFKPBYRVSA-N (2s)-2-(2-hydroxyethylamino)-4-sulfanylbutanoic acid Chemical compound OCCN[C@H](C(O)=O)CCS UJXJZOCXEZPHIE-YFKPBYRVSA-N 0.000 description 1
- CNMAQBJBWQQZFZ-LURJTMIESA-N (2s)-2-(pyridin-2-ylamino)propanoic acid Chemical compound OC(=O)[C@H](C)NC1=CC=CC=N1 CNMAQBJBWQQZFZ-LURJTMIESA-N 0.000 description 1
- KUHSEZKIEJYEHN-BXRBKJIMSA-N (2s)-2-amino-3-hydroxypropanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.OC[C@H](N)C(O)=O KUHSEZKIEJYEHN-BXRBKJIMSA-N 0.000 description 1
- PDRJLZDUOULRHE-ZETCQYMHSA-N (2s)-2-amino-3-pyridin-2-ylpropanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=N1 PDRJLZDUOULRHE-ZETCQYMHSA-N 0.000 description 1
- JQFLYFRHDIHZFZ-RXMQYKEDSA-N (2s)-3,3-dimethylpyrrolidine-2-carboxylic acid Chemical compound CC1(C)CCN[C@@H]1C(O)=O JQFLYFRHDIHZFZ-RXMQYKEDSA-N 0.000 description 1
- CNPSFBUUYIVHAP-AKGZTFGVSA-N (2s)-3-methylpyrrolidine-2-carboxylic acid Chemical compound CC1CCN[C@@H]1C(O)=O CNPSFBUUYIVHAP-AKGZTFGVSA-N 0.000 description 1
- FXGZFWDCXQRZKI-VKHMYHEASA-N (2s)-5-amino-2-nitramido-5-oxopentanoic acid Chemical compound NC(=O)CC[C@@H](C(O)=O)N[N+]([O-])=O FXGZFWDCXQRZKI-VKHMYHEASA-N 0.000 description 1
- CCAIIPMIAFGKSI-DMTCNVIQSA-N (2s,3r)-3-hydroxy-2-(methylazaniumyl)butanoate Chemical compound CN[C@@H]([C@@H](C)O)C(O)=O CCAIIPMIAFGKSI-DMTCNVIQSA-N 0.000 description 1
- HZKLCOYAVAAQRD-VGMNWLOBSA-N (3s)-3-[[2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]-4-[[(1r)-1-carboxyethyl]amino]-4-oxobutanoic acid Chemical compound OC(=O)[C@@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N HZKLCOYAVAAQRD-VGMNWLOBSA-N 0.000 description 1
- 101150084750 1 gene Proteins 0.000 description 1
- PNDPGZBMCMUPRI-HVTJNCQCSA-N 10043-66-0 Chemical compound [131I][131I] PNDPGZBMCMUPRI-HVTJNCQCSA-N 0.000 description 1
- WUAPFZMCVAUBPE-NJFSPNSNSA-N 188Re Chemical compound [188Re] WUAPFZMCVAUBPE-NJFSPNSNSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-SHYZEUOFSA-N 2'‐deoxycytidine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-SHYZEUOFSA-N 0.000 description 1
- OMGHIGVFLOPEHJ-UHFFFAOYSA-N 2,5-dihydro-1h-pyrrol-1-ium-2-carboxylate Chemical compound OC(=O)C1NCC=C1 OMGHIGVFLOPEHJ-UHFFFAOYSA-N 0.000 description 1
- WGDNWOMKBUXFHR-UHFFFAOYSA-N 2-[[2-(2-aminopropanoylamino)acetyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound CC(N)C(=O)NCC(=O)NC(C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- XEVFXAFXZZYFSX-UHFFFAOYSA-N 3-azabicyclo[2.1.1]hexane-4-carboxylic acid Chemical compound C1C2CC1(C(=O)O)NC2 XEVFXAFXZZYFSX-UHFFFAOYSA-N 0.000 description 1
- GUPXYSSGJWIURR-UHFFFAOYSA-N 3-octoxypropane-1,2-diol Chemical compound CCCCCCCCOCC(O)CO GUPXYSSGJWIURR-UHFFFAOYSA-N 0.000 description 1
- WOVKYSAHUYNSMH-RRKCRQDMSA-N 5-bromodeoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(Br)=C1 WOVKYSAHUYNSMH-RRKCRQDMSA-N 0.000 description 1
- LUCHPKXVUGJYGU-XLPZGREQSA-N 5-methyl-2'-deoxycytidine Chemical compound O=C1N=C(N)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 LUCHPKXVUGJYGU-XLPZGREQSA-N 0.000 description 1
- 101150096273 ADE2 gene Proteins 0.000 description 1
- 108010066676 Abrin Proteins 0.000 description 1
- 241000228431 Acremonium chrysogenum Species 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 1
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 1
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- 102100024321 Alkaline phosphatase, placental type Human genes 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- JJIBHAOBNIFUEL-SRVKXCTJSA-N Arg-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCN=C(N)N)N JJIBHAOBNIFUEL-SRVKXCTJSA-N 0.000 description 1
- IJYZHIOOBGIINM-WDSKDSINSA-N Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N IJYZHIOOBGIINM-WDSKDSINSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- JBQORRNSZGTLCV-WDSOQIARSA-N Arg-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 JBQORRNSZGTLCV-WDSOQIARSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- DAQIJMOLTMGJLO-YUMQZZPRSA-N Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N DAQIJMOLTMGJLO-YUMQZZPRSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- FVKHEKVYFTZWDX-GHCJXIJMSA-N Asn-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FVKHEKVYFTZWDX-GHCJXIJMSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- ICAYWNTWHRRAQP-FXQIFTODSA-N Asp-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N ICAYWNTWHRRAQP-FXQIFTODSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- VGRHZPNRCLAHQA-IMJSIDKUSA-N Asp-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O VGRHZPNRCLAHQA-IMJSIDKUSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 1
- WQSXAPPYLGNMQL-IHRRRGAJSA-N Asp-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N WQSXAPPYLGNMQL-IHRRRGAJSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241001203868 Autographa californica Species 0.000 description 1
- 208000023275 Autoimmune disease Diseases 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 231100000699 Bacterial toxin Toxicity 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 208000011691 Burkitt lymphomas Diseases 0.000 description 1
- 102100021935 C-C motif chemokine 26 Human genes 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- BHPQYMZQTOCNFJ-UHFFFAOYSA-N Calcium cation Chemical compound [Ca+2] BHPQYMZQTOCNFJ-UHFFFAOYSA-N 0.000 description 1
- 101710106622 Calcium-binding protein LPS1-beta Proteins 0.000 description 1
- 241000222128 Candida maltosa Species 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 241000282461 Canis lupus Species 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 241000218645 Cedrus Species 0.000 description 1
- 102000000844 Cell Surface Receptors Human genes 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 208000031404 Chromosome Aberrations Diseases 0.000 description 1
- 101100007328 Cocos nucifera COS-1 gene Proteins 0.000 description 1
- 108020004394 Complementary RNA Proteins 0.000 description 1
- 241000557626 Corvus corax Species 0.000 description 1
- 241001481833 Coryphaena hippurus Species 0.000 description 1
- 102000004420 Creatine Kinase Human genes 0.000 description 1
- 108010042126 Creatine kinase Proteins 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 1
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 1
- UKVGHFORADMBEN-GUBZILKMSA-N Cys-Arg-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UKVGHFORADMBEN-GUBZILKMSA-N 0.000 description 1
- XXDLUZLKHOVPNW-IHRRRGAJSA-N Cys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O XXDLUZLKHOVPNW-IHRRRGAJSA-N 0.000 description 1
- UISYPAHPLXGLNH-ACZMJKKPSA-N Cys-Asn-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UISYPAHPLXGLNH-ACZMJKKPSA-N 0.000 description 1
- XRTISHJEPHMBJG-SRVKXCTJSA-N Cys-Asp-Tyr Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XRTISHJEPHMBJG-SRVKXCTJSA-N 0.000 description 1
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 1
- NLDWTJBJFVWBDQ-KKUMJFAQSA-N Cys-Lys-Phe Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NLDWTJBJFVWBDQ-KKUMJFAQSA-N 0.000 description 1
- YYLBXQJGWOQZOU-IHRRRGAJSA-N Cys-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N YYLBXQJGWOQZOU-IHRRRGAJSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- CKTSBUTUHBMZGZ-UHFFFAOYSA-N Deoxycytidine Natural products O=C1N=C(N)C=CN1C1OC(CO)C(O)C1 CKTSBUTUHBMZGZ-UHFFFAOYSA-N 0.000 description 1
- 102000016607 Diphtheria Toxin Human genes 0.000 description 1
- 108010053187 Diphtheria Toxin Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000257465 Echinoidea Species 0.000 description 1
- 102100031780 Endonuclease Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 244000148064 Enicostema verticillatum Species 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 101700035123 Erbin Proteins 0.000 description 1
- 108010075944 Erythropoietin Receptors Proteins 0.000 description 1
- 102100036509 Erythropoietin receptor Human genes 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 241000282324 Felis Species 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- FAQVCWVVIYYWRR-WHFBIAKZSA-N Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O FAQVCWVVIYYWRR-WHFBIAKZSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- QZQYITIKPAUDGN-GVXVVHGQSA-N Gln-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QZQYITIKPAUDGN-GVXVVHGQSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- MPZWMIIOPAPAKE-BQBZGAKWSA-N Glu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MPZWMIIOPAPAKE-BQBZGAKWSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- FYYSIASRLDJUNP-WHFBIAKZSA-N Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FYYSIASRLDJUNP-WHFBIAKZSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- ISXJHXGYMJKXOI-GUBZILKMSA-N Glu-Cys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O ISXJHXGYMJKXOI-GUBZILKMSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- QYPKJXSMLMREKF-BPUTZDHNSA-N Glu-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N QYPKJXSMLMREKF-BPUTZDHNSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- JJSVALISDCNFCU-SZMVWBNQSA-N Glu-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JJSVALISDCNFCU-SZMVWBNQSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 1
- XMBSYZWANAQXEV-QWRGUYRKSA-N Glu-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-QWRGUYRKSA-N 0.000 description 1
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- UQHGAYSULGRWRG-WHFBIAKZSA-N Glu-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UQHGAYSULGRWRG-WHFBIAKZSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- QLNKFGTZOBVMCS-JBACZVJFSA-N Glu-Tyr-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QLNKFGTZOBVMCS-JBACZVJFSA-N 0.000 description 1
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- 108010053070 Glutathione Disulfide Proteins 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 1
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- YNIMVVJTPWCUJH-KBPBESRZSA-N Gly-His-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YNIMVVJTPWCUJH-KBPBESRZSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- OLIFSFOFKGKIRH-WUJLRWPWSA-N Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CN OLIFSFOFKGKIRH-WUJLRWPWSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- VPZXBVLAVMBEQI-VKHMYHEASA-N Glycyl-alanine Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 1
- 108010054017 Granulocyte Colony-Stimulating Factor Receptors Proteins 0.000 description 1
- 102100039622 Granulocyte colony-stimulating factor receptor Human genes 0.000 description 1
- 108010092372 Granulocyte-Macrophage Colony-Stimulating Factor Receptors Proteins 0.000 description 1
- 102000016355 Granulocyte-Macrophage Colony-Stimulating Factor Receptors Human genes 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 102100020948 Growth hormone receptor Human genes 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 1
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 1
- MJICNEVRDVQXJH-WDSOQIARSA-N His-Arg-Trp Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O MJICNEVRDVQXJH-WDSOQIARSA-N 0.000 description 1
- FPNWKONEZAVQJF-GUBZILKMSA-N His-Asn-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FPNWKONEZAVQJF-GUBZILKMSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 1
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- AYUOWUNWZGTNKB-ULQDDVLXSA-N His-Phe-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AYUOWUNWZGTNKB-ULQDDVLXSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- FFKJUTZARGRVTH-KKUMJFAQSA-N His-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FFKJUTZARGRVTH-KKUMJFAQSA-N 0.000 description 1
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 1
- FCPSGEVYIVXPPO-QTKMDUPCSA-N His-Thr-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FCPSGEVYIVXPPO-QTKMDUPCSA-N 0.000 description 1
- AHEBIAHEZWQVHB-QTKMDUPCSA-N His-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O AHEBIAHEZWQVHB-QTKMDUPCSA-N 0.000 description 1
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- 102000008949 Histocompatibility Antigens Class I Human genes 0.000 description 1
- 108010088652 Histocompatibility Antigens Class I Proteins 0.000 description 1
- 101000690301 Homo sapiens Aldo-keto reductase family 1 member C4 Proteins 0.000 description 1
- 101000897493 Homo sapiens C-C motif chemokine 26 Proteins 0.000 description 1
- 101000599951 Homo sapiens Insulin-like growth factor I Proteins 0.000 description 1
- 101001116548 Homo sapiens Protein CBFA2T1 Proteins 0.000 description 1
- YZJSUQQZGCHHNQ-UHFFFAOYSA-N Homoglutamine Chemical compound OC(=O)C(N)CCCC(N)=O YZJSUQQZGCHHNQ-UHFFFAOYSA-N 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- 206010020772 Hypertension Diseases 0.000 description 1
- RCFDOSNHHZGBOY-ACZMJKKPSA-N Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(O)=O RCFDOSNHHZGBOY-ACZMJKKPSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- CTHAJJYOHOBUDY-GHCJXIJMSA-N Ile-Cys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N CTHAJJYOHOBUDY-GHCJXIJMSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- GMUYXHHJAGQHGB-TUBUOCAGSA-N Ile-Thr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMUYXHHJAGQHGB-TUBUOCAGSA-N 0.000 description 1
- JSLIXOUMAOUGBN-JUKXBJQTSA-N Ile-Tyr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JSLIXOUMAOUGBN-JUKXBJQTSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- 235000003332 Ilex aquifolium Nutrition 0.000 description 1
- 241000209027 Ilex aquifolium Species 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 1
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 241000976924 Inca Species 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 102100037852 Insulin-like growth factor I Human genes 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 101710203526 Integrase Proteins 0.000 description 1
- 108010038452 Interleukin-3 Receptors Proteins 0.000 description 1
- 102000010790 Interleukin-3 Receptors Human genes 0.000 description 1
- 102000010781 Interleukin-6 Receptors Human genes 0.000 description 1
- 108010038501 Interleukin-6 Receptors Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 244000285963 Kluyveromyces fragilis Species 0.000 description 1
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 description 1
- 241001138401 Kluyveromyces lactis Species 0.000 description 1
- SNDPXSYFESPGGJ-BYPYZUCNSA-N L-2-aminopentanoic acid Chemical compound CCC[C@H](N)C(O)=O SNDPXSYFESPGGJ-BYPYZUCNSA-N 0.000 description 1
- DEFJQIDDEAULHB-IMJSIDKUSA-N L-alanyl-L-alanine Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(O)=O DEFJQIDDEAULHB-IMJSIDKUSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- 239000004395 L-leucine Substances 0.000 description 1
- 235000019454 L-leucine Nutrition 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- SNDPXSYFESPGGJ-UHFFFAOYSA-N L-norVal-OH Natural products CCCC(N)C(O)=O SNDPXSYFESPGGJ-UHFFFAOYSA-N 0.000 description 1
- HXEACLLIILLPRG-YFKPBYRVSA-N L-pipecolic acid Chemical compound [O-]C(=O)[C@@H]1CCCC[NH2+]1 HXEACLLIILLPRG-YFKPBYRVSA-N 0.000 description 1
- DZLNHFMRPBPULJ-VKHMYHEASA-N L-thioproline Chemical compound OC(=O)[C@@H]1CSCN1 DZLNHFMRPBPULJ-VKHMYHEASA-N 0.000 description 1
- KKJQZEWNZXRJFG-UHFFFAOYSA-N L-trans-4-Methyl-2-pyrrolidinecarboxylic acid Chemical compound CC1CNC(C(O)=O)C1 KKJQZEWNZXRJFG-UHFFFAOYSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 108090001090 Lectins Proteins 0.000 description 1
- 102000004856 Lectins Human genes 0.000 description 1
- 241000270322 Lepidosauria Species 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- SQXUUGUCGJSWCK-CIUDSAMLSA-N Lys-Asp-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N SQXUUGUCGJSWCK-CIUDSAMLSA-N 0.000 description 1
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- NVGBPTNZLWRQSY-UWVGGRQHSA-N Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN NVGBPTNZLWRQSY-UWVGGRQHSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- YSZNURNVYFUEHC-BQBZGAKWSA-N Lys-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YSZNURNVYFUEHC-BQBZGAKWSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- ZVXSESPJMKNIQA-YXMSTPNBSA-N Lys-Thr-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZVXSESPJMKNIQA-YXMSTPNBSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- MYTOTTSMVMWVJN-STQMWFEESA-N Lys-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MYTOTTSMVMWVJN-STQMWFEESA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 1
- 102400001132 Melanin-concentrating hormone Human genes 0.000 description 1
- 101800002739 Melanin-concentrating hormone Proteins 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 1
- YLLWCSDBVGZLOW-CIUDSAMLSA-N Met-Gln-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O YLLWCSDBVGZLOW-CIUDSAMLSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- XMQZLGBUJMMODC-AVGNSLFASA-N Met-His-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O XMQZLGBUJMMODC-AVGNSLFASA-N 0.000 description 1
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 101001038442 Mus musculus Mitochondrial glutamate carrier 1 Proteins 0.000 description 1
- 101100243377 Mus musculus Pepd gene Proteins 0.000 description 1
- 206010028289 Muscle atrophy Diseases 0.000 description 1
- 102000003505 Myosin Human genes 0.000 description 1
- 108030001204 Myosin ATPases Proteins 0.000 description 1
- 102000016349 Myosin Light Chains Human genes 0.000 description 1
- 108010067385 Myosin Light Chains Proteins 0.000 description 1
- NQTADLQHYWFPDB-UHFFFAOYSA-N N-Hydroxysuccinimide Chemical compound ON1C(=O)CCC1=O NQTADLQHYWFPDB-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- CHJJGSNFBQVOTG-UHFFFAOYSA-N N-methyl-guanidine Natural products CNC(N)=N CHJJGSNFBQVOTG-UHFFFAOYSA-N 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- BDFNAGOUUFOPSP-UHFFFAOYSA-N Nasvin Natural products O1C2=C(Cl)C(O)=C(Cl)C(C)=C2C(=O)OC2=C1C(C(C)=CC)=C(Cl)C(O)=C2CCCC BDFNAGOUUFOPSP-UHFFFAOYSA-N 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 108090000189 Neuropeptides Proteins 0.000 description 1
- 102000003797 Neuropeptides Human genes 0.000 description 1
- 241000221960 Neurospora Species 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 241000320412 Ogataea angusta Species 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 238000002944 PCR assay Methods 0.000 description 1
- 108091008606 PDGF receptors Proteins 0.000 description 1
- 101150029183 PEP4 gene Proteins 0.000 description 1
- 241001631646 Papillomaviridae Species 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- PLNHHOXNVSYKOB-JYJNAYRXSA-N Phe-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N PLNHHOXNVSYKOB-JYJNAYRXSA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- NAOVYENZCWFBDG-BZSNNMDCSA-N Phe-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 NAOVYENZCWFBDG-BZSNNMDCSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- ZIQQNOXKEFDPBE-BZSNNMDCSA-N Phe-Lys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N ZIQQNOXKEFDPBE-BZSNNMDCSA-N 0.000 description 1
- PTLMYJOMJLTMCB-KKUMJFAQSA-N Phe-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N PTLMYJOMJLTMCB-KKUMJFAQSA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- 102100027330 Phosphoribosylaminoimidazole carboxylase Human genes 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 231100000742 Plant toxin Toxicity 0.000 description 1
- 102000011653 Platelet-Derived Growth Factor Receptors Human genes 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- WCNVGGZRTNHOOS-ULQDDVLXSA-N Pro-Lys-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O WCNVGGZRTNHOOS-ULQDDVLXSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- HOJUNFDJDAPVBI-BZSNNMDCSA-N Pro-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 HOJUNFDJDAPVBI-BZSNNMDCSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 101000762949 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) Exotoxin A Proteins 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 102100025290 Ribonuclease H1 Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 108010039491 Ricin Proteins 0.000 description 1
- 239000006146 Roswell Park Memorial Institute medium Substances 0.000 description 1
- 108010077895 Sarcosine Proteins 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- KMWFXJCGRXBQAC-CIUDSAMLSA-N Ser-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N KMWFXJCGRXBQAC-CIUDSAMLSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 1
- LAFKUZYWNCHOHT-WHFBIAKZSA-N Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O LAFKUZYWNCHOHT-WHFBIAKZSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- PPQRSMGDOHLTBE-UWVGGRQHSA-N Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PPQRSMGDOHLTBE-UWVGGRQHSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- 108010068542 Somatotropin Receptors Proteins 0.000 description 1
- 108010019965 Spectrin Proteins 0.000 description 1
- 102000005890 Spectrin Human genes 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 108010008038 Synthetic Vaccines Proteins 0.000 description 1
- 102100036011 T-cell surface glycoprotein CD4 Human genes 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- BECPPKYKPSRKCP-ZDLURKLDSA-N Thr-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O BECPPKYKPSRKCP-ZDLURKLDSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 108090000253 Thyrotropin Receptors Proteins 0.000 description 1
- 102100029337 Thyrotropin receptor Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- SVGAWGVHFIYAEE-JSGCOSHPSA-N Trp-Gly-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 SVGAWGVHFIYAEE-JSGCOSHPSA-N 0.000 description 1
- ILDJYIDXESUBOE-HSCHXYMDSA-N Trp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ILDJYIDXESUBOE-HSCHXYMDSA-N 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- YBRHKUNWEYBZGT-WLTAIBSBSA-N Trp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 YBRHKUNWEYBZGT-WLTAIBSBSA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- 241000223105 Trypanosoma brucei Species 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- XTOCLOATLKOZAU-JBACZVJFSA-N Tyr-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N XTOCLOATLKOZAU-JBACZVJFSA-N 0.000 description 1
- 244000301083 Ustilago maydis Species 0.000 description 1
- 235000015919 Ustilago maydis Nutrition 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- WITCOKQIPFWQQD-FSPLSTOPSA-N Val-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O WITCOKQIPFWQQD-FSPLSTOPSA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- GJNDXQBALKCYSZ-RYUDHWBXSA-N Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 GJNDXQBALKCYSZ-RYUDHWBXSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- WFTKOJGOOUJLJV-VKOGCVSHSA-N Val-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O)NC(=O)[C@@H]([NH3+])C(C)C)=CNC2=C1 WFTKOJGOOUJLJV-VKOGCVSHSA-N 0.000 description 1
- IWADHXDXSQONEL-GUBZILKMSA-N Val-Val-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O IWADHXDXSQONEL-GUBZILKMSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 241000269370 Xenopus <genus> Species 0.000 description 1
- VWQVUPCCIRVNHF-OUBTZVSYSA-N Yttrium-90 Chemical compound [90Y] VWQVUPCCIRVNHF-OUBTZVSYSA-N 0.000 description 1
- PCBMGUSDYHYVBQ-SOOFDHNKSA-N [4-amino-2-[(3R,4S,5R)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-1H-imidazol-5-yl]phosphonic acid Chemical compound P(=O)(O)(O)C=1N=C(NC1N)C1[C@H](O)[C@H](O)[C@H](O1)CO PCBMGUSDYHYVBQ-SOOFDHNKSA-N 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 208000009956 adenocarcinoma Diseases 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 230000001464 adherent effect Effects 0.000 description 1
- 210000004100 adrenal gland Anatomy 0.000 description 1
- 229940009456 adriamycin Drugs 0.000 description 1
- 201000006960 adult spinal muscular atrophy Diseases 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000005273 aeration Methods 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000012867 alanine scanning Methods 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010056243 alanylalanine Proteins 0.000 description 1
- 229940037003 alum Drugs 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 230000000689 aminoacylating effect Effects 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 208000036878 aneuploidy Diseases 0.000 description 1
- 231100001075 aneuploidy Toxicity 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010086780 arginyl-glycyl-aspartyl-alanine Proteins 0.000 description 1
- 108010007483 arginyl-leucyl-tyrosyl-glutamic acid Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 230000037007 arousal Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000006793 arrhythmia Effects 0.000 description 1
- 206010003119 arrhythmia Diseases 0.000 description 1
- 229940009098 aspartate Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 239000000688 bacterial toxin Substances 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 102000012740 beta Adrenergic Receptors Human genes 0.000 description 1
- 108010079452 beta Adrenergic Receptors Proteins 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 238000011953 bioanalysis Methods 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000002981 blocking agent Substances 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 230000036772 blood pressure Effects 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 201000008275 breast carcinoma Diseases 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 239000006172 buffering agent Substances 0.000 description 1
- 244000309464 bull Species 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 229910001424 calcium ion Inorganic materials 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 150000001718 carbodiimides Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 206010061592 cardiac fibrillation Diseases 0.000 description 1
- 210000001054 cardiac fibroblast Anatomy 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 230000021164 cell adhesion Effects 0.000 description 1
- 230000008568 cell cell communication Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 238000001516 cell proliferation assay Methods 0.000 description 1
- 210000004671 cell-free system Anatomy 0.000 description 1
- 108091092356 cellular DNA Proteins 0.000 description 1
- 241000902900 cellular organisms Species 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000013522 chelant Substances 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 238000002038 chemiluminescence detection Methods 0.000 description 1
- 230000035572 chemosensitivity Effects 0.000 description 1
- 235000013330 chicken meat Nutrition 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 231100000005 chromosome aberration Toxicity 0.000 description 1
- 230000014107 chromosome localization Effects 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- PMMYEEVYMWASQN-IMJSIDKUSA-N cis-4-Hydroxy-L-proline Chemical compound O[C@@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-IMJSIDKUSA-N 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000012411 cloning technique Methods 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 238000007398 colorimetric assay Methods 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000016396 cytokine production Effects 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 210000001047 desmosome Anatomy 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 229960000633 dextran sulfate Drugs 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000002050 diffraction method Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- SWSQBOPZIKWTGO-UHFFFAOYSA-N dimethylaminoamidine Natural products CN(C)C(N)=N SWSQBOPZIKWTGO-UHFFFAOYSA-N 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 238000002003 electron diffraction Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 150000002118 epoxides Chemical class 0.000 description 1
- 235000020774 essential nutrients Nutrition 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 230000004720 fertilization Effects 0.000 description 1
- 230000002600 fibrillogenic effect Effects 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 235000019688 fish Nutrition 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 238000012224 gene deletion Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 238000012254 genetic linkage analysis Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- 108010037389 glutamyl-cysteinyl-lysine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 1
- YPZRWBKMTBYPTK-BJDJZHNGSA-N glutathione disulfide Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@H](C(=O)NCC(O)=O)CSSC[C@@H](C(=O)NCC(O)=O)NC(=O)CC[C@H](N)C(O)=O YPZRWBKMTBYPTK-BJDJZHNGSA-N 0.000 description 1
- 229960002449 glycine Drugs 0.000 description 1
- KZNQNBZMBZJQJO-YFKPBYRVSA-N glyclproline Chemical compound NCC(=O)N1CCC[C@H]1C(O)=O KZNQNBZMBZJQJO-YFKPBYRVSA-N 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 102000035122 glycosylated proteins Human genes 0.000 description 1
- 108091005608 glycosylated proteins Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 239000005090 green fluorescent protein Substances 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- YQOKLYTXVFAUCW-UHFFFAOYSA-N guanidine;isothiocyanic acid Chemical compound N=C=S.NC(N)=N YQOKLYTXVFAUCW-UHFFFAOYSA-N 0.000 description 1
- ZJYYHGLJYGJLLN-UHFFFAOYSA-N guanidinium thiocyanate Chemical compound SC#N.NC(N)=N ZJYYHGLJYGJLLN-UHFFFAOYSA-N 0.000 description 1
- 229940029575 guanosine Drugs 0.000 description 1
- 208000018706 hematopoietic system disease Diseases 0.000 description 1
- 229960002897 heparin Drugs 0.000 description 1
- 229920000669 heparin Polymers 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- 102000054751 human RUNX1T1 Human genes 0.000 description 1
- 230000008348 humoral response Effects 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- ZMZDMBWJUHKJPS-UHFFFAOYSA-N hydrogen thiocyanate Natural products SC#N ZMZDMBWJUHKJPS-UHFFFAOYSA-N 0.000 description 1
- 238000002169 hydrotherapy Methods 0.000 description 1
- MWFRVMDVLYIXJF-BYPYZUCNSA-N hydroxyethylcysteine Chemical compound OC(=O)[C@@H](N)CSCCO MWFRVMDVLYIXJF-BYPYZUCNSA-N 0.000 description 1
- 229910052588 hydroxylapatite Inorganic materials 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 206010020718 hyperplasia Diseases 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000008105 immune reaction Effects 0.000 description 1
- 208000026278 immune system disease Diseases 0.000 description 1
- 238000000760 immunoelectrophoresis Methods 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 239000007943 implant Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000005462 in vivo assay Methods 0.000 description 1
- 239000012678 infectious agent Substances 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 230000001524 infective effect Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 150000004001 inositols Chemical class 0.000 description 1
- 230000006362 insulin response pathway Effects 0.000 description 1
- 230000008611 intercellular interaction Effects 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000010253 intravenous injection Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000011813 knockout mouse model Methods 0.000 description 1
- HXEACLLIILLPRG-RXMQYKEDSA-N l-pipecolic acid Natural products OC(=O)[C@H]1CCCCN1 HXEACLLIILLPRG-RXMQYKEDSA-N 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- DVCSNHXRZUVYAM-BQBZGAKWSA-N leu-asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O DVCSNHXRZUVYAM-BQBZGAKWSA-N 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 238000000670 ligand binding assay Methods 0.000 description 1
- 108020001756 ligand binding domains Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 210000003141 lower extremity Anatomy 0.000 description 1
- 230000001926 lymphatic effect Effects 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- ORRDHOMWDPJSNL-UHFFFAOYSA-N melanin concentrating hormone Chemical compound N1C(=O)C(C(C)C)NC(=O)C(CCCNC(N)=N)NC(=O)CNC(=O)C(C(C)C)NC(=O)C(CCSC)NC(=O)C(NC(=O)C(CCCNC(N)=N)NC(=O)C(NC(=O)C(NC(=O)C(N)CC(O)=O)C(C)O)CCSC)CSSCC(C(=O)NC(CC=2C3=CC=CC=C3NC=2)C(=O)NC(CCC(O)=O)C(=O)NC(C(C)C)C(O)=O)NC(=O)C2CCCN2C(=O)C(CCCNC(N)=N)NC(=O)C1CC1=CC=C(O)C=C1 ORRDHOMWDPJSNL-UHFFFAOYSA-N 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 230000034153 membrane organization Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- CWWARWOPSKGELM-SARDKLJWSA-N methyl (2s)-2-[[(2s)-2-[[2-[[(2s)-2-[[(2s)-2-[[(2s)-5-amino-2-[[(2s)-5-amino-2-[[(2s)-1-[(2s)-6-amino-2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-5-oxopentanoyl]amino]-5 Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)OC)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CCCN=C(N)N)C1=CC=CC=C1 CWWARWOPSKGELM-SARDKLJWSA-N 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 235000010755 mineral Nutrition 0.000 description 1
- 239000002480 mineral oil Substances 0.000 description 1
- 235000010446 mineral oil Nutrition 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 230000004660 morphological change Effects 0.000 description 1
- 210000003098 myoblast Anatomy 0.000 description 1
- 210000001087 myotubule Anatomy 0.000 description 1
- 210000004897 n-terminal region Anatomy 0.000 description 1
- 230000006654 negative regulation of apoptotic process Effects 0.000 description 1
- 230000009707 neogenesis Effects 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 239000002858 neurotransmitter agent Substances 0.000 description 1
- PGSADBUBUOPOJS-UHFFFAOYSA-N neutral red Chemical compound Cl.C1=C(C)C(N)=CC2=NC3=CC(N(C)C)=CC=C3N=C21 PGSADBUBUOPOJS-UHFFFAOYSA-N 0.000 description 1
- 239000002547 new drug Substances 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 125000002347 octyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 210000000287 oocyte Anatomy 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- YPZRWBKMTBYPTK-UHFFFAOYSA-N oxidized gamma-L-glutamyl-L-cysteinylglycine Natural products OC(=O)C(N)CCC(=O)NC(C(=O)NCC(O)=O)CSSCC(C(=O)NCC(O)=O)NC(=O)CCC(N)C(O)=O YPZRWBKMTBYPTK-UHFFFAOYSA-N 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- XYJRXVWERLGGKC-UHFFFAOYSA-D pentacalcium;hydroxide;triphosphate Chemical compound [OH-].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O XYJRXVWERLGGKC-UHFFFAOYSA-D 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 108010035774 phosphoribosylaminoimidazole carboxylase Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 238000005222 photoaffinity labeling Methods 0.000 description 1
- 108010031345 placental alkaline phosphatase Proteins 0.000 description 1
- 239000003123 plant toxin Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 238000002600 positron emission tomography Methods 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 230000006916 protein interaction Effects 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 229950010131 puromycin Drugs 0.000 description 1
- 239000002510 pyrogen Substances 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 238000003156 radioimmunoprecipitation Methods 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 230000008707 rearrangement Effects 0.000 description 1
- 239000001044 red dye Substances 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 238000012340 reverse transcriptase PCR Methods 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 238000004007 reversed phase HPLC Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 108010052833 ribonuclease HI Proteins 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000013391 scatchard analysis Methods 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
- 238000002805 secondary assay Methods 0.000 description 1
- 210000004739 secretory vesicle Anatomy 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 230000037432 silent mutation Effects 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 210000002363 skeletal muscle cell Anatomy 0.000 description 1
- 210000002460 smooth muscle Anatomy 0.000 description 1
- 229910001415 sodium ion Inorganic materials 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 208000002320 spinal muscular atrophy Diseases 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 108010018381 streptavidin-binding peptide Proteins 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- NPDBDJFLKKQMCM-UHFFFAOYSA-N tert-butylglycine Chemical compound CC(C)(C)C(N)C(O)=O NPDBDJFLKKQMCM-UHFFFAOYSA-N 0.000 description 1
- 229960000814 tetanus toxoid Drugs 0.000 description 1
- 125000003831 tetrazolyl group Chemical group 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 108091084372 thymopoietin family Proteins 0.000 description 1
- 210000001685 thyroid gland Anatomy 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 230000007723 transport mechanism Effects 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 230000005748 tumor development Effects 0.000 description 1
- 230000004614 tumor growth Effects 0.000 description 1
- 239000000439 tumor marker Substances 0.000 description 1
- 208000005606 type IV spinal muscular atrophy Diseases 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010068794 tyrosyl-tyrosyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 210000003606 umbilical vein Anatomy 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241000701366 unidentified nuclear polyhedrosis viruses Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 230000002861 ventricular Effects 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/66—Thymopoietins
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Toxicology (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Endocrinology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Gastroenterology & Hepatology (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
The present invention relates to polynucleotide and polypeptide molecules fo r ZTMPO-1, a soluble protein with homology to emerin and the thymopoietins. Th e polypeptides, and polynucleotides encoding them are useful for modulating cellular proliferation and differentiation and may be used for diagnostic purposes. The present invention also includes antibodies to the ZTMPO-1 polypeptides.
Description
DESCRIPTION
BACKGROUND OF THE INVENTION
There is a growing family of proteins which share regions of sequence homology and localization to the nucleus. These proteins include the thymopoietins, (Zevin-Sonkin et al., Immuno. Letts. 31:301-10, 1992;
Harris et al., Proc. Natl. Acad. Sci. USA 91:6283-87, 1994; Harris et al., Genomics 28:198-205, 1995; Berger et al., Genome Res. 6:361-70, 1996 and Ishijima et al., Biochem. Biophys. Res. Comm. 226:431-8, 1996), lamina associated proteins, (Senior and Gerace, J. Cell Biol.
107:2029-36, 1988; Worman et al., J. Cell Biol. 111:1535-42, 1990; Wozniak and Blobel J. Cell Biol. 119:1441-9, 1992; Foisner and Gerace, Cell 73:1267-79, 1993; Ye and Worman, J. Biol. Chem. 269:11306-11, 1994 and Furukawa et al., EMBO J. 14:1626-36, 1995) and emerin (Bione et al., Nat. Genet. 8:323-7, 1994; Manilal et al., Hum. Mol. Gen.
5:801-8, 1996 and Small et al., Mamm. Genom. 8:337-41, 1997) .
Emerin is a nuclear membrane protein responsible for the X-linked recessive disorder Emery-Dreifuss muscular dystrophy. Mouse, rat and human emerin sequences have been reported .(Bione et al., Nat. Genet. 8:323-7, 1994; Manila et al., Hum. Mol. Genet. 5:801-8, 1996 and Small et al., Mammal. Genom. 8:337-41, 1997). The mouse, rat and human emerin share 73-95% nucleotide and amino acid identity. All share some structural homology with the thymopoietins and LAP2, in particular within portions of the conserved N-terminal region and the hydrophobic putative transmembrane domain of thymopoietin. Like the thymopoietins and LAP2, emerin is ubiquitous expressed and it is predicted that emerin has the same inner nuclear WO 99/54.468 PCT/US99/08601 membrane organization as do thymopoietin and LAP2 (Manilal et al., ibid.). Antisera raised against emerin peptides localized expression of the protein to the nuclear membranes of normal skeletal and cardiac muscle cells, but found it to be absent in those cells of patients with muscular dystrophy. It is unclear how a deficiency of a nuclear protein results in the disease (Nagano et al., Nat. Genet. 12:254-9, 1996 and Small et al., ibid.).
The present invention provides associated polypeptides for these and other uses that should be apparent to those skilled in the art from the teachings herein.
SUMMARY OF THE INVENTION
Within one aspect the invention provides an isolated polypeptide comprising a sequence of amino acid residues that is at least 80% identical in amino acid sequence to residues 1 through 876 of SEQ ID N0:2. Within one embodiment the sequence of amino acid residues is at least 90% identical. Within another embodiment any differences between said polypeptide and residues 1 through 876 of SEQ ID N0:2 are due to conservative amino acid substitutions. Within another embodiment the polypeptide specifically binds with an antibody that specifically binds with a polypeptide consisting of the amino acid sequence of SEQ ID N0:2. Within a further embodiment the polypeptide is covalently linked to a moiety selected from the group consisting of affinity tags, radionucleotides, enzymes and fluorophores. Within a related embodiment the moiety is an affinity tag selected from the group consisting of polyhistidine, FLAG, Glu-Glu, glutathione S transferase and an immunoglobulin heavy chain constant region.
Also provided is an isolated polypeptide comprising the amino acid sequence of SEQ ID N0:2.
Within another aspect the invention provides a fusion protein consisting essentially of a first portion and a second portion joined by a peptide bond, said first portion consisting of a polypeptide comprising a sequence of amino acid residues that is at least 80o identical in amino acid sequence to residues 1 through 876 of SEQ ID
N0:2; and said second portion comprising another polypeptide.
Within yet another aspect the invention provides a pharmaceutical composition comprising a polypeptide as described above, in combination with a pharmaceutically acceptable vehicle.
Within still another aspect is provided an antibody or antibody fragment that specifically binds to a polypeptide as described above. Within one embodiment the antibody is selected from the group consisting of: a) polyclonal antibody; b) murine monoclonal antibody; c) humanized antibody derived from b); and d) human monoclonal antibody. Within another embodiment the antibody fragment is selected from the group consisting of F(ab'), F(ab), Fab', Fab, Fv, scFv, and minimal recognition unit. Within still another embodiment is provided an anti-idiotype antibody that specifically binds to the antibody described above.
Also provided is a binding protein that specifically binds to an epitope of a polypeptide as described above.
Within another aspect of the invention is provided an isolated polynucleotide selected from the group consisting of: a) a polynucleotide encoding a polypeptide comprising a sequence of amino. acid residues that is at least 80% identical in amino acid sequence to residues 1 through 876 of SEQ ID N0:2; b) a polynucleotide ~ comprising the nucleotide sequence of SEQ ID N0:5; c) a polynucleotide that remains hybridized following stringent wash conditions to a polynucleotide consisting of the nucleotide sequence of SEQ ID NO:1, or the complement of SEQ ID NO:1. Within one embodiment the sequence of amino acid residues is at least 90% identical. Within another embodiment any difference between the amino acid sequence encoded by the polynucleotide and the corresponding amino acid sequence of SEQ ID N0:2 is due to a conservative amino acid substitution. Within yet another embodiment the polynucleotide comprises nucleotide 127 to nucleotide 2754 of SEQ ID NO:1. Within still another embodiment the polynucleotide is DNA.
Within another aspect the invention provides an expression vector comprising the following operably linked elements: a transcription promoter; a DNA segment consisting of a polynucleotide as described above; and a transcriptional terminator. Within one embodiment the sequence of amino acid residues is at least 90% identical.
Within another embodiment any difference between the amino acid sequence encoded by the polynucleotide and the corresponding amino acid sequence of SEQ ID N0:2 is due to a conservative amino acid substitution. Within another embodiment the DNA segment encodes a polypeptide covalently linked to an affinity tag selected from the group consisting of polyhistidine, Glu-Glu, glutathione S
transferase and an immunoglobulin heavy chain constant region. Within yet another embodiment the expression vector further comprises a secretory signal sequence operably linked to said DNA segment.
Also provided is a cultured cell into which has been introduced an expression vector as described above, wherein the cell expresses the polypeptide encoded by the DNA segment.
Within a further aspect the invention provide a method of producing a ZTMPO-1 polypeptide comprising:
culturing a cell into which has been introduced an expression vector as described above, whereby the cell expresses the polypeptide encoded by the DNA segment; and recovering the expressed polypeptide.
Also provided by the invention is a method for detecting a genetic abnormality in a patient, comprising:
obtaining a genetic sample from a patient; incubating the genetic sample with a polynucleotide comprising at least 14 contiguous nucleotides of SEQ ID NO:1 or the complement of SEQ ID NO: l, under conditions wherein said polynucleotide will hybridize to complementary 5 polynucleotide sequence, to produce a first reaction product; comparing said first reaction product to a control reaction product, wherein a difference between said first reaction product and said control reaction product is indicative of a genetic abnormality in the patient.
These and other. aspects of the invention will become evident upon reference to the following detailed description of the invention and attached drawing.
BRIEF DESCRIPTION OF THE DRAWINGS
The figure shows a multiple amino acid sequence alignment for ZTMPO-I (SEQ ID N0:2), human emerin (EMD HU) Bione et al., Nat. Genet. 8:323-27, 1994 (SEQ ID N0:3), human thymopoietin a (PIR A5) Harris et al., Proc. Natl.
Acad. Sci. USA 91: 6283-7, 1994 (SEQ ID N0:4), human thymopoietin (3 (PIR B5) Harris et al., ibid. (SEQ ID
N0:30) and human thymopoietin y (PIR-C5) Harris et al., ibid. (SEQ ID N0:31).
DETAILED DESCRIPTION OF THE INVENTION
Prior to setting forth the invention in detail, it may be helpful to the understanding thereof to define the following terms:
The term "affinity tag" is used herein to denote a polypeptide segment that can be attached to a second polypeptide to provide for purification of the second polypeptide or provide sites for attachment of the second polypeptide to a substrate. In principal, any peptide or protein for which an antibody or other specific binding agent is available can be used as an affinity tag.
BACKGROUND OF THE INVENTION
There is a growing family of proteins which share regions of sequence homology and localization to the nucleus. These proteins include the thymopoietins, (Zevin-Sonkin et al., Immuno. Letts. 31:301-10, 1992;
Harris et al., Proc. Natl. Acad. Sci. USA 91:6283-87, 1994; Harris et al., Genomics 28:198-205, 1995; Berger et al., Genome Res. 6:361-70, 1996 and Ishijima et al., Biochem. Biophys. Res. Comm. 226:431-8, 1996), lamina associated proteins, (Senior and Gerace, J. Cell Biol.
107:2029-36, 1988; Worman et al., J. Cell Biol. 111:1535-42, 1990; Wozniak and Blobel J. Cell Biol. 119:1441-9, 1992; Foisner and Gerace, Cell 73:1267-79, 1993; Ye and Worman, J. Biol. Chem. 269:11306-11, 1994 and Furukawa et al., EMBO J. 14:1626-36, 1995) and emerin (Bione et al., Nat. Genet. 8:323-7, 1994; Manilal et al., Hum. Mol. Gen.
5:801-8, 1996 and Small et al., Mamm. Genom. 8:337-41, 1997) .
Emerin is a nuclear membrane protein responsible for the X-linked recessive disorder Emery-Dreifuss muscular dystrophy. Mouse, rat and human emerin sequences have been reported .(Bione et al., Nat. Genet. 8:323-7, 1994; Manila et al., Hum. Mol. Genet. 5:801-8, 1996 and Small et al., Mammal. Genom. 8:337-41, 1997). The mouse, rat and human emerin share 73-95% nucleotide and amino acid identity. All share some structural homology with the thymopoietins and LAP2, in particular within portions of the conserved N-terminal region and the hydrophobic putative transmembrane domain of thymopoietin. Like the thymopoietins and LAP2, emerin is ubiquitous expressed and it is predicted that emerin has the same inner nuclear WO 99/54.468 PCT/US99/08601 membrane organization as do thymopoietin and LAP2 (Manilal et al., ibid.). Antisera raised against emerin peptides localized expression of the protein to the nuclear membranes of normal skeletal and cardiac muscle cells, but found it to be absent in those cells of patients with muscular dystrophy. It is unclear how a deficiency of a nuclear protein results in the disease (Nagano et al., Nat. Genet. 12:254-9, 1996 and Small et al., ibid.).
The present invention provides associated polypeptides for these and other uses that should be apparent to those skilled in the art from the teachings herein.
SUMMARY OF THE INVENTION
Within one aspect the invention provides an isolated polypeptide comprising a sequence of amino acid residues that is at least 80% identical in amino acid sequence to residues 1 through 876 of SEQ ID N0:2. Within one embodiment the sequence of amino acid residues is at least 90% identical. Within another embodiment any differences between said polypeptide and residues 1 through 876 of SEQ ID N0:2 are due to conservative amino acid substitutions. Within another embodiment the polypeptide specifically binds with an antibody that specifically binds with a polypeptide consisting of the amino acid sequence of SEQ ID N0:2. Within a further embodiment the polypeptide is covalently linked to a moiety selected from the group consisting of affinity tags, radionucleotides, enzymes and fluorophores. Within a related embodiment the moiety is an affinity tag selected from the group consisting of polyhistidine, FLAG, Glu-Glu, glutathione S transferase and an immunoglobulin heavy chain constant region.
Also provided is an isolated polypeptide comprising the amino acid sequence of SEQ ID N0:2.
Within another aspect the invention provides a fusion protein consisting essentially of a first portion and a second portion joined by a peptide bond, said first portion consisting of a polypeptide comprising a sequence of amino acid residues that is at least 80o identical in amino acid sequence to residues 1 through 876 of SEQ ID
N0:2; and said second portion comprising another polypeptide.
Within yet another aspect the invention provides a pharmaceutical composition comprising a polypeptide as described above, in combination with a pharmaceutically acceptable vehicle.
Within still another aspect is provided an antibody or antibody fragment that specifically binds to a polypeptide as described above. Within one embodiment the antibody is selected from the group consisting of: a) polyclonal antibody; b) murine monoclonal antibody; c) humanized antibody derived from b); and d) human monoclonal antibody. Within another embodiment the antibody fragment is selected from the group consisting of F(ab'), F(ab), Fab', Fab, Fv, scFv, and minimal recognition unit. Within still another embodiment is provided an anti-idiotype antibody that specifically binds to the antibody described above.
Also provided is a binding protein that specifically binds to an epitope of a polypeptide as described above.
Within another aspect of the invention is provided an isolated polynucleotide selected from the group consisting of: a) a polynucleotide encoding a polypeptide comprising a sequence of amino. acid residues that is at least 80% identical in amino acid sequence to residues 1 through 876 of SEQ ID N0:2; b) a polynucleotide ~ comprising the nucleotide sequence of SEQ ID N0:5; c) a polynucleotide that remains hybridized following stringent wash conditions to a polynucleotide consisting of the nucleotide sequence of SEQ ID NO:1, or the complement of SEQ ID NO:1. Within one embodiment the sequence of amino acid residues is at least 90% identical. Within another embodiment any difference between the amino acid sequence encoded by the polynucleotide and the corresponding amino acid sequence of SEQ ID N0:2 is due to a conservative amino acid substitution. Within yet another embodiment the polynucleotide comprises nucleotide 127 to nucleotide 2754 of SEQ ID NO:1. Within still another embodiment the polynucleotide is DNA.
Within another aspect the invention provides an expression vector comprising the following operably linked elements: a transcription promoter; a DNA segment consisting of a polynucleotide as described above; and a transcriptional terminator. Within one embodiment the sequence of amino acid residues is at least 90% identical.
Within another embodiment any difference between the amino acid sequence encoded by the polynucleotide and the corresponding amino acid sequence of SEQ ID N0:2 is due to a conservative amino acid substitution. Within another embodiment the DNA segment encodes a polypeptide covalently linked to an affinity tag selected from the group consisting of polyhistidine, Glu-Glu, glutathione S
transferase and an immunoglobulin heavy chain constant region. Within yet another embodiment the expression vector further comprises a secretory signal sequence operably linked to said DNA segment.
Also provided is a cultured cell into which has been introduced an expression vector as described above, wherein the cell expresses the polypeptide encoded by the DNA segment.
Within a further aspect the invention provide a method of producing a ZTMPO-1 polypeptide comprising:
culturing a cell into which has been introduced an expression vector as described above, whereby the cell expresses the polypeptide encoded by the DNA segment; and recovering the expressed polypeptide.
Also provided by the invention is a method for detecting a genetic abnormality in a patient, comprising:
obtaining a genetic sample from a patient; incubating the genetic sample with a polynucleotide comprising at least 14 contiguous nucleotides of SEQ ID NO:1 or the complement of SEQ ID NO: l, under conditions wherein said polynucleotide will hybridize to complementary 5 polynucleotide sequence, to produce a first reaction product; comparing said first reaction product to a control reaction product, wherein a difference between said first reaction product and said control reaction product is indicative of a genetic abnormality in the patient.
These and other. aspects of the invention will become evident upon reference to the following detailed description of the invention and attached drawing.
BRIEF DESCRIPTION OF THE DRAWINGS
The figure shows a multiple amino acid sequence alignment for ZTMPO-I (SEQ ID N0:2), human emerin (EMD HU) Bione et al., Nat. Genet. 8:323-27, 1994 (SEQ ID N0:3), human thymopoietin a (PIR A5) Harris et al., Proc. Natl.
Acad. Sci. USA 91: 6283-7, 1994 (SEQ ID N0:4), human thymopoietin (3 (PIR B5) Harris et al., ibid. (SEQ ID
N0:30) and human thymopoietin y (PIR-C5) Harris et al., ibid. (SEQ ID N0:31).
DETAILED DESCRIPTION OF THE INVENTION
Prior to setting forth the invention in detail, it may be helpful to the understanding thereof to define the following terms:
The term "affinity tag" is used herein to denote a polypeptide segment that can be attached to a second polypeptide to provide for purification of the second polypeptide or provide sites for attachment of the second polypeptide to a substrate. In principal, any peptide or protein for which an antibody or other specific binding agent is available can be used as an affinity tag.
Affinity tags include a poly-histidine tract, protein A
(Nilsson et al., EMBO J. 4:1075, 1985; Nilsson et al., Methods Enzymol. 198:3, 1991), glutathione S transferase (Smith and Johnson, Gene 67:31, 1988), Glu-Glu affinity tag, substance P, FlagTM peptide (Hopp et al., BiotechnoloQV 6:1204-10, 1988), streptavidin binding peptide, or other antigenic epitope or binding domain.
See, in general, Ford et al., Protein Expression and Purification 2: 95-107, 1991. DNAs encoding affinity tags are available from commercial suppliers (e. g., Pharmacies Biotech, Piscataway, NJ).
The term "allelic variant" is used herein to denote any of two or more alternative forms of a gene occupying the same chromosomal locus. Allelic variation arises naturally through mutation, and may result in phenotypic polymorphism within populations. Gene mutations can be silent (no change in the encoded polypeptide) or may encode polypeptides having altered amino acid sequence. The term allelic variant is also used herein to denote a protein encoded by an allelic variant of a gene.
The terms "amino-terminal" and "carboxyl-terminal" are used herein to denote positions within polypeptides. Where the context allows, these terms are used with reference to a particular sequence or portion of a polypeptide to denote proximity or relative position.
For example, a certain sequence positioned carboxyl-terminal to a reference sequence within a polypeptide is located proximal to the carboxyl terminus of the reference sequence, but is not necessarily at the carboxyl terminus of the complete polypeptide.
The term "complements of a polynucleotide molecule" is a polynucleotide molecule having a complementary base sequence and reverse orientation as compared to a reference sequence. For example, the sequence 5' ATGCACGGG 3' is complementary to 5' CCCGTGCAT
3'.
(Nilsson et al., EMBO J. 4:1075, 1985; Nilsson et al., Methods Enzymol. 198:3, 1991), glutathione S transferase (Smith and Johnson, Gene 67:31, 1988), Glu-Glu affinity tag, substance P, FlagTM peptide (Hopp et al., BiotechnoloQV 6:1204-10, 1988), streptavidin binding peptide, or other antigenic epitope or binding domain.
See, in general, Ford et al., Protein Expression and Purification 2: 95-107, 1991. DNAs encoding affinity tags are available from commercial suppliers (e. g., Pharmacies Biotech, Piscataway, NJ).
The term "allelic variant" is used herein to denote any of two or more alternative forms of a gene occupying the same chromosomal locus. Allelic variation arises naturally through mutation, and may result in phenotypic polymorphism within populations. Gene mutations can be silent (no change in the encoded polypeptide) or may encode polypeptides having altered amino acid sequence. The term allelic variant is also used herein to denote a protein encoded by an allelic variant of a gene.
The terms "amino-terminal" and "carboxyl-terminal" are used herein to denote positions within polypeptides. Where the context allows, these terms are used with reference to a particular sequence or portion of a polypeptide to denote proximity or relative position.
For example, a certain sequence positioned carboxyl-terminal to a reference sequence within a polypeptide is located proximal to the carboxyl terminus of the reference sequence, but is not necessarily at the carboxyl terminus of the complete polypeptide.
The term "complements of a polynucleotide molecule" is a polynucleotide molecule having a complementary base sequence and reverse orientation as compared to a reference sequence. For example, the sequence 5' ATGCACGGG 3' is complementary to 5' CCCGTGCAT
3'.
The term "contig" denotes a polynucleotide that has a contiguous stretch of identical or complementary sequence to another polynucleotide. Contiguous sequences are said to "overlap" a given stretch of polynucleotide sequence either in their entirety or alone a partial stretch of the polynucleotide. For example, representative contigs to the polynucleotide sequence 5'-ATGGCTTAGCTT-3' are 5'-TAGCTTgagtct-3' and 3'-gtcgacTACCGA-5'.
The term "degenerate nucleotide sequence"
denotes a sequence of nucleotides that includes one or more degenerate codons (as compared to a reference polynucleotide molecule that encodes a polypeptide).
Degenerate codons contain different triplets of nucleotides, but encode the same amino acid residue (i.e., GAU and GAC triplets each encode Asp).
The term "expression vector" is used to denote a DNA molecule, linear or circular, that comprises a segment encoding a polypeptide of interest operably linked to additional segments that provide for its transcription.
Such additional segments include promoter and terminator sequences, and may also include one or more origins of replication, one or more selectable markers, an enhancer, a polyadenylation signal, etc. Expression vectors are generally derived from plasmid or viral DNA, or may contain elements of both.
The term "isolated", when applied to a polynucleotide, denotes that the polynucleotide has been removed from its natural genetic milieu and is thus free of other extraneous or unwanted coding sequences, and is in a form suitable for use within genetically engineered protein production systems. Such isolated molecules are those that are separated from their natural environment and include cDNA and genomic clones. Isolated DNA
molecules of the present invention are free of other genes with which they are ordinarily associated, but may include naturally occurring 5' and 3' untranslated regions such as WO 99!54468 PCT/US99/08601 promoters and terminators. The identification of associated regions will be evident to one of ordinary skill in the art (see for example, Dynan and Tijan, Nature 316:774-78, 1985).
An "isolated" polypeptide or protein is a polypeptide or protein that is found in a condition other than its native environment, such as apart from blood and animal tissue. In a preferred form, the isolated polypeptide is substantially free of other polypeptides, particularly other polypeptides of animal origin. It is preferred to provide the polypeptides in a highly purified form, i.e. greater than 95% pure, more preferably greater than 99% pure. When used in this context, the term "isolated" does not exclude the presence of the same polypeptide in alternative physical forms, such as dimers or alternatively glycosylated or derivatized forms.
The term "operably linked", when referring to DNA segments, indicates that the segments are arranged so that they function in concert for their intended purposes, e.g., transcription initiates in the promoter and proceeds through the coding segment to the terminator.
The term "ortholog" denotes a polypeptide or protein obtained from one species that is the functional counterpart of a polypeptide or protein from a different species. Sequence differences among orthologs are the result of speciation.
A "polynucleotide" is a single- or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases read from the 5' to the 3' end. Polynucleotides include RNA and DNA, and may be isolated from natural sources, synthesized in vitro, or prepared from a combination of natural and synthetic molecules. Sizes of polynucleotides are expressed as base pairs (abbreviated "bp"), nucleotides ("nt"), or kilobases ("kb"). Where the context allows, the latter two terms may describe polynucleotides that are single-stranded or double-stranded. When the term is applied to double-stranded WO 99/54468 PCTlUS99/08601 molecules it is used to denote overall length and will be understood to be equivalent to the term "base pairs". It will be recognized by those skilled in the art that the two strands of a double-stranded polynucleotide may differ slightly in length and that the ends thereof may be staggered as a result of enzymatic cleavage; thus all nucleotides within a double-stranded polynucleotide molecule may not be paired. Such unpaired ends will in general not exceed 20 nt in length.
A "polypeptide" is a polymer of amino acid residues joined by peptide bonds, whether produced naturally or synthetically. Polypeptides of less than about 10 amino acid residues are commonly referred to as "peptides".
"Probes and/or primers" as used herein can be RNA or DNA. DNA can be either cDNA or genomic DNA.
Polynucleotide probes and primers are single or double-stranded DNA or RNA, generally synthetic oligonucleotides, but may be generated from cloned cDNA or genomic sequences or its complements. Analytical probes will generally be at least 20 nucleotides in length, although somewhat shorter probes (14-17 nucleotides) can be used. PCR
primers are at least 5 nucleotides in length, preferably 15 or more nt, more preferably 20-30 nt. Short polynucleotides can be used when a small region of the gene is targeted for analysis. For gross analysis of genes, a polynucleotide probe may comprise an entire exon or more. Probes can be labeled to provide a detectable signal, such as with an enzyme, biotin, a radionuclide, fluorophore, chemiluminescer, paramagnetic particle and the like, which are commercially available from many sources, such as Molecular Probes, Inc., Eugene, OR, and Amersham Corp., Arlington Heights, IL, using techniques that are well known in the art. Examples of ZTMPO-1 probes and primers include, but are not limited to, the sequences disclosed herein as SEQ ID NOs:6-29.
The term "promoter" is used herein for its art-recognized meaning to denote a portion of a gene containing DNA sequences that provide for the binding of RNA polymerase and initiation of transcription. Promoter 5 sequences are commonly, but not always, found in the 5' non-coding regions of genes.
A "protein" is a macromolecule comprising one or more polypeptide chains. A protein may also comprise non-peptidic components, such as carbohydrate groups.
The term "degenerate nucleotide sequence"
denotes a sequence of nucleotides that includes one or more degenerate codons (as compared to a reference polynucleotide molecule that encodes a polypeptide).
Degenerate codons contain different triplets of nucleotides, but encode the same amino acid residue (i.e., GAU and GAC triplets each encode Asp).
The term "expression vector" is used to denote a DNA molecule, linear or circular, that comprises a segment encoding a polypeptide of interest operably linked to additional segments that provide for its transcription.
Such additional segments include promoter and terminator sequences, and may also include one or more origins of replication, one or more selectable markers, an enhancer, a polyadenylation signal, etc. Expression vectors are generally derived from plasmid or viral DNA, or may contain elements of both.
The term "isolated", when applied to a polynucleotide, denotes that the polynucleotide has been removed from its natural genetic milieu and is thus free of other extraneous or unwanted coding sequences, and is in a form suitable for use within genetically engineered protein production systems. Such isolated molecules are those that are separated from their natural environment and include cDNA and genomic clones. Isolated DNA
molecules of the present invention are free of other genes with which they are ordinarily associated, but may include naturally occurring 5' and 3' untranslated regions such as WO 99!54468 PCT/US99/08601 promoters and terminators. The identification of associated regions will be evident to one of ordinary skill in the art (see for example, Dynan and Tijan, Nature 316:774-78, 1985).
An "isolated" polypeptide or protein is a polypeptide or protein that is found in a condition other than its native environment, such as apart from blood and animal tissue. In a preferred form, the isolated polypeptide is substantially free of other polypeptides, particularly other polypeptides of animal origin. It is preferred to provide the polypeptides in a highly purified form, i.e. greater than 95% pure, more preferably greater than 99% pure. When used in this context, the term "isolated" does not exclude the presence of the same polypeptide in alternative physical forms, such as dimers or alternatively glycosylated or derivatized forms.
The term "operably linked", when referring to DNA segments, indicates that the segments are arranged so that they function in concert for their intended purposes, e.g., transcription initiates in the promoter and proceeds through the coding segment to the terminator.
The term "ortholog" denotes a polypeptide or protein obtained from one species that is the functional counterpart of a polypeptide or protein from a different species. Sequence differences among orthologs are the result of speciation.
A "polynucleotide" is a single- or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases read from the 5' to the 3' end. Polynucleotides include RNA and DNA, and may be isolated from natural sources, synthesized in vitro, or prepared from a combination of natural and synthetic molecules. Sizes of polynucleotides are expressed as base pairs (abbreviated "bp"), nucleotides ("nt"), or kilobases ("kb"). Where the context allows, the latter two terms may describe polynucleotides that are single-stranded or double-stranded. When the term is applied to double-stranded WO 99/54468 PCTlUS99/08601 molecules it is used to denote overall length and will be understood to be equivalent to the term "base pairs". It will be recognized by those skilled in the art that the two strands of a double-stranded polynucleotide may differ slightly in length and that the ends thereof may be staggered as a result of enzymatic cleavage; thus all nucleotides within a double-stranded polynucleotide molecule may not be paired. Such unpaired ends will in general not exceed 20 nt in length.
A "polypeptide" is a polymer of amino acid residues joined by peptide bonds, whether produced naturally or synthetically. Polypeptides of less than about 10 amino acid residues are commonly referred to as "peptides".
"Probes and/or primers" as used herein can be RNA or DNA. DNA can be either cDNA or genomic DNA.
Polynucleotide probes and primers are single or double-stranded DNA or RNA, generally synthetic oligonucleotides, but may be generated from cloned cDNA or genomic sequences or its complements. Analytical probes will generally be at least 20 nucleotides in length, although somewhat shorter probes (14-17 nucleotides) can be used. PCR
primers are at least 5 nucleotides in length, preferably 15 or more nt, more preferably 20-30 nt. Short polynucleotides can be used when a small region of the gene is targeted for analysis. For gross analysis of genes, a polynucleotide probe may comprise an entire exon or more. Probes can be labeled to provide a detectable signal, such as with an enzyme, biotin, a radionuclide, fluorophore, chemiluminescer, paramagnetic particle and the like, which are commercially available from many sources, such as Molecular Probes, Inc., Eugene, OR, and Amersham Corp., Arlington Heights, IL, using techniques that are well known in the art. Examples of ZTMPO-1 probes and primers include, but are not limited to, the sequences disclosed herein as SEQ ID NOs:6-29.
The term "promoter" is used herein for its art-recognized meaning to denote a portion of a gene containing DNA sequences that provide for the binding of RNA polymerase and initiation of transcription. Promoter 5 sequences are commonly, but not always, found in the 5' non-coding regions of genes.
A "protein" is a macromolecule comprising one or more polypeptide chains. A protein may also comprise non-peptidic components, such as carbohydrate groups.
10 Carbohydrates and other non-peptidic substituents may be added to a protein by the cell in which the protein is produced, and will vary with the type of cell. Proteins are defined herein in terms of their amino acid backbone structures; substituents such as carbohydrate groups are generally not specified, but may be present nonetheless.
The term "receptor" denotes a cell-associated protein that binds to a bioactive molecule (i.e., a ligand) and mediates the effect of the ligand on the cell.
Membrane-bound receptors are characterized by a multi-domain structure comprising an extracellular ligand-binding domain and an intracellular effector domain that is typically involved in signal transduction. Binding of ligand to receptor results in a conformational change in the receptor that causes an interaction between the effector domain and other molecules) in the cell. This interaction in turn leads to an alteration in the metabolism of the cell. Metabolic events that are linked to receptor-ligand interactions include gene transcription, phosphorylation, dephosphorylation, increases in cyclic AMP production, mobilization of cellular calcium, mobilization of membrane lipids, cell adhesion, hydrolysis of inositol lipids and hydrolysis of phospholipids. In general, receptors can be membrane bound, cytosolic or nuclear; monomeric (e. g., thyroid stimulating hormone receptor, beta-adrenergic receptor) or multimeric (e. g., PDGF receptor, growth hormone receptor, WO 99/54468 PC'f/US99/08601 IL-3 receptor, GM-CSF receptor, G-CSF receptor, erythropoietin receptor and IL-6 receptor).
The term "secretory signal sequence" denotes a DNA sequence that encodes a polypeptide (a "secretory peptide") that, as a component of a larger polypeptide, directs the larger polypeptide through a secretory pathway of a cell in which it is synthesized. The larger polypeptide is commonly cleaved to remove the secretory peptide during transit through the secretory pathway.
The term "splice variant" is used herein to denote alternative forms of RNA transcribed from a gene.
Splice variation arises naturally through use of alternative splicing sites within a transcribed RNA
molecule, or less commonly between separately transcribed RNA molecules, and may result in several mRNAs transcribed from the same gene. Splice variants may encode polypeptides having altered amino acid sequence. The term splice variant is also used herein to denote a protein encoded by a splice variant of an mRNA transcribed from a gene.
Molecular weights and lengths of polymers determined by imprecise analytical methods (e.g., gel electrophoresis) will be understood to be approximate values. When such a value is expressed as "about" X or "approximately" X, the stated value of X will be understood to be accurate to t10%.
The present invention is based in part upon the discovery of a novel protein having regions of homology to members of the thymopoietin-emerin family of nuclear 3o membrane proteins. This protein has been designated "ZTMPO-1". The human ZTMPO-1 nucleotide sequence is represented in SEQ ID NO:1 and the deduced amino acid sequence in SEQ ID N0:2. The ZTMPO-1 proteins and polypeptides encoded by polynucleotides of the present invention were initially identified by querying an EST
(Expressed Sequence Tag) database for sequences homologous to conserved motifs within the thymopoietin family.
The term "receptor" denotes a cell-associated protein that binds to a bioactive molecule (i.e., a ligand) and mediates the effect of the ligand on the cell.
Membrane-bound receptors are characterized by a multi-domain structure comprising an extracellular ligand-binding domain and an intracellular effector domain that is typically involved in signal transduction. Binding of ligand to receptor results in a conformational change in the receptor that causes an interaction between the effector domain and other molecules) in the cell. This interaction in turn leads to an alteration in the metabolism of the cell. Metabolic events that are linked to receptor-ligand interactions include gene transcription, phosphorylation, dephosphorylation, increases in cyclic AMP production, mobilization of cellular calcium, mobilization of membrane lipids, cell adhesion, hydrolysis of inositol lipids and hydrolysis of phospholipids. In general, receptors can be membrane bound, cytosolic or nuclear; monomeric (e. g., thyroid stimulating hormone receptor, beta-adrenergic receptor) or multimeric (e. g., PDGF receptor, growth hormone receptor, WO 99/54468 PC'f/US99/08601 IL-3 receptor, GM-CSF receptor, G-CSF receptor, erythropoietin receptor and IL-6 receptor).
The term "secretory signal sequence" denotes a DNA sequence that encodes a polypeptide (a "secretory peptide") that, as a component of a larger polypeptide, directs the larger polypeptide through a secretory pathway of a cell in which it is synthesized. The larger polypeptide is commonly cleaved to remove the secretory peptide during transit through the secretory pathway.
The term "splice variant" is used herein to denote alternative forms of RNA transcribed from a gene.
Splice variation arises naturally through use of alternative splicing sites within a transcribed RNA
molecule, or less commonly between separately transcribed RNA molecules, and may result in several mRNAs transcribed from the same gene. Splice variants may encode polypeptides having altered amino acid sequence. The term splice variant is also used herein to denote a protein encoded by a splice variant of an mRNA transcribed from a gene.
Molecular weights and lengths of polymers determined by imprecise analytical methods (e.g., gel electrophoresis) will be understood to be approximate values. When such a value is expressed as "about" X or "approximately" X, the stated value of X will be understood to be accurate to t10%.
The present invention is based in part upon the discovery of a novel protein having regions of homology to members of the thymopoietin-emerin family of nuclear 3o membrane proteins. This protein has been designated "ZTMPO-1". The human ZTMPO-1 nucleotide sequence is represented in SEQ ID NO:1 and the deduced amino acid sequence in SEQ ID N0:2. The ZTMPO-1 proteins and polypeptides encoded by polynucleotides of the present invention were initially identified by querying an EST
(Expressed Sequence Tag) database for sequences homologous to conserved motifs within the thymopoietin family.
ZTMPO-1 as represented in SEQ ID NO:1 is a 2,754 by polynucleotide which has an open reading frame encoding an 876 amino acid residue protein. Sequence analysis of the deduced amino acid sequence as represented in SEQ ID N0:2 does not indicate the presence of a secretion signal sequence or transmembrane domain. There is a putative ankyrin-like region, amino acid residues 333-385 of SEQ ID
N0:2, having an ankyrin repeat (residues 347-379 of SEQ ID
N0:2) which may indicate that ZTMPO-1 is retained in the plasma membrane. Ankyrin repeats have been described as a 33 amino acid motif, usually found in tandem arrays of four to seven copies, that mediate protein interactions (Michaely and Bennett, J. Biol. Chem. 268:22703-9, 1993).
Ankyrin repeats have been reported in numerous proteins in species from bacteria to man (Sentenac et al., Science 256:663-5, 1992; Zhang et al., Plant Cell 4:1575-88, 1992;
Gustine et al., Plant Physiol. 108:1748, 1995; Andrews and Herskowitz, Nature 342:830-3, 1989; Warton et al., Cell 43:567-81, 1995 and Yochem and Greenwald, Cell 58:53-63, 1989. Ankyrin repeats have been proposed as a generalized protein binding motif, one function of ankyrin repeats is to serve as adaptors, associating with the spectrin-based cytoplasmic skeleton and membrane proteins. Ankyrin is used as a membrane attachment site in neurons and may provide a transport mechanism through secretory vesicles.
At the C-terminal end of ZTMPO-1 is a calcium binding protein-like region having two potential calcium binding sites (residues 678-692 and residues 719-731 pf SEQ ID N0:2) similar to that seen in the sea urchin calcium binding protein LPS1-beta (Xiang et al., J. Biol.
Chem. 16:10524-33, 1991).
The ZTMPO-1 polynucleotide of SEQ ID N0:1 encodes an 876 amino acid residue protein which is much larger than other members of the thymopoietin/emerin family. Human thymopoietin a, is a 693 amino acid residue protein, human emerin is a 254 amino acid residue protein and rat LAP2 is a 452 amino acid residue protein.
N0:2, having an ankyrin repeat (residues 347-379 of SEQ ID
N0:2) which may indicate that ZTMPO-1 is retained in the plasma membrane. Ankyrin repeats have been described as a 33 amino acid motif, usually found in tandem arrays of four to seven copies, that mediate protein interactions (Michaely and Bennett, J. Biol. Chem. 268:22703-9, 1993).
Ankyrin repeats have been reported in numerous proteins in species from bacteria to man (Sentenac et al., Science 256:663-5, 1992; Zhang et al., Plant Cell 4:1575-88, 1992;
Gustine et al., Plant Physiol. 108:1748, 1995; Andrews and Herskowitz, Nature 342:830-3, 1989; Warton et al., Cell 43:567-81, 1995 and Yochem and Greenwald, Cell 58:53-63, 1989. Ankyrin repeats have been proposed as a generalized protein binding motif, one function of ankyrin repeats is to serve as adaptors, associating with the spectrin-based cytoplasmic skeleton and membrane proteins. Ankyrin is used as a membrane attachment site in neurons and may provide a transport mechanism through secretory vesicles.
At the C-terminal end of ZTMPO-1 is a calcium binding protein-like region having two potential calcium binding sites (residues 678-692 and residues 719-731 pf SEQ ID N0:2) similar to that seen in the sea urchin calcium binding protein LPS1-beta (Xiang et al., J. Biol.
Chem. 16:10524-33, 1991).
The ZTMPO-1 polynucleotide of SEQ ID N0:1 encodes an 876 amino acid residue protein which is much larger than other members of the thymopoietin/emerin family. Human thymopoietin a, is a 693 amino acid residue protein, human emerin is a 254 amino acid residue protein and rat LAP2 is a 452 amino acid residue protein.
Like emerin, the amino acid sequence of ZTMPO-1 does not contain the 42 amino acid thymopoietin peptide originally identified by Goldstein (Nature 247:11-14, 1974) but shares discrete regions of homology with the human thymopoietins a, (3 and y (Harris et al., ibid., Genbank Accession Nos. a (U09086), (3 (U09087) and (U09088)) and the mouse thymopoietins a, , , ~3 y, s, 8 and (Berger et al., ibid., Genbank Accession Nos. a (U39078), U39074, y (U39077), s (U39074), 8 (U39076) and (U39073)). In particular, over the region defined by amino acid residues 13 to 44 of SEQ ID N0:2, ZTMPO-1 shares 50% amino acid identity with the corresponding regions of the mouse and human thymopoietins and 30% with human emerin. In particular, the region defined by amino acid residues 30-44 of SEQ ID N0:2 is highly conserved between the proteins, see Figure.
As would be expected, ZTMPO-1 also shares discrete regions of homology with rat lamina associated protein 2, LAP2, (Furukawa et al., ibid., Genbank Accession No. U18314). These regions correspond to many of the same regions with which ZTMPO-1 shares identity with the thymopoietins. ZTMPO-1 and rat LAP2 share 70%
amino acid identity over the region corresponding to amino acid residues 13 to 44 of SEQ ID N0:2.
ZTMPO-1 also shares a limited degree of homology to regions of the yeast transcription factor IIF alpha subunit over the region corresponding to amino acid residues 86 to 160 and amino acid residues 205 to 260 of SEQ ID N0:2.
Additionally, ZTMPO-2 shares 27% amino acid identity with Trypanosoma brucei ribonuclease H1 (Hesslein and Campbell, Mol. Biochem. Parasitol. 86:221-6, 1997, Genbank Accession No. U74470) over the region corresponding to amino acid residues 156 to 203 of SEQ ID
N0:2. This homology, along with that shared with LAP2, as well as the possible ankyrin repeat, suggests the possibility that ZTMPO-1 possesses chromatin or DNA
binding properties.
Those skilled in the art will recognize that these domain boundaries are approximate, and are based on alignments with known proteins and predictions of protein folding.
Northern blot analysis of various human tissues was performed using a 218 by human DNA probe (SEQ ID
N0:8). A 3.2 and a 5 kb transcript corresponding to ZTMPO-1 were ubiquitously expressed with the highest level being in testis tissue. Similar ubiquitous expression patterns were also reported for the thymopoietins and emerin (Harris et al., ibid. and Small et al., ibid.).
Chromosomal localization results show that ZTMPO-1 maps 636.18 cR_3000 from the top of the human chromosome 12 linkage group on the WICGR radiation hybrid map. The proximal framework marker was D12S367. The use of surrounding markers positions ZTMPO-1 in the 12q24.33 region on the integrated LDB chromosome 12 map. Among the genes mapping around this region are insulin-like growth factor 1 which is involved in growth and development;
melanin concentrating hormone, a neuropeptide associated with goal-associated behaviors and general arousal (Nahon et al., Genomics 12: 846-8, 1992); spinal muscular atrophy a nonprogressive muscular atrophy involving mainly the lower extremities (van Ravenswaaij, et al., Am. J. Hum.
Genet. 61 (suppl.): A299, 1997); spinal muscular atrophy 4 (Timmerman, et al., Hum. Molec. Genet. 5: 1065-9, 1996) and myosin regulatory light chain which is involved in regulation of myosin ATPase activity in smooth muscle (Macera, et al., Genomics 13: 829-31, 1992). Thymopoietin maps to chromosome 12q22 (Harris et al., ibid.).
The nucleotide sequences encoding regions of conserved amino acid residues between ZTMPO-1 and nuclear proteins such as the thymopoietins, LAP2 and emerin, for example, the region between nucleotides 163 and 258 of SEQ
ID N0:1, in particular the region between nucleotides 214 and 258 of SEQ ID NO:1, can be used as a tool to identify new family members. For instance, reverse transcription-polymerase chain reaction (RT-PCR) can be used to amplify sequences encoding these conserved regions from RNA
5 obtained from a variety of tissue sources or cell lines.
In particular, highly degenerate primers designed from the ZTMPO-1 sequences are useful for this purpose.
The present invention also provides polynucleotide molecules, including DNA and RNA molecules, 10 that encode the ZTMPO-1 polypeptides disclosed herein.
Those skilled in the art will readily recognize that, in view of the degeneracy of the genetic code, considerable sequence variation is possible among these polynucleotide molecules. SEQ ID N0:5 is a degenerate DNA sequence that 15 encompasses all DNAs that encode the ZTMPO-1 polypeptide of SEQ ID N0:2. Those skilled in the art will recognize that the degenerate sequence of SEQ ID N0:5 also provides all RNA sequences encoding SEQ ID N0:2 by substituting U
(uracil) for T (thymine). Thus, ZTMPO-1 polypeptide-encoding polynucleotides comprising nucleotide 1 to nucleotide 2628 of SEQ ID N0:5 and their RNA equivalents are contemplated by the present invention. Table 1 sets forth the one-letter codes used within SEQ ID N0:5 to denote degenerate nucleotide positions. "Resolutions" are the nucleotides denoted by a code letter. "Nucleotide Complement" indicates the code for the complementary nucleotide(s). For example, the code Y denotes either C
(cytosine) or T, and its complement R denotes A
(adenosine) or G (guanine), A being complementary to T, and G being complementary to C.
As would be expected, ZTMPO-1 also shares discrete regions of homology with rat lamina associated protein 2, LAP2, (Furukawa et al., ibid., Genbank Accession No. U18314). These regions correspond to many of the same regions with which ZTMPO-1 shares identity with the thymopoietins. ZTMPO-1 and rat LAP2 share 70%
amino acid identity over the region corresponding to amino acid residues 13 to 44 of SEQ ID N0:2.
ZTMPO-1 also shares a limited degree of homology to regions of the yeast transcription factor IIF alpha subunit over the region corresponding to amino acid residues 86 to 160 and amino acid residues 205 to 260 of SEQ ID N0:2.
Additionally, ZTMPO-2 shares 27% amino acid identity with Trypanosoma brucei ribonuclease H1 (Hesslein and Campbell, Mol. Biochem. Parasitol. 86:221-6, 1997, Genbank Accession No. U74470) over the region corresponding to amino acid residues 156 to 203 of SEQ ID
N0:2. This homology, along with that shared with LAP2, as well as the possible ankyrin repeat, suggests the possibility that ZTMPO-1 possesses chromatin or DNA
binding properties.
Those skilled in the art will recognize that these domain boundaries are approximate, and are based on alignments with known proteins and predictions of protein folding.
Northern blot analysis of various human tissues was performed using a 218 by human DNA probe (SEQ ID
N0:8). A 3.2 and a 5 kb transcript corresponding to ZTMPO-1 were ubiquitously expressed with the highest level being in testis tissue. Similar ubiquitous expression patterns were also reported for the thymopoietins and emerin (Harris et al., ibid. and Small et al., ibid.).
Chromosomal localization results show that ZTMPO-1 maps 636.18 cR_3000 from the top of the human chromosome 12 linkage group on the WICGR radiation hybrid map. The proximal framework marker was D12S367. The use of surrounding markers positions ZTMPO-1 in the 12q24.33 region on the integrated LDB chromosome 12 map. Among the genes mapping around this region are insulin-like growth factor 1 which is involved in growth and development;
melanin concentrating hormone, a neuropeptide associated with goal-associated behaviors and general arousal (Nahon et al., Genomics 12: 846-8, 1992); spinal muscular atrophy a nonprogressive muscular atrophy involving mainly the lower extremities (van Ravenswaaij, et al., Am. J. Hum.
Genet. 61 (suppl.): A299, 1997); spinal muscular atrophy 4 (Timmerman, et al., Hum. Molec. Genet. 5: 1065-9, 1996) and myosin regulatory light chain which is involved in regulation of myosin ATPase activity in smooth muscle (Macera, et al., Genomics 13: 829-31, 1992). Thymopoietin maps to chromosome 12q22 (Harris et al., ibid.).
The nucleotide sequences encoding regions of conserved amino acid residues between ZTMPO-1 and nuclear proteins such as the thymopoietins, LAP2 and emerin, for example, the region between nucleotides 163 and 258 of SEQ
ID N0:1, in particular the region between nucleotides 214 and 258 of SEQ ID NO:1, can be used as a tool to identify new family members. For instance, reverse transcription-polymerase chain reaction (RT-PCR) can be used to amplify sequences encoding these conserved regions from RNA
5 obtained from a variety of tissue sources or cell lines.
In particular, highly degenerate primers designed from the ZTMPO-1 sequences are useful for this purpose.
The present invention also provides polynucleotide molecules, including DNA and RNA molecules, 10 that encode the ZTMPO-1 polypeptides disclosed herein.
Those skilled in the art will readily recognize that, in view of the degeneracy of the genetic code, considerable sequence variation is possible among these polynucleotide molecules. SEQ ID N0:5 is a degenerate DNA sequence that 15 encompasses all DNAs that encode the ZTMPO-1 polypeptide of SEQ ID N0:2. Those skilled in the art will recognize that the degenerate sequence of SEQ ID N0:5 also provides all RNA sequences encoding SEQ ID N0:2 by substituting U
(uracil) for T (thymine). Thus, ZTMPO-1 polypeptide-encoding polynucleotides comprising nucleotide 1 to nucleotide 2628 of SEQ ID N0:5 and their RNA equivalents are contemplated by the present invention. Table 1 sets forth the one-letter codes used within SEQ ID N0:5 to denote degenerate nucleotide positions. "Resolutions" are the nucleotides denoted by a code letter. "Nucleotide Complement" indicates the code for the complementary nucleotide(s). For example, the code Y denotes either C
(cytosine) or T, and its complement R denotes A
(adenosine) or G (guanine), A being complementary to T, and G being complementary to C.
Nucleotide Base Code Resolutions Base Code Complement A A T T
C C G G
G G C C
T T A A
R A~G Y CST
Y CST R A~G
M ABC K GET
K GET M ABC
S CMG S CMG
W ACT W ACT
H A~C~T D A~G~T
B C~G~T V A~C~G
V A~C~G B C~G~T
D A~G~T H A~C~T
N A~C~G~T N A~C~G~T
The degenerate codons used in SEQ ID N0:5, encompassing all possible codons for a given amino acid, are set forth in Table 2.
C C G G
G G C C
T T A A
R A~G Y CST
Y CST R A~G
M ABC K GET
K GET M ABC
S CMG S CMG
W ACT W ACT
H A~C~T D A~G~T
B C~G~T V A~C~G
V A~C~G B C~G~T
D A~G~T H A~C~T
N A~C~G~T N A~C~G~T
The degenerate codons used in SEQ ID N0:5, encompassing all possible codons for a given amino acid, are set forth in Table 2.
Three One Letter Letter Degenerate Code Code Synonymous Colon Colons Cys C TGC TGT TGY
Ser S AGC AGTTCA TCC TCG TCT WSN
Thr T ACA ACCACG ACT ACN
Pro P CCA CCCCCG CCT CCN
Ala A GCA GCCGCG GCT GCN
Gly G GGA GGCGGG GGT GGN
Asn N AAC AAT qAY
Asp D~ GAC GAT GAY
Glu E GAA GAG GAR
Gln Q CAA CAG CAR
His H CAC CAT CAY
Arg R AGA AGGCGA CGC CGG CGT MGN
Lys K AAA AAG AAR
Met M ATG ATG
Ile I ATA ATCATT ATH
Leu L CTA CTCCTG CTT TTA TTG YTN
Val V GTA GTCGTG GTT GTN
Phe F TTC TTT ~Y
Tyr Y TAC TAT TAY
Trp W TGG TGG
Ter . TAA TAGTGA TRR
Asn~Asp B RAY
GIuJGIn Z SAR
Any X NNN
Ser S AGC AGTTCA TCC TCG TCT WSN
Thr T ACA ACCACG ACT ACN
Pro P CCA CCCCCG CCT CCN
Ala A GCA GCCGCG GCT GCN
Gly G GGA GGCGGG GGT GGN
Asn N AAC AAT qAY
Asp D~ GAC GAT GAY
Glu E GAA GAG GAR
Gln Q CAA CAG CAR
His H CAC CAT CAY
Arg R AGA AGGCGA CGC CGG CGT MGN
Lys K AAA AAG AAR
Met M ATG ATG
Ile I ATA ATCATT ATH
Leu L CTA CTCCTG CTT TTA TTG YTN
Val V GTA GTCGTG GTT GTN
Phe F TTC TTT ~Y
Tyr Y TAC TAT TAY
Trp W TGG TGG
Ter . TAA TAGTGA TRR
Asn~Asp B RAY
GIuJGIn Z SAR
Any X NNN
One of ordinary skill in the art will appreciate that some ambiguity is introduced in determining a degenerate codon, representative of all possible codons encoding each amino acid. For example, the degenerate codon for serine (WSN) can, in some circumstances, encode arginine (AGR), and the degenerate codon for arginine (MGN) can, in some circumstances, encode serine (AGY). A
similar relationship exists between codons encoding phenylalanine and leucine. Thus, some polynucleotides encompassed by the degenerate sequence may encode variant amino acid sequences, but one of ordinary skill in the art can easily identify such variant sequences by reference to the amino acid sequence of SEQ ID N0:2. Variant sequences can be readily tested for functionality as described herein.
One of ordinary skill in the art will also appreciate that different species can exhibit "preferential codon usage." In general, see, Grantham, et al., Nuc. Acids Res., 8_:1893-912, 1980; Haas, et al. Curr.
Biol., 6:315-24, 1996; Wain-Hobson, et al., Gene, 13:355-64, 1981; Grosjean and Fiers, Gene, 18:199-209, 1982;
Holm, Nuc. Acids Res., 14:3075-87, 1986; Ikemura, J. Mol.
Biol., 158:573-97, 1982. As used herein, the term "preferential codon usage" or "preferential codons" is a term of art referring to protein translation codons that are most frequently used in cells of a certain species, thus favoring one or a few representatives of the possible codons encoding each amino acid (See Table 2). For example, the amino acid threonine (Thr) may be encoded by ACA, ACC, ACG, or ACT, but in mammalian cells ACC is the most commonly used codon; in other species, for example, insect cells, yeast, viruses or bacteria, different Thr codons may be preferential. Preferential codons for a particular species can be introduced into the polynucleotides of the present invention by a variety of methods known in the art. Introduction of preferential codon sequences into recombinant DNA can, for example, enhance production of the protein by making protein translation more efficient within a particular cell type or species. Therefore, the degenerate codon sequence disclosed in SEQ ID N0:5 serves as a template for optimizing expression of polynucleotides in various cell types and species commonly used in the art and disclosed herein. Sequences containing preferential codons can be tested and optimized for expression in various species, and tested for functionality as disclosed herein.
The present invention also provides polypeptide fragments or peptides comprising an epitope-bearing portion of an ZTMPO-1 polypeptide described herein. Such fragments or peptides may comprise an "immunogenic epitope," which is a part of a protein that elicits an antibody response when the entire protein is used as an immunogen. Immunogenic epitope-bearing peptides can be identified using standard methods (see, for example, Geysen et al., Proc. Nat. Acad. Sci. USA 81:3998, 1983).
In contrast, polypeptide fragments or peptides may comprise an "antigenic epitope," which is a region of a protein molecule to which an antibody can specifically bind. Certain epitopes consist of a linear or contiguous stretch of amino acids, and the antigenicity of such an epitope is not disrupted by denaturing agents. It is known in the art that relatively short synthetic peptides that can mimic epitopes of a protein can be used to stimulate the production of antibodies against the protein (see, for example, Sutcliffe et al., Science 219:660, 1983).
Accordingly, antigenic epitope-bearing peptides and polypeptides of the present invention are useful to raise antibodies that bind with the polypeptides described herein .
Antigenic epitope-bearing peptides and polypeptides preferably contain at least four to ten amino acids, at least ten to fifteen amino acids, or about 15 to about 30 amino acids of SEQ ID N0:2. Such epitope-bearing peptides and polypeptides can be produced by fragmenting a ZTMPO-1 polypeptide, or by chemical peptide synthesis, as described herein. Moreover, epitopes can be selected by phage display of random peptide libraries (see, for example, Lane and Stephen, Curr. Opin. Immunol. 5:268, 5 1993, and Cortese et al., Curr. Opin. Biotechnol. 7:616, 1996). Standard methods for identifying epitopes and producing antibodies from small peptides that comprise an epitope are described, for example, by Mole, "Epitope Mapping," in Methods in Molecular Biologv, Vol. 10, Manson 10 (ed.), pages 105-116 (The Humana Press, Inc. 1992), Price, "Production and Characterization of Synthetic Peptide-Derived Antibodies," in Monoclonal Antibodies: Production, Engineering, and Clinical Application, Ritter and Ladyman (eds.), pages 60-84 (Cambridge University Press 1995), and 15 Coligan et al. (eds.), Current Protocols in ImmunolocTV, pages 9.3.1 - 9.3.5 and pages 9.4.1 - 9.4.11 (John Wiley &
Sons 1997).
Potential antigenic sites in ZTMPO-1 can be identified using the Jameson-Wolf method (Jameson and 20 Wolf, CABIOS 4:181, 1988), as implemented by the PROTEAN
program (version 3.14) of LASERGENE (DNASTAR; Madison, WI). The Jameson-Wolf method predicts potential antigenic determinants by combining six major subroutines for protein structural prediction. Briefly, the Hopp-Woods method (Hopp et al., Proc. Nat. Acad. Sci. USA 78:3824, 1981), is first used to identify amino acid sequences representing areas of greatest local hydrophilicity (parameter: seven residues averaged). In the second step, Emini's method (Emini et al., J. Virolocry 55:836, 1985), is used to calculate surface probabilities (parameter:
surface decision .threshold (0.6) - 1). Third, the Karplus-Schultz method, (Karplus and Schultz;
Naturwissenschaften 72:212, 1985), is used to predict backbone chain flexibility (parameter: flexibility threshold (0.2) - 1). In the fourth and fifth steps of the analysis, secondary structure predictions are applied to the data using the methods of Chou-Fasman, Chou, "Prediction of Protein Structural Classes from Amino Acid Composition," in Prediction of Protein Structure and the Princit~les of Protein Conformation, Fasman (ed.), pages 549-586 (Plenum Press 1990), and Gamier-Robson, Gamier et al., J. Mol. Biol. 120:97, 1978 (Chou-Fasman parameters: conformation table - 64 proteins; a region threshold - 103; b region threshold - 105; Garnier-Robson parameters: a and b decision constants - 0). In the sixth subroutine, flexibility parameters and hydropathy/solvent accessibility factors are combined to determine a surface contour value, designated as the "antigenic index."
Finally, a peak broadening function is applied to the antigenic index, which broadens major surface peaks by adding 20, 40, 60, or 80% of the respective peak value to account for additional free energy derived from the mobility of surface regions relative to interior regions.
Regardless of the particular nucleotide sequence of a variant ZTMPO-1 gene, the gene encodes a polypeptide that is characterized by its glycoprotein synthesis or cell-cell interaction activity, or by the ability to bind specifically to an anti-ZTMPO-1 antibody. More specifically, variant ZTMPO-1 genes encode polypeptides which exhibit at least 50%, and preferably, greater than 70, 80, or 90°s, of the activity of polypeptide encoded by the human ZTMPO-1 gene described herein.
For any ZTMPO-1 polypeptide, including variants and fusion proteins, one of ordinary skill in the art can readily generate a fully degenerate polynucleotide sequence encoding that variant using the information set forth in Tables 1 and 2 above. Moreover, those of skill in the art can use standard software to devise ZTMPO-1 variants based upon the nucleotide and amino acid sequences described herein. Accordingly, the present invention includes a computer-readable medium encoded with a data structure that provides at least one of the following sequences: SEQ ID NO:1, SEQ ID N0:2 or SEQ ID
N0:5. Suitable forms of computer-readable media include magnetic media and optically-readable media. Examples of magnetic media include a hard or fixed drive, a random access memory (RAM) chip, a floppy disk, digital linear tape (DLT), a disk cache, and a ZIP disk. Optically S readable media are exemplified by compact discs (e.g., CD-read only memory (ROM), CD-rewritable (RW), and CD-recordable), and digital versatile/video discs (DVD) (e. g., DVD-ROM, DVD-RAM, and DVD+RW).
Within preferred embodiments of the invention, the isolated polynucleotides can hybridize under stringent conditions to polynucleotides having the nucleotide sequence of SEQ ID NO:1 or to nucleic acid molecules having a nucleotide sequence complementary to SEQ ID NO:1.
In general, stringent conditions are selected to be about 5°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe.
A pair of nucleic acid molecules, such as DNA-DNA, RNA-RNA and DNA-RNA, can hybridize if the nucleotide sequences have some degree of complementarity. Hybrids can tolerate mismatched base pairs in the double helix, but the stability of the hybrid is influenced by the degree of mismatch. The Tm of the mismatched hybrid decreases by 1°C for every 1-1.5% base pair mismatch.
Varying the stringency of the hybridization conditions allows control over the degree of mismatch that will be present in the hybrid. The degree of stringency increases as the hybridization temperature increases and the ionic strength of the hybridization buffer decreases. Stringent hybridization conditions encompass temperatures of about 5-25°C below the Tm of the hybrid and a hybridization buffer having up to 1 M Na'. Higher degrees of stringency at lower temperatures can be achieved with the addition of formamide which reduces the Tm of the hybrid about 1°C for each 1% formamide in the buffer solution. Generally, such stringent conditions include temperatures of 20-70°C and a hybridization buffer containing up to 6xSSC and 0-500 formamide. A higher degree of stringency can be achieved at temperatures of from 40-70°C with a hybridization buffer having up to 4xSSC and from 0-50o formamide.
Highly stringent conditions typically encompass temperatures of 42-70°C with a hybridization buffer having up to lxSSC and 0-50% formamide. Different degrees of stringency can be used during hybridization and washing to achieve maximum specific binding to the target sequence.
Typically, the washes following hybridization are performed at increasing degrees of stringency to remove non-hybridized polynucleotide probes from hybridized complexes.
The above conditions are meant to serve as a guide and it is well within the abilities of one skilled in the art to adapt these conditions for use with a particular polypeptide hybrid. The Tm for a specific target sequence is the temperature (under defined conditions) at which 50% of the target sequence will hybridize to a perfectly matched probe sequence. Those conditions which influence the Tm include, the size and base pair content of the polynucleotide probe, the ionic strength of the hybridization solution, and the presence of destabilizing agents in the hybridization solution.
Numerous equations for calculating Tm are known in the art, and are specific for DNA, RNA and DNA-RNA hybrids and polynucleotide probe sequences of varying length (see, for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, Second Edition (Cold Spring Harbor Press 1989);
Ausubel et al., (eds.), Current Protocols in Molecular Bioloc~v (John Wiley and Sons, Inc. 1987); Berger and Kimmel (eds.), Guide to Molecular Cloning Technigues, (Academic Press, Inc. 1987); and Wetmur, Crit. Rev.
Biochem. Mol. Biol. 26:227 (1990)). Sequence analysis software, such as OLIGO 6.0 (LSR; Long Lake, MN) and Primer Premier 4.0 (Premier Biosoft International; Palo Alto, CA), as well as sites on the Internet, are available tools for analyzing a given sequence and calculating Tm based on user defined criteria. Such programs can also analyze a given sequence under defined conditions and identify suitable probe sequences. Typically, hybridization of longer polynucleotide sequences, >50 base pairs, is performed at temperatures of about 20-25°C below the calculated Tm. For smaller probes, <50 base pairs, hybridization is typically carried out at the Tm or 5-10°C
below. This allows for the maximum rate of hybridization for DNA-DNA and DNA-RNA hybrids.
The length of the polynucleotide sequence influences the rate and stability of hybrid formation.
Smaller probe sequences, <50 base pairs, reach equilibrium with complementary sequences rapidly, but may form less stable hybrids. Incubation times of anywhere from minutes to hours can be used to achieve hybrid formation. Longer probe sequences come to equilibrium more slowly, but form more stable complexes even at lower temperatures.
Incubations are typically allowed to proceed overnight or longer. Generally, incubations are carried out for a period equal to three times the calculated Cot time. Cot time, the time it takes for the polynucleotide sequences to reassociate, can be calculated for a particular sequence by methods known in the art.
The base pair composition of polynucleotide sequence will effect the thermal stability of the hybrid complex, thereby influencing the choice of hybridization temperature and the ionic strength of the hybridization buffer. A-T pairs are less stable than G-C pairs in aqueous solutions containing sodium chloride. Therefore, the higher the G-C content, the more stable the hybrid.
Even distribution of G and C residues within the sequence also contribute positively to hybrid stability. In addition, the base pair composition can be manipulated to alter the Tm of a given sequence. For example, 5-methyldeoxycytidine can be substituted for deoxycytidine and S-bromodeoxuridine can be substituted for thymidine to increase the Tm, whereas 7-deazz-2'-deoxyguanosine can be 5 substituted for guanosine to reduce dependence on Tm.
The ionic concentration of the hybridization buffer also affects the stability of the hybrid.
Hybridization buffers generally contain blocking agents such as Denhardt's solution (Sigma Chemical Co., St.
10 Louis, Mo.), denatured salmon sperm DNA, tRNA, milk powders (BLOTTO), heparin or SDS, and a Na' source, such as SSC (lx SSC: 0.15 M sodium chloride, 15 mM sodium citrate) or SSPE ( lx SSPE : 1 . 8 M NaCl , 10 mM NaH2P0, , 1 mM EDTA, pH
7.7). By decreasing the ionic concentration of the 15 buffer, the stability of the hybrid is increased.
Typically, hybridization buffers contain from between 10 mM - 1 M Na'. The addition of destabilizing or denaturing agents such as formamide, tetralkylammonium salts, guanidinium cations or thiocyanate cations to the 20 hybridization solution will alter the Tm of a hybrid.
Typically, formamide is used at a concentration of up to SO% to allow incubations to be carried out at more convenient and lower temperatures. Formamide also acts to reduce non-specific background when using RNA probes.
similar relationship exists between codons encoding phenylalanine and leucine. Thus, some polynucleotides encompassed by the degenerate sequence may encode variant amino acid sequences, but one of ordinary skill in the art can easily identify such variant sequences by reference to the amino acid sequence of SEQ ID N0:2. Variant sequences can be readily tested for functionality as described herein.
One of ordinary skill in the art will also appreciate that different species can exhibit "preferential codon usage." In general, see, Grantham, et al., Nuc. Acids Res., 8_:1893-912, 1980; Haas, et al. Curr.
Biol., 6:315-24, 1996; Wain-Hobson, et al., Gene, 13:355-64, 1981; Grosjean and Fiers, Gene, 18:199-209, 1982;
Holm, Nuc. Acids Res., 14:3075-87, 1986; Ikemura, J. Mol.
Biol., 158:573-97, 1982. As used herein, the term "preferential codon usage" or "preferential codons" is a term of art referring to protein translation codons that are most frequently used in cells of a certain species, thus favoring one or a few representatives of the possible codons encoding each amino acid (See Table 2). For example, the amino acid threonine (Thr) may be encoded by ACA, ACC, ACG, or ACT, but in mammalian cells ACC is the most commonly used codon; in other species, for example, insect cells, yeast, viruses or bacteria, different Thr codons may be preferential. Preferential codons for a particular species can be introduced into the polynucleotides of the present invention by a variety of methods known in the art. Introduction of preferential codon sequences into recombinant DNA can, for example, enhance production of the protein by making protein translation more efficient within a particular cell type or species. Therefore, the degenerate codon sequence disclosed in SEQ ID N0:5 serves as a template for optimizing expression of polynucleotides in various cell types and species commonly used in the art and disclosed herein. Sequences containing preferential codons can be tested and optimized for expression in various species, and tested for functionality as disclosed herein.
The present invention also provides polypeptide fragments or peptides comprising an epitope-bearing portion of an ZTMPO-1 polypeptide described herein. Such fragments or peptides may comprise an "immunogenic epitope," which is a part of a protein that elicits an antibody response when the entire protein is used as an immunogen. Immunogenic epitope-bearing peptides can be identified using standard methods (see, for example, Geysen et al., Proc. Nat. Acad. Sci. USA 81:3998, 1983).
In contrast, polypeptide fragments or peptides may comprise an "antigenic epitope," which is a region of a protein molecule to which an antibody can specifically bind. Certain epitopes consist of a linear or contiguous stretch of amino acids, and the antigenicity of such an epitope is not disrupted by denaturing agents. It is known in the art that relatively short synthetic peptides that can mimic epitopes of a protein can be used to stimulate the production of antibodies against the protein (see, for example, Sutcliffe et al., Science 219:660, 1983).
Accordingly, antigenic epitope-bearing peptides and polypeptides of the present invention are useful to raise antibodies that bind with the polypeptides described herein .
Antigenic epitope-bearing peptides and polypeptides preferably contain at least four to ten amino acids, at least ten to fifteen amino acids, or about 15 to about 30 amino acids of SEQ ID N0:2. Such epitope-bearing peptides and polypeptides can be produced by fragmenting a ZTMPO-1 polypeptide, or by chemical peptide synthesis, as described herein. Moreover, epitopes can be selected by phage display of random peptide libraries (see, for example, Lane and Stephen, Curr. Opin. Immunol. 5:268, 5 1993, and Cortese et al., Curr. Opin. Biotechnol. 7:616, 1996). Standard methods for identifying epitopes and producing antibodies from small peptides that comprise an epitope are described, for example, by Mole, "Epitope Mapping," in Methods in Molecular Biologv, Vol. 10, Manson 10 (ed.), pages 105-116 (The Humana Press, Inc. 1992), Price, "Production and Characterization of Synthetic Peptide-Derived Antibodies," in Monoclonal Antibodies: Production, Engineering, and Clinical Application, Ritter and Ladyman (eds.), pages 60-84 (Cambridge University Press 1995), and 15 Coligan et al. (eds.), Current Protocols in ImmunolocTV, pages 9.3.1 - 9.3.5 and pages 9.4.1 - 9.4.11 (John Wiley &
Sons 1997).
Potential antigenic sites in ZTMPO-1 can be identified using the Jameson-Wolf method (Jameson and 20 Wolf, CABIOS 4:181, 1988), as implemented by the PROTEAN
program (version 3.14) of LASERGENE (DNASTAR; Madison, WI). The Jameson-Wolf method predicts potential antigenic determinants by combining six major subroutines for protein structural prediction. Briefly, the Hopp-Woods method (Hopp et al., Proc. Nat. Acad. Sci. USA 78:3824, 1981), is first used to identify amino acid sequences representing areas of greatest local hydrophilicity (parameter: seven residues averaged). In the second step, Emini's method (Emini et al., J. Virolocry 55:836, 1985), is used to calculate surface probabilities (parameter:
surface decision .threshold (0.6) - 1). Third, the Karplus-Schultz method, (Karplus and Schultz;
Naturwissenschaften 72:212, 1985), is used to predict backbone chain flexibility (parameter: flexibility threshold (0.2) - 1). In the fourth and fifth steps of the analysis, secondary structure predictions are applied to the data using the methods of Chou-Fasman, Chou, "Prediction of Protein Structural Classes from Amino Acid Composition," in Prediction of Protein Structure and the Princit~les of Protein Conformation, Fasman (ed.), pages 549-586 (Plenum Press 1990), and Gamier-Robson, Gamier et al., J. Mol. Biol. 120:97, 1978 (Chou-Fasman parameters: conformation table - 64 proteins; a region threshold - 103; b region threshold - 105; Garnier-Robson parameters: a and b decision constants - 0). In the sixth subroutine, flexibility parameters and hydropathy/solvent accessibility factors are combined to determine a surface contour value, designated as the "antigenic index."
Finally, a peak broadening function is applied to the antigenic index, which broadens major surface peaks by adding 20, 40, 60, or 80% of the respective peak value to account for additional free energy derived from the mobility of surface regions relative to interior regions.
Regardless of the particular nucleotide sequence of a variant ZTMPO-1 gene, the gene encodes a polypeptide that is characterized by its glycoprotein synthesis or cell-cell interaction activity, or by the ability to bind specifically to an anti-ZTMPO-1 antibody. More specifically, variant ZTMPO-1 genes encode polypeptides which exhibit at least 50%, and preferably, greater than 70, 80, or 90°s, of the activity of polypeptide encoded by the human ZTMPO-1 gene described herein.
For any ZTMPO-1 polypeptide, including variants and fusion proteins, one of ordinary skill in the art can readily generate a fully degenerate polynucleotide sequence encoding that variant using the information set forth in Tables 1 and 2 above. Moreover, those of skill in the art can use standard software to devise ZTMPO-1 variants based upon the nucleotide and amino acid sequences described herein. Accordingly, the present invention includes a computer-readable medium encoded with a data structure that provides at least one of the following sequences: SEQ ID NO:1, SEQ ID N0:2 or SEQ ID
N0:5. Suitable forms of computer-readable media include magnetic media and optically-readable media. Examples of magnetic media include a hard or fixed drive, a random access memory (RAM) chip, a floppy disk, digital linear tape (DLT), a disk cache, and a ZIP disk. Optically S readable media are exemplified by compact discs (e.g., CD-read only memory (ROM), CD-rewritable (RW), and CD-recordable), and digital versatile/video discs (DVD) (e. g., DVD-ROM, DVD-RAM, and DVD+RW).
Within preferred embodiments of the invention, the isolated polynucleotides can hybridize under stringent conditions to polynucleotides having the nucleotide sequence of SEQ ID NO:1 or to nucleic acid molecules having a nucleotide sequence complementary to SEQ ID NO:1.
In general, stringent conditions are selected to be about 5°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe.
A pair of nucleic acid molecules, such as DNA-DNA, RNA-RNA and DNA-RNA, can hybridize if the nucleotide sequences have some degree of complementarity. Hybrids can tolerate mismatched base pairs in the double helix, but the stability of the hybrid is influenced by the degree of mismatch. The Tm of the mismatched hybrid decreases by 1°C for every 1-1.5% base pair mismatch.
Varying the stringency of the hybridization conditions allows control over the degree of mismatch that will be present in the hybrid. The degree of stringency increases as the hybridization temperature increases and the ionic strength of the hybridization buffer decreases. Stringent hybridization conditions encompass temperatures of about 5-25°C below the Tm of the hybrid and a hybridization buffer having up to 1 M Na'. Higher degrees of stringency at lower temperatures can be achieved with the addition of formamide which reduces the Tm of the hybrid about 1°C for each 1% formamide in the buffer solution. Generally, such stringent conditions include temperatures of 20-70°C and a hybridization buffer containing up to 6xSSC and 0-500 formamide. A higher degree of stringency can be achieved at temperatures of from 40-70°C with a hybridization buffer having up to 4xSSC and from 0-50o formamide.
Highly stringent conditions typically encompass temperatures of 42-70°C with a hybridization buffer having up to lxSSC and 0-50% formamide. Different degrees of stringency can be used during hybridization and washing to achieve maximum specific binding to the target sequence.
Typically, the washes following hybridization are performed at increasing degrees of stringency to remove non-hybridized polynucleotide probes from hybridized complexes.
The above conditions are meant to serve as a guide and it is well within the abilities of one skilled in the art to adapt these conditions for use with a particular polypeptide hybrid. The Tm for a specific target sequence is the temperature (under defined conditions) at which 50% of the target sequence will hybridize to a perfectly matched probe sequence. Those conditions which influence the Tm include, the size and base pair content of the polynucleotide probe, the ionic strength of the hybridization solution, and the presence of destabilizing agents in the hybridization solution.
Numerous equations for calculating Tm are known in the art, and are specific for DNA, RNA and DNA-RNA hybrids and polynucleotide probe sequences of varying length (see, for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, Second Edition (Cold Spring Harbor Press 1989);
Ausubel et al., (eds.), Current Protocols in Molecular Bioloc~v (John Wiley and Sons, Inc. 1987); Berger and Kimmel (eds.), Guide to Molecular Cloning Technigues, (Academic Press, Inc. 1987); and Wetmur, Crit. Rev.
Biochem. Mol. Biol. 26:227 (1990)). Sequence analysis software, such as OLIGO 6.0 (LSR; Long Lake, MN) and Primer Premier 4.0 (Premier Biosoft International; Palo Alto, CA), as well as sites on the Internet, are available tools for analyzing a given sequence and calculating Tm based on user defined criteria. Such programs can also analyze a given sequence under defined conditions and identify suitable probe sequences. Typically, hybridization of longer polynucleotide sequences, >50 base pairs, is performed at temperatures of about 20-25°C below the calculated Tm. For smaller probes, <50 base pairs, hybridization is typically carried out at the Tm or 5-10°C
below. This allows for the maximum rate of hybridization for DNA-DNA and DNA-RNA hybrids.
The length of the polynucleotide sequence influences the rate and stability of hybrid formation.
Smaller probe sequences, <50 base pairs, reach equilibrium with complementary sequences rapidly, but may form less stable hybrids. Incubation times of anywhere from minutes to hours can be used to achieve hybrid formation. Longer probe sequences come to equilibrium more slowly, but form more stable complexes even at lower temperatures.
Incubations are typically allowed to proceed overnight or longer. Generally, incubations are carried out for a period equal to three times the calculated Cot time. Cot time, the time it takes for the polynucleotide sequences to reassociate, can be calculated for a particular sequence by methods known in the art.
The base pair composition of polynucleotide sequence will effect the thermal stability of the hybrid complex, thereby influencing the choice of hybridization temperature and the ionic strength of the hybridization buffer. A-T pairs are less stable than G-C pairs in aqueous solutions containing sodium chloride. Therefore, the higher the G-C content, the more stable the hybrid.
Even distribution of G and C residues within the sequence also contribute positively to hybrid stability. In addition, the base pair composition can be manipulated to alter the Tm of a given sequence. For example, 5-methyldeoxycytidine can be substituted for deoxycytidine and S-bromodeoxuridine can be substituted for thymidine to increase the Tm, whereas 7-deazz-2'-deoxyguanosine can be 5 substituted for guanosine to reduce dependence on Tm.
The ionic concentration of the hybridization buffer also affects the stability of the hybrid.
Hybridization buffers generally contain blocking agents such as Denhardt's solution (Sigma Chemical Co., St.
10 Louis, Mo.), denatured salmon sperm DNA, tRNA, milk powders (BLOTTO), heparin or SDS, and a Na' source, such as SSC (lx SSC: 0.15 M sodium chloride, 15 mM sodium citrate) or SSPE ( lx SSPE : 1 . 8 M NaCl , 10 mM NaH2P0, , 1 mM EDTA, pH
7.7). By decreasing the ionic concentration of the 15 buffer, the stability of the hybrid is increased.
Typically, hybridization buffers contain from between 10 mM - 1 M Na'. The addition of destabilizing or denaturing agents such as formamide, tetralkylammonium salts, guanidinium cations or thiocyanate cations to the 20 hybridization solution will alter the Tm of a hybrid.
Typically, formamide is used at a concentration of up to SO% to allow incubations to be carried out at more convenient and lower temperatures. Formamide also acts to reduce non-specific background when using RNA probes.
25 As an illustration, a polynucleotide encoding a variant ZTMPO-1 polypeptide can be hybridized with a polynucleotide having the nucleotide sequence of SEQ ID
NO:1 (or its complement) at 42°C overnight in a solution comprising SOo formamide, SxSSC (lxSSC: 0.15 M sodium chloride and 15 mM sodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution (100x Denhardt's solution: 2% (w/v) Ficoll 400, 2% (w/v) polyvinylpyrrolidone, and 2% (w/v) bovine serum albumin), loo dextran sulfate, and 20 ~g/ml denatured, sheared salmon sperm DNA. One of skill in the art can devise variations of these hybridization conditions. For example, the hybridization mixture can be incubated at a higher or lower temperature, such a~ about 65°C, in a solution that does not contain formamide. Moreover, premixed hybridization solutions are available (e. g., EXPRESSHYB Hybridization Solution from CLONTECH
Laboratories, Inc.), and hybridization can be performed according to the manufacturer's instructions.
Following hybridization, the nucleic acid molecules can be washed to remove non-hybridized nucleic acid molecules under stringent conditions, or under highly stringent conditions. Typical stringent washing conditions include washing in a solution of 0.5x-2x SSC
with 0.1% sodium dodecyl sulfate (SDS) at 55-65°C. That is, nucleic acid molecules encoding a variant ZTMPO-1 polypeptide hybridize with a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:1 (or its complement) under stringent washing conditions, in which the wash stringency is equivalent to 0.5x-2x SSC with 0.1% SDS at 50-65°C, including 0.5x SSC with O.la SDS at 55°C, or 2x SSC with 0.1% SDS at 65°C. One of skill in the art can readily devise equivalent conditions, for example, by substituting SSPE for SSC in the wash solution.
Typical highly stringent washing conditions include washing in a solution of O.lx-0.2x SSC with 0.1%
sodium dodecyl sulfate (SDS) at 50-65°C. In other words, polynucleotides encoding a variant ZTMPO-1 polypeptide hybridize with a polynucleotide having the nucleotide sequence of SEQ ID NO:1 (or its complement) under highly stringent washing conditions, in which the wash stringency is equivalent to O.lx-0.2x SSC with 0.1% SDS at 50-65°C, including O.lx SSC with 0.1% SDS at 50°C, or 0.2x SSC with O.lo SDS at 65°C.
The present invention also contemplates ZTMPO-1 variant polypeptides that can be identified using two criteria: a determination of the similarity between the encoded polypeptide with the amino acid sequence of SEQ ID
N0:2, and a hybridization assay, as described above. Such ZTMPO-1 variants include nucleic acid molecules (1) that hybridize with a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:1 (or its complement) under stringent washing conditions, in which the wash stringency is equivalent to 0.5x-2x SSC with O.lo SDS at 50-65°C, and (2) that encode a polypeptide having at least 80%, at least 900, at least 95% or greater than 95%
sequence identity to the amino acid sequence of SEQ ID
N0:2. Alternatively, ZTMPO-1 variants can be characterized as nucleic acid molecules (1) that hybridize with a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:1 (or its complement) under highly stringent washing conditions, in which the wash stringency is equivalent to O.lx-0.2x SSC with 0.1% SDS at 50-65°C, and ( 2 ) that encode a polypeptide having at least 80 0 , at least 90%, at least 95% or greater than 95% sequence identity to the amino acid sequence of SEQ ID N0:2.
As previously noted, the isolated polynucleotides of the present invention include DNA and RNA. Methods for preparing DNA and RNA are well known in the art. In general, RNA is isolated from a tissue or cell that produces large amounts of ZTMPO-1 RNA. Such tissues and cells are identified by Northern blotting (Thomas, Proc. Natl. Acad. Sci. USA 77:5201, 1980), an exemplary source being human testis tissue. Total RNA can be prepared using guanidine HC1 extraction followed by isolation by centrifugation in a CsCl gradient (Chirgwin et al., Biochemistrv 18:52-94, 1979). Poly (A)+ RNA is prepared from total RNA using the method of Aviv and Leder (Proc. Natl. Acad. Sci. USA 69:1408-12, 1972).
Complementary DNA (cDNA) is prepared from poly(A)+ RNA
using known methods. In the alternative, genomic DNA can be isolated. Polynucleotides encoding ZTMPO-1 polypeptides are then identified and isolated by, for example, hybridization or PCR.
The polynucleotides of the present invention can also be synthesized using techniques widely known in the art. See, for example, Glick and Pasternak, Molecular Biotechnology, Principles & Applications of Recombinant DNA, (ASM Press, Washington, D.C. 1994); Itakura et al., Annu. Rev. Biochem. 53: 323-56, 1984 and Climie et al., Proc. Natl. Acad. Sci. USA 87:633-7, 1990.
The present invention further provides counterpart polypeptides and polynucleotides from other species (orthologs). These species include, but are not limited to mammalian, avian, amphibian, reptile, fish, insect and other vertebrate and invertebrate species. Of particular interest are ZTMPO-1 polypeptides from other mammalian species, including murine, porcine, ovine, bovine, canine, feline, equine, and other primate polypeptides. Orthologs of human ZTMPO-1 can be cloned using information and compositions provided by the present invention in combination with conventional cloning techniques. For example, a cDNA can be cloned using mRNA
obtained from a tissue or cell type that expresses ZTMPO-1 as disclosed herein. Suitable sources of mRNA can be identified by probing Northern blots with probes designed from the sequences disclosed herein. A library is then prepared from mRNA of a positive tissue or cell line. A
ZTMPO-1-encoding cDNA can then be isolated by a variety of methods, such as by probing with a complete or partial human cDNA or with one or more sets of degenerate probes based on the disclosed sequences. A cDNA can also be cloned using the polymerase chain reaction, or PCR
(Mullis, U.S. Patent No. 4,683,202), using primers designed from the representative human ZTMPO-1 sequence disclosed herein. Within an additional method, the cDNA
library can be used to transform or transfect host cells, and expression of the cDNA of interest can be detected with an antibody to ZTMPO-1 polypeptide. Similar techniques can also be applied to the isolation of genomic clones.
Those skilled in the art will recognize that the sequence disclosed in SEQ ID NO:1 represents a single allele of human ZTMPO-1 and that allelic variation and alternative splicing are expected to occur. Allelic variants of this sequence can be cloned by probing cDNA or genomic libraries from different individuals according to standard procedures. Allelic variants of the DNA
sequence shown in SEQ ID N0:2, including those containing silent mutations and those in which mutations result in amino acid sequence changes, are within the scope of the present invention, as are proteins which are allelic variants of SEQ ID N0:2. cDNAs generated from alternatively spliced mRNAs, which retain the properties of the ZTMPO-1 polypeptide are included within the scope of the present invention, as are polypeptides encoded by such cDNAs and mRNAs. Allelic variants and splice variants of these sequences can be cloned by probing cDNA or genomic libraries from different individuals or tissues according to standard procedures known in the art.
The present invention also provides isolated ZTMPO-1 polypeptides that are substantially homologous to the polypeptides of SEQ ID N0:2 and their orthologs. The term "substantially homologous" is used herein to denote polypeptides having 50%, preferably 60%, more preferably at least 80%, sequence identity to the sequences shown in SEQ ID N0:2 or their orthologs. Such polypeptides will more preferably be at least 90% identical, and most preferably 95a or more identical to SEQ ID N0:2 or its orthologs). Percent sequence identity is determined by conventional methods. See, for example, Altschul et al., Bull. Math. Bio. 48: 603-16, 1986 and Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 89:10915-9, 1992. The present invention further includes nucleic acid molecules that encode such polypeptides. Methods for determining percent identity are described below.
Briefly, two amino acid sequences are aligned to optimize the alignment scores using a gap opening penalty of 10, a gap extension penalty of 1, and the "blosum 62" scoring matrix of Henikoff and Henikoff (ibid.) as shown in Table 3 (amino acids are indicated by the standard one-letter codes). The percent identity is then calculated as:
Total number of identical matches x 100 5 (length of the longer sequence plus the number of gaps introduced into the longer sequence in order to align the two sequences]
i H N M
i i r1M N N
I I
L~rl ~-I~ M N
~ I I I
l0 d~N N r-IM rl I I I
LflO N r-irlr-irlr-1 t i I
~ U1rl M riO r-IM N N
t I I I I I ~
a di N N O M N irlN rlv-1 M I I I I I I
H diN M rl O M N ,-~f'W-~IM
I
I I I I I
x CO M M rlN ~-IN ri N N N M
I I I I I I I I I
L7 l0N d~d~ N M M N O N N M M
I I I I I I I I i I t W tJ7N O M M v-1N M v-1O r-W~~)N N
I I I I I ~ I I I I
lflN N O M N r-iO ('~1riO ~-1N riN
i I I I I I I t U 41 M d~ M M r-Iri M ~-1N M rl rlN N r-I
1 I I ~ I I I I I I I ~ ~ I i A l~M O N rW-i M d~ rlM M rlO rl~ M M
I I I I I ~ I I I I I I I
'~.., lD r-IM O O O v-1M M O N M N r1 O d~ N M
I I I I I I i I i (Y., 111O N M riO N O M N N H M N ~-irlM N M
I I ~ i I I I I I I I I I
~I,' d~H N N O v-1rl O N rlrl rirl N rlr-iO M N O
I I I I ~ I I I I I I I ~ I
x z A a a w a x H a x ~ w w ~n H
~n O ul O
'" ~ r-I N
Those skilled in the art appreciate that there are many established algorithms available to align two amino acid sequences. The "FASTA" similarity search algorithm of Pearson and Lipman is a suitable protein alignment method for examining the level of identity shared by an amino acid sequence disclosed herein and the amino acid sequence of a putative variant ZTMPO-1. The FASTA algorithm is described by Pearson and Lipman, Proc.
Nat. Acad. Sci. USA 85:2444, 1988, and by Pearson, Meth.
Enzymol. 183:63, 1990.
Briefly, FASTA first characterizes sequence similarity by identifying regions shared by the query sequence (e. g., SEQ ID N0:2) and a test sequence that have either the highest density of identities (if the ktup variable is 1) or pairs of identities (if ktup=2), without considering conservative amino acid substitutions, insertions, or deletions. The ten regions with the highest density of identities are then re-scored by comparing the similarity of all paired amino acids using an amino acid substitution matrix, and the ends of the regions are "trimmed" to include only those residues that contribute to the highest score. If there are several regions with scores greater than the "cutoff" value (calculated by a predetermined formula based upon the length of the sequence and the ktup value), then the trimmed initial regions are examined to determine whether the regions can be joined to form an approximate alignment with gaps. Finally, the highest scoring regions of the two amino acid sequences are aligned using a modification of the Needleman-Wunsch-Sellers algorithm (Needleman and Wunsch, J. Mol. Biol. 48:444, 1970; Sellers, SIAM J. Appl.
Math. 26:787, 1974), which allows for amino acid insertions and deletions. Illustrative parameters for FASTA analysis are: ktup=1, gap opening penalty=10, gap extension penalty=1, and substitution matrix=BLOSUM62.
These parameters can be introduced into a FASTA program by modifying the scoring matrix file ("SMATRIX"), as explained in Appendix 2 of Pearson, Meth. Enzymol. 183:63, 1990.
FASTA can also be used to determine the sequence identity of nucleic acid molecules using a ratio as disclosed above. For nucleotide sequence comparisons, the ktup value can range between one to six, preferably from four to six.
Substantially homologous proteins and polypeptides are characterized as having one or more amino acid substitutions, deletions or additions. These changes are preferably of a minor nature, that is conservative amino acid substitutions and other substitutions that do not significantly affect the folding or activity of the protein or polypeptide; small deletions, typically of one to about 30 amino acids; and small amino- or carboxyl-terminal extensions, such as an amino-terminal methionine residue, a small linker peptide of up to about 20-25 residues, or an affinity tag. Polypeptides comprising affinity tags can further comprise a proteolytic cleavage site between the zsig37 polypeptide and the affinity tag.
Preferred such sites include thrombin cleavage sites and factor Xa cleavage sites.
The present invention includes nucleic acid molecules that encode a polypeptide having one or more "conservative amino acid substitutions," compared with the amino acid sequence of SEQ ID N0:2. Conservative amino acid substitutions can be based upon the chemical properties of the amino acids. That is, variants can be obtained that contain one or more amino acid substitutions of SEQ ID N0:2, in which an alkyl amino acid is substituted for an alkyl amino acid in a ZTMPO-1 amino acid sequence, an aromatic amino acid is substituted for an aromatic amino acid in a ZTMPO-1 amino acid sequence, a sulfur-containing amino acid is substituted for a sulfur-containing amino acid in a ZTMPO-1 amino acid sequence, a hydroxy-containing amino acid is substituted for a hydroxy-containing amino acid in a ZTMPO-1 amino acid sequence, an acidic amino acid is substituted for an acidic amino acid in a ZTMPO-1 amino acid sequence, a basic amino acid is substituted for a basic amino acid in a ZTMPO-1 amino acid sequence, or a dibasic monocarboxylic amino acid is substituted for a dibasic monocarboxylic amino acid in a ZTMPO-1 amino acid sequence.
Among the common amino acids, for example, a "conservative amino acid substitution" is illustrated by a substitution among amino acids within each of the following groups: (1) glycine, alanine, valine, leucine, and isoleucine, (2) phenylalanine, tyrosine, and tryptophan, (3) serine and threonine, (4) aspartate and glutamate, (5) glutamine and asparagine, and (6) lysine, arginine and histidine. Other conservative amino acid substitutions are provided in Table 4.
Table 4 Conservative amino acid substitutions Basic: arginine lysine histidine Acidic: glutamic acid aspartic acid Polar: glutamine asparagine 10 Hydrophobic: leucine isoleucine valine Aromatic: phenylalanine tryptophan 15 tyrosine Small: glycine alanine serine threonine 20 methionine The BLOSUM62 table is an amino acid substitution matrix derived from about 2,000 local multiple alignments of protein sequence segments, representing highly conserved regions of more than 500 groups of related 25 proteins (Henikoff and Henikoff, Proc. Natl. Acad. Sci USA 89:10915, 1992). Accordingly, the BLOSUM62 substitution frequencies can be used to define conservative amino acid substitutions that may be introduced into the amino acid sequences of the present 30 invention. Although it is possible to design amino acid substitutions based solely upon chemical properties (as discussed above), the language "conservative amino acid substitution" preferably refers to a substitution represented by a BLOSUM62 value of greater than -1. For 35 example, an amino acid substitution is conservative if the substitution is characterized by a BLOSUM62 value of 0, 1, 2, or 3. According to this system, preferred conservative amino acid substitutions are characterized by a BLOSUM62 value of at least 1 (e. g., 1, 2 or 3), while more preferred conservative amino acid substitutions are characterized by a BLOSUM62 value of at least 2 (e.g., 2 or 3 ) .
Conservative amino acid changes in a ZTMPO-1 gene can be introduced by substituting nucleotides for the nucleotides recited in SEQ ID NO:1. Such "conservative amino acid" variants can be obtained, for example, by oligonucleotide-directed mutagenesis, linker-scanning mutagenesis, mutagenesis using the polymerase chain reaction, and the like (see Ausubel (1995) at pages 8-10 to 8-22; and McPherson (ed.), Directed MutaQenesis: A
Practical Approach (IRL Press 1991)). The ability of such variants to promote proliferation and cardiac functions as will as other properties of the wild-type protein can be determined using a standard methods, such as the assays described herein. Alternatively, a variant ZTMPO-1 polypeptide can be identified by the ability to specifically bind anti-ZTMPO-1 antibodies.
The proteins of the present invention can also comprise non-naturally occurring amino acid residues.
Non-naturally occurring amino acids include, without limitation, traps-3-methylproline, 2,4-methanoproline, cis-4-hydroxyproline, traps-4-hydroxyproline, N-methyl-glycine, allo-threonine, methylthreonine, hydroxy-ethylcysteine, hydroxyethylhomocysteine, nitro-glutamine, homoglutamine, pipecolic acid, thiazolidine carboxylic acid, dehydroproline, 3- and 4-methylproline, 3,3-dimethylproline, tert-leucine, norvaline, 2-azaphenyl-alanine, 3-azaphenylalanine, 4-azaphenylalanine, and 4-fluorophenylalanine. Several methods are known in the art for incorporating non-naturally occurring amino acid residues into proteins. For example, an in vitro system can be employed wherein nonsense mutations are suppressed using chemically aminoacylated suppressor tRNAs. Methods WO 99/54468 PC1'/US99/08601 for synthesizing amino acids and aminoacylating tRNA are known in the art. Transcription and translation of plasmids containing nonsense mutations is carried out in a cell-free system comprising an E. coli S30 extract and commercially available enzymes and other reagents.
Proteins are purified by chromatography. See, for example, Robertson et al., J. Am. Chem. Soc. 113:2722, 1991; Ellman et al., Methods Enzymol. 202:301, 1991; Chung et al., Science 259:806-9, 1993; and Chung et al., Proc.
Natl. Acad. Sci. USA 90:10145-9, 1993). In a second method, translation is carried out in Xenopus oocytes by microinjection of mutated mRNA and chemically aminoacylated suppressor tRNAs (Turcatti et al., J. Biol.
Chem. 271:19991-8, 1996). Within a third method, E. coli cells are cultured in the absence of a natural amino acid that is to be replaced (e.g., phenylalanine) and in the presence of the desired non-naturally occurring amino acids) (e.g., 2-azaphenylalanine, 3-azaphenylalanine, 4-azaphenylalanine, or 4-fluorophenylalanine). The non-naturally occurring amino acid is incorporated into the protein in place of its natural counterpart. See, Koide et al., Biochem. 33:7470-6, 1994. Naturally occurring amino acid residues can be converted to non-naturally occurring species by in vitro chemical modification.
Chemical modification can be combined with site-directed mutagenesis to further expand the range of substitutions (Wynn and Richards, Protein Sci. 2:395-403, 1993).
A limited number of non-conservative amino acids, amino acids that are not encoded by the genetic code, non-naturally occurring amino~acids, and unnatural amino acids may be substituted for ZTMPO-1 amino acid residues.
Essential amino acids in the polypeptides of the present invention can be identified according to procedures known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, Science 244: 1081-5, 1989; Bass et al., Proc.
Natl. Acad. Sci. USA 88:4498-502, 1991). In the latter technique, single alanine mutations are introduced at every residue in the molecule, and the resultant mutant molecules are tested for biological activity as disclosed below to identify amino acid residues that are critical to the activity of the molecule. See also, Hilton et al. , ,T.
Biol. Chem. 271:4699-708, 1996. Sites of ligand-receptor interaction can also be determined by physical analysis of structure, as determined by such techniques as nuclear magnetic resonance, crystallography, electron diffraction or photoaffinity labeling, in conjunction with mutation of putative contact site amino acids. See, for example, de Vos et al., Science 255:306-12, 1992; Smith et al., J.
Mol. Biol. 224:899-904, 1992; Wlodaver et al., FEBS Lett.
309:59-64, 1992. The identities of essential amino acids can also be inferred from analysis of homologies with related nuclear membrane bound proteins.
Multiple amino acid substitutions can be made and tested using known methods of mutagenesis and screening, such as those disclosed by Reidhaar-Olson and Sauer (Science 241:53-7, 1988) or Bowie and Sauer (Proc.
Natl. Acad. Sci. USA 86:2152-6, 1989). Briefly, these authors disclose methods for simultaneously randomizing two or more positions in a polypeptide, selecting for functional polypeptide, and then sequencing the mutagenized polypeptides to determine the spectrum of allowable substitutions at each position. Other methods that can be used include phage display (e.g., Lowman et al., Biochem. 30:10832-7, 1991; Ladner et al., U.S. Patent No. 5,223,409; Huse, WIPO Publication WO 92/06204) and region-directed mutagenesis (Derbyshire et al., Gene 46:145, 1986; Ner et al., DNA 7:127, 1988).
Variants of the disclosed ZTMPO-1 DNA and polypeptide sequences can be generated through DNA
shuffling as disclosed by Stemmer, Nature 370:389-91, 1994 and Stemmer, Proc. Natl. Acad. Sci. USA 91:10747-51, 1994.
Briefly, variant DNAs are generated by in vitro homologous recombination by random fragmentation of a parent DNA
followed by reassembly using PCR, resulting in randomly introduced point mutations. This technique can be modified by using a family of parent DNAs, such as allelic variants or genes from different species, to introduce additional variability into the process. Selection or screening for the desired activity, followed by additional iterations of mutagenesis and assay provides for rapid "evolution" of sequences by selecting for desirable mutations while simultaneously selecting against detrimental changes.
Mutagenesis methods as disclosed herein can be combined with high-throughput, automated screening methods to detect activity of cloned, mutagenized polypeptides in host cells. Preferred assays in this regard include cell proliferation assays and biosensor-based ligand-binding assays, which are described below. Mutagenized DNA
molecules that encode active polypeptides can be recovered from the host cells and rapidly sequenced using modern equipment. These methods allow the rapid determination of the importance of individual amino acid residues in a polypeptide of interest, and can be applied to polypeptides of unknown structure.
Using the methods discussed herein, one of ordinary skill in the art can identify and/or prepare a variety of polypeptide fragments or variants of SEQ ID
N0:2 or that retain the receptor binding properties of the wild-type ZTMPO-1 protein. Such polypeptides may also include additional polypeptide segments as generally disclosed herein.
For any ZTMPO-1 polypeptide, including variants and fusion proteins, one of ordinary skill in the art can readily generate a fully degenerate polynucleotide sequence encoding that variant using the information set forth in Tables 1 and 2 above.
As used herein a fusion protein consists essentially of a first portion and a second portion joined by a peptide bond. In one embodiment the first portion consists of a polypeptide comprising a sequence of amino 5 acid residues that is at least 80% identical in amino acid sequence to residues 1 through 876 of SEQ ID N0:2 and the second portion is any other polypetide. The other polypeptide may be alternative or additional domains from other members of the thymopoietin or emerin family, a 10 signal peptide to facilitate secretion of the fusion protein, affinity tags, Ig domains or the like.
The ZTMPO-1 polypeptides of the present invention, including full-length polypeptides, biologically active fragments, and fusion polypeptides, 15 can be produced in genetically engineered host cells according to conventional techniques. Suitable host cells are those cell types that can be transformed or transfected with exogenous DNA and grown in culture, and include bacteria, fungal cells, and cultured higher 20 eukaryotic cells. Eukaryotic cells, particularly cultured cells of multicellular organisms, are preferred.
Techniques for manipulating cloned DNA molecules and introducing exogenous DNA into a variety of host cells are disclosed by Sambrook et al., Molecular Cloning: A
25 Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989, and Ausubel et al., eds., Current Protocols in Molecular Bioloav, John Wiley and Sons, Inc., NY, 1987.
In general, a DNA sequence encoding a ZTMPO-1 30 polypeptide is operably linked to other genetic elements required for its expression, generally including a transcription promoter and terminator, within an expression vector. The vector will also commonly contain one or more selectable markers and one or more origins of 35 replication, although those skilled in the art will recognize that within certain systems selectable markers may be provided on separate vectors, and replication of the exogenous DNA may be provided by integration into the host cell genome. Selection of promoters, terminators, selectable markers, vectors and other elements is a matter of routine design within the level of ordinary skill in the art. Many such elements are described in the literature and are available through commercial suppliers.
To direct a ZTMPO-1 polypeptide into the secretory pathway of a host cell, a secretory signal sequence (also known as a leader sequence, signal sequence, prepro sequence or pre sequence) is provided in the expression vector. The secretory signal sequence may be derived from another secreted protein (e.g., t-PA) or synthesized de novo. The secretory signal sequence is operably linked to the ZTMPO-1 DNA sequence, i.e., the two sequences are joined in the correct reading frame and positioned to direct the newly synthesized polypeptide into the secretory pathway of the host cell. Secretory signal sequences are commonly positioned 5' to the DNA
sequence encoding the polypeptide of interest, although certain secretory signal sequences may be positioned elsewhere in the DNA sequence of interest (see, e.g., Welch et al., U.S. Patent No. 5,037,743; Holland et al., U.S. Patent No. 5,143,830).
Cultured mammalian cells are suitable hosts within the present invention. Methods for introducing exogenous DNA into mammalian host cells include calcium phosphate-mediated transfection (Wigler et al., Cell 14:725, 1978; Corsaro and Pearson, Somatic Cell Genetics 7:603, 1981: Graham and Van der Eb, ViroloQV 52:456, 1973), electroporation (Neumann et al., EMBO J. 1:841-5, 1982), DEAE-dextran mediated transfection (Ausubel et al., ibid.), and liposome-mediated transfection (Hawley-Nelson et al., Focus 15:73, 1993; Ciccarone et al., Focus 15:80, 1993, and viral vectors (Miller and Rosman, BioTechniaues 7:980-90, 1989; Wang and Finer, Nature Med. 2:714-6, 1996). The production of recombinant polypeptides in cultured mammalian cells is disclosed, for example, by Levinson et al., U.S. Patent No. 4,713,339; Hagen et al., U.S. Patent No. 4,784,950; Palmiter et al., U.S. Patent No. 4,579,821; and Ringold, U.S. Patent No. 4,656,134.
Suitable cultured mammalian cells include the COS-1 (ATCC
No. CRL 1650), COS-7 (ATCC No. CRL 1651), BHK (ATCC No.
CRL 1632), BHK 570 (ATCC No. CRL 10314), 293 (ATCC No. CRL
1573; Graham et al., J. Gen. Virol. 36:59-72, 1977) and Chinese hamster ovary (e. g. CHO-K1; ATCC No. CCL 61) cell lines. Additional suitable cell lines are known in the art and available from public depositories such as the American Type Culture Collection, Rockville, Maryland. In general, strong transcription promoters are preferred, such as promoters from SV-40 or cytomegalovirus. See, e.g., U.S. Patent No. 4,956,288. Other suitable promoters include those from metallothionein genes (U. S. Patent Nos.
4,579,821 and 4,601,978) and the adenovirus major late promoter.
Drug selection is generally used to select for cultured mammalian cells into which foreign DNA has been inserted. Such cells are commonly referred to as "transfectants". Cells that have been cultured in the presence of the selective agent and are able to pass the gene of interest to their progeny are referred to as "stable transfectants." A preferred selectable marker is a gene encoding resistance to the antibiotic neomycin.
Selection is carried out in the presence of a neomycin-type drug, such as G-418 or the like. Selection systems can also be used to increase the expression level of the gene of interest, a process referred to as "amplification." Amplification is carried out by culturing transfectants in the presence of a low level of the selective agent and then increasing the amount of selective agent to select for cells that produce high levels of the products of the introduced genes. A
preferred amplifiable selectable marker is dihydrofolate reductase, which confers resistance to methotrexate.
Other drug resistance genes (e. g. hygromycin resistance, multi-drug, resistance, puromycin acetyltransferase) can also be used. Alternative markers that introduce an altered phenotype, such as green fluorescent protein, or cell surface proteins such as CD4, CDB, Class I MHC, placental alkaline phosphatase may be used to sort transfected cells from untrarisfected cells by such means as FACS sorting or magnetic bead separation technology.
Other higher eukaryotic cells can also be used as hosts, including plant cells, insect cells and avian cells. The use of Agrobacterium rhizogenes as a vector for expressing genes in plant cells has been reviewed by Sinkar et al., J. Biosci. (Banaalore) 11:47-58, 1987.
Transformation of insect cells and production of foreign polypeptides therein is disclosed by Guarino et al., U.S.
Patent No. 5,162,222 and WIPO publication WO 94/06463.
Insect cells can be infected with recombinant baculovirus vectors, which are commonly derived from Autographa californica multiple nuclear polyhedrosis virus (AcMNPV).
DNA encoding the polypeptide of interest is inserted into the viral genome in place of the polyhedrin gene coding sequence by homologous recombination in cells infected with intact, wild-type AcMNPV and transfected with a transfer vector comprising the cloned gene operably linked to polyhedrin gene promoter, terminator, and flanking sequences. The resulting recombinant virus is used to infect host cells, typically a cell line derived from the fall armyworm, Spodoptera frugiperda. See, in general, Glick and Pasternak, Molecular Biotechnolocry: Principles and Applications of Recombinant DNA, ASM Press, Washington, D.C., 1994.
Fungal cells, including yeast cells, can also be used within the present invention. Yeast species of particular interest in this regard include Saccharomyces cerevisiae, Pichia pastoris, and Pichia methanolica.
Methods for transforming S. cerevisiae cells with exogenous DNA and producing recombinant polypeptides therefrom are disclosed by, for example, Kawasaki, U.S.
Patent No. 4,599,311; Kawasaki et al., U.S. Patent No.
4,931,373; Brake, U.S. Patent No. 4,870,008; Welch et al., U.S. Patent No. 5,037,743; and Murray et al., U.S. Patent No. 4,845,075. Transformed cells are selected by phenotype determined by the selectable marker, commonly drug resistance or the ability to grow in the absence of a particular nutrient (e. g., leucine). A preferred vector system for use in Saccharomyces cerevisiae is the POTI
vector system disclosed by Kawasaki et al. (U. S. Patent No. 4,931,373), which allows transformed cells to be selected by growth in glucose-containing media. Suitable promoters and terminators for use in yeast include those from glycolytic enzyme genes (see, e.g., Kawasaki, U.S.
Patent No. 4,599,311; Kingsman et al., U.S. Patent No.
4,615,974; and Bitter, U.S. Patent No. 4,977,092) and alcohol dehydrogenase genes. See also U.S. Patents Nos.
4,990,446; 5,063,154; 5,139,936 and 4,661,454.
Transformation systems for other yeasts, including Hansenula polymorpha, Schizosaccharornyces pombe, Kluyveromyces lactis, Kluyveromyces fragilis, Ustilago maydis, Pichia pastoris, Pichia methanolica, Pichia guillermondii and Candida maltosa are known in the art.
See, for example, Gleeson et al., J. Gen. Microbiol.
132:3459-65, 1986 and Cregg, U.S. Patent No. 4,882,279.
Aspergillus cells may be utilized according to the methods of McKnight et al., U.S. Patent No. 4,935,349. Methods for transforming Acremonium chrysogenum are disclosed by Sumino et al., U.S. Patent No. 5,162,228. Methods for transforming Neurospora are disclosed by Lambowitz, U.S.
Patent No. 4,486,533.
The use of Pichia methanolica as host for the production of recombinant proteins is disclosed in WIPO
Publications WO 9717450 and W09717451. DNA molecules for use in transforming P. methanolica will commonly be prepared as double-stranded, circular plasmids, which are preferably linearized prior to transformation. For polypeptide production in P. methanolica, it is preferred that the promoter and terminator in the plasmid be that of 5 a P. methanolica gene, such as a P. methanolica alcohol utilization gene (AUGI or AUG2) . Other useful nrnmntP,-include those of the dihydroxyacetone synthase (DHAS), formate dehydrogenase (FMD), and catalase (CAT) genes. To facilitate integration of the DNA into the host 10 chromosome, it is preferred to have the entire expression segment of -the plasmid flanked at both ends by host DNA
sequences. A preferred selectable marker for use in Pichia methanolica is a P. rnethanolica ADE2 gene, which encodes phosphoribosyl-5-aminoimidazole carboxylase (AIRC;
15 EC 4 . 1 .1. 21 ) , which allows ade2 host cells to grow in the absence of adenine. For large-scale, industrial processes where it is desirable to minimize the use of methanol, it is preferred to use host cells in which both methanol utilization genes (AUGI and AUG2) are deleted. For 20 production of secreted proteins, host cells deficient in vacuolar protease genes (PEP4 and PRBI) are preferred.
Electroporation is used to facilitate the introduction of a plasmid containing DNA encoding a polypeptide of interest into P. methanolica cells. It is preferred to 25 transform P. methanolica cells by electroporation using an exponentially decaying, pulsed electric field having a field strength of from 2.5 to 4.5 kV/cm, preferably about 3.75 kV/cm, and a time constant (t) of from 1 to 40 milliseconds, most preferably about 20 milliseconds.
30 Prokaryotic host cells, including strains of the bacteria Escherichia coli, Bacillus and other genera are also useful as host cells within the present invention.
Techniques for transforming these hosts and expressing foreign DNA sequences cloned therein are well known in the 35 art (see, e.g., Sambrook et al., ibid.). When expressing a ZTMPO-1 polypeptide in bacteria such as E. coli, the polypeptide may be retained in the cytoplasm, typically as insoluble granules, or may be directed to the periplasmic space by a bacterial secretion sequence. In the former case, the cells are lysed, and the granules are recovered and denatured using, for example, guanidine isothiocyanate or urea. The denatured polypeptide can then be refolded and dimerized by diluting the denaturant, such as by dialysis against a solution of urea and a combination of reduced and oxidized glutathione, followed by dialysis against a buffered saline solution. In the latter case, the polypeptide can be recovered from the periplasmic space in a soluble and functional form by disrupting the cells (by, for example, sonication or osmotic shock) to I5 release the contents of the periplasmic space and recovering the protein, thereby obviating the need for denaturation and refolding.
Transformed or transfected host cells are cultured according to conventional procedures in a culture medium containing nutrients and other components required for the growth of the chosen host cells. A variety of suitable media, including defined media and complex media, are known in the art and generally include a carbon source, a nitrogen source, essential amino acids, vitamins and minerals. Media may also contain such components as growth factors or serum, as required. The growth medium will generally select for cells containing the exogenously added DNA by, for example, drug selection or deficiency in an essential nutrient which is complemented by the selectable marker carried on the expression vector or co-transfected into the host cell. P. methanolica cells are cultured in a medium comprising adequate sources of carbon, nitrogen and trace nutrients at a temperature of about 25°C to 35°C. Liquid cultures are provided with sufficient aeration by conventional means, such as shaking of small flasks or sparging of fermentors. A preferred culture medium for P. methanolica is YEPD (2% D-glucose, WO 99/544b8 PCT/US99/08b01 2o BactoTM Peptone (Difco Laboratories, Detroit, MI), to BactoT"' yeast extract (Difco Laboratories), 0.004% adenine and 0.0060 L-leucine).
It is preferred to purify the polypeptides of the present invention to >_80% purity, more preferably to >_90o purity, even more preferably ?95% purity, and particularly preferred is a pharmaceutically pure state, that is greater than 99.9% pure with respect to contaminating macromolecules, particularly other proteins and nucleic acids, and free of infectious and pyrogenic agents. Preferably, a purified polypeptide is substantially free of other polypeptides, particularly other polypeptides of animal origin.
Expressed recombinant ZTMPO-1 polypeptides (or fusion or chimeric ZTMPO-1 polypeptides) can be purified using fractionation and/or conventional purification methods and media. Ammonium sulfate precipitation and acid or chaotrope extraction may be used for fractionation of samples. Exemplary purification steps may include hydroxyapatite, size exclusion, FPLC and reverse-phase high performance liquid chromatography. Suitable chromatographic media include derivatized dextrans, agarose, cellulose, polyacrylamide, specialty silicas, and the like. PEI, DEAE, QAE and Q derivatives are preferred.
Exemplary chromatographic media include those media derivatized with phenyl, butyl, or octyl groups, such as Phenyl-Sepharose FF (Pharmacia), Toyopearl butyl 650 (Toso Haas, Montgomeryville, PA), Octyl-Sepharose (Pharmacia) and the like; or polyacrylic resins, such as Amberchrom CG
71 (Toso Haas) and the like. Suitable solid supports include glass beads, silica-based resins, cellulosic resins, agarose beads, cross-linked agarose beads, polystyrene beads, cross-linked polyacrylamide resins and the like that are insoluble under the conditions in which they are to be used. These supports may be modified with reactive groups that allow attachment of proteins by amino groups, carboxyl groups, sulfhydryl groups, hydroxyl groups and/or carbohydrate moieties. Examples of coupling chemistries include cyanogen bromide activation, N-hydroxysuccinimide activation, epoxide activation, sulfhydryl activation, hydrazide activation, and carboxyl and amino derivatives for carbodiimide coupling chemistries. These and other solid media are well known and widely used in the art, and are available from commercial suppliers. Methods for binding receptor polypeptides to support media are well known in the art.
Selection of a particular method is a matter of routine design and is determined in part by the properties of the chosen support. See, for example, Affinitv Chromatoaraphv: Principles & Methods, Pharmacia LKB
Biotechnology, Uppsala, Sweden, 1988.
The polypeptides of the present invention can be isolated by exploitation of their binding properties. For example, immobilized metal ion adsorption (IMAC) chromatography can be used to purify histidine-rich proteins, including those comprising polyhistidine tags.
Briefly, a gel is first charged with divalent metal ions to form a chelate (Sulkowski, Trends in Biochem. 3:1-7, 1985). Histidine-rich proteins will be adsorbed to this matrix with differing affinities, depending upon the metal ion used, and will be eluted by competitive elution, lowering the pH, or use of strong chelating agents. Other methods of purification include purification of glycosylated proteins by lectin affinity chromatography and ion exchange chromatography (Methods in Enzymol., Vol.
182, "Guide to Protein Purification", M. Deutscher, (ed.), Acad. Press, San Diego, 1990, pp.529-39). Within additional embodiments of the invention, a fusion of the polypeptide of interest and an affinity tag (e.g., Glu-Glu tag) may be constructed to facilitate purification.
ZTMPO-1 polypeptides or fragments thereof may also be prepared through chemical synthesis according to methods known in the art, including exclusive solid phase synthesis, partial solid phase methods, fragment condensation or classical solution synthesis. See, for example, Merrifield, J. Am. Chem. Soc. 85:2149, 1963.
Using methods known in the art, ZTMPO-1 S polypeptides may be prepared as monomers or multimers;
glycosylated or non-glycosylated; pegylated or non pegylated; and may or may not include an initial methionine amino acid residue.
An in vivo approach for assaying proteins of the present invention involves viral delivery systems.
Exemplary viruses for this purpose include adenovirus, herpesvirus, vaccinia virus and adeno-associated virus (AAV). Adenovirus, a double-stranded DNA virus, is currently the best studied gene transfer vector for delivery of heterologous nucleic acid (for a review, see Becker et al., Meth. Cell Biol. 43:161-89, 1994; and Douglas and Curiel, Science & Medicine 4:44-53). The adenovirus system offers several advantages: adenovirus can (i) accommodate relatively large DNA inserts; (ii) be grown to high-titer; (iii) infect a broad range of mammalian cell types; and (iv) be used with a large number of available vectors containing different promoters.
Also, because adenoviruses are stable in the bloodstream, they can be administered by intravenous injection.
By deleting portions of the adenovirus genome, larger inserts (up to 7 kb) of heterologous DNA can be accommodated. These inserts may be incorporated into the viral DNA by direct ligation or by homologous recombination with a co-transfected plasmid. In an exemplary system, the essential E1 gene has been deleted from the viral vector, and the virus will not replicate unless the E1 gene is provided by the host cell (the human 293 cell line is exemplary). When intravenously administered to intact animals, adenovirus primarily targets the liver. If the adenoviral delivery system has an E1 gene deletion, the virus cannot replicate in the host cells. However, the host's tissue (e. g., liver) will express and process (and, if a secretory signal sequence is present, secrete) the heterologous protein. Secreted proteins will enter the circulation in the highly vascularized liver, and effects on the infected animal can 5 be determined.
The adenovirus system can also be used for protein production in vitro. By culturing adenovirus-infected non-293 cells under conditions where the cells are not rapidly dividing, the cells can produce proteins 10 for extended periods of time. For instance, BHK cells are grown to confluence in cell factories, then exposed to the adenoviral vector encoding the secreted protein of interest. The cells are then grown under serum-free conditions, which allows infected cells to survive for 15 several weeks without significant cell division.
Alternatively, adenovirus vector infected 2935 cells can be grown in suspension culture at relatively high cell density to produce significant amounts of protein (see Gamier et al., Cytotechnol. 15:145-55, 1994). With 20 either protocol, an expressed, secreted heterologous protein can be repeatedly isolated from the cell culture supernatant. Within the infected 293S cell production protocol, non-secreted proteins may also be effectively obtained.
25 The broad tissue distribution of ZTMPO-1 suggests it may play a critical role in biological processes of an organism and as such altered expression of ZTMPO-1 is likely involved in numerous pathologies associated with genetic and other human disease states, in 30 particular those related to immunological, reproductive, cardiac and muscle pathologies, such as diabetes, muscular dystrophys, hematopoietic disorders, immune disorders, leukemias, hypertension and cardiac disorders and diseases. ZTMPO-1 polypeptides, agonists and antagonists 35 have potential in both in vitro and in vivo applications.
ZTMPO-1 is expressed ubiquitously, many of those tissues are characterized by a high rate of cellular proliferation. ZTMPO-1 polypeptides would find use as regulators of cellular proliferation and/or differentiation. Proliferation and differentiation can be measured using cultured cells or in vivo by administering molecules of the present invention to the appropriate animal model. Suitable cultured cells, include but are not limited to, testicular, muscle, lymphatic and tumor cell lines which are all readily available to one skilled in the art from such sources as American Type Culture Collection, Rockville, MD. In particular, proliferation can be measured using cultured cardiac cells or in vivo by administering molecules of the present invention to the appropriate animal model. Generally, proliferative effects are seen as an increase in cell number, and may include inhibition of apoptosis as well as stimulation of mitogenesis. Cultured cells for use in these assays include cardiac fibroblasts, cardiac myocytes, skeletal myocytes, and human umbilical vein endothelial cells from primary cultures. Suitable established cell lines include: NIH 3T3 fibroblasts (ATCC No. CRL-1658), CHH-1 chum heart cells (ATCC No. CRL-1680), H9c2 rat heart myoblasts (ATCC No. CRL-1446), Shionogi mammary carcinoma cells (Tanaka et al., Proc. Natl. Acad. Sci. 89:8928-32, 1992), and LNCap.FGC adenocarcinoma cells (ATCC No. CRL-1740). Cultured testicular cells include dolphin DBl.Tes cells (ATCC No. CRL-6258); mouse GC-1 spg cells (ATCC No.
CRL-2053); TM3 cells (ATCC No. CRL-1714); TM4 cells (ATCC
No. CRL-1715); and pig ST cells (ATCC No. CRL-1746).
Mouse skeletal muscle (ATCC No. CRL-2174), human muscle (ATCC No. CRL-7522) and Raji, (Burkitt's human lymphoma, ATCC No. CCL86), Ramos (Burkitt's lymphoma cell line, ATCC
No. CRL-1596), Daudi (Burkitt's human lymphoma, ATCC No.
CCL213) and RPMI 1788 (a B lymphocyte cell line, CCL-156) all available from American Type Culture Collection, 10801 University Boulevard, Manassas, VA 20110-2209. Cultured Assays measuring cell proliferation are well known in the art. For example, assays measuring proliferation include chemosensitivity to neutral red dye (Cavanaugh et al., Investictational New Drugs 8:347-54, 1990), incorporation of radiolabelled nucleotides (Cook et al., Analytical Biochem. 179:1-7, 1989), incorporation of 5-bromo-2'-deoxyuridine (BrdU) in the DNA of proliferating cells (Porstmann et al., J. Immunol. Methods 82:169-79, 1985), and use of tetrazolium salts (Mosmann, J. Immunol. Methods 65:55-63, 1983; Alley et al., Cancer Res. 48:589-601, 1988; Marshall et al., Growth Reg. 5:69-84, 1995; and Scudiero et al., Cancer Res. 48:4827-33, 1988).
Additional methods can be found in the art, for example, Current Protocols in Molecular Biolocty, John Wiley and Sons, Inc., NY, 1997.
Assays measuring differentiation include, for example, measuring cell-surface markers associated with stage-specific expression of a tissue, enzymatic activity, functional activity or morphological changes (Watt, FASEB, 5:281-4, 1991; Francis, Differentiation 57:63-75, 1994;
Raes, Adv. Anim. Cell Biol. Technol. Bioprocesses, 161-71, 1989). Bioassays and ELISAs are available to measure cellular response to ZTMPO-1, in particular are those which measure changes in cytokine production as a measure of cellular response (see for example, Current Protocols in Immunolocrv ed. John E. Coligan et al., NIH, 1996).
In vivo assays are available for evaluating cardiac neogenesis or hyperplasia include treating neonatal and mature rats with the molecules of the present invention. The animals' cardiac function is measured as heart rate, blood pressure, and cardiac output to determine left ventricular function. Post-mortem methods for assessing cardiac decline or improvement include:
increased or decreased cardiac weight, nuclei/cytoplasmic volume, and staining of cardiac histology sections to determine proliferating cell nuclear antigen (PCNA) vs.
cytoplasmic actin levels (Quaini et al., Circulation Res.
75:1050-63, 1994 and Reiss et al., Proc. Natl. Acad Sci 93:8630-5, 1996.).
Cardiac defects related to conduction have been reported in patients having a deleted emerin gene (Emery, J. Med. Genet. 2-66:637-41, 1989). The resulting cardiac conduction defect is life threatening in these patients.
Defects in the intrinsic conduction system can cause irregularities in the heart rhythm, such as arrhythmia and fibrillation. Tissue distribution and sequence similarities between emerin and ZTMPO-1 suggest that ZTMPO-1 may be involved in re-polarization of cardiac cell membranes. Localization of emerin to the desmosomes and fasciae adherentes suggests that association with the connection between epithelial cells accounts for the cardiac conduction defect when the gene is absent.
ZTMPO-1 polypeptides and antagonists may influence cell-cell communication, either independently, or in conjunction with other proteins, such as emerin, and may regulate messages between cell membranes. To verify the presence of this capability in ZTMPO-1 polypeptides, agonists or antagonists of the present invention, such ZTMPO-1 polypeptides, agonists or antagonists are evaluated with respect to their ability to modulate cardiac conductance according to procedures known in the art. If desired, ZTMPO-1 polypeptide performance in this regard can be compared to emerin and may be evaluated in combination with emerin to identify synergistic effects.
With respect to cardiac conductance, a resulting increase or decrease is measured by assessing voltage-dependent conductance, sodium or calcium ion flux in an appropriate assay system known in the art. Changes in the voltage conductance or in indicator substrates reflect the activities of ZTMPO-1 polypeptides on enhancing or inhibition cardiac conductance relative to a control not subjected to treatment. An electrocardiograph is used to monitor the electrical currents generated and transmitted through the heart. Changes in the electrocardiogram (ECG) WO 99/54d68 PCT/US99/08601 tracing (wave pattern and/or timing) would indicate an alteration in the heart's conduction system. Therefore a return to a normal ECG pattern following ZTMPO-1 administration would indicate a re-establishment of a regular heart rhythm.
The invention also provides isolated and purified ZTMPO-1 polynucleotide probes or primers. Such polynucleotide probes can be RNA or DNA. DNA can be either cDNA or genomic DNA. Polynucleotide probes are single or double-stranded DNA or RNA, generally synthetic oligonucleotides, but may be generated from cloned cDNA or genomic sequences and will generally comprise at least 16 nucleotides, more often from 17 nucleotides to 25 or more nucleotides, sometimes 40 to 60 nucleotides, and in some instances a substantial portion, domain or even the entire ZTMPO-1 gene or cDNA. Probes and primers are generally synthetic oligonucleotides, but may be generated from cloned cDNA or genomic sequences or its complements.
Analytical probes will generally be at least 20 nucleotides in length, although somewhat shorter probes (14-I7 nucleotides) can be used. PCR primers are at least 5 nucleotides in length, preferably 15 or more nucleotides, more preferably 20-30 nucleotides. Short polynucleotides can be used when a small region of the gene is targeted for analysis. For gross analysis of genes, a polynucleotide probe may comprise an entire exon or more. Probes can be labeled to provide a detectable signal, such as with an enzyme, biotin, a radionuclide, fluorophore, chemiluminescer, paramagnetic particle and the like, which are commercially available from many sources, such as Molecular Probes, Inc., Eugene, OR, and Amersham Corp., Arlington Heights, IL, using techniques that are well known in the art. Preferred regions from which to construct probes include regions of homology with other thymopoietins and emerin as described herein, the ankyrin-like region, the calcium binding protein-like region, the signal sequence, and the like. Techniques for developing polynucleotide probes and hybridization techniques are known in the art, see for example, Ausubel 5 et al., eds., Current Protocols in Molecular Biology, John Wiley and Sons, Inc., NY, 1991.
ZTMPO-1 polypeptides may be used within diagnostic systems to detect the presence of ZTMPO-1. The information derived from such detection methods would 10 provide insight into the significance of ZTMPO-1 polypeptides in various diseases, and as a would serve as diagnostic tools for diseases for which altered levels of ZTMPO-1 are significant. Altered levels of ZTMPO-1 receptor polypeptides may be indicative of pathological 15 conditions including cancer, cardiac and autoimmune disorders and infectious diseases.
In a basic assay, a single-stranded probe molecule is incubated with RNA, isolated from a biological sample, under conditions of temperature and ionic strength 20 that promote base pairing between the probe and target ZTMPO-1 RNA species. After separating unbound probe from hybridized molecules, the amount of hybrids is detected.
Well-established hybridization methods of RNA
detection include northern analysis and dot/slot blot 25 hybridization (see, for example, Ausubel ibid. and Wu et al. (eds.), "Analysis of Gene Expression at the RNA
Level," in Methods in Gene Biotechnology, pages 225-239 (CRC Press, Inc. 1997)). Nucleic acid probes can be detectably labeled with radioisotopes such as 32P or 355.
30 Alternatively, ZTMPO-1 RNA can be detected with a nonradioactive hybridization method (see, for example, Isaac (ed.), Protocols for Nucleic Acid Analysis by Nonradioactive Probes, Humana Press, Inc., 1993).
Typically, nonradioactive detection is achieved by 35 enzymatic conversion of chromogenic or chemiluminescent substrates. Illustrative nonradioactive moieties include biotin, fluorescein, and digoxigenin.
ZTMPO-1 oligonucleotide probes are also useful for in vivo diagnosis. As an illustration, 18F-labeled oligonucleotides can be administered to a subject and visualized by positron emission tomography (Tavitian et al., Nature Medicine 4:467, 1998).
Numerous diagnostic procedures take advantage of the polymerase chain reaction (PCR) to increase sensitivity of detection methods. Standard techniques for performing PCR are well-known (see, generally, Mathew (ed.), Protocols in Human Molecular Genetics (Humans Press, Inc. 1991), White (ed.), PCR Protocols: Current Methods and Applications {Humans Press, Inc. 1993), Cotter (ed.), Molecular Diagnosis of Cancer (Humans Press, Inc.
1996), Hanausek and Walaszek (eds.), Tumor Marker Protocols (Humans Press, Inc. 1998), Lo (ed.), Clinical Applications of PCR (Humans Press, Inc. 1998), and Meltzer (ed.), PCR in Bioanalysis (Humans Press, Inc. 1998)).
PCR primers can be designed to amplify a sequence encoding a particular ZTMPO-1 domain or region of homology as described herein.
One variation of PCR for diagnostic assays is reverse transcriptase-PCR (RT-PCR). In the RT-PCR
technique, RNA is isolated from a biological sample, reverse transcribed to cDNA, and the cDNA is incubated with ZTMPO-1 primers (see, for example, Wu et al. (eds.), "Rapid Isolation of Specific cDNAs or Genes by PCR," in Methods in Gene Biotechnology, CRC Press, Inc., pages 15-28, 1997). PCR is then performed and the products are analyzed using standard techniques.
As an illustration, RNA is isolated from biological sample using, for example, the guanidinium-thiocyanate cell lysis procedure described above.
Alternatively, a solid-phase technique can be used to isolate mRNA from a cell lysate. A reverse transcription reaction can be primed with the isolated RNA using random oligonucleotides, short homopolymers of dT, or ZTMPO-1 anti-sense oligomers. Oligo-dT primers offer the advantage that various mRNA nucleotide sequences are amplified that can provide control target sequences.
ZTMPO-1 sequences are amplified by the polymerase chain reaction using two flanking oligonucleotide primers that are typically at least S bases in length.
PCR amplification products can be detected using a variety of approaches, For example, PCR products can be fractionated by gel electrophoresis, and visualized by ethidium bromide staining. Alternatively, fractionated PCR products can be transferred to a membrane, hybridized with a detectably-labeled ZTMPO-1 probe, and examined by autoradiography. Additional alternative approaches include the use of digoxigenin-labeled deoxyribonucleic acid triphosphates to provide chemiluminescence detection, and the C-TRAK colorimetric assay.
Another approach is real time quantitative PCR
(Perkin-Elmer Cetus, Norwalk, Ct.). A fluorogenic probe, consisting of an oligonucleotide with both a reporter and a quencher dye attached, anneals specifically between the forward and reverse primers. Using the 5' endonuclease activity of Taq DNA polymerase, the reporter dye is separated from the quencher dye and a sequence-specific signal is generated and increases as amplification increases. The fluorescence intensity can be continuously monitored and quantified during the PCR reaction.
Another approach for detection of ZTMPO-1 expression is cycling probe technology (CPT), in which a single-stranded DNA target binds with an excess of DNA
RNA-DNA chimeric probe to form a complex, the RNA portion is cleaved with RNase H, and the presence of cleaved chimeric probe is detected (see, for example, Beggs et al., J. Clin. Microbiol. 34:2985, 1996 and Bekkaoui et al., Biotechniques 20:240, 1996). Alternative methods for detection of ZTMPO-1 sequences can utilize approaches such as nucleic acid sequence-based amplification (NASBA), cooperative amplification of templates by cross-hybridization (CATCH), and the ligase chain reaction (LCR) (see, for example, Marshall et al., U.S. Patent No.
5,686,272 (1997), Dyer et al., J. Virol. Methods 60:161, 1996; Ehricht et al., Eur. J. Biochem. 243:358, 1997 and Chadwick et al., J. Virol. Methods 70:59, 1998). Other standard methods are known to those of ski-11 in the art.
ZTMPO-1 probes and primers can also be used to detect and to localize ZTMPO-1 gene expression in tissue samples. Methods for such in situ hybridization are well known to those of skill in the art (see, for example, Choo (ed.), In Situ Hybridization Protocols, Humana Press, Inc., 1999; Wu et al. (eds.), "Analysis of Cellular DNA or Abundance of mRNA by Radioactive In Situ Hybridization IRISH)," in Methods in Gene Biotechnology, CRC Press, Inc., pages 259-278, 1997 and Wu et al. (eds.), "Localization of DNA or Abundance of mRNA by Fluorescence In Situ Hybridization IRISH)," in Methods in Gene Biotechnology, CRC Press, Inc., pages 279-289, 1997).
Various additional diagnostic approaches are well-known to those of skill in the art (see, for example, Mathew (ed.), Protocols in Human Molecular Genetics Humana Press, Inc., 1991; Coleman and Tsongalis, Molecular Diagnostics, Humana Press, Inc., 1996 and Elles, Molecular Diagnosis of Genetic Diseases, Humana Press, Inc., 1996).
The invention also provides antagonists or inhibitors of ZTMPO-1 activity. Such antagonists would include anti-ZTMPO-1 antibodies, soluble ZTMPO-1 receptors, as well as other peptidic and non-peptidic agents (including ribozymes). Such antagonists would have use as research reagents for characterizing sites of ligand-receptor interaction. Antagonists would also find use in modulating cellular proliferation and differentiation such as in tumor growth and development.
High levels of expression of ZTMPO-1 in testis tissue suggest a role in spermatogenesis. These ZTMPO-1 antagonists would be useful for inhibiting spermatogenesis and sperm activation. Such ZTMPO-1 antagonists can be used for contraception in humans and animals, and in particular, domestic and zoological animals and livestock, where they would act to prevent fertilization of an egg.
Such ZTMPO-1 antagonists could be used, for instance, in place of surgical forms of contraception (such as spaying and neutering), and would allow for the possibility of future breeding of treated animals if desired. ZTMPO-1 antagonists could also be used to mediate immune response, for instance by boosting the humoral response in individuals at risk for an infectious disease or as a supplement to vaccination.
ZTMPO-1 can be used to identify inhibitors (antagonists) of its activity. Test compounds are transfected into cells or possibly added to the assays disclosed herein to identify compounds that inhibit the activity of ZTMPO-1. In addition to those assays disclosed herein, samples can be tested for inhibition of ZTMPO-1 activity within a variety of assays designed to measure receptor binding or the stimulation/inhibition of ZTMPO-1-dependent cellular responses. For example, ZTMPO-1-responsive cell lines can be transfected with a reporter gene construct that is responsive to a ZTMPO-1-stimulated cellular pathway. Reporter gene constructs of this type are known in the art, and will generally comprise a ZTMPO-1-DNA response element operably linked to a gene encoding an assayable protein, such as luciferase. DNA response elements can include, but are not limited to, cyclic AMP
response elements (CRE), hormone response elements (HRE) insulin response element (IRE) (Nasrin et al., Proc. Natl.
Acad. Sci. USA 87:5273-7, 1990) and serum response elements (SRE) (Shaw et al. Cell 56: 563-72, 1989).
Cyclic AMP response elements are reviewed in Roestler et al., J. Biol. Chem. 263: 9063-6; 1988 and Habener, Molec.
WO 99154468 PCTlUS99/08601 Endocrinol. 4:1087-94; 1990. Hormone response elements are reviewed in Beato, Cell 56:335-44; 1989. Candidate compounds, solutions, mixtures or extracts are tested for the ability to inhibit the activity of ZTMPO-1 on the 5 target cells as evidenced by a decrease in ZTMPO-1 stimulation of reporter gene expression. Assays of this type will detect compounds that directly block ZTMPO-1 binding to cell-surface receptors, as well as compounds that block processes in the cellular pathway subsequent to 10 receptor-ligand binding. In the alternative, compounds or other samples can be tested for direct blocking of ZTMPO-1 binding to receptor using ZTMPO-1 tagged with a detectable label (e. g., 'ZSI, biotin, horseradish peroxidase, FITC, and the like). Within assays of this type, the ability of 15 a test sample to inhibit the binding of labeled ZTMPO-1 to the receptor is indicative of inhibitory activity, which can be confirmed through secondary assays. Receptors used within binding assays may be cellular receptors or isolated, immobilized receptors.
20 ZTMPO-1 polypeptides can also be used to prepare antibodies that specifically bind to ZTMPO-1 epitopes, peptides or polypeptides. The ZTMPO-1 polypeptide or a fragment thereof serves as an antigen (immunogen) to inoculate an animal and elicit an immune response.
25 Suitable antigens would be the ZTMPO-1 polypeptide encoded by SEQ ID N0:2 from amino acid number 1 to amino acid number 876, or contiguous 9 to 25 amino acid residue fragments thereof. Antibodies generated from this immune response can be isolated and purified as described herein.
30 Methods for preparing and isolating polyclonal and monoclonal antibodies are well known in the art. See, for example, Current Protocols in Immunolocrv, Cooligan, et al.
(eds.), National Institutes of Health, John Wiley and Sons, Inc., 1995; Sambrook et al., Molecular Cloning: A
35 Laboratory Manual, Second Edition, Cold Spring Harbor, NY, 1989; and Hurrell, (Ed.), Monoclonal Hybridoma Antibodies:
Techniques and Applications, CRC Press, Inc., Boca Raton, FL, 1982 .
As would be evident to one of ordinary skill in the art, polyclonal antibodies can be generated from inoculating a variety of warm-blooded animals such as horses, cows, goats, sheep, dogs, chickens, rabbits, mice, and rats with a ZTMPO-1 polypeptide or a fragment thereof.
The immunogenicity of a ZTMPO-1 polypeptide may be increased through the use of an adjuvant, such as alum (aluminum hydroxide) or Freund's complete or incomplete adjuvant. Polypeptides useful for immunization also include fusion polypeptides, such as fusions of ZTMPO-1 or a portion thereof with an immunoglobulin polypeptide or with maltose binding protein. The polypeptide immunogen may be a full-length molecule or a portion thereof. If the polypeptide portion is "hapten-like", such portion may be advantageously joined or linked to a macromolecular carrier (such as keyhole limpet hemocyanin (FCLH), bovine serum albumin (BSA) or tetanus toxoid) for immunization.
As used herein, the term "antibodies" includes polyclonal antibodies, affinity-purified polyclonal antibodies, monoclonal antibodies, and antigen-binding fragments, such as F(ab')2 and Fab proteolytic fragments.
Genetically engineered intact antibodies or fragments, such as chimeric antibodies, Fv fragments, single chain antibodies and the like, as well as synthetic antigen-binding peptides and polypeptides, are also included.
Non-human antibodies may be humanized by grafting non-human CDRs onto human framework and constant regions, cr by incorporating the entire non-human variable domains (optionally "cloaking" them with a human-like surface by replacement of exposed residues, wherein the result is a "veneered" antibody). In some instances, humanized antibodies may retain non-human residues within the human variable region framework domains to enhance proper binding characteristics. Through humanizing antibodies, biological half-life may be increased, and the potential for adverse immune reactions upon administration to humans is reduced.
Alternative techniques for generating or selecting antibodies useful herein include in vitro exposure of lymphocytes to ZTMPO-1 protein or peptide, and selection of antibody display libraries in phage or similar vectors (for instance, through use of immobilized or labeled ZTMPO-1 protein or peptide). Genes encoding polypeptides having potential ZTMPO-1 polypeptide binding domains can be obtained by screening random peptide libraries displayed on phage (phage display) or on bacteria, such as E. coli. Nucleotide sequences encoding the polypeptides can be obtained in a number of ways, such as through random mutagenesis and random polynucleotide synthesis. These random peptide display libraries can be used to screen for peptides which interact with a known target which can be a protein or polypeptide, such as a ligand or receptor, a biological or synthetic macromolecule, or organic or inorganic substances.
Techniques for creating and screening such random peptide display libraries are known in the art (Ladner et al., US
Patent NO. 5,223,409; Ladner et al., US Patent NO.
4,946,778; Ladner et al., US Patent NO. 5,403,484 and Ladner et al., US Patent N0. 5,571,698) and random peptide display libraries and kits for screening such libraries are available commercially, for instance from Clontech (Palo Alto, CA), Invitrogen Inca (San Diego, CA), New England Biolabs, Inc. (Beverly, MA) and Pharmacia LKB
Biotechnology Inc. (Piscataway, NJ). Random peptide display libraries can be screened using the ZTMPO-1 sequences disclosed herein to identify proteins which bind to ZTMPO-1. These "binding proteins" which interact with ZTMPO-1 polypeptides can be used for tagging cells; for isolating homolog polypeptides by affinity purification;
they can be directly or indirectly conjugated to drugs, toxins, radionuclides and the like. These binding proteins can also be used in analytical methods such as for screening expression libraries and neutralizing activity. The binding proteins can also be used for diagnostic assays for determining circulating levels of polypeptides; for detecting or quantitating soluble polypeptides as marker of underlying pathology or disease.
These binding proteins can also act as ZTMPO-1 "antagonists" to block ZTMPO-1 binding and signal transduction in vitro and in vivo. These anti-ZTMPO-1 binding proteins would be useful for inhibiting binding.
Antibodies are determined to be specifically binding if: 1) they exhibit a threshold level of binding activity, and/or 2) they do not significantly cross-react with related polypeptide molecules. First, antibodies herein specifically bind if they bind to a ZTMPO-1 polypeptide, peptide or epitope with a binding affinity (Ka) of 106 M 1 or greater, preferably 10~ M 1 or greater, more preferably 108 M 1 or greater, and most preferably 109 M 1 or greater. The binding affinity of an antibody can be readily determined by one of ordinary skill in the art, for example, by Scatchard analysis (Scatchard, Ann.
NY Acad. Sci. 51: 660-72, 1949).
Second, antibodies are determined to specifically bind if they do not significantly cross-react with related polypeptides. Antibodies do not significantly cross-react with related polypeptide molecules, for example, if they detect ZTMPO-1 but not known related polypeptides using a standard Western blot analysis (Ausubel et al., ibid.). Examples of known related polypeptides are those disclosed in the prior art, such as known orthologs, and paralogs, and similar known members of a protein family. Moreover, antibodies may be "screened against" known related polypeptides, such as non-human ZTMPO-l, and ZTMPO-1 mutant polypeptides, to isolate a population that specifically binds to the inventive polypeptides. For example, antibodies raised to ZTMPO-1 are adsorbed to related polypeptides adhered to insoluble matrix; antibodies specific to ZTMPO-1 will flow through the matrix under the proper buffer conditions.
Such screening allows isolation of polyclonal and monoclonal antibodies non-crossreactive to closely related polypeptides (Antibodies: A Laboratory Manual, Harlow and Lane (eds.), Cold Spring Harbor Laboratory Press, 1988;
Current Protocols in Immunology, Cooligan, et al. (eds.), National Institutes of Health, John Wiley and Sons, Inc., 1995). Screening and isolation of specific antibodies is well known in the art. See, Fundamental Immunology, Paul (eds.), Raven Press, 1993: Getzoff et al., Adv. in Immunol. 43: 1-98, 1988; Monoclonal Antibodies:
Principles and Practice, Goding, J.W. (eds.), Academic Press Ltd., 1996; Benjamin et al., Ann. Rev. Immunol. 2:
67-101, 1984.
A variety of assays known to those skilled in the art can be utilized to detect antibodies and binding proteins which specifically bind to ZTMPO-1 proteins or peptides. Exemplary assays are described in detail in Antibodies: A Laboratory Manual, Harlow and Lane (Eds.), Cold Spring Harbor Laboratory Press, 1988. Representative examples of such assays include: concurrent immunoelectrophoresis, radioimmunoassay, radioimmuno-precipitation, enzyme-linked immunosorbent assay (ELISA), dot blot or Western blot assay, inhibition or competition assay, and sandwich assay. In addition, antibodies can be screened for binding to wild-type versus mutant ZTMPO-1 protein or polypeptide.
Antibodies to ZTMPO-1 may be used for tagging cells that express ZTMPO-1; for isolating ZTMPO-1 by affinity purification; for diagnostic assays for determining circulating levels of ZTMPO-1 polypeptides;
S for detecting or quantitating soluble ZTMPO-1 as marker of underlying pathology or disease; in analytical methods employing FACS; for screening expression libraries; for generating anti-idiotypic antibodies; and as neutralizing antibodies or as antagonists to block ZTMPO-1 binding in 10 vitro and in vivo. Suitable direct tags or labels include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent markers, chemiluminescent markers, magnetic particles and the like; indirect tags or labels may feature use of biotin-avidin or other complement/anti-15 complement pairs as intermediates. Antibodies herein may also be directly or indirectly conjugated to drugs, toxins, radionuclides and the like, and these conjugates used for in vivo diagnostic or therapeutic applications.
Moreover, antibodies to ZTMPO-1 or fragments thereof may 20 be used in vitro to detect denatured ZTMPO-1 or fragments thereof in assays, for example, Western Blots or other assays known in the art.
Antibodies or polypeptides herein may also be directly or indirectly conjugated to drugs, toxins, 25 radionuclides and the like, and these conjugates used for in vivo diagnostic or therapeutic applications. For instance, polypeptides or antibodies of the present invention may be used to identify or treat tissues or organs that express a corresponding anti-complementary 30 molecule (receptor or antigen, respectively, for instance). More specifically, ZTMPO-1 polypeptides or anti-ZTMPO-1 antibodies, or bioactive fragments or portions thereof, can be coupled to detectable or cytotoxic molecules and delivered to a mammal having cells, tissues or organs that express the anti-complementary molecule.
Suitable detectable molecules may be directly or indirectly attached to the polypeptide or antibody, and include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent markers, chemiluminescent markers, magnetic particles and the like. Suitable cytotoxic molecules may be directly or indirectly attached to the polypeptide or antibody, and include bacterial or plant toxins (for instance, diphtheria toxin, Pseudomonas exotoxin, ricin, abrin and the like), as well as therapeutic radionuclides, such as iodine-131, rhenium-188 or yttrium-90 (either directly attached to the polypeptide or antibody, or indirectly attached through means of a chelating moiety, fox instance). Polypeptides or antibodies may also be conjugated to cytotoxic drugs, such as adriamycin. For indirect attachment of a detectable or cytotoxic molecule, the detectable or cytotoxic molecule may be conjugated with a member of a complementary/
anticomplementary pair, where the other member is bound to the polypeptide or antibody portion. For these purposes, biotin/streptavidin is an exemplary complementary/
anticomplementary pair.
Molecules of the present invention can be used to identify and isolate receptors involved in ZTMPO-1 binding. For example, proteins and peptides of the present invention can be immobilized on a column and membrane preparations run over the column (Immobilized Affinity Ligand Techniques, Hermanson et al., eds., Academic Press, San Diego, CA, 1992, pp.195-202).
Proteins and peptides can also be radiolabeled (Methods in Enzymol., vol. 182, "Guide to Protein Purification", M.
Deutscher, ed., Acad. Press, San Diego, 1990, 721-37) or photoaffinity labeled (Brunner et al., Ann. Rev. Biochem.
62:483-514, 1993 and Fedan et al., Biochem. Pharmacol.
33:1167-80, 1984) and specific cell-surface proteins can be identified.
The molecules of the present invention will be useful regulators in multiple cellular organisms. The molecules of the present invention may used to modulate cellular proliferation and differentiation, for example spermatogenesis. In particular, certain proliferative disorders such as cancers may be amenable to such diagnosis, treatment or prevention. ZTMPO-1 would be useful in modulating the cell cycle such as during differentiation or in rapidly proliferating cells such as in tumor tissues. ZTMPO-1 would find application in a diverse array of tissues as testis, skeletal muscle, thyroid and adrenal gland for example.
Polynucleotides encoding ZTMPO-1 polypeptides are useful within gene therapy applications where it is desired to increase or inhibit ZTMPO-1 activity. If a mammal has a mutated or absent ZTMPO-1 gene, the ZTMPO-1 gene can be introduced into the cells of the mammal. In one embodiment, a gene encoding a ZTMPO-1 polypeptide is introduced in vivo in a viral vector. Such vectors include an attenuated or defective DNA virus, such as, but not limited to, herpes simplex virus (HSV), papillomavirus, Epstein Barr virus (EBV), adenovirus, adeno-associated virus (AAV), and the like. Defective viruses, which entirely or almost entirely lack viral genes, are preferred. A defective virus is not infective after introduction into a cell. Use of defective viral vectors allows for administration to cells in a specific, localized area, without concern that the vector can infect other cells. Examples of particular vectors include, but are not limited to, a defective herpes simplex virus 1 (HSV1) vector (Kaplitt et al., Molec. Cell. Neurosci.
2:320-30, 1991); an attenuated adenovirus vector, such as the vector described by Stratford-Perricaudet et al., J.
Clin. Invest. 90:626-30, 1992; and a defective adeno-associated virus vector (Sarnulski et al., J. Virol.
61:3096-101, 1987; Samulski et al., J. Virol. 63:3822-8, 1989) .
In another embodiment, a ZTMPO-1 gene can be introduced in a retroviral vector, e.g., as described in Anderson et al., U.S. Patent No. 5,399,346; Mann et al.
Cell 33:153, 1983; Temin et al., U.S. Patent No.
4,650,764; Temin et al., U.S. Patent No. 4,980,289;
Markowit2 et al., J. Virol. 62:1120, 1988; Temin et al., U.S. Patent No. 5,124,263; International Patent Publication No. WO 95/07358, published March 16, 1995 by Dougherty et al.; and Kuo et al., Blood 82:845, 1993.
Alternatively, the vector can be introduced by lipofection in vivo using liposomes. Synthetic cationic lipids can be used to prepare liposomes for in vivo transfection of a gene encoding a marker (Felgner et al., Proc. Natl. Acad.
Sci. USA 84:7413-7, 1987; Mackey et al., Proc. Natl. Acad.
Sci. USA 85:8027-31, 1988). The use of lipofection to introduce exogenous genes into specific organs in vivo has certain practical advantages. Molecular targeting of liposomes to specific cells represents one area of benefit. More particularly, directing transfection to particular cells represents one area of benefit. For instance, directing transfection to particular cell types would be particularly advantageous in a tissue with cellular heterogeneity, such as the pancreas, liver, kidney, and brain. Lipids may be chemically coupled to other molecules for the purpose of targeting. Targeted peptides (e. g., hormones or neurotransmitters), proteins such as antibodies, or non-peptide molecules can be coupled to liposomes chemically.
It is possible to remove the target cells from the body; to introduce the vector as a naked DNA plasmid;
and then to re-implant the transformed cells into the body. Naked DNA vectors for gene therapy can be introduced into the desired host cells by methods known in the art, e.g., transfection, electroporation, microinjection, transduction, cell fusion, DEAF dextran, calcium phosphate precipitation, use of a gene gun or use of a DNA vector transporter. See, e.g., Wu et al., J.
Biol. Chem. 267:963-7, 1992; Wu et al., J. Biol. Chem.
263:14621-4, 1988.
The present invention also provides reagents for use in diagnostic applications. For example, the ZTMPO-1 gene, a probe comprising ZTMPO-1 DNA or RNA, or a subsequence thereof can be used to determine if the ZTMPO
1 gene is present on chromosome 12 or if a mutation has occurred. The emerin gene is not detected in samples from patients with Emery-Dreifuss muscular dystrophy, and is present in normal patients (Bione et al., Nat. Genet.
8:323-7, 1994 and Nagano et al., Nat. Genet. 12:254-9, 1996) and thus serves as a marker for the disease.
Detectable chromosomal aberrations at the ZTMPO-1 gene locus include, but are not limited to, aneuploidy, gene copy number changes, insertions, deletions, restriction site changes and rearrangements. These aberrations can occur within the coding sequence, within introns, or within flanking sequences, including upstream promoter and regulatory regions, and may be manifested as physical alterations within a coding sequence or changes in gene expression level.
In general, these diagnostic methods comprise the steps of (a) obtaining a genetic sample from a patient; (b) incubating the genetic sample with a polynucleotide probe or primer as disclosed above, under conditions wherein the polynucleotide will hybridize to complementary polynucleotide sequence, to produce a first reaction product; and (iii) comparing the first reaction product to a control reaction product. A difference between the first reaction product and the control reaction product is indicative of a genetic abnormality in the patient. Genetic samples for use within the present invention include genomic DNA, cDNA, and RNA. The polynucleotide probe or primer can be RNA or DNA, and will comprise a portion of SEQ ID NO:1, the complement of SEQ
ID NO: l, or an RNA equivalent thereof. Suitable assay methods in this regard include molecular genetic techniques known to those in the art, such as restriction fragment length polymorphism (RFLP) analysis, short tandem repeat (STR) analysis employing PCR techniques, ligation 5 chain reaction (Barany, PCR Methods and Applications 1:5-16, 1991), ribonuclease protection assays, and other genetic linkage analysis techniques known in the art (Sambrook et al., ibid.; Ausubel et. al., ibid.; Marian, Chest 108:255-65, 1995). Ribonuclease protection assays 10 (see, e.g., Ausubel et al., ibid., ch. 4) comprise the hybridization of an RNA probe to a patient RNA sample, after which the reaction product (RNA-RNA hybrid) is exposed to RNase. Hybridized regions of the RNA are protected from digestion. Within PCR assays, a patient's 15 genetic sample is incubated with a pair of polynucleotide primers, and the region between the primers is amplified and recovered. Changes in size or amount of recovered product are indicative of mutations in the patient.
Another PCR-based technique that can be employed is single 20 strand conformational polymorphism (SSCP) analysis (Hayashi, PCR Methods and Applications 1:34-8, 1991).
Transgenic mice, engineered to express the ZTMPO-1 gene, and mice that exhibit a complete absence of ZTMPO-1 gene function, referred to as "knockout mice"
25 (Snouwaert et al., Science 257:1083, 1992), may also be generated (Lowell et al., Nature 366:740-42, 1993). These mice may be employed to study the ZTMPO-1 gene and the protein encoded thereby in an in vivo system. Such mice could be used, for example, in breeding studies to 30 determine the effect ZTMPO-1 has on spermatogenesis and sperm function as well as on conductivity of the heart.
For pharmaceutical use, the proteins of the present invention are formulated for parenteral, particularly intravenous or subcutaneous, delivery 35 according to conventional methods. Intravenous administration will be by bolus injection or infusion over a typical period of one to several hours. In general, WO 99/54468 PC'T/US99/08601 pharmaceutical formulations will include a ZTMPO-1 protein in combination with a pharmaceutically acceptable vehicle, such as saline, buffered saline, 5% dextrose in water or the like. Formulations may further include one or more excipients, preservatives, solubilizers, buffering agents, albumin to prevent protein loss on vial surfaces, etc.
Methods of formulation are well known in the art and are disclosed, for example, in Remington: The Science and Practice of Pharmacv, Gennaro, ed., Mack Publishing Co., Easton, PA, 19th ed., 1995. Determination of dose is within the level of ordinary skill in the art. The proteins may be administered for acute treatment, over one week or less, often over a period of one to three days or may be used in chronic treatment, over several months or years. Evaluation of therapeutic effect of ZTMPO-1 for cardiac applications can be done by looking for changes in ECG. Decreases in creatine kinase levels and a decrease in weakness would serve as indicators for changes in muscle wasting associated with muscular dystrophy.
The invention is further illustrated by the following non-limiting examples.
EXAMPLES
Example 1 Isolation of ZTMPO-1 Novel ZTMPO-1 encoding polynucleotides and polypeptides of the present invention were initially identified by querying an EST database. To identify the corresponding cDNA, two clones from which an identified EST was derived that were considered likely to contain the entire human ZTMPO-1 sequence were used for sequencing.
Using a QIAwell 8 plasmid kit (Qiagen, Inc., Chatsworth, CA) according to manufacturer's instructions, a 5 ml overnight culture in LB + 50 ~g/ml ampicillin was prepared. The templates were sequenced on an Applied WO 99/54468 PCTlUS99/08601 BiosystemsT"' model 377 DNA sequences (Perkin-Elmer Cetus, Norwalk, Ct.) using the ABI PRISMT"' Dye Terminator Cycle Sequencing Ready Reaction Kit (Perkin-Elmer Corp.) according to the manufacturer's instructions.
Oligonucleotides ZC694 (SEQ ID N0:9), ZC976 (SEQ ID N0:10) and ZC447 (SEQ ID N0:14) were used as sequencing primers.
Oligonucleotides ZC15976 (SEQ ID NO:11), ZC15485 (SEQ ID
N0:12), ZC15526 (SEQ ID N0:13), 215620 (SEQ ID N0:15) and ZC15823 (SEQ ID N0:16) were used to complete the sequence from the clones.
Sequencing reactions were carried out in a Hybaid OmniGene Temperature Cycling System (National Labnet Co., Woodbridge, NY). SequencherTM 3.0 sequence analysis software (Gene Codes Corporation, Ann Arbor, MI) was used for data analysis. The sequences from the two clones overlapped by 740 by and contained the 3' end of the gene and the poly A tail. A third clone prepared as described above was sequenced resulting in the remaining 5' sequence. Oligonucleotides ZC447 (SEQ ID N0:14), ZC976 (SEQ ID NO:10), ZC16162 (SEQ ID N0:17), ZC16038 (SEQ ID
N0:18), ZC16249 (SEQ ID N0:19), ZC16164 (SEQ ID N0:20), ZC16163 (SEQ ID N0:21), ZC16165 (SEQ ID N0:22) and ZC16037 (SEQ ID N0:23) were used in sequencing. Differences between the original EST sequences and the final sequence of ZTMPO-1 were detected. The lack of identity arose from ambiguity in the original EST sequences.
To confirm that the polynucleotide sequence encoding the initial methionine had been identified, ,a nested 5'RACE (rapid amplification of cDNA ends) was performed. Several Marathon's'"' cDNA libraries (human prostate, spleen, testis and uterus) were prepared using a Marathon cDNA kit (Clontech) according the manufacturer's instructions. For the first round PCR oligonucleotides AP1 (SEQ ID N0:24, supplied with the kit or synthesized) and ZC15527 (SEQ ID N0:25) were used as primers and the 5°
RACE reaction was carried out at 94oC, for 2 minutes, followed by 25 cycles at 94oC for 15 seconds, 6loC for 20 seconds and 72oC for 30 seconds, followed by a 1 minute extension at 72oC. The PCR products from the first round reaction were diluted 1/100 and used as templates for a second round of PCR using oligonucleotides AP2 (SEQ ID
N0:32, supplied with the Marathon Kit or synthesized) and ZC15526 (SEQ ID N0:13) as primers. The PCR derived DNA
fragments were resolved by gel electrophoresis, excised and ligated into the expression vector was the vector pCR2.1 (TA Cloning Kit, Invitrogen Inc., San Diego, CA) according to manufacturer's instructions. The sequence of the inserts was confirmed by sequence analysis using oligos ZC694 (SEQ ID N0:9) and ZC695 (SEQ ID N0:26) as primers, as described above and confirmed that the Met (amino acid residue 1 of SEQ ID N0:2) was indeed the start methionine. The resulting 2,754 by polynucleotide (SEQ ID
NO:1) had an open reading frame encoding an 876 amino acid residue protein sequence (SEQ ID N0:2) and was designated ZTMPO-1.
Example 2 Northern Blot Analysis of ZTMPO-1 Human Multiple Tissue Northern Blots (MTN I, MTN
II and MTN III; Clontech) were probed to determine the tissue distribution of human ZTMPO1 expression. An approximately 218 by PCR derived probe (SEQ ID N0:8) was amplified using EST clone EST934031 (SEQ ID N0:27) as a template and oligonucleotide ZC15521 (SEQ ID N0:28) and ZC15525 (SEQ ID N0:29) as primers. The amplification was carried out as follows : 1 cycle at 94°C for 2 minutes, 30 cycles of 94°C for 15 seconds, 65°C 20 seconds and 72°C
seconds, followed by 1 cycle at 72°C for 1 minute. The PCR
product was gel purified using the QIAquick method (Qiagen, Chatsworth, CA) and radioactively labeled using the Rediprime DNA labeling kit (Amersham, Arlington Heights, IL) both according to the manufacturer's suggestion. The probe was purified using a NUCTRAP push column (Stratagene). EXPRESSHYB (Clontech) solution was used for prehybridization and as a hybridizing solution for the Northern blots. Hybridization took place overnight at 65°C using 4 x 106 cpm/ml of labeled probe.
The blots were then washed in 2X SSC and 0.05% SDS at RT, followed by washes in O.1X SSC and 0.1% SDS at 50°C twice and at 55°C once. Two transcripts of approximately 3.2 kb and 5 kb were seen in nearly all the tissues with the most predominant expression being in testis.
Example 3 Chromosomal Assicrnment and Placement of ZTMPO-1 ZTMPO-1 was mapped to chromosome 12 using the commercially available GeneBridge 4 Radiation Hybrid Panel (Research Genetics, Inc., Huntsville, AL). The GeneBridge 4 Radiation Hybrid Panel contains PCRable DNAs from each of 93 radiation hybrid clones, plus two control DNAs (the HFL donor and the A23 recipient). A publicly available WWW server (http://www-genome.wi.mit.edu/cgi-bin/contig/
rhmapper.pl) allows mapping relative to the Whitehead Institute/MIT Center for Genome Research's radiation hybrid map of the human genome (the "WICGR" radiation hybrid map) which was constructed with the GeneBridge 4 Radiation Hybrid Panel.
For the mapping of ZTMPO-1 with the GeneBridge 4 RH Panel, 20 ~.1 reactions were set up in a 96-well microtiter plate (Stratagene, La Jolla, CA) and used in a RoboCycler Gradient 96 thermal cycler (Stratagene). Each of the 95 PCR reactions consisted of 2 ul lOX KlenTaq PCR
reaction buffer (CLONTECH Laboratories, Inc., Palo Alto, CA), 1.6 ~.1 dNTPs mix (2.5 mM each, PERKIN-ELMER, Foster City, CA) , 1 ~,1 sense primer, ZC15, 487 (SEQ ID NO: 6) , 1 ~.1 antisense primer, ZC 15486 (SEQ ID N0:7), 2 ~.1 RediLoad (Research Genetics, Inc.), 0.4 ~1 50X Advantage KlenTaq Polymerase Mix (Clontech Laboratories, Inc.), 25 ng of DNA
from an individual hybrid clone or control and ddH20 for a total volume of 20 ~1. The reactions were overlaid with an equal amount of mineral oil and sealed. The PCR cycler conditions were as follows: an initial 1 cycle 5 minute denaturation at 95°C, 35 cycles of a 1 minute denaturation at 95°C, 1 minute annealing at 62°C and 1.5 minute 5 extension at 72°C, followed by a final 1 cycle extension of _ 7 minutes at 72°C. The reactions were separated by electrophoresis on a 2o agarose gel (Life Technologies, Gaithersburg, MD).
The results showed that ZTMPO-1 maps 636.18 10 cR_3000 from the top of the human chromosome 12 linkage group on the WICGR radiation hybrid map. The proximal framework marker was D12S367. This positions ZTMPO-1 in the 12q24.33 region on the integrated LDB chromosome 12 map (The Genetic Location Database, University of 15 Southhampton, wWW server: http://cedar.genetics.soton.ac.
uk/public html/).
From the foregoing, it will be appreciated that, although specific embodiments of the invention have been described herein for purposes of illustration, various 20 modifications may be made without deviating from the spirit and scope of the invention. Accordingly, the invention is not limited except as by the appended claims.
SEQUENCE LISTING
<110> ZyrnoGenetics. Inc.
1201 Eastlake Avenue East Seattle. Washington 98102 United States of America <120> SOLUBLE PROTEIN ZTMPO-1 <130> 97-67PC
<150> 60/082.513 <151> 1998-04-21 <160> 32 <170> FastSEQ for Windows Version 3.0 <210>1 <211>2884 <212>DNA
<213>Homo Sapiens <220>
<221> CDS
<222> (127)...(2754) <400> 1 aaagttttta atgaaagaaa cagaaactga tgccattata taatgaaccc tagtacccat 60 cacccagctt cagcaggtgt tagtattttg tgactctttg atttttttgt cttgggccta 120 ggtgaa atg aca atg gat get ctg ttg get cga ttg aaa ctt ctg aat 168 Met Thr Met Asp Ala Leu Leu Ala Arg Leu Lys Leu Leu Asn 1 5 lp ' cca gat gac ctt aga gaa gaa atc gtc aaa gcc gga ttg aaa tgt gga 216 Pro Asp Asp Leu Arg Glu Glu Ile Val Lys Ala Gly Leu Lys Cys Gly ccc att aca tca act aca agg ttc att ttt gag aaa aaa ttg get cag 264 Pro Ile Thr Ser Thr Thr Arg Phe Ile Phe Glu Lys Lys Leu Ala Gln get tta ctg gag caa gga gga agg ctg tct tct ttc tac cac cat gag 312 Ala Leu Leu Glu Gln Gly Gly Arg Leu Ser Ser Phe Tyr His His Glu gca ggt gtc aca get ctc agc cag gac cca caa agg att ttg aag cca 360 Ala Gly Val Thr Ala Leu Ser Gln Asp Pro Gln Arg Ile Leu Lys Pro get gaa ggg aac cca act gat cag get ggt ttt tct gaa gac aga gat 408 Ala Glu Gly Asn Pro Thr Asp Gln Ala Gly Phe Ser Glu Asp Arg Asp ttt ggt tac agt gtg ggc ctg aat cct cca gag gag gaa get gtg aca 456 Phe Gly Tyr Ser Val Gly Leu Asn Pro Pro Glu Glu Glu Ala Val Thr tcc aag acc tgc tcg gtg ccc cct agt gac acc gac acc tac aga get 504 Ser Lys Thr Cys Ser Val Pro Pro Ser Asp Thr Asp Thr Tyr Arg Ala gga gcg act gcg tct aag gag ccg ccc ctg tac tat ggg gtg tgt cca 552 Gly Ala Thr Ala Ser Lys Glu Pro Pro Leu Tyr Tyr Gly Val Cys Pro gtg tat gag gac gtc cca gcg aga aat gaa agg atc tat gtt tat gaa 600 Ual Tyr Glu Asp Val Pro Ala Arg Asn Glu Arg Ile Tyr Val Tyr Glu aat aaa aag gaa gca ttg caa get gtc aag atg atc aaa ggg tcc cga 648 Asn Lys Lys Glu Ala Leu Gln Ala Val Lys Met Ile Lys Gly Ser Arg ttt aaa get ttt tct acc aga gaa gac get gag aaa ttt get aga gga 696 Phe Lys Ala Phe Ser Thr Arg Glu Asp Ala Glu Lys Phe Ala Arg Gly att tgt gat tat ttc cct tct cca agc aaa acg tcc tta cca ctg tct 744 Ile Cys Asp Tyr Phe Pro Ser Pro Ser Lys Thr Ser Leu Pro Leu Ser cct gtg aaa aca get cca ctc ttt agc aat gac agg ttg aaa gat ggt 792 Pro Ual Lys Thr Ala Pro Leu Phe Ser Asn Asp Arg Leu Lys Asp Gly ttg tgc ttg tcg gaa tca gaa aca gtc aac aaa gag cga gcg aac agt 840 Leu Cys Leu Ser Glu Ser Glu Thr Val Asn Lys Glu Arg Ala Asn Ser tac aaa aat ccc cgc acg cag gac ctc acc gcc aag ctt cgg aaa get 888 Tyr Lys Asn Pro Arg Thr Gln Asp Leu Thr Ala Lys Leu Arg Lys Ala gtg gag aag gga gag gag gac acc ttt tct gac ctt atc tgg agc aac 936 Val Glu Lys Gly Glu Glu Asp Thr Phe Ser Asp Leu Ile Trp Ser Asn ccc cgg tat ctg ata ggc tca gga gac aac ccc act atc gtg cag gaa 984 Pro Arg Tyr Leu Ile Gly Ser Gly Asp Asn Pro Thr Ile Val Gln Glu ggg tgc agg tac aac gtg atg cat gtt get gcc aaa gag aac cag get 1032 Gly Cys Arg Tyr Asn Ual Met His Val Ala Ala Lys Glu Asn Gln Ala tcc atc tgc cag ctg act ctg gac gtc ctg gag aac cct gac ttc atg 1080 Ser Ile Cys Gln Leu Thr Leu Asp Val Leu Glu Asn Pro Asp Phe Met agg ctg atg tac cct gat gac gac gag gcc atg ctg cag aag cgt atc 1128 Arg Leu Met Tyr Pro Asp Asp Asp Glu Ala Met Leu Gln Lys Arg Ile cgt tac gtg gtg gac ctg tac ctc aac acc ccc gac aag atg ggc tat 1176 Arg Tyr Val Val Asp Leu Tyr Leu Asn Thr Pro Asp Lys Met Gly Tyr gac aca ccg ttg cat ttt get tgt aag ttt gga aat gca gat gta gtc 1224 Asp Thr Pro Leu His Phe Ala Cys Lys Phe Gly Asn Ala Asp Val Val aac gtg ctt tcg tca cac cat ttg att gta aaa aac tca agg aat aaa 1272 Asn Val Leu Ser Ser His His Leu Ile Val Lys Asn Ser Arg Asn Lys tat gat aaa aca cct gaa gat gta att tgt gaa aga agc aaa aat aaa 1320 Tyr Asp Lys Thr Pro Glu Asp Ual Ile Cys Glu Arg Ser Lys Asn Lys tct gtg gaa ctg aag gag cgg atc aga gag tat tta aag ggc cac tac 1368 Ser Val Glu Leu Lys Glu Arg Ile Arg Glu Tyr Leu Lys Gly His Tyr WO 99/54468 PCT/US99l08601 tac gtg ccc ctc ctg aga gcg gaa gag act tct tct cca gtc atc ggg 1416 Tyr Val Pro Leu Leu Arg Ala Glu Glu Thr Ser Ser Pro Val Ile Gly gag ctg tgg tcc cca gac cag acg get gag gcc tct cac gtc agc cgc 1464 Glu Leu Trp Ser Pro Asp Gln Thr Ala Glu Ala Ser His Val Ser Arg tat gga ggc agc ccc aga gac ccg gta ctg acc ctg aga gcc ttc gca 1512 Tyr Gly Gly Ser Pro Arg Asp Pro Val Leu Thr Leu Arg Ala Phe Ala ggg ccc ctg agt cca gcc aag gca gaa gat ttt cgc aag ctc tgg aaa 1560 Gly Pro Leu Ser Pro Ala Lys Ala Glu Asp Phe Arg Lys Leu Trp Lys act cca cct cga gag aaa gca ggc ttc ctt cac cac gtc aag aag tcg 1608 Thr Pro Pro Arg Glu Lys Ala Gly Phe Leu His His Ual Lys Lys Ser gac ccg gaa aga ggc ttt gag aga gtg gga agg gag cta get cat gag 1656 Asp Pro Glu Arg Gly Phe Glu Arg Val Gly Arg Glu Leu Ala His Glu ctg ggg tat ccc tgg gtt gaa tac tgg gaa ttt ctg ggc tgt ttt gtt 1704 Leu Gly Tyr Pro Trp Val Glu Tyr Trp Glu Phe Leu Gly Cys Phe Val gat ctg tct tcc cag gaa ggc ctg caa aga cta gaa gaa tat ctc aca 1752 Asp Leu Ser Ser Gln Glu Gly Leu Gln Arg Leu Glu Glu Tyr Leu Thr cag cag gaa ata ggc aaa aag get caa caa gaa aca gga gaa cgg gaa 1800 Gln Gln Glu Ile Gly Lys Lys Ala Gln Gln Glu Thr Gly Glu Arg Glu gcc tcc tgc cga gat aaa gcc acc acg tct ggc agc aat tcc att tcc 1848 Ala Ser Cys Arg Asp Lys Ala Thr Thr Ser Gly Ser Asn Ser Ile Ser gtg agg gcg ttt cta gat gaa gat gac atg agc ttg gaa gaa ata aaa 1896 Val Arg Ala Phe Leu Asp Glu Asp Asp Met Ser Leu Glu Glu Ile Lys aat cgg caa aat gca get cga aat aac agc ccg ccc aca gtc ggt get 1944 Asn Arg Gln Asn Ala Ala Arg Asn Asn Ser Pro Pro Thr Val Gly Ala ttt gga cat acg agg tgc agc gcc ttc ccc ttg gag cag gag gca gac 1992 Phe Gly His Thr Arg Cys Ser Ala Phe Pro Leu Glu Gln Glu Ala Asp ctc ata gaa gcc gcc gag ccg gga ggt cca cac agc agc aga aat ggg 2040 Leu Ile Glu Ala Ala Glu Pro Gly Gly Pro His Ser Ser Arg Asn Gly ctc tgc cat cct ctg aat cac agc agg acc ctg gcg ggc aag aga cca 2088 Leu Cys His Pro Leu Asn His Ser Arg Thr Leu Ala Gly Lys Arg Pro aag gcc ccc cat ggg gag gaa gcc cat ctg cca cct gtc tcg gat ttg 2136 Lys Ala Pro His Gly Glu Glu Ala His Leu Pro Pro Val Ser Asp Leu act gtt gag ttt gat aaa ctg aat ttg caa aat ata gga cgt agc gtt 2184 Thr Ual Glu Phe Asp Lys Leu Asn Leu Gln Asn Ile Gly Arg Ser Val tcc aag aca cca gat gaa agt aca aaa act aaa gat cag atc ctg act 2232 Ser Lys Thr Pro Asp Glu Ser Thr Lys Thr Lys Asp Gln Ile Leu Thr tca aga atc aat gca gta gaa aga gac ttg tta gag cct tct ccc gca 2280 Ser Arg Ile Asn Ala Val Glu Arg Asp Leu Leu Glu Pro Ser Pro Ala gac caa ctc ggg aat ggc cac agg agg aca gaa agt gaa atg tca gcc 2328 Asp Gln Leu Gly Asn Gly His Arg Arg Thr Glu Ser Glu Met Ser Ala agg atc get aaa atg tcc ttg agt ccc agc agc ccc agg cac gag gat 2376 Arg Ile Ala Lys Met Ser Leu Ser Pro Ser Ser Pro Arg His Glu Asp cag ctc gag gtc acc agg gaa ccg gcc agg cgg ctc ttc ctt ttt gga 2424 Gln Leu Glu Val Thr Arg Glu Pro Ala Arg Arg Leu Phe Leu Phe Gly gag gag cca tca aaa ctc gat cag gat gtt ttg gcc get ctt gaa tgt 2472 Glu Glu Pro Ser Lys Leu Asp Gln Asp Val Leu Ala Ala Leu Glu Cys gca gac gtc gac ccc cat cag ttc ccg gcc gtg cac aga tgg aag agt 2520 Ala Asp Ual Asp Pro His Gln Phe Pro Ala Val His Arg Trp Lys Ser get gtc ctg tgc tac tca ccc tcg gac aga cag agt tgg ccc agt ccc 2568 Ala Val Leu Cys Tyr Ser Pro Ser Asp Arg Gln Ser Trp Pro Ser Pro gcg gtg aaa gga agg ttc aag tct cag ctg cca gat ctc agt ggc cct 2616 Ala Val Lys Gly Arg Phe Lys Ser Gln Leu Pro Asp Leu Ser Gly Pro cac agc tac agt ccg ggg aga aac agc gtg get gga agc aac ccc gca 2664 His Ser Tyr Ser Pro Gly Arg Asn Ser Val Ala Gly Ser Asn Pro Ala aag cca ggc ctg ggc agt cct ggg cgc tac agc ccc gtg cac ggg agc 2712 Lys Pro Gly Leu Gly Ser Pro Gly Arg Tyr Ser Pro Val His Gly Ser cag ctc cgc agg atg gcg cgc ctg get gag ctt gcc gcc ctg 2754 Gln Leu Arg Arg Met Ala Arg Leu Ala Glu Leu Ala Ala Leu taggcttggc gctgggctct cggtttgttc ttcattttta aagaaggaag ggtcatatgt 2814 ttattgctaa actgtcaaaa aggaatatat tctgattaaa ttattactcc tcaaaaaaaa 2874 aaaaaaaaaa 2884 <210>2 <211>876 <212>PRT
<213>Homo sapiens <400> 2 Met Thr Met Asp Ala Leu Leu Ala Arg Leu Lys Leu Leu Asn Pro Asp Asp Leu Arg Glu Glu Ile Val Lys Ala Gly Leu Lys Cys Gly Pro Ile Thr Ser Thr Thr Arg Phe Ile Phe Glu Lys Lys Leu Ala Gln Ala Leu Leu Glu Gln Gly Gly Arg Leu Ser Ser Phe Tyr His His Glu Ala Gly Val Thr Ala Leu Ser Gln Asp Pro Gln Arg Ile Leu Lys Pro Ala Glu Gly Asn Pro Thr Asp Gln Ala Gly Phe Ser Glu Asp Arg Asp Phe Gly Tyr Ser Val Gly Leu Asn Pro Pro Glu Glu Glu Ala Val Thr Ser Lys Thr Cys Ser Val Pro Pro Ser Asp Thr Asp Thr Tyr Arg Ala Gly Ala Thr Ala Ser Lys Glu Pro Pro Leu Tyr Tyr Gly Ual Cys Pro Val Tyr Glu Asp Val Pro Ala Arg Asn Glu Arg Ile Tyr Val Tyr Glu Asn Lys Lys Glu Ala Leu Gln Ala Val Lys Met Ile Lys Gly Ser Arg Phe Lys Ala Phe Ser Thr Arg Glu Asp Ala Glu Lys Phe Ala Arg Gly Ile Cys Asp Tyr Phe Pro Ser Pro Ser Lys Thr Ser Leu Pro Leu Ser Pro Val Lys Thr Ala Pro Leu Phe Ser Asn Asp Arg Leu Lys Asp Gly Leu Cys Leu Ser Glu Ser Glu Thr Val Asn Lys Glu Arg Ala Asn Ser Tyr Lys Asn Pro Arg Thr Gln Asp Leu Thr Ala Lys Leu Arg Lys Ala Val Glu Lys Gly Glu Glu Asp Thr Phe Ser Asp Leu Ile Trp Ser Asn Pro Arg Tyr Leu Ile Gly Ser Gly Asp Asn Pro Thr Ile Val Gln Glu Gly Cys Arg Tyr Asn Val Met His Val Ala Ala Lys Glu Asn Gln Ala Ser Ile Cys Gln Leu Thr Leu Asp Val Leu Glu Asn Pro Asp Phe Met Arg Leu Met Tyr Pro Asp Asp Asp Glu Ala Met Leu Gln Lys Arg Ile Arg Tyr Val Val Asp Leu Tyr Leu Asn Thr Pro Asp Lys Met Gly Tyr Asp Thr Pro Leu His Phe Ala Cys Lys Phe Gly Asn Ala Asp Val Val Asn Val Leu Ser Ser His His Leu Ile Val Lys Asn Ser Arg Asn Lys Tyr Asp Lys Thr Pro Glu Asp Val Ile Cys Glu Arg Ser Lys Asn Lys Ser Val Glu Leu Lys Glu Arg Ile Arg Glu Tyr Leu Lys Gly His Tyr Tyr Val Pro Leu Leu Arg Ala Glu Glu Thr Ser Ser Pro Val Ile Gly Glu Leu Trp Ser Pro Asp Gln Thr Ala Glu Ala Ser His Val Ser Arg Tyr Gly Gly Ser Pro Arg Asp Pro Val Leu Thr Leu Arg Ala Phe Ala Gly Pro Leu Ser Pro Ala Lys Ala Glu Asp Phe Arg Lys Leu Trp Lys Thr Pro Pro Arg Glu Lys Ala Gly Phe Leu His His Val Lys Lys Ser Asp Pro Glu Arg Gly Phe Glu Arg Val Gly Arg Glu Leu Ala His Glu Leu Gly Tyr Pro Trp Val Glu Tyr Trp Glu Phe Leu Gly Cys Phe Val Asp Leu Ser Ser Gln Glu Gly Leu Gln Arg Leu Glu Glu Tyr Leu Thr Gln Gln Glu Ile Gly Lys Lys Ala Gln Gln Glu Thr Gly Glu Arg Glu Ala Ser Cys Arg Asp Lys Ala Thr Thr Ser Gly Ser Asn Ser Ile Ser Val Arg Ala Phe Leu Asp Glu Asp Asp Met Ser Leu Glu Glu Ile Lys Asn Arg Gln Asn Ala Ala Arg Asn Asn Ser Pro Pro Thr Val Gly Ala Phe Gly His Thr Arg Cys Ser Ala Phe Pro Leu Glu Gln Glu Ala Asp Leu Ile Glu Ala Ala Glu Pro Gly Gly Pro His Ser Ser Arg Asn Gly Leu Cys His Pro Leu Asn His Ser Arg Thr Leu Ala Gly Lys Arg Pro Lys Ala Pro His Gly Glu Glu Ala His Leu Pro Pro Val Ser Asp Leu Thr Val Glu Phe Asp Lys Leu Asn Leu Gln Asn Ile Gly Arg Ser Ual Ser Lys Thr Pro Asp Glu Ser Thr Lys Thr Lys Asp Gln Ile Leu Thr Ser Arg Ile Asn Ala Val Glu Arg Asp Leu Leu Glu Pro Ser Pro Ala Asp Gln Leu Gly Asn Gly His Arg Arg Thr Glu Ser Glu Met Ser Ala Arg Ile Ala Lys Met Ser Leu Ser Pro Ser Ser Pro Arg His Glu Asp Gln Leu Glu Val Thr Arg Glu Pro Ala Arg Arg Leu Phe Leu Phe Gly Glu Glu Pro Ser Lys Leu Asp Gln Asp Val Leu Ala Ala Leu Glu Cys Ala Asp WO 99/544b8 PCT/US99/08601 Ual Asp Pro His Gln Phe Pro Ala Ual His Arg Trp Lys Ser Ala Val Leu Cys Tyr Ser Pro Ser Asp Arg Gln Ser Trp Pro Ser Pro Ala Val Lys Gly Arg Phe Lys Ser Gln Leu Pro Asp Leu Ser Gly Pro His Ser Tyr Ser Pro Gly Arg Asn Ser Val Ala Gly Ser Asn Pro Ala Lys Pro Gly Leu Gly Ser Pro Gly Arg Tyr Ser Pro Val His Gly Ser Gln Leu Arg Arg Met Ala Arg Leu Ala Glu Leu Ala Ala Leu <210> 3 <211> 254 <212> PRT
<213> Homo sapiens <400> 3 Met Asp Asn Tyr Ala Asp Leu Ser Asp Thr Glu Leu Thr Thr Leu Leu Arg Arg Tyr Asn Ile Pro His Gly Pro Val Val Gly Ser Thr Arg Arg Leu Tyr Glu Lys Lys Ile Phe Glu Tyr Glu Thr Gln Arg Arg Arg Leu Ser Pro Pro Ser Ser Ser Ala Ala Ser Ser Tyr Ser Phe Ser Asp Leu Asn Ser Thr Arg Gly Asp Ala Asp Met Tyr Asp Leu Pro Lys Lys Glu Asp Ala Leu Leu Tyr Gln Ser Lys Gly Tyr Asn Asp Asp Tyr Tyr Glu Glu Ser Tyr Phe Thr Thr Arg Thr Tyr Gly Glu Pro Glu Ser Ala Gly Pro Ser Arg Ala Ual Arg Gln Ser Val Thr Ser Phe Pro Asp Ala Asp Ala Phe His His Gln Val His Asp Asp Asp Leu Leu Ser Ser Ser Glu Glu Glu Cys Lys Asp Arg Glu Arg Pro Met Tyr Gly Arg Asp Ser Ala Tyr Gln Ser Ile Thr His Tyr Arg Pro Val Ser Ala Ser Arg Ser Ser Leu Asp Leu Ser Tyr Tyr Pro Thr Ser Ser Ser Thr Ser Phe Met Ser Ser Ser Ser Ser Ser Ser Ser Trp Leu Thr Arg Arg Ala Ile Arg Pro Glu Asn Arg Ala Pro Gly Ala Gly Leu Gly Gln Asp Arg Gln Val Pro Leu Trp Gly Gln Leu Leu Leu Phe Leu Val Phe Val Ile Val Leu Phe Phe Ile Tyr His Phe Met Gln Ala Glu Glu Gly Asn Pro Phe <210>4 <211>694 <212>PRT
<213>Homo Sapiens <400> 4 Met Pro Glu Phe Leu Glu Asp Pro Ser Val Leu Thr Lys Asp Lys Leu Lys Ser Glu Leu Val Ala Asn Asn Val Thr Leu Pro Ala Gly Glu Gln Arg Lys Asp Val Tyr Val Gln Leu Tyr Leu Gln His Leu Thr Ala Arg Asn Arg Pro Pro Leu Pro Ala Gly Thr Asn Ser Lys Gly Pro Pro Asp Phe Ser Ser Asp Glu Glu Arg Glu Pro Thr Pro Val Leu Gly Ser Gly Ala Ala Ala Ala Gly Arg Ser Arg Ala Ala Val Gly Arg Lys Ala Thr Lys Lys Thr Asp Lys Pro Arg Gln Glu Asp Lys Asp Asp Leu Asp Val Thr Glu Leu Thr Asn Glu Asp Leu Leu Asp Gln Leu Ual Lys Tyr Gly Val Asn Pro Gly Pro Ile Val Gly Thr Thr Arg Lys Leu Tyr Glu Lys Lys Leu Leu Lys Leu Arg Glu Gln Gly Thr Glu Ser Arg Ser Ser Thr Pro Leu Pro Thr Ile Ser Ser Ser Ala Glu Asn Thr Arg Gln Asn Gly Ser Asn Asp Ser Asp Arg Tyr Ser Asp Asn Glu Glu Gly Lys Lys Lys Glu His Lys Lys Val Lys Ser Thr Arg Asp Ile Val Pro Phe Ser Glu Leu Gly Thr Thr Pro Ser Gly Gly Gly Phe Phe Gln Gly Ile Ser Phe Pro Glu Ile Ser Thr Arg Pro Pro Leu Gly Ser Thr Glu Leu Gln Ala Ala Lys Lys Val His Thr Ser Lys Gly Asp Leu Pro Arg Glu Pro Leu Val Ala Thr Asn Leu Pro Gly Arg Gly Gln Leu Gln Lys Leu Ala Ser Glu Arg Asn Leu Phe Ile Ser Cys Lys Ser Ser His Asp Arg Cys Leu Glu Lys Ser Ser Ser Ser Ser Ser Gln Pro Glu His Ser Ala Met Leu Val Ser Thr Ala Ala Ser Pro Ser Leu Ile Lys Glu Thr Thr Thr Gly Tyr Tyr Lys Asp Ile Val Glu Asn Ile Cys Gly Arg Glu Lys Ser Gly Ile Gln Pro Leu Cys Pro Glu Arg Ser His Ile Ser Asp Gln Ser Pro Leu Ser Ser Lys Arg Lys Ala Leu Glu Glu Ser Glu Ser Ser Gln Leu Ile Ser Pro Pro Leu Ala Gln Ala Ile Arg Asp Tyr Val Asn Ser Leu Leu Val Gln Gly Gly Ual Gly Ser Leu Pro Gly Thr Ser Asn Ser Met Pro Pro Leu Asp Val Glu Asn Ile Gln Lys Arg Ile Asp Gln Ser Lys Phe Gln Glu Thr G1u Phe Leu Ser Pro Pro Arg Lys Ual Pro Arg Leu Ser Glu Lys Ser Val Glu Glu Arg Asp Ser Gly Ser Phe Val Ala Phe Gln Asn Ile Pro Gly Ser Glu Leu Met Ser Ser Phe Ala Lys Thr Val Val Ser His Ser Leu Thr Thr Leu Gly Leu Glu Ual Ala Lys Gln Ser Gln His Asp Lys Ile Asp Ala Ser Glu Leu Ser Phe Pro Phe His Glu Ser Ile Leu Lys Val Ile Glu Glu Glu Trp Gln Gln Val Asp Arg Gln Leu Pro Ser Leu Ala Cys Lys Tyr Pro Val Ser Ser Arg Glu Ala Thr Gln Ile Leu Ser Ual Pro Lys Val Asp Asp Glu Ile Leu Gly Phe Ile Ser Glu Ala Thr Pro Leu Gly Gly Ile Gln Ala Ala Ser Thr Glu Ser Cys Asn Gln Gln Leu Asp Leu Ala Leu Cys Arg Ala Tyr Glu Ala Ala Ala Ser Ala Leu Gln Ile Ala Thr His Thr Ala Phe Ual Ala Lys Ala Met Gln Ala Asp Ile Ser Gln Ala A1a Gln Ile Leu Ser Ser Asp Pro Ser Arg Thr His Gln Ala Leu Gly Ile Leu Ser Lys Thr Tyr Asp Ala Ala Ser Tyr Ile Cys Glu Ala Ala Phe Asp Glu Val Lys Met Ala Ala His Thr Met Gly Asn Ala Thr Val Gly Arg Arg Tyr Leu Trp Leu Lys Asp Cys Lys Ile Asn Leu Ala Ser Lys Asn Lys Leu Ala Ser Thr Pro Phe Lys Giy Gly Thr Leu Phe Gly Gly Glu Val Cys Lys Val Ile Lys Lys Arg Gly Asn Lys His <210> 5 <211> 2628 <212> DNA
<213> Artificial Sequence <220>
<223> Degenerate nucleotide sequence encoding the polypeptide of SEQ ID N0:2 <221> variation <222> (1)...(2628) <223> Each N is independently any one of A, T, G or C.
<400> 5 atgacnatgg aygcnytnytngcnmgnytnaarytnytnaayccngaygayytnmgngar 60 garathgtna argcnggnytnaartgyggnccnathacnwsnacnacnmgnttyathtty 120 garaaraary tngcncargcnytnytngarcarggnggnmgnytnwsnwsnttytaycay 180 caygargcng gngtnacngcnytnwsncargayccncarmgnathytnaarccngcngar 240 ggnaayccna cngaycargcnggnttywsngargaymgngayttyggntaywsngtnggn 300 ytnaayccnc cngargargargcngtnacnwsnaaracntgywsngtnccnccnwsngay 360 acngayacnt aymgngcnggngcnacngcnwsnaargarccnccnytntaytayggngtn 420 tgyccngtnt aygargaygtnccngcnmgnaaygarmgnathtaygtntaygaraayaar 480 aargargcny tncargcngtnaaratgathaarggnwsnmgnttyaargcnttywsnacn 540 mgngargayg cngaraarttygcnmgnggnathtgygaytayttyccnwsnccnwsnaar 600 acnwsnytnc cnytnwsnccngtnaaracngcnccnytnttywsnaaygaymgnytnaar 660 gayggnytnt gyytnwsngarwsngaracngtnaayaargarmgngcnaaywsntayaar 720 aayccnmgna cncargayytnacngcnaarytnmgnaargcngtngaraarggngargar 780 gayacnttyw sngayytnathtggwsnaayccnmgntayytnathggnwsnggngayaay 840 ccnacnathg tncargarggntgymgntayaaygtnatgcaygtngcngcnaargaraay 900 cargcnwsna thtgycarytnacnytngaygtnytngaraayccngayttyatgmgnytn 960 atgtayccng aygaygaygargcnatgytncaraarmgnathmgntaygtngtngayytn 1020 tayytnaaya cnccngayaaratgggntaygayacnccnytncayttygcntgyaartty 1080 ggnaaygcng aygtngtnaaygtnytnwsnwsncaycayytnathgtnaaraaywsnmgn 1140 aayaartaygayaaracnccngargaygtnathtgygarmgnwsnaaraayaarwsngtn 1200 garytnaargarmgnathmgngartayytnaarggncaytaytaygtnccnytnytnmgn 1260 gcngargaracnwsnwsnccngtnathggngarytntggwsnccngaycaracngcngar 1320 gcnwsncaygtnwsnmgntayggnggnwsnccnmgngayccngtnytnacnytnmgngcn 1380 ttygcnggnccnytnwsnccngcnaargcngargayttymgnaarytntggaaracnccn 1440 ccnmgngaraargcnggnttyytncaycaygtnaaraarwsngayccngarmgnggntty 1500 garmgngtnggnmgngarytngcncaygarytnggntayccntgggtngartaytgggar 1560 ttyytnggntgyttygtngayytnwsnwsncargarggnytncarmgnytngargartay 1620 ytnacncarcargarathggnaaraargcncarcargaracnggngarmgngargcnwsn 1680 tgymgngayaargcnacnacnwsnggnwsnaaywsnathwsngtnmgngcnttyytngay 1740 gargaygayatgwsnytngargarathaaraaymgncaraaygcngcnmgnaayaaywsn 1800 ccnccnacngtnggngcnttyggncayacnmgntgywsngcnttyccnytngarcargar 1860 gcngayytnathgargcngcngarccnggnggnccncaywsnwsnmgnaayggnytntgy 1920 cayccnytnaaycaywsnmgnacnytngcnggnaarmgnccnaargcnccncayggngar 1980 gargcncayytnccnccngtnwsngayytnacngtngarttygayaarytnaayytncar 2040 aayathggnmgnwsngtnwsnaaracnccngaygarwsnacnaaracnaargaycarath 2100 ytnacnwsnmgnathaaygcngtngarmgngayytnytngarccnwsnccngcngaycar 2160 ytnggnaayggncaymgnmgnacngarwsngaratgwsngcnmgnathgcnaaratgwsn 2220 ytnwsnccnwsnwsnccnmgncaygargaycarytngargtnacnmgngarccngcnmgn 2280 mgnytnttyytnttyggngargarccnwsnaarytngaycargaygtnytngcngcnytn 2340 gartgygcngaygtngayccncaycarttyccngcngtncaymgntggaarwsngcngtn 2400 ytntgytaywsnccnwsngaymgncarwsntggccnwsnccngcngtnaarggnmgntty 2460 aarwsncarytnccngayytnwsnggnccncaywsntaywsnccnggnmgnaaywsngtn 2520 gcnggnwsnaayccngcnaarccnggnytnggnwsnccnggnmgntaywsnccngtncay 2580 ggnwsncarytnmgnmgnatggcnmgnytngcngarytngcngcnytn 2628 <210> 6 <211> 18 <2I2> DNA
<2I3> Artificial Sequence <220>
<223> Oligonucleotide ZC15487 <400> 6 ggacccatta catcaact Ig <210> 7 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15486 <400> 7 cctccttgct ccagtaaa lg <210> 8 <211> 218 <212> DNA
<213> Artificial Sequence <220>
<223> Northern Blot probe <400> 8 ctcaggcttt actggagcaa ggaggaaggc tgtcttcttt ctaccaccat gaggcaggtg 60 tcacagctct cagccaggac ccacaaagga ttttgaagcc agctgaaggg aacccaactg 120 atcaggctgg tttttctgaa gacagagatt ttggttacag tgtgggcctg aatcctccag 180 aggaggaagc tgtgacatcc aagacctgct cggtgccc 218 <210> 9 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> ZC694 <400> 9 taatacgact cactatag lg <210> 10 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC976 <400> 10 cgttgtaaaa cgacggcc 18 <210> 11 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15976 <400> 11 cagctctgta ggtgtcggtg tc 22 <210> 12 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15485 <400> 12 caccgacacc tacagagc 18 <210> 13 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> ZC15526 <400> 13 tgctccagta aagcctgagc caatt 25 <210> 14 <211> 17 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC447 <400> 14 taacaatttc acacagg 17 <210> 15 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15620 <400> 15 acagagctgg agcgactgcg 20 <210> 16 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15823 <400> 16 tctctttggc agcaacatgc 20 <210> 17 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16162 <400> 17 gtgcaggtac aacgtgatgc 20 <210> 18 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16035 <400> 18 ' ctgacttcat gaggctgatg 20 <210> 19 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16249 <400> 19 cagggtacat cagcctcatg 20 <210> 20 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16164 <400> 20 tctgtcttcc caggaaggcc 20 <210> 21 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16163 <400> 21 ggaattgctg ccagacgtgg 20 <210> 22 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16165 a <400> 22 agagccttct cccgcagacc 20 <210> 23 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide 16037 <400> 23 ggctgctggg actcaaggac 20 <210> 24 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide AP1 <400> 24 ccatcctaat acgactcact atagggc 27 <210> 25 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15527 <400> 25 ctcatggtgg tagaaagaag acagc 25 <210> 26 <211> 19 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC695 <400> 26 gatttaggtg acactatag lg <210> 27 <211> 424 <212> DNA
<213> Artificial Sequence <220>
<223> EST934031 <400> 27 gctcgattga aacttctgaa tccagatgac cttagagaag aaatcgtcaa agccggattg 60 aaatgtggacccattacatcaactacaaggttcatttttgagaaaaaattggctcaggct 120 ttactggagcaaggaggaaggctgtcttctttctaccaccatgaggcaggtgtcacagct 180 ctcagccaggacccacaaaggattttgaagccagctgaagggaacccaactgatcaggct 240 ggtttttctgaagacagagattttggttacagtgtgggcctgaatcctccagaggaggaa 300 gctgtgacatccaagacctgctcggtgccccctagtgacaccgacacctacagagctgga 360 gcgactgcgtctataggagccgccccctgtactatgngggtgtgtccagttgtatgagga 420 cgtc 424 <210> 28 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15521 <400> 28 gggcaccgag caggtcttgg atgt 24 <210> 29 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15525 <400> 29 ctcaggcttt actggagcaa ggagg 25 <210>30 <211>454 <212>PRT
<213>Homo Sapiens <400> 30 Met Pro Glu Phe Leu Glu Asp Pro Ser Val Leu Thr Lys Asp Lys Leu Lys Ser Glu Leu Val Ala Asn Asn Val Thr Leu Pro Ala Gly Glu Gln Arg Lys Asp Val Tyr Val Gln Leu Tyr Leu Gln His Leu Thr Ala Arg Asn Arg Pro Pro Leu Pro Ala Gly Thr Asn Ser Lys Gly Pro Pro Asp Phe Ser Ser Asp Glu Glu Arg Glu Pro Thr Pro Val Leu Giy Ser Gly Ala Ala AlaAiaGly ArgSerArg AlaAla UalGlyArg LysAlaThr Lys Lys ThrAspLys ProArgGln GluAsp LysAspAsp LeuAspVal Thr Glu LeuThrAsn GluAspLeu LeuAsp GlnLeuVal LysTyrGly Val Asn ProGlyPro IleValGly ThrThr ArgLysLeu TyrGluLys Lys Leu LeuLysLeu ArgGluGln GlyThr GluSerArg SerSerThr Pro Leu ProThrIle SerSerSer AlaGlu AsnThrArg GlnAsnGly Ser Asn AspSerAsp ArgTyrSer AspAsn GluGluAsp SerLysIle Glu Leu LysLeuGlu LysArgGlu ProLeu LysGlyArg AlaLysThr Pro Val ThrLeuLys GlnArgArg ValGlu HisAsnGln SerTyrSer Gln Ala GlyIleThr GluThrGlu TrpThr SerGlySer SerLysGly Gly Pro LeuGlnAla LeuThrArg GluSer ThrArgGly SerArgArg Thr Pro ArgLysArg ValGluThr SerGlu HisPheArg IleAspGly Pro Val IleSerGlu SerThrPro IleAla GluThrIle MetAlaSer Ser Asn GluSerLeu ValValAsn ArgVal ThrGlyAsn PheLysHis Ala Ser ProIleLeu ProIleThr GluPhe SerAspIle ProArgArg Ala Pro LysLysPro LeuThrArg AlaGlu UalGlyGlu LysThrGlu Glu Arg ArgUalGlu ArgAspIle LeuLys GluMetPhe ProTyrGlu Ala Ser ThrProThr GlyIleSer AlaSer CysArgArg ProIleLys Gly Ala AlaGlyArg ProLeuGlu LeuSer AspPheArg MetGluGlu Ser Phe SerSerLys TyrValPro LysTyr ValProLeu AlaAspUai Lys Ser GluLysThr LysLysGly ArgSer IleProVal TrpIleLys Ile Leu LeuPheVal ValValAla ValPhe LeuPheLeu ValTyrGln Ala Met Glu Thr Asn Gln Val Asn Pro Phe Ser Asn Phe Leu His Val Asp Pro Arg Lys Ser Asn <210> 31 <211> 345 <212> PRT
<213> Homo sapiens <400> 31 Met Pro Glu Phe Leu Glu Asp Pro Ser Val Leu Thr Lys Asp Lys Leu Lys Ser Glu Leu Val Ala Asn Asn Val Thr Leu Pro Ala Gly Glu Gln Arg Lys Asp Val Tyr Val Gln Leu Tyr Leu Gln His Leu Thr Ala Arg Asn Arg Pro Pro Leu Pro Ala Gly Thr Asn Ser Lys Gly Pro Pro Asp Phe Ser Ser Asp Glu Glu Arg Glu Pro Thr Pro Val Leu Gly Ser Gly 65 70 75 g0 Ala Ala Ala Ala Gly Arg Ser Arg Ala Ala Val Gly Arg Lys Ala Thr Lys Lys Thr Asp Lys Pro Arg Gln Glu Asp Lys Asp Asp Leu Asp Val Thr Glu Leu Thr Asn Glu Asp Leu Leu Asp Gln Leu Val Lys Tyr Gly Val Asn Pro Gly Pro Ile Val Gly Thr Thr Arg Lys Leu Tyr Glu Lys Lys Leu Leu Lys Leu Arg Glu Gln Gly Thr Glu Ser Arg Ser Ser Thr Pro Leu Pro Thr Ile Ser Ser Ser Ala Glu Asn Thr Arg Gln Asn Gly Ser Asn Asp Ser Asp Arg Tyr Ser Asp Asn Glu Glu Asp Ser Lys Ile 180 185 190 ' Glu Leu Lys Leu Glu Lys Arg Glu Pro Leu Lys Gly Arg Ala Lys Thr Pro Val Thr Leu Lys Gln Arg Arg Val Glu His Asn Gln Val Gly Glu Lys Thr Glu Glu Arg Arg Ual Glu Arg Asp Ile Leu Lys Glu Met Phe Pro Tyr Glu Ala Ser Thr Pro Thr Gly Ile Ser Ala Ser Cys Arg Arg Pro Ile Lys Gly Ala Ala Gly Arg Pro Leu Glu Leu Ser Asp Phe Arg Met Glu Glu Ser Phe Ser Ser Lys Tyr Val Pro Lys Tyr Val Pro Leu Ala Asp Val Lys Ser Glu Lys Thr Lys Lys Gly Arg Ser Ile Pro Val Trp Ile Lys Ile Leu Leu Phe Val Val Val Ala Val Phe Leu Phe Leu Val Tyr Gln Ala Met Glu Thr Asn Gln Val Asn Pro Phe Ser Asn Phe Leu His Val Asp Pro Arg Lys Ser Asn <210> 32 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide AP2 <400> 32 actcactata gggctcgagc ggc 23
NO:1 (or its complement) at 42°C overnight in a solution comprising SOo formamide, SxSSC (lxSSC: 0.15 M sodium chloride and 15 mM sodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution (100x Denhardt's solution: 2% (w/v) Ficoll 400, 2% (w/v) polyvinylpyrrolidone, and 2% (w/v) bovine serum albumin), loo dextran sulfate, and 20 ~g/ml denatured, sheared salmon sperm DNA. One of skill in the art can devise variations of these hybridization conditions. For example, the hybridization mixture can be incubated at a higher or lower temperature, such a~ about 65°C, in a solution that does not contain formamide. Moreover, premixed hybridization solutions are available (e. g., EXPRESSHYB Hybridization Solution from CLONTECH
Laboratories, Inc.), and hybridization can be performed according to the manufacturer's instructions.
Following hybridization, the nucleic acid molecules can be washed to remove non-hybridized nucleic acid molecules under stringent conditions, or under highly stringent conditions. Typical stringent washing conditions include washing in a solution of 0.5x-2x SSC
with 0.1% sodium dodecyl sulfate (SDS) at 55-65°C. That is, nucleic acid molecules encoding a variant ZTMPO-1 polypeptide hybridize with a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:1 (or its complement) under stringent washing conditions, in which the wash stringency is equivalent to 0.5x-2x SSC with 0.1% SDS at 50-65°C, including 0.5x SSC with O.la SDS at 55°C, or 2x SSC with 0.1% SDS at 65°C. One of skill in the art can readily devise equivalent conditions, for example, by substituting SSPE for SSC in the wash solution.
Typical highly stringent washing conditions include washing in a solution of O.lx-0.2x SSC with 0.1%
sodium dodecyl sulfate (SDS) at 50-65°C. In other words, polynucleotides encoding a variant ZTMPO-1 polypeptide hybridize with a polynucleotide having the nucleotide sequence of SEQ ID NO:1 (or its complement) under highly stringent washing conditions, in which the wash stringency is equivalent to O.lx-0.2x SSC with 0.1% SDS at 50-65°C, including O.lx SSC with 0.1% SDS at 50°C, or 0.2x SSC with O.lo SDS at 65°C.
The present invention also contemplates ZTMPO-1 variant polypeptides that can be identified using two criteria: a determination of the similarity between the encoded polypeptide with the amino acid sequence of SEQ ID
N0:2, and a hybridization assay, as described above. Such ZTMPO-1 variants include nucleic acid molecules (1) that hybridize with a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:1 (or its complement) under stringent washing conditions, in which the wash stringency is equivalent to 0.5x-2x SSC with O.lo SDS at 50-65°C, and (2) that encode a polypeptide having at least 80%, at least 900, at least 95% or greater than 95%
sequence identity to the amino acid sequence of SEQ ID
N0:2. Alternatively, ZTMPO-1 variants can be characterized as nucleic acid molecules (1) that hybridize with a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:1 (or its complement) under highly stringent washing conditions, in which the wash stringency is equivalent to O.lx-0.2x SSC with 0.1% SDS at 50-65°C, and ( 2 ) that encode a polypeptide having at least 80 0 , at least 90%, at least 95% or greater than 95% sequence identity to the amino acid sequence of SEQ ID N0:2.
As previously noted, the isolated polynucleotides of the present invention include DNA and RNA. Methods for preparing DNA and RNA are well known in the art. In general, RNA is isolated from a tissue or cell that produces large amounts of ZTMPO-1 RNA. Such tissues and cells are identified by Northern blotting (Thomas, Proc. Natl. Acad. Sci. USA 77:5201, 1980), an exemplary source being human testis tissue. Total RNA can be prepared using guanidine HC1 extraction followed by isolation by centrifugation in a CsCl gradient (Chirgwin et al., Biochemistrv 18:52-94, 1979). Poly (A)+ RNA is prepared from total RNA using the method of Aviv and Leder (Proc. Natl. Acad. Sci. USA 69:1408-12, 1972).
Complementary DNA (cDNA) is prepared from poly(A)+ RNA
using known methods. In the alternative, genomic DNA can be isolated. Polynucleotides encoding ZTMPO-1 polypeptides are then identified and isolated by, for example, hybridization or PCR.
The polynucleotides of the present invention can also be synthesized using techniques widely known in the art. See, for example, Glick and Pasternak, Molecular Biotechnology, Principles & Applications of Recombinant DNA, (ASM Press, Washington, D.C. 1994); Itakura et al., Annu. Rev. Biochem. 53: 323-56, 1984 and Climie et al., Proc. Natl. Acad. Sci. USA 87:633-7, 1990.
The present invention further provides counterpart polypeptides and polynucleotides from other species (orthologs). These species include, but are not limited to mammalian, avian, amphibian, reptile, fish, insect and other vertebrate and invertebrate species. Of particular interest are ZTMPO-1 polypeptides from other mammalian species, including murine, porcine, ovine, bovine, canine, feline, equine, and other primate polypeptides. Orthologs of human ZTMPO-1 can be cloned using information and compositions provided by the present invention in combination with conventional cloning techniques. For example, a cDNA can be cloned using mRNA
obtained from a tissue or cell type that expresses ZTMPO-1 as disclosed herein. Suitable sources of mRNA can be identified by probing Northern blots with probes designed from the sequences disclosed herein. A library is then prepared from mRNA of a positive tissue or cell line. A
ZTMPO-1-encoding cDNA can then be isolated by a variety of methods, such as by probing with a complete or partial human cDNA or with one or more sets of degenerate probes based on the disclosed sequences. A cDNA can also be cloned using the polymerase chain reaction, or PCR
(Mullis, U.S. Patent No. 4,683,202), using primers designed from the representative human ZTMPO-1 sequence disclosed herein. Within an additional method, the cDNA
library can be used to transform or transfect host cells, and expression of the cDNA of interest can be detected with an antibody to ZTMPO-1 polypeptide. Similar techniques can also be applied to the isolation of genomic clones.
Those skilled in the art will recognize that the sequence disclosed in SEQ ID NO:1 represents a single allele of human ZTMPO-1 and that allelic variation and alternative splicing are expected to occur. Allelic variants of this sequence can be cloned by probing cDNA or genomic libraries from different individuals according to standard procedures. Allelic variants of the DNA
sequence shown in SEQ ID N0:2, including those containing silent mutations and those in which mutations result in amino acid sequence changes, are within the scope of the present invention, as are proteins which are allelic variants of SEQ ID N0:2. cDNAs generated from alternatively spliced mRNAs, which retain the properties of the ZTMPO-1 polypeptide are included within the scope of the present invention, as are polypeptides encoded by such cDNAs and mRNAs. Allelic variants and splice variants of these sequences can be cloned by probing cDNA or genomic libraries from different individuals or tissues according to standard procedures known in the art.
The present invention also provides isolated ZTMPO-1 polypeptides that are substantially homologous to the polypeptides of SEQ ID N0:2 and their orthologs. The term "substantially homologous" is used herein to denote polypeptides having 50%, preferably 60%, more preferably at least 80%, sequence identity to the sequences shown in SEQ ID N0:2 or their orthologs. Such polypeptides will more preferably be at least 90% identical, and most preferably 95a or more identical to SEQ ID N0:2 or its orthologs). Percent sequence identity is determined by conventional methods. See, for example, Altschul et al., Bull. Math. Bio. 48: 603-16, 1986 and Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 89:10915-9, 1992. The present invention further includes nucleic acid molecules that encode such polypeptides. Methods for determining percent identity are described below.
Briefly, two amino acid sequences are aligned to optimize the alignment scores using a gap opening penalty of 10, a gap extension penalty of 1, and the "blosum 62" scoring matrix of Henikoff and Henikoff (ibid.) as shown in Table 3 (amino acids are indicated by the standard one-letter codes). The percent identity is then calculated as:
Total number of identical matches x 100 5 (length of the longer sequence plus the number of gaps introduced into the longer sequence in order to align the two sequences]
i H N M
i i r1M N N
I I
L~rl ~-I~ M N
~ I I I
l0 d~N N r-IM rl I I I
LflO N r-irlr-irlr-1 t i I
~ U1rl M riO r-IM N N
t I I I I I ~
a di N N O M N irlN rlv-1 M I I I I I I
H diN M rl O M N ,-~f'W-~IM
I
I I I I I
x CO M M rlN ~-IN ri N N N M
I I I I I I I I I
L7 l0N d~d~ N M M N O N N M M
I I I I I I I I i I t W tJ7N O M M v-1N M v-1O r-W~~)N N
I I I I I ~ I I I I
lflN N O M N r-iO ('~1riO ~-1N riN
i I I I I I I t U 41 M d~ M M r-Iri M ~-1N M rl rlN N r-I
1 I I ~ I I I I I I I ~ ~ I i A l~M O N rW-i M d~ rlM M rlO rl~ M M
I I I I I ~ I I I I I I I
'~.., lD r-IM O O O v-1M M O N M N r1 O d~ N M
I I I I I I i I i (Y., 111O N M riO N O M N N H M N ~-irlM N M
I I ~ i I I I I I I I I I
~I,' d~H N N O v-1rl O N rlrl rirl N rlr-iO M N O
I I I I ~ I I I I I I I ~ I
x z A a a w a x H a x ~ w w ~n H
~n O ul O
'" ~ r-I N
Those skilled in the art appreciate that there are many established algorithms available to align two amino acid sequences. The "FASTA" similarity search algorithm of Pearson and Lipman is a suitable protein alignment method for examining the level of identity shared by an amino acid sequence disclosed herein and the amino acid sequence of a putative variant ZTMPO-1. The FASTA algorithm is described by Pearson and Lipman, Proc.
Nat. Acad. Sci. USA 85:2444, 1988, and by Pearson, Meth.
Enzymol. 183:63, 1990.
Briefly, FASTA first characterizes sequence similarity by identifying regions shared by the query sequence (e. g., SEQ ID N0:2) and a test sequence that have either the highest density of identities (if the ktup variable is 1) or pairs of identities (if ktup=2), without considering conservative amino acid substitutions, insertions, or deletions. The ten regions with the highest density of identities are then re-scored by comparing the similarity of all paired amino acids using an amino acid substitution matrix, and the ends of the regions are "trimmed" to include only those residues that contribute to the highest score. If there are several regions with scores greater than the "cutoff" value (calculated by a predetermined formula based upon the length of the sequence and the ktup value), then the trimmed initial regions are examined to determine whether the regions can be joined to form an approximate alignment with gaps. Finally, the highest scoring regions of the two amino acid sequences are aligned using a modification of the Needleman-Wunsch-Sellers algorithm (Needleman and Wunsch, J. Mol. Biol. 48:444, 1970; Sellers, SIAM J. Appl.
Math. 26:787, 1974), which allows for amino acid insertions and deletions. Illustrative parameters for FASTA analysis are: ktup=1, gap opening penalty=10, gap extension penalty=1, and substitution matrix=BLOSUM62.
These parameters can be introduced into a FASTA program by modifying the scoring matrix file ("SMATRIX"), as explained in Appendix 2 of Pearson, Meth. Enzymol. 183:63, 1990.
FASTA can also be used to determine the sequence identity of nucleic acid molecules using a ratio as disclosed above. For nucleotide sequence comparisons, the ktup value can range between one to six, preferably from four to six.
Substantially homologous proteins and polypeptides are characterized as having one or more amino acid substitutions, deletions or additions. These changes are preferably of a minor nature, that is conservative amino acid substitutions and other substitutions that do not significantly affect the folding or activity of the protein or polypeptide; small deletions, typically of one to about 30 amino acids; and small amino- or carboxyl-terminal extensions, such as an amino-terminal methionine residue, a small linker peptide of up to about 20-25 residues, or an affinity tag. Polypeptides comprising affinity tags can further comprise a proteolytic cleavage site between the zsig37 polypeptide and the affinity tag.
Preferred such sites include thrombin cleavage sites and factor Xa cleavage sites.
The present invention includes nucleic acid molecules that encode a polypeptide having one or more "conservative amino acid substitutions," compared with the amino acid sequence of SEQ ID N0:2. Conservative amino acid substitutions can be based upon the chemical properties of the amino acids. That is, variants can be obtained that contain one or more amino acid substitutions of SEQ ID N0:2, in which an alkyl amino acid is substituted for an alkyl amino acid in a ZTMPO-1 amino acid sequence, an aromatic amino acid is substituted for an aromatic amino acid in a ZTMPO-1 amino acid sequence, a sulfur-containing amino acid is substituted for a sulfur-containing amino acid in a ZTMPO-1 amino acid sequence, a hydroxy-containing amino acid is substituted for a hydroxy-containing amino acid in a ZTMPO-1 amino acid sequence, an acidic amino acid is substituted for an acidic amino acid in a ZTMPO-1 amino acid sequence, a basic amino acid is substituted for a basic amino acid in a ZTMPO-1 amino acid sequence, or a dibasic monocarboxylic amino acid is substituted for a dibasic monocarboxylic amino acid in a ZTMPO-1 amino acid sequence.
Among the common amino acids, for example, a "conservative amino acid substitution" is illustrated by a substitution among amino acids within each of the following groups: (1) glycine, alanine, valine, leucine, and isoleucine, (2) phenylalanine, tyrosine, and tryptophan, (3) serine and threonine, (4) aspartate and glutamate, (5) glutamine and asparagine, and (6) lysine, arginine and histidine. Other conservative amino acid substitutions are provided in Table 4.
Table 4 Conservative amino acid substitutions Basic: arginine lysine histidine Acidic: glutamic acid aspartic acid Polar: glutamine asparagine 10 Hydrophobic: leucine isoleucine valine Aromatic: phenylalanine tryptophan 15 tyrosine Small: glycine alanine serine threonine 20 methionine The BLOSUM62 table is an amino acid substitution matrix derived from about 2,000 local multiple alignments of protein sequence segments, representing highly conserved regions of more than 500 groups of related 25 proteins (Henikoff and Henikoff, Proc. Natl. Acad. Sci USA 89:10915, 1992). Accordingly, the BLOSUM62 substitution frequencies can be used to define conservative amino acid substitutions that may be introduced into the amino acid sequences of the present 30 invention. Although it is possible to design amino acid substitutions based solely upon chemical properties (as discussed above), the language "conservative amino acid substitution" preferably refers to a substitution represented by a BLOSUM62 value of greater than -1. For 35 example, an amino acid substitution is conservative if the substitution is characterized by a BLOSUM62 value of 0, 1, 2, or 3. According to this system, preferred conservative amino acid substitutions are characterized by a BLOSUM62 value of at least 1 (e. g., 1, 2 or 3), while more preferred conservative amino acid substitutions are characterized by a BLOSUM62 value of at least 2 (e.g., 2 or 3 ) .
Conservative amino acid changes in a ZTMPO-1 gene can be introduced by substituting nucleotides for the nucleotides recited in SEQ ID NO:1. Such "conservative amino acid" variants can be obtained, for example, by oligonucleotide-directed mutagenesis, linker-scanning mutagenesis, mutagenesis using the polymerase chain reaction, and the like (see Ausubel (1995) at pages 8-10 to 8-22; and McPherson (ed.), Directed MutaQenesis: A
Practical Approach (IRL Press 1991)). The ability of such variants to promote proliferation and cardiac functions as will as other properties of the wild-type protein can be determined using a standard methods, such as the assays described herein. Alternatively, a variant ZTMPO-1 polypeptide can be identified by the ability to specifically bind anti-ZTMPO-1 antibodies.
The proteins of the present invention can also comprise non-naturally occurring amino acid residues.
Non-naturally occurring amino acids include, without limitation, traps-3-methylproline, 2,4-methanoproline, cis-4-hydroxyproline, traps-4-hydroxyproline, N-methyl-glycine, allo-threonine, methylthreonine, hydroxy-ethylcysteine, hydroxyethylhomocysteine, nitro-glutamine, homoglutamine, pipecolic acid, thiazolidine carboxylic acid, dehydroproline, 3- and 4-methylproline, 3,3-dimethylproline, tert-leucine, norvaline, 2-azaphenyl-alanine, 3-azaphenylalanine, 4-azaphenylalanine, and 4-fluorophenylalanine. Several methods are known in the art for incorporating non-naturally occurring amino acid residues into proteins. For example, an in vitro system can be employed wherein nonsense mutations are suppressed using chemically aminoacylated suppressor tRNAs. Methods WO 99/54468 PC1'/US99/08601 for synthesizing amino acids and aminoacylating tRNA are known in the art. Transcription and translation of plasmids containing nonsense mutations is carried out in a cell-free system comprising an E. coli S30 extract and commercially available enzymes and other reagents.
Proteins are purified by chromatography. See, for example, Robertson et al., J. Am. Chem. Soc. 113:2722, 1991; Ellman et al., Methods Enzymol. 202:301, 1991; Chung et al., Science 259:806-9, 1993; and Chung et al., Proc.
Natl. Acad. Sci. USA 90:10145-9, 1993). In a second method, translation is carried out in Xenopus oocytes by microinjection of mutated mRNA and chemically aminoacylated suppressor tRNAs (Turcatti et al., J. Biol.
Chem. 271:19991-8, 1996). Within a third method, E. coli cells are cultured in the absence of a natural amino acid that is to be replaced (e.g., phenylalanine) and in the presence of the desired non-naturally occurring amino acids) (e.g., 2-azaphenylalanine, 3-azaphenylalanine, 4-azaphenylalanine, or 4-fluorophenylalanine). The non-naturally occurring amino acid is incorporated into the protein in place of its natural counterpart. See, Koide et al., Biochem. 33:7470-6, 1994. Naturally occurring amino acid residues can be converted to non-naturally occurring species by in vitro chemical modification.
Chemical modification can be combined with site-directed mutagenesis to further expand the range of substitutions (Wynn and Richards, Protein Sci. 2:395-403, 1993).
A limited number of non-conservative amino acids, amino acids that are not encoded by the genetic code, non-naturally occurring amino~acids, and unnatural amino acids may be substituted for ZTMPO-1 amino acid residues.
Essential amino acids in the polypeptides of the present invention can be identified according to procedures known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, Science 244: 1081-5, 1989; Bass et al., Proc.
Natl. Acad. Sci. USA 88:4498-502, 1991). In the latter technique, single alanine mutations are introduced at every residue in the molecule, and the resultant mutant molecules are tested for biological activity as disclosed below to identify amino acid residues that are critical to the activity of the molecule. See also, Hilton et al. , ,T.
Biol. Chem. 271:4699-708, 1996. Sites of ligand-receptor interaction can also be determined by physical analysis of structure, as determined by such techniques as nuclear magnetic resonance, crystallography, electron diffraction or photoaffinity labeling, in conjunction with mutation of putative contact site amino acids. See, for example, de Vos et al., Science 255:306-12, 1992; Smith et al., J.
Mol. Biol. 224:899-904, 1992; Wlodaver et al., FEBS Lett.
309:59-64, 1992. The identities of essential amino acids can also be inferred from analysis of homologies with related nuclear membrane bound proteins.
Multiple amino acid substitutions can be made and tested using known methods of mutagenesis and screening, such as those disclosed by Reidhaar-Olson and Sauer (Science 241:53-7, 1988) or Bowie and Sauer (Proc.
Natl. Acad. Sci. USA 86:2152-6, 1989). Briefly, these authors disclose methods for simultaneously randomizing two or more positions in a polypeptide, selecting for functional polypeptide, and then sequencing the mutagenized polypeptides to determine the spectrum of allowable substitutions at each position. Other methods that can be used include phage display (e.g., Lowman et al., Biochem. 30:10832-7, 1991; Ladner et al., U.S. Patent No. 5,223,409; Huse, WIPO Publication WO 92/06204) and region-directed mutagenesis (Derbyshire et al., Gene 46:145, 1986; Ner et al., DNA 7:127, 1988).
Variants of the disclosed ZTMPO-1 DNA and polypeptide sequences can be generated through DNA
shuffling as disclosed by Stemmer, Nature 370:389-91, 1994 and Stemmer, Proc. Natl. Acad. Sci. USA 91:10747-51, 1994.
Briefly, variant DNAs are generated by in vitro homologous recombination by random fragmentation of a parent DNA
followed by reassembly using PCR, resulting in randomly introduced point mutations. This technique can be modified by using a family of parent DNAs, such as allelic variants or genes from different species, to introduce additional variability into the process. Selection or screening for the desired activity, followed by additional iterations of mutagenesis and assay provides for rapid "evolution" of sequences by selecting for desirable mutations while simultaneously selecting against detrimental changes.
Mutagenesis methods as disclosed herein can be combined with high-throughput, automated screening methods to detect activity of cloned, mutagenized polypeptides in host cells. Preferred assays in this regard include cell proliferation assays and biosensor-based ligand-binding assays, which are described below. Mutagenized DNA
molecules that encode active polypeptides can be recovered from the host cells and rapidly sequenced using modern equipment. These methods allow the rapid determination of the importance of individual amino acid residues in a polypeptide of interest, and can be applied to polypeptides of unknown structure.
Using the methods discussed herein, one of ordinary skill in the art can identify and/or prepare a variety of polypeptide fragments or variants of SEQ ID
N0:2 or that retain the receptor binding properties of the wild-type ZTMPO-1 protein. Such polypeptides may also include additional polypeptide segments as generally disclosed herein.
For any ZTMPO-1 polypeptide, including variants and fusion proteins, one of ordinary skill in the art can readily generate a fully degenerate polynucleotide sequence encoding that variant using the information set forth in Tables 1 and 2 above.
As used herein a fusion protein consists essentially of a first portion and a second portion joined by a peptide bond. In one embodiment the first portion consists of a polypeptide comprising a sequence of amino 5 acid residues that is at least 80% identical in amino acid sequence to residues 1 through 876 of SEQ ID N0:2 and the second portion is any other polypetide. The other polypeptide may be alternative or additional domains from other members of the thymopoietin or emerin family, a 10 signal peptide to facilitate secretion of the fusion protein, affinity tags, Ig domains or the like.
The ZTMPO-1 polypeptides of the present invention, including full-length polypeptides, biologically active fragments, and fusion polypeptides, 15 can be produced in genetically engineered host cells according to conventional techniques. Suitable host cells are those cell types that can be transformed or transfected with exogenous DNA and grown in culture, and include bacteria, fungal cells, and cultured higher 20 eukaryotic cells. Eukaryotic cells, particularly cultured cells of multicellular organisms, are preferred.
Techniques for manipulating cloned DNA molecules and introducing exogenous DNA into a variety of host cells are disclosed by Sambrook et al., Molecular Cloning: A
25 Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989, and Ausubel et al., eds., Current Protocols in Molecular Bioloav, John Wiley and Sons, Inc., NY, 1987.
In general, a DNA sequence encoding a ZTMPO-1 30 polypeptide is operably linked to other genetic elements required for its expression, generally including a transcription promoter and terminator, within an expression vector. The vector will also commonly contain one or more selectable markers and one or more origins of 35 replication, although those skilled in the art will recognize that within certain systems selectable markers may be provided on separate vectors, and replication of the exogenous DNA may be provided by integration into the host cell genome. Selection of promoters, terminators, selectable markers, vectors and other elements is a matter of routine design within the level of ordinary skill in the art. Many such elements are described in the literature and are available through commercial suppliers.
To direct a ZTMPO-1 polypeptide into the secretory pathway of a host cell, a secretory signal sequence (also known as a leader sequence, signal sequence, prepro sequence or pre sequence) is provided in the expression vector. The secretory signal sequence may be derived from another secreted protein (e.g., t-PA) or synthesized de novo. The secretory signal sequence is operably linked to the ZTMPO-1 DNA sequence, i.e., the two sequences are joined in the correct reading frame and positioned to direct the newly synthesized polypeptide into the secretory pathway of the host cell. Secretory signal sequences are commonly positioned 5' to the DNA
sequence encoding the polypeptide of interest, although certain secretory signal sequences may be positioned elsewhere in the DNA sequence of interest (see, e.g., Welch et al., U.S. Patent No. 5,037,743; Holland et al., U.S. Patent No. 5,143,830).
Cultured mammalian cells are suitable hosts within the present invention. Methods for introducing exogenous DNA into mammalian host cells include calcium phosphate-mediated transfection (Wigler et al., Cell 14:725, 1978; Corsaro and Pearson, Somatic Cell Genetics 7:603, 1981: Graham and Van der Eb, ViroloQV 52:456, 1973), electroporation (Neumann et al., EMBO J. 1:841-5, 1982), DEAE-dextran mediated transfection (Ausubel et al., ibid.), and liposome-mediated transfection (Hawley-Nelson et al., Focus 15:73, 1993; Ciccarone et al., Focus 15:80, 1993, and viral vectors (Miller and Rosman, BioTechniaues 7:980-90, 1989; Wang and Finer, Nature Med. 2:714-6, 1996). The production of recombinant polypeptides in cultured mammalian cells is disclosed, for example, by Levinson et al., U.S. Patent No. 4,713,339; Hagen et al., U.S. Patent No. 4,784,950; Palmiter et al., U.S. Patent No. 4,579,821; and Ringold, U.S. Patent No. 4,656,134.
Suitable cultured mammalian cells include the COS-1 (ATCC
No. CRL 1650), COS-7 (ATCC No. CRL 1651), BHK (ATCC No.
CRL 1632), BHK 570 (ATCC No. CRL 10314), 293 (ATCC No. CRL
1573; Graham et al., J. Gen. Virol. 36:59-72, 1977) and Chinese hamster ovary (e. g. CHO-K1; ATCC No. CCL 61) cell lines. Additional suitable cell lines are known in the art and available from public depositories such as the American Type Culture Collection, Rockville, Maryland. In general, strong transcription promoters are preferred, such as promoters from SV-40 or cytomegalovirus. See, e.g., U.S. Patent No. 4,956,288. Other suitable promoters include those from metallothionein genes (U. S. Patent Nos.
4,579,821 and 4,601,978) and the adenovirus major late promoter.
Drug selection is generally used to select for cultured mammalian cells into which foreign DNA has been inserted. Such cells are commonly referred to as "transfectants". Cells that have been cultured in the presence of the selective agent and are able to pass the gene of interest to their progeny are referred to as "stable transfectants." A preferred selectable marker is a gene encoding resistance to the antibiotic neomycin.
Selection is carried out in the presence of a neomycin-type drug, such as G-418 or the like. Selection systems can also be used to increase the expression level of the gene of interest, a process referred to as "amplification." Amplification is carried out by culturing transfectants in the presence of a low level of the selective agent and then increasing the amount of selective agent to select for cells that produce high levels of the products of the introduced genes. A
preferred amplifiable selectable marker is dihydrofolate reductase, which confers resistance to methotrexate.
Other drug resistance genes (e. g. hygromycin resistance, multi-drug, resistance, puromycin acetyltransferase) can also be used. Alternative markers that introduce an altered phenotype, such as green fluorescent protein, or cell surface proteins such as CD4, CDB, Class I MHC, placental alkaline phosphatase may be used to sort transfected cells from untrarisfected cells by such means as FACS sorting or magnetic bead separation technology.
Other higher eukaryotic cells can also be used as hosts, including plant cells, insect cells and avian cells. The use of Agrobacterium rhizogenes as a vector for expressing genes in plant cells has been reviewed by Sinkar et al., J. Biosci. (Banaalore) 11:47-58, 1987.
Transformation of insect cells and production of foreign polypeptides therein is disclosed by Guarino et al., U.S.
Patent No. 5,162,222 and WIPO publication WO 94/06463.
Insect cells can be infected with recombinant baculovirus vectors, which are commonly derived from Autographa californica multiple nuclear polyhedrosis virus (AcMNPV).
DNA encoding the polypeptide of interest is inserted into the viral genome in place of the polyhedrin gene coding sequence by homologous recombination in cells infected with intact, wild-type AcMNPV and transfected with a transfer vector comprising the cloned gene operably linked to polyhedrin gene promoter, terminator, and flanking sequences. The resulting recombinant virus is used to infect host cells, typically a cell line derived from the fall armyworm, Spodoptera frugiperda. See, in general, Glick and Pasternak, Molecular Biotechnolocry: Principles and Applications of Recombinant DNA, ASM Press, Washington, D.C., 1994.
Fungal cells, including yeast cells, can also be used within the present invention. Yeast species of particular interest in this regard include Saccharomyces cerevisiae, Pichia pastoris, and Pichia methanolica.
Methods for transforming S. cerevisiae cells with exogenous DNA and producing recombinant polypeptides therefrom are disclosed by, for example, Kawasaki, U.S.
Patent No. 4,599,311; Kawasaki et al., U.S. Patent No.
4,931,373; Brake, U.S. Patent No. 4,870,008; Welch et al., U.S. Patent No. 5,037,743; and Murray et al., U.S. Patent No. 4,845,075. Transformed cells are selected by phenotype determined by the selectable marker, commonly drug resistance or the ability to grow in the absence of a particular nutrient (e. g., leucine). A preferred vector system for use in Saccharomyces cerevisiae is the POTI
vector system disclosed by Kawasaki et al. (U. S. Patent No. 4,931,373), which allows transformed cells to be selected by growth in glucose-containing media. Suitable promoters and terminators for use in yeast include those from glycolytic enzyme genes (see, e.g., Kawasaki, U.S.
Patent No. 4,599,311; Kingsman et al., U.S. Patent No.
4,615,974; and Bitter, U.S. Patent No. 4,977,092) and alcohol dehydrogenase genes. See also U.S. Patents Nos.
4,990,446; 5,063,154; 5,139,936 and 4,661,454.
Transformation systems for other yeasts, including Hansenula polymorpha, Schizosaccharornyces pombe, Kluyveromyces lactis, Kluyveromyces fragilis, Ustilago maydis, Pichia pastoris, Pichia methanolica, Pichia guillermondii and Candida maltosa are known in the art.
See, for example, Gleeson et al., J. Gen. Microbiol.
132:3459-65, 1986 and Cregg, U.S. Patent No. 4,882,279.
Aspergillus cells may be utilized according to the methods of McKnight et al., U.S. Patent No. 4,935,349. Methods for transforming Acremonium chrysogenum are disclosed by Sumino et al., U.S. Patent No. 5,162,228. Methods for transforming Neurospora are disclosed by Lambowitz, U.S.
Patent No. 4,486,533.
The use of Pichia methanolica as host for the production of recombinant proteins is disclosed in WIPO
Publications WO 9717450 and W09717451. DNA molecules for use in transforming P. methanolica will commonly be prepared as double-stranded, circular plasmids, which are preferably linearized prior to transformation. For polypeptide production in P. methanolica, it is preferred that the promoter and terminator in the plasmid be that of 5 a P. methanolica gene, such as a P. methanolica alcohol utilization gene (AUGI or AUG2) . Other useful nrnmntP,-include those of the dihydroxyacetone synthase (DHAS), formate dehydrogenase (FMD), and catalase (CAT) genes. To facilitate integration of the DNA into the host 10 chromosome, it is preferred to have the entire expression segment of -the plasmid flanked at both ends by host DNA
sequences. A preferred selectable marker for use in Pichia methanolica is a P. rnethanolica ADE2 gene, which encodes phosphoribosyl-5-aminoimidazole carboxylase (AIRC;
15 EC 4 . 1 .1. 21 ) , which allows ade2 host cells to grow in the absence of adenine. For large-scale, industrial processes where it is desirable to minimize the use of methanol, it is preferred to use host cells in which both methanol utilization genes (AUGI and AUG2) are deleted. For 20 production of secreted proteins, host cells deficient in vacuolar protease genes (PEP4 and PRBI) are preferred.
Electroporation is used to facilitate the introduction of a plasmid containing DNA encoding a polypeptide of interest into P. methanolica cells. It is preferred to 25 transform P. methanolica cells by electroporation using an exponentially decaying, pulsed electric field having a field strength of from 2.5 to 4.5 kV/cm, preferably about 3.75 kV/cm, and a time constant (t) of from 1 to 40 milliseconds, most preferably about 20 milliseconds.
30 Prokaryotic host cells, including strains of the bacteria Escherichia coli, Bacillus and other genera are also useful as host cells within the present invention.
Techniques for transforming these hosts and expressing foreign DNA sequences cloned therein are well known in the 35 art (see, e.g., Sambrook et al., ibid.). When expressing a ZTMPO-1 polypeptide in bacteria such as E. coli, the polypeptide may be retained in the cytoplasm, typically as insoluble granules, or may be directed to the periplasmic space by a bacterial secretion sequence. In the former case, the cells are lysed, and the granules are recovered and denatured using, for example, guanidine isothiocyanate or urea. The denatured polypeptide can then be refolded and dimerized by diluting the denaturant, such as by dialysis against a solution of urea and a combination of reduced and oxidized glutathione, followed by dialysis against a buffered saline solution. In the latter case, the polypeptide can be recovered from the periplasmic space in a soluble and functional form by disrupting the cells (by, for example, sonication or osmotic shock) to I5 release the contents of the periplasmic space and recovering the protein, thereby obviating the need for denaturation and refolding.
Transformed or transfected host cells are cultured according to conventional procedures in a culture medium containing nutrients and other components required for the growth of the chosen host cells. A variety of suitable media, including defined media and complex media, are known in the art and generally include a carbon source, a nitrogen source, essential amino acids, vitamins and minerals. Media may also contain such components as growth factors or serum, as required. The growth medium will generally select for cells containing the exogenously added DNA by, for example, drug selection or deficiency in an essential nutrient which is complemented by the selectable marker carried on the expression vector or co-transfected into the host cell. P. methanolica cells are cultured in a medium comprising adequate sources of carbon, nitrogen and trace nutrients at a temperature of about 25°C to 35°C. Liquid cultures are provided with sufficient aeration by conventional means, such as shaking of small flasks or sparging of fermentors. A preferred culture medium for P. methanolica is YEPD (2% D-glucose, WO 99/544b8 PCT/US99/08b01 2o BactoTM Peptone (Difco Laboratories, Detroit, MI), to BactoT"' yeast extract (Difco Laboratories), 0.004% adenine and 0.0060 L-leucine).
It is preferred to purify the polypeptides of the present invention to >_80% purity, more preferably to >_90o purity, even more preferably ?95% purity, and particularly preferred is a pharmaceutically pure state, that is greater than 99.9% pure with respect to contaminating macromolecules, particularly other proteins and nucleic acids, and free of infectious and pyrogenic agents. Preferably, a purified polypeptide is substantially free of other polypeptides, particularly other polypeptides of animal origin.
Expressed recombinant ZTMPO-1 polypeptides (or fusion or chimeric ZTMPO-1 polypeptides) can be purified using fractionation and/or conventional purification methods and media. Ammonium sulfate precipitation and acid or chaotrope extraction may be used for fractionation of samples. Exemplary purification steps may include hydroxyapatite, size exclusion, FPLC and reverse-phase high performance liquid chromatography. Suitable chromatographic media include derivatized dextrans, agarose, cellulose, polyacrylamide, specialty silicas, and the like. PEI, DEAE, QAE and Q derivatives are preferred.
Exemplary chromatographic media include those media derivatized with phenyl, butyl, or octyl groups, such as Phenyl-Sepharose FF (Pharmacia), Toyopearl butyl 650 (Toso Haas, Montgomeryville, PA), Octyl-Sepharose (Pharmacia) and the like; or polyacrylic resins, such as Amberchrom CG
71 (Toso Haas) and the like. Suitable solid supports include glass beads, silica-based resins, cellulosic resins, agarose beads, cross-linked agarose beads, polystyrene beads, cross-linked polyacrylamide resins and the like that are insoluble under the conditions in which they are to be used. These supports may be modified with reactive groups that allow attachment of proteins by amino groups, carboxyl groups, sulfhydryl groups, hydroxyl groups and/or carbohydrate moieties. Examples of coupling chemistries include cyanogen bromide activation, N-hydroxysuccinimide activation, epoxide activation, sulfhydryl activation, hydrazide activation, and carboxyl and amino derivatives for carbodiimide coupling chemistries. These and other solid media are well known and widely used in the art, and are available from commercial suppliers. Methods for binding receptor polypeptides to support media are well known in the art.
Selection of a particular method is a matter of routine design and is determined in part by the properties of the chosen support. See, for example, Affinitv Chromatoaraphv: Principles & Methods, Pharmacia LKB
Biotechnology, Uppsala, Sweden, 1988.
The polypeptides of the present invention can be isolated by exploitation of their binding properties. For example, immobilized metal ion adsorption (IMAC) chromatography can be used to purify histidine-rich proteins, including those comprising polyhistidine tags.
Briefly, a gel is first charged with divalent metal ions to form a chelate (Sulkowski, Trends in Biochem. 3:1-7, 1985). Histidine-rich proteins will be adsorbed to this matrix with differing affinities, depending upon the metal ion used, and will be eluted by competitive elution, lowering the pH, or use of strong chelating agents. Other methods of purification include purification of glycosylated proteins by lectin affinity chromatography and ion exchange chromatography (Methods in Enzymol., Vol.
182, "Guide to Protein Purification", M. Deutscher, (ed.), Acad. Press, San Diego, 1990, pp.529-39). Within additional embodiments of the invention, a fusion of the polypeptide of interest and an affinity tag (e.g., Glu-Glu tag) may be constructed to facilitate purification.
ZTMPO-1 polypeptides or fragments thereof may also be prepared through chemical synthesis according to methods known in the art, including exclusive solid phase synthesis, partial solid phase methods, fragment condensation or classical solution synthesis. See, for example, Merrifield, J. Am. Chem. Soc. 85:2149, 1963.
Using methods known in the art, ZTMPO-1 S polypeptides may be prepared as monomers or multimers;
glycosylated or non-glycosylated; pegylated or non pegylated; and may or may not include an initial methionine amino acid residue.
An in vivo approach for assaying proteins of the present invention involves viral delivery systems.
Exemplary viruses for this purpose include adenovirus, herpesvirus, vaccinia virus and adeno-associated virus (AAV). Adenovirus, a double-stranded DNA virus, is currently the best studied gene transfer vector for delivery of heterologous nucleic acid (for a review, see Becker et al., Meth. Cell Biol. 43:161-89, 1994; and Douglas and Curiel, Science & Medicine 4:44-53). The adenovirus system offers several advantages: adenovirus can (i) accommodate relatively large DNA inserts; (ii) be grown to high-titer; (iii) infect a broad range of mammalian cell types; and (iv) be used with a large number of available vectors containing different promoters.
Also, because adenoviruses are stable in the bloodstream, they can be administered by intravenous injection.
By deleting portions of the adenovirus genome, larger inserts (up to 7 kb) of heterologous DNA can be accommodated. These inserts may be incorporated into the viral DNA by direct ligation or by homologous recombination with a co-transfected plasmid. In an exemplary system, the essential E1 gene has been deleted from the viral vector, and the virus will not replicate unless the E1 gene is provided by the host cell (the human 293 cell line is exemplary). When intravenously administered to intact animals, adenovirus primarily targets the liver. If the adenoviral delivery system has an E1 gene deletion, the virus cannot replicate in the host cells. However, the host's tissue (e. g., liver) will express and process (and, if a secretory signal sequence is present, secrete) the heterologous protein. Secreted proteins will enter the circulation in the highly vascularized liver, and effects on the infected animal can 5 be determined.
The adenovirus system can also be used for protein production in vitro. By culturing adenovirus-infected non-293 cells under conditions where the cells are not rapidly dividing, the cells can produce proteins 10 for extended periods of time. For instance, BHK cells are grown to confluence in cell factories, then exposed to the adenoviral vector encoding the secreted protein of interest. The cells are then grown under serum-free conditions, which allows infected cells to survive for 15 several weeks without significant cell division.
Alternatively, adenovirus vector infected 2935 cells can be grown in suspension culture at relatively high cell density to produce significant amounts of protein (see Gamier et al., Cytotechnol. 15:145-55, 1994). With 20 either protocol, an expressed, secreted heterologous protein can be repeatedly isolated from the cell culture supernatant. Within the infected 293S cell production protocol, non-secreted proteins may also be effectively obtained.
25 The broad tissue distribution of ZTMPO-1 suggests it may play a critical role in biological processes of an organism and as such altered expression of ZTMPO-1 is likely involved in numerous pathologies associated with genetic and other human disease states, in 30 particular those related to immunological, reproductive, cardiac and muscle pathologies, such as diabetes, muscular dystrophys, hematopoietic disorders, immune disorders, leukemias, hypertension and cardiac disorders and diseases. ZTMPO-1 polypeptides, agonists and antagonists 35 have potential in both in vitro and in vivo applications.
ZTMPO-1 is expressed ubiquitously, many of those tissues are characterized by a high rate of cellular proliferation. ZTMPO-1 polypeptides would find use as regulators of cellular proliferation and/or differentiation. Proliferation and differentiation can be measured using cultured cells or in vivo by administering molecules of the present invention to the appropriate animal model. Suitable cultured cells, include but are not limited to, testicular, muscle, lymphatic and tumor cell lines which are all readily available to one skilled in the art from such sources as American Type Culture Collection, Rockville, MD. In particular, proliferation can be measured using cultured cardiac cells or in vivo by administering molecules of the present invention to the appropriate animal model. Generally, proliferative effects are seen as an increase in cell number, and may include inhibition of apoptosis as well as stimulation of mitogenesis. Cultured cells for use in these assays include cardiac fibroblasts, cardiac myocytes, skeletal myocytes, and human umbilical vein endothelial cells from primary cultures. Suitable established cell lines include: NIH 3T3 fibroblasts (ATCC No. CRL-1658), CHH-1 chum heart cells (ATCC No. CRL-1680), H9c2 rat heart myoblasts (ATCC No. CRL-1446), Shionogi mammary carcinoma cells (Tanaka et al., Proc. Natl. Acad. Sci. 89:8928-32, 1992), and LNCap.FGC adenocarcinoma cells (ATCC No. CRL-1740). Cultured testicular cells include dolphin DBl.Tes cells (ATCC No. CRL-6258); mouse GC-1 spg cells (ATCC No.
CRL-2053); TM3 cells (ATCC No. CRL-1714); TM4 cells (ATCC
No. CRL-1715); and pig ST cells (ATCC No. CRL-1746).
Mouse skeletal muscle (ATCC No. CRL-2174), human muscle (ATCC No. CRL-7522) and Raji, (Burkitt's human lymphoma, ATCC No. CCL86), Ramos (Burkitt's lymphoma cell line, ATCC
No. CRL-1596), Daudi (Burkitt's human lymphoma, ATCC No.
CCL213) and RPMI 1788 (a B lymphocyte cell line, CCL-156) all available from American Type Culture Collection, 10801 University Boulevard, Manassas, VA 20110-2209. Cultured Assays measuring cell proliferation are well known in the art. For example, assays measuring proliferation include chemosensitivity to neutral red dye (Cavanaugh et al., Investictational New Drugs 8:347-54, 1990), incorporation of radiolabelled nucleotides (Cook et al., Analytical Biochem. 179:1-7, 1989), incorporation of 5-bromo-2'-deoxyuridine (BrdU) in the DNA of proliferating cells (Porstmann et al., J. Immunol. Methods 82:169-79, 1985), and use of tetrazolium salts (Mosmann, J. Immunol. Methods 65:55-63, 1983; Alley et al., Cancer Res. 48:589-601, 1988; Marshall et al., Growth Reg. 5:69-84, 1995; and Scudiero et al., Cancer Res. 48:4827-33, 1988).
Additional methods can be found in the art, for example, Current Protocols in Molecular Biolocty, John Wiley and Sons, Inc., NY, 1997.
Assays measuring differentiation include, for example, measuring cell-surface markers associated with stage-specific expression of a tissue, enzymatic activity, functional activity or morphological changes (Watt, FASEB, 5:281-4, 1991; Francis, Differentiation 57:63-75, 1994;
Raes, Adv. Anim. Cell Biol. Technol. Bioprocesses, 161-71, 1989). Bioassays and ELISAs are available to measure cellular response to ZTMPO-1, in particular are those which measure changes in cytokine production as a measure of cellular response (see for example, Current Protocols in Immunolocrv ed. John E. Coligan et al., NIH, 1996).
In vivo assays are available for evaluating cardiac neogenesis or hyperplasia include treating neonatal and mature rats with the molecules of the present invention. The animals' cardiac function is measured as heart rate, blood pressure, and cardiac output to determine left ventricular function. Post-mortem methods for assessing cardiac decline or improvement include:
increased or decreased cardiac weight, nuclei/cytoplasmic volume, and staining of cardiac histology sections to determine proliferating cell nuclear antigen (PCNA) vs.
cytoplasmic actin levels (Quaini et al., Circulation Res.
75:1050-63, 1994 and Reiss et al., Proc. Natl. Acad Sci 93:8630-5, 1996.).
Cardiac defects related to conduction have been reported in patients having a deleted emerin gene (Emery, J. Med. Genet. 2-66:637-41, 1989). The resulting cardiac conduction defect is life threatening in these patients.
Defects in the intrinsic conduction system can cause irregularities in the heart rhythm, such as arrhythmia and fibrillation. Tissue distribution and sequence similarities between emerin and ZTMPO-1 suggest that ZTMPO-1 may be involved in re-polarization of cardiac cell membranes. Localization of emerin to the desmosomes and fasciae adherentes suggests that association with the connection between epithelial cells accounts for the cardiac conduction defect when the gene is absent.
ZTMPO-1 polypeptides and antagonists may influence cell-cell communication, either independently, or in conjunction with other proteins, such as emerin, and may regulate messages between cell membranes. To verify the presence of this capability in ZTMPO-1 polypeptides, agonists or antagonists of the present invention, such ZTMPO-1 polypeptides, agonists or antagonists are evaluated with respect to their ability to modulate cardiac conductance according to procedures known in the art. If desired, ZTMPO-1 polypeptide performance in this regard can be compared to emerin and may be evaluated in combination with emerin to identify synergistic effects.
With respect to cardiac conductance, a resulting increase or decrease is measured by assessing voltage-dependent conductance, sodium or calcium ion flux in an appropriate assay system known in the art. Changes in the voltage conductance or in indicator substrates reflect the activities of ZTMPO-1 polypeptides on enhancing or inhibition cardiac conductance relative to a control not subjected to treatment. An electrocardiograph is used to monitor the electrical currents generated and transmitted through the heart. Changes in the electrocardiogram (ECG) WO 99/54d68 PCT/US99/08601 tracing (wave pattern and/or timing) would indicate an alteration in the heart's conduction system. Therefore a return to a normal ECG pattern following ZTMPO-1 administration would indicate a re-establishment of a regular heart rhythm.
The invention also provides isolated and purified ZTMPO-1 polynucleotide probes or primers. Such polynucleotide probes can be RNA or DNA. DNA can be either cDNA or genomic DNA. Polynucleotide probes are single or double-stranded DNA or RNA, generally synthetic oligonucleotides, but may be generated from cloned cDNA or genomic sequences and will generally comprise at least 16 nucleotides, more often from 17 nucleotides to 25 or more nucleotides, sometimes 40 to 60 nucleotides, and in some instances a substantial portion, domain or even the entire ZTMPO-1 gene or cDNA. Probes and primers are generally synthetic oligonucleotides, but may be generated from cloned cDNA or genomic sequences or its complements.
Analytical probes will generally be at least 20 nucleotides in length, although somewhat shorter probes (14-I7 nucleotides) can be used. PCR primers are at least 5 nucleotides in length, preferably 15 or more nucleotides, more preferably 20-30 nucleotides. Short polynucleotides can be used when a small region of the gene is targeted for analysis. For gross analysis of genes, a polynucleotide probe may comprise an entire exon or more. Probes can be labeled to provide a detectable signal, such as with an enzyme, biotin, a radionuclide, fluorophore, chemiluminescer, paramagnetic particle and the like, which are commercially available from many sources, such as Molecular Probes, Inc., Eugene, OR, and Amersham Corp., Arlington Heights, IL, using techniques that are well known in the art. Preferred regions from which to construct probes include regions of homology with other thymopoietins and emerin as described herein, the ankyrin-like region, the calcium binding protein-like region, the signal sequence, and the like. Techniques for developing polynucleotide probes and hybridization techniques are known in the art, see for example, Ausubel 5 et al., eds., Current Protocols in Molecular Biology, John Wiley and Sons, Inc., NY, 1991.
ZTMPO-1 polypeptides may be used within diagnostic systems to detect the presence of ZTMPO-1. The information derived from such detection methods would 10 provide insight into the significance of ZTMPO-1 polypeptides in various diseases, and as a would serve as diagnostic tools for diseases for which altered levels of ZTMPO-1 are significant. Altered levels of ZTMPO-1 receptor polypeptides may be indicative of pathological 15 conditions including cancer, cardiac and autoimmune disorders and infectious diseases.
In a basic assay, a single-stranded probe molecule is incubated with RNA, isolated from a biological sample, under conditions of temperature and ionic strength 20 that promote base pairing between the probe and target ZTMPO-1 RNA species. After separating unbound probe from hybridized molecules, the amount of hybrids is detected.
Well-established hybridization methods of RNA
detection include northern analysis and dot/slot blot 25 hybridization (see, for example, Ausubel ibid. and Wu et al. (eds.), "Analysis of Gene Expression at the RNA
Level," in Methods in Gene Biotechnology, pages 225-239 (CRC Press, Inc. 1997)). Nucleic acid probes can be detectably labeled with radioisotopes such as 32P or 355.
30 Alternatively, ZTMPO-1 RNA can be detected with a nonradioactive hybridization method (see, for example, Isaac (ed.), Protocols for Nucleic Acid Analysis by Nonradioactive Probes, Humana Press, Inc., 1993).
Typically, nonradioactive detection is achieved by 35 enzymatic conversion of chromogenic or chemiluminescent substrates. Illustrative nonradioactive moieties include biotin, fluorescein, and digoxigenin.
ZTMPO-1 oligonucleotide probes are also useful for in vivo diagnosis. As an illustration, 18F-labeled oligonucleotides can be administered to a subject and visualized by positron emission tomography (Tavitian et al., Nature Medicine 4:467, 1998).
Numerous diagnostic procedures take advantage of the polymerase chain reaction (PCR) to increase sensitivity of detection methods. Standard techniques for performing PCR are well-known (see, generally, Mathew (ed.), Protocols in Human Molecular Genetics (Humans Press, Inc. 1991), White (ed.), PCR Protocols: Current Methods and Applications {Humans Press, Inc. 1993), Cotter (ed.), Molecular Diagnosis of Cancer (Humans Press, Inc.
1996), Hanausek and Walaszek (eds.), Tumor Marker Protocols (Humans Press, Inc. 1998), Lo (ed.), Clinical Applications of PCR (Humans Press, Inc. 1998), and Meltzer (ed.), PCR in Bioanalysis (Humans Press, Inc. 1998)).
PCR primers can be designed to amplify a sequence encoding a particular ZTMPO-1 domain or region of homology as described herein.
One variation of PCR for diagnostic assays is reverse transcriptase-PCR (RT-PCR). In the RT-PCR
technique, RNA is isolated from a biological sample, reverse transcribed to cDNA, and the cDNA is incubated with ZTMPO-1 primers (see, for example, Wu et al. (eds.), "Rapid Isolation of Specific cDNAs or Genes by PCR," in Methods in Gene Biotechnology, CRC Press, Inc., pages 15-28, 1997). PCR is then performed and the products are analyzed using standard techniques.
As an illustration, RNA is isolated from biological sample using, for example, the guanidinium-thiocyanate cell lysis procedure described above.
Alternatively, a solid-phase technique can be used to isolate mRNA from a cell lysate. A reverse transcription reaction can be primed with the isolated RNA using random oligonucleotides, short homopolymers of dT, or ZTMPO-1 anti-sense oligomers. Oligo-dT primers offer the advantage that various mRNA nucleotide sequences are amplified that can provide control target sequences.
ZTMPO-1 sequences are amplified by the polymerase chain reaction using two flanking oligonucleotide primers that are typically at least S bases in length.
PCR amplification products can be detected using a variety of approaches, For example, PCR products can be fractionated by gel electrophoresis, and visualized by ethidium bromide staining. Alternatively, fractionated PCR products can be transferred to a membrane, hybridized with a detectably-labeled ZTMPO-1 probe, and examined by autoradiography. Additional alternative approaches include the use of digoxigenin-labeled deoxyribonucleic acid triphosphates to provide chemiluminescence detection, and the C-TRAK colorimetric assay.
Another approach is real time quantitative PCR
(Perkin-Elmer Cetus, Norwalk, Ct.). A fluorogenic probe, consisting of an oligonucleotide with both a reporter and a quencher dye attached, anneals specifically between the forward and reverse primers. Using the 5' endonuclease activity of Taq DNA polymerase, the reporter dye is separated from the quencher dye and a sequence-specific signal is generated and increases as amplification increases. The fluorescence intensity can be continuously monitored and quantified during the PCR reaction.
Another approach for detection of ZTMPO-1 expression is cycling probe technology (CPT), in which a single-stranded DNA target binds with an excess of DNA
RNA-DNA chimeric probe to form a complex, the RNA portion is cleaved with RNase H, and the presence of cleaved chimeric probe is detected (see, for example, Beggs et al., J. Clin. Microbiol. 34:2985, 1996 and Bekkaoui et al., Biotechniques 20:240, 1996). Alternative methods for detection of ZTMPO-1 sequences can utilize approaches such as nucleic acid sequence-based amplification (NASBA), cooperative amplification of templates by cross-hybridization (CATCH), and the ligase chain reaction (LCR) (see, for example, Marshall et al., U.S. Patent No.
5,686,272 (1997), Dyer et al., J. Virol. Methods 60:161, 1996; Ehricht et al., Eur. J. Biochem. 243:358, 1997 and Chadwick et al., J. Virol. Methods 70:59, 1998). Other standard methods are known to those of ski-11 in the art.
ZTMPO-1 probes and primers can also be used to detect and to localize ZTMPO-1 gene expression in tissue samples. Methods for such in situ hybridization are well known to those of skill in the art (see, for example, Choo (ed.), In Situ Hybridization Protocols, Humana Press, Inc., 1999; Wu et al. (eds.), "Analysis of Cellular DNA or Abundance of mRNA by Radioactive In Situ Hybridization IRISH)," in Methods in Gene Biotechnology, CRC Press, Inc., pages 259-278, 1997 and Wu et al. (eds.), "Localization of DNA or Abundance of mRNA by Fluorescence In Situ Hybridization IRISH)," in Methods in Gene Biotechnology, CRC Press, Inc., pages 279-289, 1997).
Various additional diagnostic approaches are well-known to those of skill in the art (see, for example, Mathew (ed.), Protocols in Human Molecular Genetics Humana Press, Inc., 1991; Coleman and Tsongalis, Molecular Diagnostics, Humana Press, Inc., 1996 and Elles, Molecular Diagnosis of Genetic Diseases, Humana Press, Inc., 1996).
The invention also provides antagonists or inhibitors of ZTMPO-1 activity. Such antagonists would include anti-ZTMPO-1 antibodies, soluble ZTMPO-1 receptors, as well as other peptidic and non-peptidic agents (including ribozymes). Such antagonists would have use as research reagents for characterizing sites of ligand-receptor interaction. Antagonists would also find use in modulating cellular proliferation and differentiation such as in tumor growth and development.
High levels of expression of ZTMPO-1 in testis tissue suggest a role in spermatogenesis. These ZTMPO-1 antagonists would be useful for inhibiting spermatogenesis and sperm activation. Such ZTMPO-1 antagonists can be used for contraception in humans and animals, and in particular, domestic and zoological animals and livestock, where they would act to prevent fertilization of an egg.
Such ZTMPO-1 antagonists could be used, for instance, in place of surgical forms of contraception (such as spaying and neutering), and would allow for the possibility of future breeding of treated animals if desired. ZTMPO-1 antagonists could also be used to mediate immune response, for instance by boosting the humoral response in individuals at risk for an infectious disease or as a supplement to vaccination.
ZTMPO-1 can be used to identify inhibitors (antagonists) of its activity. Test compounds are transfected into cells or possibly added to the assays disclosed herein to identify compounds that inhibit the activity of ZTMPO-1. In addition to those assays disclosed herein, samples can be tested for inhibition of ZTMPO-1 activity within a variety of assays designed to measure receptor binding or the stimulation/inhibition of ZTMPO-1-dependent cellular responses. For example, ZTMPO-1-responsive cell lines can be transfected with a reporter gene construct that is responsive to a ZTMPO-1-stimulated cellular pathway. Reporter gene constructs of this type are known in the art, and will generally comprise a ZTMPO-1-DNA response element operably linked to a gene encoding an assayable protein, such as luciferase. DNA response elements can include, but are not limited to, cyclic AMP
response elements (CRE), hormone response elements (HRE) insulin response element (IRE) (Nasrin et al., Proc. Natl.
Acad. Sci. USA 87:5273-7, 1990) and serum response elements (SRE) (Shaw et al. Cell 56: 563-72, 1989).
Cyclic AMP response elements are reviewed in Roestler et al., J. Biol. Chem. 263: 9063-6; 1988 and Habener, Molec.
WO 99154468 PCTlUS99/08601 Endocrinol. 4:1087-94; 1990. Hormone response elements are reviewed in Beato, Cell 56:335-44; 1989. Candidate compounds, solutions, mixtures or extracts are tested for the ability to inhibit the activity of ZTMPO-1 on the 5 target cells as evidenced by a decrease in ZTMPO-1 stimulation of reporter gene expression. Assays of this type will detect compounds that directly block ZTMPO-1 binding to cell-surface receptors, as well as compounds that block processes in the cellular pathway subsequent to 10 receptor-ligand binding. In the alternative, compounds or other samples can be tested for direct blocking of ZTMPO-1 binding to receptor using ZTMPO-1 tagged with a detectable label (e. g., 'ZSI, biotin, horseradish peroxidase, FITC, and the like). Within assays of this type, the ability of 15 a test sample to inhibit the binding of labeled ZTMPO-1 to the receptor is indicative of inhibitory activity, which can be confirmed through secondary assays. Receptors used within binding assays may be cellular receptors or isolated, immobilized receptors.
20 ZTMPO-1 polypeptides can also be used to prepare antibodies that specifically bind to ZTMPO-1 epitopes, peptides or polypeptides. The ZTMPO-1 polypeptide or a fragment thereof serves as an antigen (immunogen) to inoculate an animal and elicit an immune response.
25 Suitable antigens would be the ZTMPO-1 polypeptide encoded by SEQ ID N0:2 from amino acid number 1 to amino acid number 876, or contiguous 9 to 25 amino acid residue fragments thereof. Antibodies generated from this immune response can be isolated and purified as described herein.
30 Methods for preparing and isolating polyclonal and monoclonal antibodies are well known in the art. See, for example, Current Protocols in Immunolocrv, Cooligan, et al.
(eds.), National Institutes of Health, John Wiley and Sons, Inc., 1995; Sambrook et al., Molecular Cloning: A
35 Laboratory Manual, Second Edition, Cold Spring Harbor, NY, 1989; and Hurrell, (Ed.), Monoclonal Hybridoma Antibodies:
Techniques and Applications, CRC Press, Inc., Boca Raton, FL, 1982 .
As would be evident to one of ordinary skill in the art, polyclonal antibodies can be generated from inoculating a variety of warm-blooded animals such as horses, cows, goats, sheep, dogs, chickens, rabbits, mice, and rats with a ZTMPO-1 polypeptide or a fragment thereof.
The immunogenicity of a ZTMPO-1 polypeptide may be increased through the use of an adjuvant, such as alum (aluminum hydroxide) or Freund's complete or incomplete adjuvant. Polypeptides useful for immunization also include fusion polypeptides, such as fusions of ZTMPO-1 or a portion thereof with an immunoglobulin polypeptide or with maltose binding protein. The polypeptide immunogen may be a full-length molecule or a portion thereof. If the polypeptide portion is "hapten-like", such portion may be advantageously joined or linked to a macromolecular carrier (such as keyhole limpet hemocyanin (FCLH), bovine serum albumin (BSA) or tetanus toxoid) for immunization.
As used herein, the term "antibodies" includes polyclonal antibodies, affinity-purified polyclonal antibodies, monoclonal antibodies, and antigen-binding fragments, such as F(ab')2 and Fab proteolytic fragments.
Genetically engineered intact antibodies or fragments, such as chimeric antibodies, Fv fragments, single chain antibodies and the like, as well as synthetic antigen-binding peptides and polypeptides, are also included.
Non-human antibodies may be humanized by grafting non-human CDRs onto human framework and constant regions, cr by incorporating the entire non-human variable domains (optionally "cloaking" them with a human-like surface by replacement of exposed residues, wherein the result is a "veneered" antibody). In some instances, humanized antibodies may retain non-human residues within the human variable region framework domains to enhance proper binding characteristics. Through humanizing antibodies, biological half-life may be increased, and the potential for adverse immune reactions upon administration to humans is reduced.
Alternative techniques for generating or selecting antibodies useful herein include in vitro exposure of lymphocytes to ZTMPO-1 protein or peptide, and selection of antibody display libraries in phage or similar vectors (for instance, through use of immobilized or labeled ZTMPO-1 protein or peptide). Genes encoding polypeptides having potential ZTMPO-1 polypeptide binding domains can be obtained by screening random peptide libraries displayed on phage (phage display) or on bacteria, such as E. coli. Nucleotide sequences encoding the polypeptides can be obtained in a number of ways, such as through random mutagenesis and random polynucleotide synthesis. These random peptide display libraries can be used to screen for peptides which interact with a known target which can be a protein or polypeptide, such as a ligand or receptor, a biological or synthetic macromolecule, or organic or inorganic substances.
Techniques for creating and screening such random peptide display libraries are known in the art (Ladner et al., US
Patent NO. 5,223,409; Ladner et al., US Patent NO.
4,946,778; Ladner et al., US Patent NO. 5,403,484 and Ladner et al., US Patent N0. 5,571,698) and random peptide display libraries and kits for screening such libraries are available commercially, for instance from Clontech (Palo Alto, CA), Invitrogen Inca (San Diego, CA), New England Biolabs, Inc. (Beverly, MA) and Pharmacia LKB
Biotechnology Inc. (Piscataway, NJ). Random peptide display libraries can be screened using the ZTMPO-1 sequences disclosed herein to identify proteins which bind to ZTMPO-1. These "binding proteins" which interact with ZTMPO-1 polypeptides can be used for tagging cells; for isolating homolog polypeptides by affinity purification;
they can be directly or indirectly conjugated to drugs, toxins, radionuclides and the like. These binding proteins can also be used in analytical methods such as for screening expression libraries and neutralizing activity. The binding proteins can also be used for diagnostic assays for determining circulating levels of polypeptides; for detecting or quantitating soluble polypeptides as marker of underlying pathology or disease.
These binding proteins can also act as ZTMPO-1 "antagonists" to block ZTMPO-1 binding and signal transduction in vitro and in vivo. These anti-ZTMPO-1 binding proteins would be useful for inhibiting binding.
Antibodies are determined to be specifically binding if: 1) they exhibit a threshold level of binding activity, and/or 2) they do not significantly cross-react with related polypeptide molecules. First, antibodies herein specifically bind if they bind to a ZTMPO-1 polypeptide, peptide or epitope with a binding affinity (Ka) of 106 M 1 or greater, preferably 10~ M 1 or greater, more preferably 108 M 1 or greater, and most preferably 109 M 1 or greater. The binding affinity of an antibody can be readily determined by one of ordinary skill in the art, for example, by Scatchard analysis (Scatchard, Ann.
NY Acad. Sci. 51: 660-72, 1949).
Second, antibodies are determined to specifically bind if they do not significantly cross-react with related polypeptides. Antibodies do not significantly cross-react with related polypeptide molecules, for example, if they detect ZTMPO-1 but not known related polypeptides using a standard Western blot analysis (Ausubel et al., ibid.). Examples of known related polypeptides are those disclosed in the prior art, such as known orthologs, and paralogs, and similar known members of a protein family. Moreover, antibodies may be "screened against" known related polypeptides, such as non-human ZTMPO-l, and ZTMPO-1 mutant polypeptides, to isolate a population that specifically binds to the inventive polypeptides. For example, antibodies raised to ZTMPO-1 are adsorbed to related polypeptides adhered to insoluble matrix; antibodies specific to ZTMPO-1 will flow through the matrix under the proper buffer conditions.
Such screening allows isolation of polyclonal and monoclonal antibodies non-crossreactive to closely related polypeptides (Antibodies: A Laboratory Manual, Harlow and Lane (eds.), Cold Spring Harbor Laboratory Press, 1988;
Current Protocols in Immunology, Cooligan, et al. (eds.), National Institutes of Health, John Wiley and Sons, Inc., 1995). Screening and isolation of specific antibodies is well known in the art. See, Fundamental Immunology, Paul (eds.), Raven Press, 1993: Getzoff et al., Adv. in Immunol. 43: 1-98, 1988; Monoclonal Antibodies:
Principles and Practice, Goding, J.W. (eds.), Academic Press Ltd., 1996; Benjamin et al., Ann. Rev. Immunol. 2:
67-101, 1984.
A variety of assays known to those skilled in the art can be utilized to detect antibodies and binding proteins which specifically bind to ZTMPO-1 proteins or peptides. Exemplary assays are described in detail in Antibodies: A Laboratory Manual, Harlow and Lane (Eds.), Cold Spring Harbor Laboratory Press, 1988. Representative examples of such assays include: concurrent immunoelectrophoresis, radioimmunoassay, radioimmuno-precipitation, enzyme-linked immunosorbent assay (ELISA), dot blot or Western blot assay, inhibition or competition assay, and sandwich assay. In addition, antibodies can be screened for binding to wild-type versus mutant ZTMPO-1 protein or polypeptide.
Antibodies to ZTMPO-1 may be used for tagging cells that express ZTMPO-1; for isolating ZTMPO-1 by affinity purification; for diagnostic assays for determining circulating levels of ZTMPO-1 polypeptides;
S for detecting or quantitating soluble ZTMPO-1 as marker of underlying pathology or disease; in analytical methods employing FACS; for screening expression libraries; for generating anti-idiotypic antibodies; and as neutralizing antibodies or as antagonists to block ZTMPO-1 binding in 10 vitro and in vivo. Suitable direct tags or labels include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent markers, chemiluminescent markers, magnetic particles and the like; indirect tags or labels may feature use of biotin-avidin or other complement/anti-15 complement pairs as intermediates. Antibodies herein may also be directly or indirectly conjugated to drugs, toxins, radionuclides and the like, and these conjugates used for in vivo diagnostic or therapeutic applications.
Moreover, antibodies to ZTMPO-1 or fragments thereof may 20 be used in vitro to detect denatured ZTMPO-1 or fragments thereof in assays, for example, Western Blots or other assays known in the art.
Antibodies or polypeptides herein may also be directly or indirectly conjugated to drugs, toxins, 25 radionuclides and the like, and these conjugates used for in vivo diagnostic or therapeutic applications. For instance, polypeptides or antibodies of the present invention may be used to identify or treat tissues or organs that express a corresponding anti-complementary 30 molecule (receptor or antigen, respectively, for instance). More specifically, ZTMPO-1 polypeptides or anti-ZTMPO-1 antibodies, or bioactive fragments or portions thereof, can be coupled to detectable or cytotoxic molecules and delivered to a mammal having cells, tissues or organs that express the anti-complementary molecule.
Suitable detectable molecules may be directly or indirectly attached to the polypeptide or antibody, and include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent markers, chemiluminescent markers, magnetic particles and the like. Suitable cytotoxic molecules may be directly or indirectly attached to the polypeptide or antibody, and include bacterial or plant toxins (for instance, diphtheria toxin, Pseudomonas exotoxin, ricin, abrin and the like), as well as therapeutic radionuclides, such as iodine-131, rhenium-188 or yttrium-90 (either directly attached to the polypeptide or antibody, or indirectly attached through means of a chelating moiety, fox instance). Polypeptides or antibodies may also be conjugated to cytotoxic drugs, such as adriamycin. For indirect attachment of a detectable or cytotoxic molecule, the detectable or cytotoxic molecule may be conjugated with a member of a complementary/
anticomplementary pair, where the other member is bound to the polypeptide or antibody portion. For these purposes, biotin/streptavidin is an exemplary complementary/
anticomplementary pair.
Molecules of the present invention can be used to identify and isolate receptors involved in ZTMPO-1 binding. For example, proteins and peptides of the present invention can be immobilized on a column and membrane preparations run over the column (Immobilized Affinity Ligand Techniques, Hermanson et al., eds., Academic Press, San Diego, CA, 1992, pp.195-202).
Proteins and peptides can also be radiolabeled (Methods in Enzymol., vol. 182, "Guide to Protein Purification", M.
Deutscher, ed., Acad. Press, San Diego, 1990, 721-37) or photoaffinity labeled (Brunner et al., Ann. Rev. Biochem.
62:483-514, 1993 and Fedan et al., Biochem. Pharmacol.
33:1167-80, 1984) and specific cell-surface proteins can be identified.
The molecules of the present invention will be useful regulators in multiple cellular organisms. The molecules of the present invention may used to modulate cellular proliferation and differentiation, for example spermatogenesis. In particular, certain proliferative disorders such as cancers may be amenable to such diagnosis, treatment or prevention. ZTMPO-1 would be useful in modulating the cell cycle such as during differentiation or in rapidly proliferating cells such as in tumor tissues. ZTMPO-1 would find application in a diverse array of tissues as testis, skeletal muscle, thyroid and adrenal gland for example.
Polynucleotides encoding ZTMPO-1 polypeptides are useful within gene therapy applications where it is desired to increase or inhibit ZTMPO-1 activity. If a mammal has a mutated or absent ZTMPO-1 gene, the ZTMPO-1 gene can be introduced into the cells of the mammal. In one embodiment, a gene encoding a ZTMPO-1 polypeptide is introduced in vivo in a viral vector. Such vectors include an attenuated or defective DNA virus, such as, but not limited to, herpes simplex virus (HSV), papillomavirus, Epstein Barr virus (EBV), adenovirus, adeno-associated virus (AAV), and the like. Defective viruses, which entirely or almost entirely lack viral genes, are preferred. A defective virus is not infective after introduction into a cell. Use of defective viral vectors allows for administration to cells in a specific, localized area, without concern that the vector can infect other cells. Examples of particular vectors include, but are not limited to, a defective herpes simplex virus 1 (HSV1) vector (Kaplitt et al., Molec. Cell. Neurosci.
2:320-30, 1991); an attenuated adenovirus vector, such as the vector described by Stratford-Perricaudet et al., J.
Clin. Invest. 90:626-30, 1992; and a defective adeno-associated virus vector (Sarnulski et al., J. Virol.
61:3096-101, 1987; Samulski et al., J. Virol. 63:3822-8, 1989) .
In another embodiment, a ZTMPO-1 gene can be introduced in a retroviral vector, e.g., as described in Anderson et al., U.S. Patent No. 5,399,346; Mann et al.
Cell 33:153, 1983; Temin et al., U.S. Patent No.
4,650,764; Temin et al., U.S. Patent No. 4,980,289;
Markowit2 et al., J. Virol. 62:1120, 1988; Temin et al., U.S. Patent No. 5,124,263; International Patent Publication No. WO 95/07358, published March 16, 1995 by Dougherty et al.; and Kuo et al., Blood 82:845, 1993.
Alternatively, the vector can be introduced by lipofection in vivo using liposomes. Synthetic cationic lipids can be used to prepare liposomes for in vivo transfection of a gene encoding a marker (Felgner et al., Proc. Natl. Acad.
Sci. USA 84:7413-7, 1987; Mackey et al., Proc. Natl. Acad.
Sci. USA 85:8027-31, 1988). The use of lipofection to introduce exogenous genes into specific organs in vivo has certain practical advantages. Molecular targeting of liposomes to specific cells represents one area of benefit. More particularly, directing transfection to particular cells represents one area of benefit. For instance, directing transfection to particular cell types would be particularly advantageous in a tissue with cellular heterogeneity, such as the pancreas, liver, kidney, and brain. Lipids may be chemically coupled to other molecules for the purpose of targeting. Targeted peptides (e. g., hormones or neurotransmitters), proteins such as antibodies, or non-peptide molecules can be coupled to liposomes chemically.
It is possible to remove the target cells from the body; to introduce the vector as a naked DNA plasmid;
and then to re-implant the transformed cells into the body. Naked DNA vectors for gene therapy can be introduced into the desired host cells by methods known in the art, e.g., transfection, electroporation, microinjection, transduction, cell fusion, DEAF dextran, calcium phosphate precipitation, use of a gene gun or use of a DNA vector transporter. See, e.g., Wu et al., J.
Biol. Chem. 267:963-7, 1992; Wu et al., J. Biol. Chem.
263:14621-4, 1988.
The present invention also provides reagents for use in diagnostic applications. For example, the ZTMPO-1 gene, a probe comprising ZTMPO-1 DNA or RNA, or a subsequence thereof can be used to determine if the ZTMPO
1 gene is present on chromosome 12 or if a mutation has occurred. The emerin gene is not detected in samples from patients with Emery-Dreifuss muscular dystrophy, and is present in normal patients (Bione et al., Nat. Genet.
8:323-7, 1994 and Nagano et al., Nat. Genet. 12:254-9, 1996) and thus serves as a marker for the disease.
Detectable chromosomal aberrations at the ZTMPO-1 gene locus include, but are not limited to, aneuploidy, gene copy number changes, insertions, deletions, restriction site changes and rearrangements. These aberrations can occur within the coding sequence, within introns, or within flanking sequences, including upstream promoter and regulatory regions, and may be manifested as physical alterations within a coding sequence or changes in gene expression level.
In general, these diagnostic methods comprise the steps of (a) obtaining a genetic sample from a patient; (b) incubating the genetic sample with a polynucleotide probe or primer as disclosed above, under conditions wherein the polynucleotide will hybridize to complementary polynucleotide sequence, to produce a first reaction product; and (iii) comparing the first reaction product to a control reaction product. A difference between the first reaction product and the control reaction product is indicative of a genetic abnormality in the patient. Genetic samples for use within the present invention include genomic DNA, cDNA, and RNA. The polynucleotide probe or primer can be RNA or DNA, and will comprise a portion of SEQ ID NO:1, the complement of SEQ
ID NO: l, or an RNA equivalent thereof. Suitable assay methods in this regard include molecular genetic techniques known to those in the art, such as restriction fragment length polymorphism (RFLP) analysis, short tandem repeat (STR) analysis employing PCR techniques, ligation 5 chain reaction (Barany, PCR Methods and Applications 1:5-16, 1991), ribonuclease protection assays, and other genetic linkage analysis techniques known in the art (Sambrook et al., ibid.; Ausubel et. al., ibid.; Marian, Chest 108:255-65, 1995). Ribonuclease protection assays 10 (see, e.g., Ausubel et al., ibid., ch. 4) comprise the hybridization of an RNA probe to a patient RNA sample, after which the reaction product (RNA-RNA hybrid) is exposed to RNase. Hybridized regions of the RNA are protected from digestion. Within PCR assays, a patient's 15 genetic sample is incubated with a pair of polynucleotide primers, and the region between the primers is amplified and recovered. Changes in size or amount of recovered product are indicative of mutations in the patient.
Another PCR-based technique that can be employed is single 20 strand conformational polymorphism (SSCP) analysis (Hayashi, PCR Methods and Applications 1:34-8, 1991).
Transgenic mice, engineered to express the ZTMPO-1 gene, and mice that exhibit a complete absence of ZTMPO-1 gene function, referred to as "knockout mice"
25 (Snouwaert et al., Science 257:1083, 1992), may also be generated (Lowell et al., Nature 366:740-42, 1993). These mice may be employed to study the ZTMPO-1 gene and the protein encoded thereby in an in vivo system. Such mice could be used, for example, in breeding studies to 30 determine the effect ZTMPO-1 has on spermatogenesis and sperm function as well as on conductivity of the heart.
For pharmaceutical use, the proteins of the present invention are formulated for parenteral, particularly intravenous or subcutaneous, delivery 35 according to conventional methods. Intravenous administration will be by bolus injection or infusion over a typical period of one to several hours. In general, WO 99/54468 PC'T/US99/08601 pharmaceutical formulations will include a ZTMPO-1 protein in combination with a pharmaceutically acceptable vehicle, such as saline, buffered saline, 5% dextrose in water or the like. Formulations may further include one or more excipients, preservatives, solubilizers, buffering agents, albumin to prevent protein loss on vial surfaces, etc.
Methods of formulation are well known in the art and are disclosed, for example, in Remington: The Science and Practice of Pharmacv, Gennaro, ed., Mack Publishing Co., Easton, PA, 19th ed., 1995. Determination of dose is within the level of ordinary skill in the art. The proteins may be administered for acute treatment, over one week or less, often over a period of one to three days or may be used in chronic treatment, over several months or years. Evaluation of therapeutic effect of ZTMPO-1 for cardiac applications can be done by looking for changes in ECG. Decreases in creatine kinase levels and a decrease in weakness would serve as indicators for changes in muscle wasting associated with muscular dystrophy.
The invention is further illustrated by the following non-limiting examples.
EXAMPLES
Example 1 Isolation of ZTMPO-1 Novel ZTMPO-1 encoding polynucleotides and polypeptides of the present invention were initially identified by querying an EST database. To identify the corresponding cDNA, two clones from which an identified EST was derived that were considered likely to contain the entire human ZTMPO-1 sequence were used for sequencing.
Using a QIAwell 8 plasmid kit (Qiagen, Inc., Chatsworth, CA) according to manufacturer's instructions, a 5 ml overnight culture in LB + 50 ~g/ml ampicillin was prepared. The templates were sequenced on an Applied WO 99/54468 PCTlUS99/08601 BiosystemsT"' model 377 DNA sequences (Perkin-Elmer Cetus, Norwalk, Ct.) using the ABI PRISMT"' Dye Terminator Cycle Sequencing Ready Reaction Kit (Perkin-Elmer Corp.) according to the manufacturer's instructions.
Oligonucleotides ZC694 (SEQ ID N0:9), ZC976 (SEQ ID N0:10) and ZC447 (SEQ ID N0:14) were used as sequencing primers.
Oligonucleotides ZC15976 (SEQ ID NO:11), ZC15485 (SEQ ID
N0:12), ZC15526 (SEQ ID N0:13), 215620 (SEQ ID N0:15) and ZC15823 (SEQ ID N0:16) were used to complete the sequence from the clones.
Sequencing reactions were carried out in a Hybaid OmniGene Temperature Cycling System (National Labnet Co., Woodbridge, NY). SequencherTM 3.0 sequence analysis software (Gene Codes Corporation, Ann Arbor, MI) was used for data analysis. The sequences from the two clones overlapped by 740 by and contained the 3' end of the gene and the poly A tail. A third clone prepared as described above was sequenced resulting in the remaining 5' sequence. Oligonucleotides ZC447 (SEQ ID N0:14), ZC976 (SEQ ID NO:10), ZC16162 (SEQ ID N0:17), ZC16038 (SEQ ID
N0:18), ZC16249 (SEQ ID N0:19), ZC16164 (SEQ ID N0:20), ZC16163 (SEQ ID N0:21), ZC16165 (SEQ ID N0:22) and ZC16037 (SEQ ID N0:23) were used in sequencing. Differences between the original EST sequences and the final sequence of ZTMPO-1 were detected. The lack of identity arose from ambiguity in the original EST sequences.
To confirm that the polynucleotide sequence encoding the initial methionine had been identified, ,a nested 5'RACE (rapid amplification of cDNA ends) was performed. Several Marathon's'"' cDNA libraries (human prostate, spleen, testis and uterus) were prepared using a Marathon cDNA kit (Clontech) according the manufacturer's instructions. For the first round PCR oligonucleotides AP1 (SEQ ID N0:24, supplied with the kit or synthesized) and ZC15527 (SEQ ID N0:25) were used as primers and the 5°
RACE reaction was carried out at 94oC, for 2 minutes, followed by 25 cycles at 94oC for 15 seconds, 6loC for 20 seconds and 72oC for 30 seconds, followed by a 1 minute extension at 72oC. The PCR products from the first round reaction were diluted 1/100 and used as templates for a second round of PCR using oligonucleotides AP2 (SEQ ID
N0:32, supplied with the Marathon Kit or synthesized) and ZC15526 (SEQ ID N0:13) as primers. The PCR derived DNA
fragments were resolved by gel electrophoresis, excised and ligated into the expression vector was the vector pCR2.1 (TA Cloning Kit, Invitrogen Inc., San Diego, CA) according to manufacturer's instructions. The sequence of the inserts was confirmed by sequence analysis using oligos ZC694 (SEQ ID N0:9) and ZC695 (SEQ ID N0:26) as primers, as described above and confirmed that the Met (amino acid residue 1 of SEQ ID N0:2) was indeed the start methionine. The resulting 2,754 by polynucleotide (SEQ ID
NO:1) had an open reading frame encoding an 876 amino acid residue protein sequence (SEQ ID N0:2) and was designated ZTMPO-1.
Example 2 Northern Blot Analysis of ZTMPO-1 Human Multiple Tissue Northern Blots (MTN I, MTN
II and MTN III; Clontech) were probed to determine the tissue distribution of human ZTMPO1 expression. An approximately 218 by PCR derived probe (SEQ ID N0:8) was amplified using EST clone EST934031 (SEQ ID N0:27) as a template and oligonucleotide ZC15521 (SEQ ID N0:28) and ZC15525 (SEQ ID N0:29) as primers. The amplification was carried out as follows : 1 cycle at 94°C for 2 minutes, 30 cycles of 94°C for 15 seconds, 65°C 20 seconds and 72°C
seconds, followed by 1 cycle at 72°C for 1 minute. The PCR
product was gel purified using the QIAquick method (Qiagen, Chatsworth, CA) and radioactively labeled using the Rediprime DNA labeling kit (Amersham, Arlington Heights, IL) both according to the manufacturer's suggestion. The probe was purified using a NUCTRAP push column (Stratagene). EXPRESSHYB (Clontech) solution was used for prehybridization and as a hybridizing solution for the Northern blots. Hybridization took place overnight at 65°C using 4 x 106 cpm/ml of labeled probe.
The blots were then washed in 2X SSC and 0.05% SDS at RT, followed by washes in O.1X SSC and 0.1% SDS at 50°C twice and at 55°C once. Two transcripts of approximately 3.2 kb and 5 kb were seen in nearly all the tissues with the most predominant expression being in testis.
Example 3 Chromosomal Assicrnment and Placement of ZTMPO-1 ZTMPO-1 was mapped to chromosome 12 using the commercially available GeneBridge 4 Radiation Hybrid Panel (Research Genetics, Inc., Huntsville, AL). The GeneBridge 4 Radiation Hybrid Panel contains PCRable DNAs from each of 93 radiation hybrid clones, plus two control DNAs (the HFL donor and the A23 recipient). A publicly available WWW server (http://www-genome.wi.mit.edu/cgi-bin/contig/
rhmapper.pl) allows mapping relative to the Whitehead Institute/MIT Center for Genome Research's radiation hybrid map of the human genome (the "WICGR" radiation hybrid map) which was constructed with the GeneBridge 4 Radiation Hybrid Panel.
For the mapping of ZTMPO-1 with the GeneBridge 4 RH Panel, 20 ~.1 reactions were set up in a 96-well microtiter plate (Stratagene, La Jolla, CA) and used in a RoboCycler Gradient 96 thermal cycler (Stratagene). Each of the 95 PCR reactions consisted of 2 ul lOX KlenTaq PCR
reaction buffer (CLONTECH Laboratories, Inc., Palo Alto, CA), 1.6 ~.1 dNTPs mix (2.5 mM each, PERKIN-ELMER, Foster City, CA) , 1 ~,1 sense primer, ZC15, 487 (SEQ ID NO: 6) , 1 ~.1 antisense primer, ZC 15486 (SEQ ID N0:7), 2 ~.1 RediLoad (Research Genetics, Inc.), 0.4 ~1 50X Advantage KlenTaq Polymerase Mix (Clontech Laboratories, Inc.), 25 ng of DNA
from an individual hybrid clone or control and ddH20 for a total volume of 20 ~1. The reactions were overlaid with an equal amount of mineral oil and sealed. The PCR cycler conditions were as follows: an initial 1 cycle 5 minute denaturation at 95°C, 35 cycles of a 1 minute denaturation at 95°C, 1 minute annealing at 62°C and 1.5 minute 5 extension at 72°C, followed by a final 1 cycle extension of _ 7 minutes at 72°C. The reactions were separated by electrophoresis on a 2o agarose gel (Life Technologies, Gaithersburg, MD).
The results showed that ZTMPO-1 maps 636.18 10 cR_3000 from the top of the human chromosome 12 linkage group on the WICGR radiation hybrid map. The proximal framework marker was D12S367. This positions ZTMPO-1 in the 12q24.33 region on the integrated LDB chromosome 12 map (The Genetic Location Database, University of 15 Southhampton, wWW server: http://cedar.genetics.soton.ac.
uk/public html/).
From the foregoing, it will be appreciated that, although specific embodiments of the invention have been described herein for purposes of illustration, various 20 modifications may be made without deviating from the spirit and scope of the invention. Accordingly, the invention is not limited except as by the appended claims.
SEQUENCE LISTING
<110> ZyrnoGenetics. Inc.
1201 Eastlake Avenue East Seattle. Washington 98102 United States of America <120> SOLUBLE PROTEIN ZTMPO-1 <130> 97-67PC
<150> 60/082.513 <151> 1998-04-21 <160> 32 <170> FastSEQ for Windows Version 3.0 <210>1 <211>2884 <212>DNA
<213>Homo Sapiens <220>
<221> CDS
<222> (127)...(2754) <400> 1 aaagttttta atgaaagaaa cagaaactga tgccattata taatgaaccc tagtacccat 60 cacccagctt cagcaggtgt tagtattttg tgactctttg atttttttgt cttgggccta 120 ggtgaa atg aca atg gat get ctg ttg get cga ttg aaa ctt ctg aat 168 Met Thr Met Asp Ala Leu Leu Ala Arg Leu Lys Leu Leu Asn 1 5 lp ' cca gat gac ctt aga gaa gaa atc gtc aaa gcc gga ttg aaa tgt gga 216 Pro Asp Asp Leu Arg Glu Glu Ile Val Lys Ala Gly Leu Lys Cys Gly ccc att aca tca act aca agg ttc att ttt gag aaa aaa ttg get cag 264 Pro Ile Thr Ser Thr Thr Arg Phe Ile Phe Glu Lys Lys Leu Ala Gln get tta ctg gag caa gga gga agg ctg tct tct ttc tac cac cat gag 312 Ala Leu Leu Glu Gln Gly Gly Arg Leu Ser Ser Phe Tyr His His Glu gca ggt gtc aca get ctc agc cag gac cca caa agg att ttg aag cca 360 Ala Gly Val Thr Ala Leu Ser Gln Asp Pro Gln Arg Ile Leu Lys Pro get gaa ggg aac cca act gat cag get ggt ttt tct gaa gac aga gat 408 Ala Glu Gly Asn Pro Thr Asp Gln Ala Gly Phe Ser Glu Asp Arg Asp ttt ggt tac agt gtg ggc ctg aat cct cca gag gag gaa get gtg aca 456 Phe Gly Tyr Ser Val Gly Leu Asn Pro Pro Glu Glu Glu Ala Val Thr tcc aag acc tgc tcg gtg ccc cct agt gac acc gac acc tac aga get 504 Ser Lys Thr Cys Ser Val Pro Pro Ser Asp Thr Asp Thr Tyr Arg Ala gga gcg act gcg tct aag gag ccg ccc ctg tac tat ggg gtg tgt cca 552 Gly Ala Thr Ala Ser Lys Glu Pro Pro Leu Tyr Tyr Gly Val Cys Pro gtg tat gag gac gtc cca gcg aga aat gaa agg atc tat gtt tat gaa 600 Ual Tyr Glu Asp Val Pro Ala Arg Asn Glu Arg Ile Tyr Val Tyr Glu aat aaa aag gaa gca ttg caa get gtc aag atg atc aaa ggg tcc cga 648 Asn Lys Lys Glu Ala Leu Gln Ala Val Lys Met Ile Lys Gly Ser Arg ttt aaa get ttt tct acc aga gaa gac get gag aaa ttt get aga gga 696 Phe Lys Ala Phe Ser Thr Arg Glu Asp Ala Glu Lys Phe Ala Arg Gly att tgt gat tat ttc cct tct cca agc aaa acg tcc tta cca ctg tct 744 Ile Cys Asp Tyr Phe Pro Ser Pro Ser Lys Thr Ser Leu Pro Leu Ser cct gtg aaa aca get cca ctc ttt agc aat gac agg ttg aaa gat ggt 792 Pro Ual Lys Thr Ala Pro Leu Phe Ser Asn Asp Arg Leu Lys Asp Gly ttg tgc ttg tcg gaa tca gaa aca gtc aac aaa gag cga gcg aac agt 840 Leu Cys Leu Ser Glu Ser Glu Thr Val Asn Lys Glu Arg Ala Asn Ser tac aaa aat ccc cgc acg cag gac ctc acc gcc aag ctt cgg aaa get 888 Tyr Lys Asn Pro Arg Thr Gln Asp Leu Thr Ala Lys Leu Arg Lys Ala gtg gag aag gga gag gag gac acc ttt tct gac ctt atc tgg agc aac 936 Val Glu Lys Gly Glu Glu Asp Thr Phe Ser Asp Leu Ile Trp Ser Asn ccc cgg tat ctg ata ggc tca gga gac aac ccc act atc gtg cag gaa 984 Pro Arg Tyr Leu Ile Gly Ser Gly Asp Asn Pro Thr Ile Val Gln Glu ggg tgc agg tac aac gtg atg cat gtt get gcc aaa gag aac cag get 1032 Gly Cys Arg Tyr Asn Ual Met His Val Ala Ala Lys Glu Asn Gln Ala tcc atc tgc cag ctg act ctg gac gtc ctg gag aac cct gac ttc atg 1080 Ser Ile Cys Gln Leu Thr Leu Asp Val Leu Glu Asn Pro Asp Phe Met agg ctg atg tac cct gat gac gac gag gcc atg ctg cag aag cgt atc 1128 Arg Leu Met Tyr Pro Asp Asp Asp Glu Ala Met Leu Gln Lys Arg Ile cgt tac gtg gtg gac ctg tac ctc aac acc ccc gac aag atg ggc tat 1176 Arg Tyr Val Val Asp Leu Tyr Leu Asn Thr Pro Asp Lys Met Gly Tyr gac aca ccg ttg cat ttt get tgt aag ttt gga aat gca gat gta gtc 1224 Asp Thr Pro Leu His Phe Ala Cys Lys Phe Gly Asn Ala Asp Val Val aac gtg ctt tcg tca cac cat ttg att gta aaa aac tca agg aat aaa 1272 Asn Val Leu Ser Ser His His Leu Ile Val Lys Asn Ser Arg Asn Lys tat gat aaa aca cct gaa gat gta att tgt gaa aga agc aaa aat aaa 1320 Tyr Asp Lys Thr Pro Glu Asp Ual Ile Cys Glu Arg Ser Lys Asn Lys tct gtg gaa ctg aag gag cgg atc aga gag tat tta aag ggc cac tac 1368 Ser Val Glu Leu Lys Glu Arg Ile Arg Glu Tyr Leu Lys Gly His Tyr WO 99/54468 PCT/US99l08601 tac gtg ccc ctc ctg aga gcg gaa gag act tct tct cca gtc atc ggg 1416 Tyr Val Pro Leu Leu Arg Ala Glu Glu Thr Ser Ser Pro Val Ile Gly gag ctg tgg tcc cca gac cag acg get gag gcc tct cac gtc agc cgc 1464 Glu Leu Trp Ser Pro Asp Gln Thr Ala Glu Ala Ser His Val Ser Arg tat gga ggc agc ccc aga gac ccg gta ctg acc ctg aga gcc ttc gca 1512 Tyr Gly Gly Ser Pro Arg Asp Pro Val Leu Thr Leu Arg Ala Phe Ala ggg ccc ctg agt cca gcc aag gca gaa gat ttt cgc aag ctc tgg aaa 1560 Gly Pro Leu Ser Pro Ala Lys Ala Glu Asp Phe Arg Lys Leu Trp Lys act cca cct cga gag aaa gca ggc ttc ctt cac cac gtc aag aag tcg 1608 Thr Pro Pro Arg Glu Lys Ala Gly Phe Leu His His Ual Lys Lys Ser gac ccg gaa aga ggc ttt gag aga gtg gga agg gag cta get cat gag 1656 Asp Pro Glu Arg Gly Phe Glu Arg Val Gly Arg Glu Leu Ala His Glu ctg ggg tat ccc tgg gtt gaa tac tgg gaa ttt ctg ggc tgt ttt gtt 1704 Leu Gly Tyr Pro Trp Val Glu Tyr Trp Glu Phe Leu Gly Cys Phe Val gat ctg tct tcc cag gaa ggc ctg caa aga cta gaa gaa tat ctc aca 1752 Asp Leu Ser Ser Gln Glu Gly Leu Gln Arg Leu Glu Glu Tyr Leu Thr cag cag gaa ata ggc aaa aag get caa caa gaa aca gga gaa cgg gaa 1800 Gln Gln Glu Ile Gly Lys Lys Ala Gln Gln Glu Thr Gly Glu Arg Glu gcc tcc tgc cga gat aaa gcc acc acg tct ggc agc aat tcc att tcc 1848 Ala Ser Cys Arg Asp Lys Ala Thr Thr Ser Gly Ser Asn Ser Ile Ser gtg agg gcg ttt cta gat gaa gat gac atg agc ttg gaa gaa ata aaa 1896 Val Arg Ala Phe Leu Asp Glu Asp Asp Met Ser Leu Glu Glu Ile Lys aat cgg caa aat gca get cga aat aac agc ccg ccc aca gtc ggt get 1944 Asn Arg Gln Asn Ala Ala Arg Asn Asn Ser Pro Pro Thr Val Gly Ala ttt gga cat acg agg tgc agc gcc ttc ccc ttg gag cag gag gca gac 1992 Phe Gly His Thr Arg Cys Ser Ala Phe Pro Leu Glu Gln Glu Ala Asp ctc ata gaa gcc gcc gag ccg gga ggt cca cac agc agc aga aat ggg 2040 Leu Ile Glu Ala Ala Glu Pro Gly Gly Pro His Ser Ser Arg Asn Gly ctc tgc cat cct ctg aat cac agc agg acc ctg gcg ggc aag aga cca 2088 Leu Cys His Pro Leu Asn His Ser Arg Thr Leu Ala Gly Lys Arg Pro aag gcc ccc cat ggg gag gaa gcc cat ctg cca cct gtc tcg gat ttg 2136 Lys Ala Pro His Gly Glu Glu Ala His Leu Pro Pro Val Ser Asp Leu act gtt gag ttt gat aaa ctg aat ttg caa aat ata gga cgt agc gtt 2184 Thr Ual Glu Phe Asp Lys Leu Asn Leu Gln Asn Ile Gly Arg Ser Val tcc aag aca cca gat gaa agt aca aaa act aaa gat cag atc ctg act 2232 Ser Lys Thr Pro Asp Glu Ser Thr Lys Thr Lys Asp Gln Ile Leu Thr tca aga atc aat gca gta gaa aga gac ttg tta gag cct tct ccc gca 2280 Ser Arg Ile Asn Ala Val Glu Arg Asp Leu Leu Glu Pro Ser Pro Ala gac caa ctc ggg aat ggc cac agg agg aca gaa agt gaa atg tca gcc 2328 Asp Gln Leu Gly Asn Gly His Arg Arg Thr Glu Ser Glu Met Ser Ala agg atc get aaa atg tcc ttg agt ccc agc agc ccc agg cac gag gat 2376 Arg Ile Ala Lys Met Ser Leu Ser Pro Ser Ser Pro Arg His Glu Asp cag ctc gag gtc acc agg gaa ccg gcc agg cgg ctc ttc ctt ttt gga 2424 Gln Leu Glu Val Thr Arg Glu Pro Ala Arg Arg Leu Phe Leu Phe Gly gag gag cca tca aaa ctc gat cag gat gtt ttg gcc get ctt gaa tgt 2472 Glu Glu Pro Ser Lys Leu Asp Gln Asp Val Leu Ala Ala Leu Glu Cys gca gac gtc gac ccc cat cag ttc ccg gcc gtg cac aga tgg aag agt 2520 Ala Asp Ual Asp Pro His Gln Phe Pro Ala Val His Arg Trp Lys Ser get gtc ctg tgc tac tca ccc tcg gac aga cag agt tgg ccc agt ccc 2568 Ala Val Leu Cys Tyr Ser Pro Ser Asp Arg Gln Ser Trp Pro Ser Pro gcg gtg aaa gga agg ttc aag tct cag ctg cca gat ctc agt ggc cct 2616 Ala Val Lys Gly Arg Phe Lys Ser Gln Leu Pro Asp Leu Ser Gly Pro cac agc tac agt ccg ggg aga aac agc gtg get gga agc aac ccc gca 2664 His Ser Tyr Ser Pro Gly Arg Asn Ser Val Ala Gly Ser Asn Pro Ala aag cca ggc ctg ggc agt cct ggg cgc tac agc ccc gtg cac ggg agc 2712 Lys Pro Gly Leu Gly Ser Pro Gly Arg Tyr Ser Pro Val His Gly Ser cag ctc cgc agg atg gcg cgc ctg get gag ctt gcc gcc ctg 2754 Gln Leu Arg Arg Met Ala Arg Leu Ala Glu Leu Ala Ala Leu taggcttggc gctgggctct cggtttgttc ttcattttta aagaaggaag ggtcatatgt 2814 ttattgctaa actgtcaaaa aggaatatat tctgattaaa ttattactcc tcaaaaaaaa 2874 aaaaaaaaaa 2884 <210>2 <211>876 <212>PRT
<213>Homo sapiens <400> 2 Met Thr Met Asp Ala Leu Leu Ala Arg Leu Lys Leu Leu Asn Pro Asp Asp Leu Arg Glu Glu Ile Val Lys Ala Gly Leu Lys Cys Gly Pro Ile Thr Ser Thr Thr Arg Phe Ile Phe Glu Lys Lys Leu Ala Gln Ala Leu Leu Glu Gln Gly Gly Arg Leu Ser Ser Phe Tyr His His Glu Ala Gly Val Thr Ala Leu Ser Gln Asp Pro Gln Arg Ile Leu Lys Pro Ala Glu Gly Asn Pro Thr Asp Gln Ala Gly Phe Ser Glu Asp Arg Asp Phe Gly Tyr Ser Val Gly Leu Asn Pro Pro Glu Glu Glu Ala Val Thr Ser Lys Thr Cys Ser Val Pro Pro Ser Asp Thr Asp Thr Tyr Arg Ala Gly Ala Thr Ala Ser Lys Glu Pro Pro Leu Tyr Tyr Gly Ual Cys Pro Val Tyr Glu Asp Val Pro Ala Arg Asn Glu Arg Ile Tyr Val Tyr Glu Asn Lys Lys Glu Ala Leu Gln Ala Val Lys Met Ile Lys Gly Ser Arg Phe Lys Ala Phe Ser Thr Arg Glu Asp Ala Glu Lys Phe Ala Arg Gly Ile Cys Asp Tyr Phe Pro Ser Pro Ser Lys Thr Ser Leu Pro Leu Ser Pro Val Lys Thr Ala Pro Leu Phe Ser Asn Asp Arg Leu Lys Asp Gly Leu Cys Leu Ser Glu Ser Glu Thr Val Asn Lys Glu Arg Ala Asn Ser Tyr Lys Asn Pro Arg Thr Gln Asp Leu Thr Ala Lys Leu Arg Lys Ala Val Glu Lys Gly Glu Glu Asp Thr Phe Ser Asp Leu Ile Trp Ser Asn Pro Arg Tyr Leu Ile Gly Ser Gly Asp Asn Pro Thr Ile Val Gln Glu Gly Cys Arg Tyr Asn Val Met His Val Ala Ala Lys Glu Asn Gln Ala Ser Ile Cys Gln Leu Thr Leu Asp Val Leu Glu Asn Pro Asp Phe Met Arg Leu Met Tyr Pro Asp Asp Asp Glu Ala Met Leu Gln Lys Arg Ile Arg Tyr Val Val Asp Leu Tyr Leu Asn Thr Pro Asp Lys Met Gly Tyr Asp Thr Pro Leu His Phe Ala Cys Lys Phe Gly Asn Ala Asp Val Val Asn Val Leu Ser Ser His His Leu Ile Val Lys Asn Ser Arg Asn Lys Tyr Asp Lys Thr Pro Glu Asp Val Ile Cys Glu Arg Ser Lys Asn Lys Ser Val Glu Leu Lys Glu Arg Ile Arg Glu Tyr Leu Lys Gly His Tyr Tyr Val Pro Leu Leu Arg Ala Glu Glu Thr Ser Ser Pro Val Ile Gly Glu Leu Trp Ser Pro Asp Gln Thr Ala Glu Ala Ser His Val Ser Arg Tyr Gly Gly Ser Pro Arg Asp Pro Val Leu Thr Leu Arg Ala Phe Ala Gly Pro Leu Ser Pro Ala Lys Ala Glu Asp Phe Arg Lys Leu Trp Lys Thr Pro Pro Arg Glu Lys Ala Gly Phe Leu His His Val Lys Lys Ser Asp Pro Glu Arg Gly Phe Glu Arg Val Gly Arg Glu Leu Ala His Glu Leu Gly Tyr Pro Trp Val Glu Tyr Trp Glu Phe Leu Gly Cys Phe Val Asp Leu Ser Ser Gln Glu Gly Leu Gln Arg Leu Glu Glu Tyr Leu Thr Gln Gln Glu Ile Gly Lys Lys Ala Gln Gln Glu Thr Gly Glu Arg Glu Ala Ser Cys Arg Asp Lys Ala Thr Thr Ser Gly Ser Asn Ser Ile Ser Val Arg Ala Phe Leu Asp Glu Asp Asp Met Ser Leu Glu Glu Ile Lys Asn Arg Gln Asn Ala Ala Arg Asn Asn Ser Pro Pro Thr Val Gly Ala Phe Gly His Thr Arg Cys Ser Ala Phe Pro Leu Glu Gln Glu Ala Asp Leu Ile Glu Ala Ala Glu Pro Gly Gly Pro His Ser Ser Arg Asn Gly Leu Cys His Pro Leu Asn His Ser Arg Thr Leu Ala Gly Lys Arg Pro Lys Ala Pro His Gly Glu Glu Ala His Leu Pro Pro Val Ser Asp Leu Thr Val Glu Phe Asp Lys Leu Asn Leu Gln Asn Ile Gly Arg Ser Ual Ser Lys Thr Pro Asp Glu Ser Thr Lys Thr Lys Asp Gln Ile Leu Thr Ser Arg Ile Asn Ala Val Glu Arg Asp Leu Leu Glu Pro Ser Pro Ala Asp Gln Leu Gly Asn Gly His Arg Arg Thr Glu Ser Glu Met Ser Ala Arg Ile Ala Lys Met Ser Leu Ser Pro Ser Ser Pro Arg His Glu Asp Gln Leu Glu Val Thr Arg Glu Pro Ala Arg Arg Leu Phe Leu Phe Gly Glu Glu Pro Ser Lys Leu Asp Gln Asp Val Leu Ala Ala Leu Glu Cys Ala Asp WO 99/544b8 PCT/US99/08601 Ual Asp Pro His Gln Phe Pro Ala Ual His Arg Trp Lys Ser Ala Val Leu Cys Tyr Ser Pro Ser Asp Arg Gln Ser Trp Pro Ser Pro Ala Val Lys Gly Arg Phe Lys Ser Gln Leu Pro Asp Leu Ser Gly Pro His Ser Tyr Ser Pro Gly Arg Asn Ser Val Ala Gly Ser Asn Pro Ala Lys Pro Gly Leu Gly Ser Pro Gly Arg Tyr Ser Pro Val His Gly Ser Gln Leu Arg Arg Met Ala Arg Leu Ala Glu Leu Ala Ala Leu <210> 3 <211> 254 <212> PRT
<213> Homo sapiens <400> 3 Met Asp Asn Tyr Ala Asp Leu Ser Asp Thr Glu Leu Thr Thr Leu Leu Arg Arg Tyr Asn Ile Pro His Gly Pro Val Val Gly Ser Thr Arg Arg Leu Tyr Glu Lys Lys Ile Phe Glu Tyr Glu Thr Gln Arg Arg Arg Leu Ser Pro Pro Ser Ser Ser Ala Ala Ser Ser Tyr Ser Phe Ser Asp Leu Asn Ser Thr Arg Gly Asp Ala Asp Met Tyr Asp Leu Pro Lys Lys Glu Asp Ala Leu Leu Tyr Gln Ser Lys Gly Tyr Asn Asp Asp Tyr Tyr Glu Glu Ser Tyr Phe Thr Thr Arg Thr Tyr Gly Glu Pro Glu Ser Ala Gly Pro Ser Arg Ala Ual Arg Gln Ser Val Thr Ser Phe Pro Asp Ala Asp Ala Phe His His Gln Val His Asp Asp Asp Leu Leu Ser Ser Ser Glu Glu Glu Cys Lys Asp Arg Glu Arg Pro Met Tyr Gly Arg Asp Ser Ala Tyr Gln Ser Ile Thr His Tyr Arg Pro Val Ser Ala Ser Arg Ser Ser Leu Asp Leu Ser Tyr Tyr Pro Thr Ser Ser Ser Thr Ser Phe Met Ser Ser Ser Ser Ser Ser Ser Ser Trp Leu Thr Arg Arg Ala Ile Arg Pro Glu Asn Arg Ala Pro Gly Ala Gly Leu Gly Gln Asp Arg Gln Val Pro Leu Trp Gly Gln Leu Leu Leu Phe Leu Val Phe Val Ile Val Leu Phe Phe Ile Tyr His Phe Met Gln Ala Glu Glu Gly Asn Pro Phe <210>4 <211>694 <212>PRT
<213>Homo Sapiens <400> 4 Met Pro Glu Phe Leu Glu Asp Pro Ser Val Leu Thr Lys Asp Lys Leu Lys Ser Glu Leu Val Ala Asn Asn Val Thr Leu Pro Ala Gly Glu Gln Arg Lys Asp Val Tyr Val Gln Leu Tyr Leu Gln His Leu Thr Ala Arg Asn Arg Pro Pro Leu Pro Ala Gly Thr Asn Ser Lys Gly Pro Pro Asp Phe Ser Ser Asp Glu Glu Arg Glu Pro Thr Pro Val Leu Gly Ser Gly Ala Ala Ala Ala Gly Arg Ser Arg Ala Ala Val Gly Arg Lys Ala Thr Lys Lys Thr Asp Lys Pro Arg Gln Glu Asp Lys Asp Asp Leu Asp Val Thr Glu Leu Thr Asn Glu Asp Leu Leu Asp Gln Leu Ual Lys Tyr Gly Val Asn Pro Gly Pro Ile Val Gly Thr Thr Arg Lys Leu Tyr Glu Lys Lys Leu Leu Lys Leu Arg Glu Gln Gly Thr Glu Ser Arg Ser Ser Thr Pro Leu Pro Thr Ile Ser Ser Ser Ala Glu Asn Thr Arg Gln Asn Gly Ser Asn Asp Ser Asp Arg Tyr Ser Asp Asn Glu Glu Gly Lys Lys Lys Glu His Lys Lys Val Lys Ser Thr Arg Asp Ile Val Pro Phe Ser Glu Leu Gly Thr Thr Pro Ser Gly Gly Gly Phe Phe Gln Gly Ile Ser Phe Pro Glu Ile Ser Thr Arg Pro Pro Leu Gly Ser Thr Glu Leu Gln Ala Ala Lys Lys Val His Thr Ser Lys Gly Asp Leu Pro Arg Glu Pro Leu Val Ala Thr Asn Leu Pro Gly Arg Gly Gln Leu Gln Lys Leu Ala Ser Glu Arg Asn Leu Phe Ile Ser Cys Lys Ser Ser His Asp Arg Cys Leu Glu Lys Ser Ser Ser Ser Ser Ser Gln Pro Glu His Ser Ala Met Leu Val Ser Thr Ala Ala Ser Pro Ser Leu Ile Lys Glu Thr Thr Thr Gly Tyr Tyr Lys Asp Ile Val Glu Asn Ile Cys Gly Arg Glu Lys Ser Gly Ile Gln Pro Leu Cys Pro Glu Arg Ser His Ile Ser Asp Gln Ser Pro Leu Ser Ser Lys Arg Lys Ala Leu Glu Glu Ser Glu Ser Ser Gln Leu Ile Ser Pro Pro Leu Ala Gln Ala Ile Arg Asp Tyr Val Asn Ser Leu Leu Val Gln Gly Gly Ual Gly Ser Leu Pro Gly Thr Ser Asn Ser Met Pro Pro Leu Asp Val Glu Asn Ile Gln Lys Arg Ile Asp Gln Ser Lys Phe Gln Glu Thr G1u Phe Leu Ser Pro Pro Arg Lys Ual Pro Arg Leu Ser Glu Lys Ser Val Glu Glu Arg Asp Ser Gly Ser Phe Val Ala Phe Gln Asn Ile Pro Gly Ser Glu Leu Met Ser Ser Phe Ala Lys Thr Val Val Ser His Ser Leu Thr Thr Leu Gly Leu Glu Ual Ala Lys Gln Ser Gln His Asp Lys Ile Asp Ala Ser Glu Leu Ser Phe Pro Phe His Glu Ser Ile Leu Lys Val Ile Glu Glu Glu Trp Gln Gln Val Asp Arg Gln Leu Pro Ser Leu Ala Cys Lys Tyr Pro Val Ser Ser Arg Glu Ala Thr Gln Ile Leu Ser Ual Pro Lys Val Asp Asp Glu Ile Leu Gly Phe Ile Ser Glu Ala Thr Pro Leu Gly Gly Ile Gln Ala Ala Ser Thr Glu Ser Cys Asn Gln Gln Leu Asp Leu Ala Leu Cys Arg Ala Tyr Glu Ala Ala Ala Ser Ala Leu Gln Ile Ala Thr His Thr Ala Phe Ual Ala Lys Ala Met Gln Ala Asp Ile Ser Gln Ala A1a Gln Ile Leu Ser Ser Asp Pro Ser Arg Thr His Gln Ala Leu Gly Ile Leu Ser Lys Thr Tyr Asp Ala Ala Ser Tyr Ile Cys Glu Ala Ala Phe Asp Glu Val Lys Met Ala Ala His Thr Met Gly Asn Ala Thr Val Gly Arg Arg Tyr Leu Trp Leu Lys Asp Cys Lys Ile Asn Leu Ala Ser Lys Asn Lys Leu Ala Ser Thr Pro Phe Lys Giy Gly Thr Leu Phe Gly Gly Glu Val Cys Lys Val Ile Lys Lys Arg Gly Asn Lys His <210> 5 <211> 2628 <212> DNA
<213> Artificial Sequence <220>
<223> Degenerate nucleotide sequence encoding the polypeptide of SEQ ID N0:2 <221> variation <222> (1)...(2628) <223> Each N is independently any one of A, T, G or C.
<400> 5 atgacnatgg aygcnytnytngcnmgnytnaarytnytnaayccngaygayytnmgngar 60 garathgtna argcnggnytnaartgyggnccnathacnwsnacnacnmgnttyathtty 120 garaaraary tngcncargcnytnytngarcarggnggnmgnytnwsnwsnttytaycay 180 caygargcng gngtnacngcnytnwsncargayccncarmgnathytnaarccngcngar 240 ggnaayccna cngaycargcnggnttywsngargaymgngayttyggntaywsngtnggn 300 ytnaayccnc cngargargargcngtnacnwsnaaracntgywsngtnccnccnwsngay 360 acngayacnt aymgngcnggngcnacngcnwsnaargarccnccnytntaytayggngtn 420 tgyccngtnt aygargaygtnccngcnmgnaaygarmgnathtaygtntaygaraayaar 480 aargargcny tncargcngtnaaratgathaarggnwsnmgnttyaargcnttywsnacn 540 mgngargayg cngaraarttygcnmgnggnathtgygaytayttyccnwsnccnwsnaar 600 acnwsnytnc cnytnwsnccngtnaaracngcnccnytnttywsnaaygaymgnytnaar 660 gayggnytnt gyytnwsngarwsngaracngtnaayaargarmgngcnaaywsntayaar 720 aayccnmgna cncargayytnacngcnaarytnmgnaargcngtngaraarggngargar 780 gayacnttyw sngayytnathtggwsnaayccnmgntayytnathggnwsnggngayaay 840 ccnacnathg tncargarggntgymgntayaaygtnatgcaygtngcngcnaargaraay 900 cargcnwsna thtgycarytnacnytngaygtnytngaraayccngayttyatgmgnytn 960 atgtayccng aygaygaygargcnatgytncaraarmgnathmgntaygtngtngayytn 1020 tayytnaaya cnccngayaaratgggntaygayacnccnytncayttygcntgyaartty 1080 ggnaaygcng aygtngtnaaygtnytnwsnwsncaycayytnathgtnaaraaywsnmgn 1140 aayaartaygayaaracnccngargaygtnathtgygarmgnwsnaaraayaarwsngtn 1200 garytnaargarmgnathmgngartayytnaarggncaytaytaygtnccnytnytnmgn 1260 gcngargaracnwsnwsnccngtnathggngarytntggwsnccngaycaracngcngar 1320 gcnwsncaygtnwsnmgntayggnggnwsnccnmgngayccngtnytnacnytnmgngcn 1380 ttygcnggnccnytnwsnccngcnaargcngargayttymgnaarytntggaaracnccn 1440 ccnmgngaraargcnggnttyytncaycaygtnaaraarwsngayccngarmgnggntty 1500 garmgngtnggnmgngarytngcncaygarytnggntayccntgggtngartaytgggar 1560 ttyytnggntgyttygtngayytnwsnwsncargarggnytncarmgnytngargartay 1620 ytnacncarcargarathggnaaraargcncarcargaracnggngarmgngargcnwsn 1680 tgymgngayaargcnacnacnwsnggnwsnaaywsnathwsngtnmgngcnttyytngay 1740 gargaygayatgwsnytngargarathaaraaymgncaraaygcngcnmgnaayaaywsn 1800 ccnccnacngtnggngcnttyggncayacnmgntgywsngcnttyccnytngarcargar 1860 gcngayytnathgargcngcngarccnggnggnccncaywsnwsnmgnaayggnytntgy 1920 cayccnytnaaycaywsnmgnacnytngcnggnaarmgnccnaargcnccncayggngar 1980 gargcncayytnccnccngtnwsngayytnacngtngarttygayaarytnaayytncar 2040 aayathggnmgnwsngtnwsnaaracnccngaygarwsnacnaaracnaargaycarath 2100 ytnacnwsnmgnathaaygcngtngarmgngayytnytngarccnwsnccngcngaycar 2160 ytnggnaayggncaymgnmgnacngarwsngaratgwsngcnmgnathgcnaaratgwsn 2220 ytnwsnccnwsnwsnccnmgncaygargaycarytngargtnacnmgngarccngcnmgn 2280 mgnytnttyytnttyggngargarccnwsnaarytngaycargaygtnytngcngcnytn 2340 gartgygcngaygtngayccncaycarttyccngcngtncaymgntggaarwsngcngtn 2400 ytntgytaywsnccnwsngaymgncarwsntggccnwsnccngcngtnaarggnmgntty 2460 aarwsncarytnccngayytnwsnggnccncaywsntaywsnccnggnmgnaaywsngtn 2520 gcnggnwsnaayccngcnaarccnggnytnggnwsnccnggnmgntaywsnccngtncay 2580 ggnwsncarytnmgnmgnatggcnmgnytngcngarytngcngcnytn 2628 <210> 6 <211> 18 <2I2> DNA
<2I3> Artificial Sequence <220>
<223> Oligonucleotide ZC15487 <400> 6 ggacccatta catcaact Ig <210> 7 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15486 <400> 7 cctccttgct ccagtaaa lg <210> 8 <211> 218 <212> DNA
<213> Artificial Sequence <220>
<223> Northern Blot probe <400> 8 ctcaggcttt actggagcaa ggaggaaggc tgtcttcttt ctaccaccat gaggcaggtg 60 tcacagctct cagccaggac ccacaaagga ttttgaagcc agctgaaggg aacccaactg 120 atcaggctgg tttttctgaa gacagagatt ttggttacag tgtgggcctg aatcctccag 180 aggaggaagc tgtgacatcc aagacctgct cggtgccc 218 <210> 9 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> ZC694 <400> 9 taatacgact cactatag lg <210> 10 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC976 <400> 10 cgttgtaaaa cgacggcc 18 <210> 11 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15976 <400> 11 cagctctgta ggtgtcggtg tc 22 <210> 12 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15485 <400> 12 caccgacacc tacagagc 18 <210> 13 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> ZC15526 <400> 13 tgctccagta aagcctgagc caatt 25 <210> 14 <211> 17 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC447 <400> 14 taacaatttc acacagg 17 <210> 15 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15620 <400> 15 acagagctgg agcgactgcg 20 <210> 16 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15823 <400> 16 tctctttggc agcaacatgc 20 <210> 17 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16162 <400> 17 gtgcaggtac aacgtgatgc 20 <210> 18 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16035 <400> 18 ' ctgacttcat gaggctgatg 20 <210> 19 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16249 <400> 19 cagggtacat cagcctcatg 20 <210> 20 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16164 <400> 20 tctgtcttcc caggaaggcc 20 <210> 21 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16163 <400> 21 ggaattgctg ccagacgtgg 20 <210> 22 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC16165 a <400> 22 agagccttct cccgcagacc 20 <210> 23 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide 16037 <400> 23 ggctgctggg actcaaggac 20 <210> 24 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide AP1 <400> 24 ccatcctaat acgactcact atagggc 27 <210> 25 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15527 <400> 25 ctcatggtgg tagaaagaag acagc 25 <210> 26 <211> 19 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC695 <400> 26 gatttaggtg acactatag lg <210> 27 <211> 424 <212> DNA
<213> Artificial Sequence <220>
<223> EST934031 <400> 27 gctcgattga aacttctgaa tccagatgac cttagagaag aaatcgtcaa agccggattg 60 aaatgtggacccattacatcaactacaaggttcatttttgagaaaaaattggctcaggct 120 ttactggagcaaggaggaaggctgtcttctttctaccaccatgaggcaggtgtcacagct 180 ctcagccaggacccacaaaggattttgaagccagctgaagggaacccaactgatcaggct 240 ggtttttctgaagacagagattttggttacagtgtgggcctgaatcctccagaggaggaa 300 gctgtgacatccaagacctgctcggtgccccctagtgacaccgacacctacagagctgga 360 gcgactgcgtctataggagccgccccctgtactatgngggtgtgtccagttgtatgagga 420 cgtc 424 <210> 28 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15521 <400> 28 gggcaccgag caggtcttgg atgt 24 <210> 29 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide ZC15525 <400> 29 ctcaggcttt actggagcaa ggagg 25 <210>30 <211>454 <212>PRT
<213>Homo Sapiens <400> 30 Met Pro Glu Phe Leu Glu Asp Pro Ser Val Leu Thr Lys Asp Lys Leu Lys Ser Glu Leu Val Ala Asn Asn Val Thr Leu Pro Ala Gly Glu Gln Arg Lys Asp Val Tyr Val Gln Leu Tyr Leu Gln His Leu Thr Ala Arg Asn Arg Pro Pro Leu Pro Ala Gly Thr Asn Ser Lys Gly Pro Pro Asp Phe Ser Ser Asp Glu Glu Arg Glu Pro Thr Pro Val Leu Giy Ser Gly Ala Ala AlaAiaGly ArgSerArg AlaAla UalGlyArg LysAlaThr Lys Lys ThrAspLys ProArgGln GluAsp LysAspAsp LeuAspVal Thr Glu LeuThrAsn GluAspLeu LeuAsp GlnLeuVal LysTyrGly Val Asn ProGlyPro IleValGly ThrThr ArgLysLeu TyrGluLys Lys Leu LeuLysLeu ArgGluGln GlyThr GluSerArg SerSerThr Pro Leu ProThrIle SerSerSer AlaGlu AsnThrArg GlnAsnGly Ser Asn AspSerAsp ArgTyrSer AspAsn GluGluAsp SerLysIle Glu Leu LysLeuGlu LysArgGlu ProLeu LysGlyArg AlaLysThr Pro Val ThrLeuLys GlnArgArg ValGlu HisAsnGln SerTyrSer Gln Ala GlyIleThr GluThrGlu TrpThr SerGlySer SerLysGly Gly Pro LeuGlnAla LeuThrArg GluSer ThrArgGly SerArgArg Thr Pro ArgLysArg ValGluThr SerGlu HisPheArg IleAspGly Pro Val IleSerGlu SerThrPro IleAla GluThrIle MetAlaSer Ser Asn GluSerLeu ValValAsn ArgVal ThrGlyAsn PheLysHis Ala Ser ProIleLeu ProIleThr GluPhe SerAspIle ProArgArg Ala Pro LysLysPro LeuThrArg AlaGlu UalGlyGlu LysThrGlu Glu Arg ArgUalGlu ArgAspIle LeuLys GluMetPhe ProTyrGlu Ala Ser ThrProThr GlyIleSer AlaSer CysArgArg ProIleLys Gly Ala AlaGlyArg ProLeuGlu LeuSer AspPheArg MetGluGlu Ser Phe SerSerLys TyrValPro LysTyr ValProLeu AlaAspUai Lys Ser GluLysThr LysLysGly ArgSer IleProVal TrpIleLys Ile Leu LeuPheVal ValValAla ValPhe LeuPheLeu ValTyrGln Ala Met Glu Thr Asn Gln Val Asn Pro Phe Ser Asn Phe Leu His Val Asp Pro Arg Lys Ser Asn <210> 31 <211> 345 <212> PRT
<213> Homo sapiens <400> 31 Met Pro Glu Phe Leu Glu Asp Pro Ser Val Leu Thr Lys Asp Lys Leu Lys Ser Glu Leu Val Ala Asn Asn Val Thr Leu Pro Ala Gly Glu Gln Arg Lys Asp Val Tyr Val Gln Leu Tyr Leu Gln His Leu Thr Ala Arg Asn Arg Pro Pro Leu Pro Ala Gly Thr Asn Ser Lys Gly Pro Pro Asp Phe Ser Ser Asp Glu Glu Arg Glu Pro Thr Pro Val Leu Gly Ser Gly 65 70 75 g0 Ala Ala Ala Ala Gly Arg Ser Arg Ala Ala Val Gly Arg Lys Ala Thr Lys Lys Thr Asp Lys Pro Arg Gln Glu Asp Lys Asp Asp Leu Asp Val Thr Glu Leu Thr Asn Glu Asp Leu Leu Asp Gln Leu Val Lys Tyr Gly Val Asn Pro Gly Pro Ile Val Gly Thr Thr Arg Lys Leu Tyr Glu Lys Lys Leu Leu Lys Leu Arg Glu Gln Gly Thr Glu Ser Arg Ser Ser Thr Pro Leu Pro Thr Ile Ser Ser Ser Ala Glu Asn Thr Arg Gln Asn Gly Ser Asn Asp Ser Asp Arg Tyr Ser Asp Asn Glu Glu Asp Ser Lys Ile 180 185 190 ' Glu Leu Lys Leu Glu Lys Arg Glu Pro Leu Lys Gly Arg Ala Lys Thr Pro Val Thr Leu Lys Gln Arg Arg Val Glu His Asn Gln Val Gly Glu Lys Thr Glu Glu Arg Arg Ual Glu Arg Asp Ile Leu Lys Glu Met Phe Pro Tyr Glu Ala Ser Thr Pro Thr Gly Ile Ser Ala Ser Cys Arg Arg Pro Ile Lys Gly Ala Ala Gly Arg Pro Leu Glu Leu Ser Asp Phe Arg Met Glu Glu Ser Phe Ser Ser Lys Tyr Val Pro Lys Tyr Val Pro Leu Ala Asp Val Lys Ser Glu Lys Thr Lys Lys Gly Arg Ser Ile Pro Val Trp Ile Lys Ile Leu Leu Phe Val Val Val Ala Val Phe Leu Phe Leu Val Tyr Gln Ala Met Glu Thr Asn Gln Val Asn Pro Phe Ser Asn Phe Leu His Val Asp Pro Arg Lys Ser Asn <210> 32 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Oligonucleotide AP2 <400> 32 actcactata gggctcgagc ggc 23
Claims (27)
1. An isolated polypeptide comprising a sequence of amino acid residues that is at least 80% identical in amino acid sequence to residues 1 through 876 of SEQ ID NO:2.
2. An isolated polypeptide according to claim 1, wherein said sequence of amino acid residues is at least 90%
identical.
identical.
3. An isolated polypeptide according to claim 1, wherein any differences between said polypeptide and residues 1 through 876 of SEQ ID NO:2 are due to conservative amino acid substitutions.
4. An isolated polypeptide according to claim 1, wherein said polypeptide specifically binds with an antibody that specifically binds with a polypeptide consisting of the amino acid sequence of SEQ ID NO:2.
5. An isolated polypeptide according to claim 1, covalently linked to a moiety selected from the group consisting of affinity tags, radionucleotides, enzymes and fluorophores.
6. An isolated polypeptide according to claim 5, wherein said moiety is an affinity tag selected from the group consisting of polyhistidine, FLAG, Glu-Glu, glutathione S
transferase and an immunoglobulin heavy chain constant region.
transferase and an immunoglobulin heavy chain constant region.
7. An isolated polypeptide comprising the amino acid sequence of SEQ ID NO:2.
8. A fusion protein consisting essentially of a first portion and a second portion joined by a peptide bond, said first portion consisting of a polypeptide comprising a sequence of amino acid residues that is at least 80% identical in amino acid sequence to residues 1 through 876 of SEQ ID
NO:2; and said second portion comprising another polypeptide.
NO:2; and said second portion comprising another polypeptide.
9. A pharmaceutical composition comprising a polypeptide according to claim 1, in combination with a pharmaceutically acceptable vehicle.
10. An antibody or antibody fragment that specifically binds to a polypeptide according to claim 1.
11. An antibody according to claim 10, wherein said antibody is selected from the group consisting of:
a) polyclonal antibody;
b) murine monoclonal antibody;
c) humanized antibody derived from b); and d) human monoclonal antibody.
a) polyclonal antibody;
b) murine monoclonal antibody;
c) humanized antibody derived from b); and d) human monoclonal antibody.
12. An antibody fragment according to claim 10, wherein said antibody fragment is selected from the group consisting of F(ab'), F(ab), Fab', Fab, Fv, scFv, and minimal recognition unit.
13. An anti-idiotype antibody that specifically binds to said antibody of claim 10.
14. A binding protein that specifically binds to an epitope of a polypeptide according the claim 1.
15. An isolated polynucleotide selected from the group consisting of:
a) a polynucleotide encoding a polypeptide comprising a sequence of amino acid residues that is at least 80% identical in amino acid sequence to residues 1 through 876 of SEQ ID NO:2;
b) a polynucleotide comprising the nucleotide sequence of SEQ ID NO:5;
c) a polynucleotide that remains hybridized following stringent wash conditions to a polynucleotide consisting of the nucleotide sequence of SEQ ID NO:1, or the complement of SEQ ID NO:1.
a) a polynucleotide encoding a polypeptide comprising a sequence of amino acid residues that is at least 80% identical in amino acid sequence to residues 1 through 876 of SEQ ID NO:2;
b) a polynucleotide comprising the nucleotide sequence of SEQ ID NO:5;
c) a polynucleotide that remains hybridized following stringent wash conditions to a polynucleotide consisting of the nucleotide sequence of SEQ ID NO:1, or the complement of SEQ ID NO:1.
16. An isolated polynucleotide according to claim 15, wherein said sequence of amino acid residues is at least 90% identical.
17. An isolated polynucleotide according to claim 15, wherein any difference between the amino acid sequence encoded by the polynucleotide and the corresponding amino acid sequence of SEQ ID NO:2 is due to a conservative amino acid substitution.
18. An isolated polynucleotide according to claim 15 comprising nucleotide 127 to nucleotide 2754 of SEQ ID
NO: 1.
NO: 1.
19. An isolated polynucleotide according to claim 15, wherein said polynucleotide is DNA.
20. An expression vector comprising the following operably linked elements:
a transcription promoter;
a DNA segment consisting of a polynucleotide of claim 15; and a transcriptional terminator.
a transcription promoter;
a DNA segment consisting of a polynucleotide of claim 15; and a transcriptional terminator.
21. An expression vector according to claim 20, wherein said sequence of amino acid residues is at least 90%
identical.
identical.
22. An expression vector according to claim 20, wherein any difference between the amino acid sequence encoded by the polynucleotide and the corresponding amino acid sequence of SEQ ID NO:2 is due to a conservative amino acid substitution.
23. An expression vector according to claim 20, wherein said DNA segment encodes a polypeptide covalently linked to an affinity tag selected from the group consisting of polyhistidine, Glu-Glu, glutathione S transferase and an immunoglobulin heavy chain constant region.
24. An expression vector according to claim 20 further comprising a secretory signal sequence operably linked to said DNA segment.
25. A cultured cell into which has been introduced an expression vector according to claim 20, wherein said cell expresses the polypeptide encoded by said DNA segment.
26. A method of producing a ZTMPO-1 polypeptide comprising:
culturing a cell into which has been introduced an expression vector according to claim 20, whereby said cell expresses the polypeptide encoded by said DNA segment; and recovering said expressed polypeptide.
culturing a cell into which has been introduced an expression vector according to claim 20, whereby said cell expresses the polypeptide encoded by said DNA segment; and recovering said expressed polypeptide.
27. A method for detecting a genetic abnormality in a patient, comprising:
obtaining a genetic sample from a patient;
incubating the genetic sample with a polynucleotide comprising at least 14 contiguous nucleotides of SEQ ID NO:1 or the complement of SEQ ID NO:1, under conditions wherein said polynucleotide will hybridize to complementary polynucleotide sequence, to produce a first reaction product;
comparing said first reaction product to a control reaction product, wherein a difference between said first reaction product and said control reaction product is indicative of a genetic abnormality in the patient.
obtaining a genetic sample from a patient;
incubating the genetic sample with a polynucleotide comprising at least 14 contiguous nucleotides of SEQ ID NO:1 or the complement of SEQ ID NO:1, under conditions wherein said polynucleotide will hybridize to complementary polynucleotide sequence, to produce a first reaction product;
comparing said first reaction product to a control reaction product, wherein a difference between said first reaction product and said control reaction product is indicative of a genetic abnormality in the patient.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US6383898A | 1998-04-21 | 1998-04-21 | |
US09/063,838 | 1998-04-21 | ||
PCT/US1999/008601 WO1999054468A1 (en) | 1998-04-21 | 1999-04-19 | Soluble protein ztmpo-1 |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2325822A1 true CA2325822A1 (en) | 1999-10-28 |
Family
ID=22051855
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002325822A Abandoned CA2325822A1 (en) | 1998-04-21 | 1999-04-19 | Soluble protein ztmpo-1 |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1071780A1 (en) |
JP (1) | JP2002512033A (en) |
AU (1) | AU3863499A (en) |
CA (1) | CA2325822A1 (en) |
WO (1) | WO1999054468A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2001234523A1 (en) * | 2000-04-10 | 2001-10-23 | Zymogenetics Inc. | Methods for detecting neurological disorders |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5472856A (en) * | 1993-12-21 | 1995-12-05 | Immunobiology Research Institute, Inc. | Recombinant human thymopoietin proteins and uses therefor |
-
1999
- 1999-04-19 EP EP99921412A patent/EP1071780A1/en not_active Withdrawn
- 1999-04-19 WO PCT/US1999/008601 patent/WO1999054468A1/en not_active Application Discontinuation
- 1999-04-19 JP JP2000544800A patent/JP2002512033A/en active Pending
- 1999-04-19 CA CA002325822A patent/CA2325822A1/en not_active Abandoned
- 1999-04-19 AU AU38634/99A patent/AU3863499A/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
AU3863499A (en) | 1999-11-08 |
JP2002512033A (en) | 2002-04-23 |
WO1999054468A1 (en) | 1999-10-28 |
EP1071780A1 (en) | 2001-01-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6436400B1 (en) | Protease-activated receptor PAR4 ZCHEMR2 | |
US6372889B1 (en) | Soluble protein ZTMPO-1 | |
AU1622200A (en) | Mammalian chondromodulin-like protein | |
CA2402614A1 (en) | Insulin homolog polypeptide zins4 | |
US20020012967A1 (en) | Insulin homolog polypeptide zins4 | |
CA2325822A1 (en) | Soluble protein ztmpo-1 | |
US20010044134A1 (en) | Novel secreted polypeptide zsig87 | |
CA2364330A1 (en) | Secreted protein zsig49 | |
MXPA00010232A (en) | Soluble protein ztmpo-1 | |
CA2294702A1 (en) | Mammalian secretory peptide-9 | |
US20010049432A1 (en) | Human semaphorin ZSMF-16 | |
CA2321176A1 (en) | Connective tissue growth factor homologs | |
US6440697B1 (en) | Ring finger protein zapop3 | |
WO2000029430A1 (en) | Ring finger protein zapop3 | |
US20020160487A1 (en) | Testis specific transcription factor ZGCL-1 | |
WO2000020583A1 (en) | Ribonucleoprotein homolog zrnp1, having also homology to the gnrh receptor | |
WO2001040278A2 (en) | Human semaphorin zsmf-16 | |
CA2360584A1 (en) | Mammalian alpha-helical protein - 12 | |
EP0998563A1 (en) | Human chloride ion channel zsig44 | |
US20040110927A1 (en) | Mammalian secretory peptide - 9 | |
CA2296292A1 (en) | Secreted proteins encoded by human chromosome 13 | |
WO2000073458A1 (en) | Secreted alpha-helical protein-31 | |
WO2001042292A2 (en) | Secreted polypeptide zsig87 | |
EP1323823A2 (en) | Mammalian secretory peptide 9, antibodes against it and their use | |
WO2002079248A2 (en) | Mammalian alpha-helical protein-53 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
FZDE | Dead |