AU2002305834A1 - Compositions and methods for high-level, large-scale production of recombinant proteins - Google Patents
Compositions and methods for high-level, large-scale production of recombinant proteinsInfo
- Publication number
- AU2002305834A1 AU2002305834A1 AU2002305834A AU2002305834A AU2002305834A1 AU 2002305834 A1 AU2002305834 A1 AU 2002305834A1 AU 2002305834 A AU2002305834 A AU 2002305834A AU 2002305834 A AU2002305834 A AU 2002305834A AU 2002305834 A1 AU2002305834 A1 AU 2002305834A1
- Authority
- AU
- Australia
- Prior art keywords
- polypeptide
- composition
- line
- protein
- host cell
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims description 132
- 239000000203 mixture Substances 0.000 title claims description 53
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 title claims description 47
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 title claims description 47
- 238000011031 large-scale manufacturing process Methods 0.000 title claims description 8
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 222
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 213
- 229920001184 polypeptide Polymers 0.000 claims description 208
- 108090000623 proteins and genes Proteins 0.000 claims description 124
- 239000013598 vector Substances 0.000 claims description 118
- 230000014509 gene expression Effects 0.000 claims description 98
- 102000004169 proteins and genes Human genes 0.000 claims description 90
- 241000282414 Homo sapiens Species 0.000 claims description 62
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 claims description 40
- 241001529936 Murinae Species 0.000 claims description 33
- 238000012217 deletion Methods 0.000 claims description 31
- 230000037430 deletion Effects 0.000 claims description 31
- 238000001890 transfection Methods 0.000 claims description 29
- 230000012010 growth Effects 0.000 claims description 28
- 229950010131 puromycin Drugs 0.000 claims description 20
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Natural products O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 claims description 15
- 239000003795 chemical substances by application Substances 0.000 claims description 14
- 239000000725 suspension Substances 0.000 claims description 13
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 claims description 11
- 101000756632 Homo sapiens Actin, cytoplasmic 1 Proteins 0.000 claims description 11
- 230000002103 transcriptional effect Effects 0.000 claims description 9
- 230000013595 glycosylation Effects 0.000 claims description 8
- 238000006206 glycosylation reaction Methods 0.000 claims description 7
- 239000012679 serum free medium Substances 0.000 claims description 7
- 238000004114 suspension culture Methods 0.000 claims description 7
- 230000002018 overexpression Effects 0.000 claims description 5
- 238000013341 scale-up Methods 0.000 claims description 5
- 108010077544 Chromatin Proteins 0.000 claims description 4
- 210000003483 chromatin Anatomy 0.000 claims description 4
- 230000002459 sustained effect Effects 0.000 claims description 3
- 210000004027 cell Anatomy 0.000 description 90
- 235000018102 proteins Nutrition 0.000 description 72
- 102000040430 polynucleotide Human genes 0.000 description 54
- 108091033319 polynucleotide Proteins 0.000 description 54
- 239000002157 polynucleotide Substances 0.000 description 54
- 239000012634 fragment Substances 0.000 description 50
- 230000027455 binding Effects 0.000 description 41
- 239000000427 antigen Substances 0.000 description 39
- 108091007433 antigens Proteins 0.000 description 39
- 102000036639 antigens Human genes 0.000 description 39
- 235000001014 amino acid Nutrition 0.000 description 37
- 229940024606 amino acid Drugs 0.000 description 35
- 150000001413 amino acids Chemical class 0.000 description 35
- 230000000694 effects Effects 0.000 description 29
- 239000013604 expression vector Substances 0.000 description 28
- 230000004927 fusion Effects 0.000 description 28
- 238000004519 manufacturing process Methods 0.000 description 28
- 230000002163 immunogen Effects 0.000 description 27
- 125000005647 linker group Chemical group 0.000 description 25
- 239000013612 plasmid Substances 0.000 description 24
- 102100035360 Cerebellar degeneration-related antigen 1 Human genes 0.000 description 23
- 108020004414 DNA Proteins 0.000 description 23
- 239000013600 plasmid vector Substances 0.000 description 19
- 230000003321 amplification Effects 0.000 description 18
- 238000003199 nucleic acid amplification method Methods 0.000 description 18
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 17
- 108091028043 Nucleic acid sequence Proteins 0.000 description 17
- 239000003550 marker Substances 0.000 description 17
- 125000003729 nucleotide group Chemical group 0.000 description 17
- 206010028980 Neoplasm Diseases 0.000 description 16
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 14
- 125000003275 alpha amino acid group Chemical group 0.000 description 14
- 108091026890 Coding region Proteins 0.000 description 13
- 239000002773 nucleotide Substances 0.000 description 13
- 238000006467 substitution reaction Methods 0.000 description 13
- 238000004458 analytical method Methods 0.000 description 12
- 238000000746 purification Methods 0.000 description 12
- 238000007792 addition Methods 0.000 description 11
- 239000011230 binding agent Substances 0.000 description 11
- 239000002299 complementary DNA Substances 0.000 description 11
- 239000003814 drug Substances 0.000 description 11
- 230000001900 immune effect Effects 0.000 description 11
- 150000007523 nucleic acids Chemical class 0.000 description 11
- 230000014616 translation Effects 0.000 description 11
- 108060003951 Immunoglobulin Proteins 0.000 description 10
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 10
- 238000011161 development Methods 0.000 description 10
- 102000018358 immunoglobulin Human genes 0.000 description 10
- 238000013518 transcription Methods 0.000 description 10
- 230000035897 transcription Effects 0.000 description 10
- 238000013519 translation Methods 0.000 description 10
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 9
- 210000001744 T-lymphocyte Anatomy 0.000 description 9
- 238000003556 assay Methods 0.000 description 9
- 230000016784 immunoglobulin production Effects 0.000 description 9
- 230000036961 partial effect Effects 0.000 description 9
- 108020004705 Codon Proteins 0.000 description 8
- 102000053602 DNA Human genes 0.000 description 8
- 241000238631 Hexapoda Species 0.000 description 8
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 8
- 241000700605 Viruses Species 0.000 description 8
- 230000003993 interaction Effects 0.000 description 8
- 102000039446 nucleic acids Human genes 0.000 description 8
- 108020004707 nucleic acids Proteins 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 102000007469 Actins Human genes 0.000 description 7
- 108010085238 Actins Proteins 0.000 description 7
- 238000002965 ELISA Methods 0.000 description 7
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 7
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 7
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 7
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 7
- 241000283984 Rodentia Species 0.000 description 7
- 125000000539 amino acid group Chemical group 0.000 description 7
- 201000011510 cancer Diseases 0.000 description 7
- 150000001720 carbohydrates Chemical group 0.000 description 7
- 239000005090 green fluorescent protein Substances 0.000 description 7
- 230000001976 improved effect Effects 0.000 description 7
- 238000011144 upstream manufacturing Methods 0.000 description 7
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 6
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 6
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 6
- 241000270295 Serpentes Species 0.000 description 6
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 6
- 238000013459 approach Methods 0.000 description 6
- 239000003623 enhancer Substances 0.000 description 6
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 238000010369 molecular cloning Methods 0.000 description 6
- 238000002360 preparation method Methods 0.000 description 6
- 230000009257 reactivity Effects 0.000 description 6
- 239000000523 sample Substances 0.000 description 6
- 210000002966 serum Anatomy 0.000 description 6
- 239000004475 Arginine Substances 0.000 description 5
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 5
- 239000004471 Glycine Substances 0.000 description 5
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 5
- 239000004472 Lysine Substances 0.000 description 5
- 241001465754 Metazoa Species 0.000 description 5
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 5
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 5
- 239000004473 Threonine Substances 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 5
- 229960001230 asparagine Drugs 0.000 description 5
- 235000009582 asparagine Nutrition 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 102000037865 fusion proteins Human genes 0.000 description 5
- 108020001507 fusion proteins Proteins 0.000 description 5
- 210000004408 hybridoma Anatomy 0.000 description 5
- 230000005847 immunogenicity Effects 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 238000012216 screening Methods 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 4
- 108091060211 Expressed sequence tag Proteins 0.000 description 4
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 4
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 4
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 4
- 108091081024 Start codon Proteins 0.000 description 4
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 4
- 229960003767 alanine Drugs 0.000 description 4
- 235000004279 alanine Nutrition 0.000 description 4
- 230000004071 biological effect Effects 0.000 description 4
- 210000004899 c-terminal region Anatomy 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 201000010099 disease Diseases 0.000 description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 4
- 238000010494 dissociation reaction Methods 0.000 description 4
- 230000005593 dissociations Effects 0.000 description 4
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 230000028993 immune response Effects 0.000 description 4
- 238000003018 immunoassay Methods 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 230000002452 interceptive effect Effects 0.000 description 4
- 229960000310 isoleucine Drugs 0.000 description 4
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 4
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 229940124597 therapeutic agent Drugs 0.000 description 4
- 230000001052 transient effect Effects 0.000 description 4
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 4
- 239000004474 valine Substances 0.000 description 4
- 241000282693 Cercopithecidae Species 0.000 description 3
- 241000699802 Cricetulus griseus Species 0.000 description 3
- 102100037840 Dehydrogenase/reductase SDR family member 2, mitochondrial Human genes 0.000 description 3
- 108010013369 Enteropeptidase Proteins 0.000 description 3
- 102100029727 Enteropeptidase Human genes 0.000 description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 3
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 3
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 3
- 241000699666 Mus <mouse, genus> Species 0.000 description 3
- 229930193140 Neomycin Natural products 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- 206010035226 Plasma cell myeloma Diseases 0.000 description 3
- 101710182846 Polyhedrin Proteins 0.000 description 3
- 101710188053 Protein D Proteins 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 101710132893 Resolvase Proteins 0.000 description 3
- 150000007513 acids Chemical class 0.000 description 3
- 238000001042 affinity chromatography Methods 0.000 description 3
- 210000004102 animal cell Anatomy 0.000 description 3
- 229940009098 aspartate Drugs 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 230000002457 bidirectional effect Effects 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 239000007795 chemical reaction product Substances 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- -1 cofactors Substances 0.000 description 3
- 230000001186 cumulative effect Effects 0.000 description 3
- 235000018417 cysteine Nutrition 0.000 description 3
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 229930195712 glutamate Natural products 0.000 description 3
- 102000005396 glutamine synthetase Human genes 0.000 description 3
- 108020002326 glutamine synthetase Proteins 0.000 description 3
- 239000000833 heterodimer Substances 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 201000000050 myeloid neoplasm Diseases 0.000 description 3
- 229960004927 neomycin Drugs 0.000 description 3
- 210000001672 ovary Anatomy 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 239000004017 serum-free culture medium Substances 0.000 description 3
- 238000002741 site-directed mutagenesis Methods 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 210000004989 spleen cell Anatomy 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 230000001225 therapeutic effect Effects 0.000 description 3
- 239000003053 toxin Substances 0.000 description 3
- 231100000765 toxin Toxicity 0.000 description 3
- 108700012359 toxins Proteins 0.000 description 3
- 108700023418 Amidases Proteins 0.000 description 2
- 241000201370 Autographa californica nucleopolyhedrovirus Species 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- FERIUCNNQQJTOY-UHFFFAOYSA-N Butyric acid Chemical compound CCCC(O)=O FERIUCNNQQJTOY-UHFFFAOYSA-N 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 108090000288 Glycoproteins Proteins 0.000 description 2
- 102000003886 Glycoproteins Human genes 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 2
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 2
- SIKJAQJRHWYJAI-UHFFFAOYSA-N Indole Chemical compound C1=CC=C2NC=CC2=C1 SIKJAQJRHWYJAI-UHFFFAOYSA-N 0.000 description 2
- 102100034872 Kallikrein-4 Human genes 0.000 description 2
- 125000000998 L-alanino group Chemical group [H]N([*])[C@](C([H])([H])[H])([H])C(=O)O[H] 0.000 description 2
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 2
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 2
- 108091005461 Nucleic proteins Proteins 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 241000256251 Spodoptera frugiperda Species 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 241000723873 Tobacco mosaic virus Species 0.000 description 2
- 241000255985 Trichoplusia Species 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 2
- 102000005922 amidase Human genes 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 210000004900 c-terminal fragment Anatomy 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 2
- 229960001231 choline Drugs 0.000 description 2
- OEYIOHPDSNJKLS-UHFFFAOYSA-N choline Chemical compound C[N+](C)(C)CCO OEYIOHPDSNJKLS-UHFFFAOYSA-N 0.000 description 2
- 230000004186 co-expression Effects 0.000 description 2
- 230000021615 conjugation Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 230000003292 diminished effect Effects 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 229940088598 enzyme Drugs 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 230000030279 gene silencing Effects 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 150000004820 halides Chemical class 0.000 description 2
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 239000000411 inducer Substances 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 108010024383 kallikrein 4 Proteins 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 210000004379 membrane Anatomy 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 229910052751 metal Inorganic materials 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 229960000485 methotrexate Drugs 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 238000003127 radioimmunoassay Methods 0.000 description 2
- 239000011541 reaction mixture Substances 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 239000006152 selective media Substances 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 230000010473 stable expression Effects 0.000 description 2
- 230000000638 stimulation Effects 0.000 description 2
- 125000001424 substituent group Chemical group 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 125000003396 thiol group Chemical group [H]S* 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 201000008827 tuberculosis Diseases 0.000 description 2
- 210000004881 tumor cell Anatomy 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- OCUSNPIJIZCRSZ-ZTZWCFDHSA-N (2s)-2-amino-3-methylbutanoic acid;(2s)-2-amino-4-methylpentanoic acid;(2s,3s)-2-amino-3-methylpentanoic acid Chemical compound CC(C)[C@H](N)C(O)=O.CC[C@H](C)[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O OCUSNPIJIZCRSZ-ZTZWCFDHSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- NFGXHKASABOEEW-UHFFFAOYSA-N 1-methylethyl 11-methoxy-3,7,11-trimethyl-2,4-dodecadienoate Chemical compound COC(C)(C)CCCC(C)CC=CC(C)=CC(=O)OC(C)C NFGXHKASABOEEW-UHFFFAOYSA-N 0.000 description 1
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- TVZGACDUOSZQKY-LBPRGKRZSA-N 4-aminofolic acid Chemical compound C1=NC2=NC(N)=NC(N)=C2N=C1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 TVZGACDUOSZQKY-LBPRGKRZSA-N 0.000 description 1
- 108010066676 Abrin Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 102100029457 Adenine phosphoribosyltransferase Human genes 0.000 description 1
- 108010024223 Adenine phosphoribosyltransferase Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 241000024188 Andala Species 0.000 description 1
- 108010083359 Antigen Receptors Proteins 0.000 description 1
- 102000006306 Antigen Receptors Human genes 0.000 description 1
- 206010003445 Ascites Diseases 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 239000005496 Chlorsulfuron Substances 0.000 description 1
- 102000009016 Cholera Toxin Human genes 0.000 description 1
- 108010049048 Cholera Toxin Proteins 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 108010047041 Complementarity Determining Regions Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000557626 Corvus corax Species 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- LEVWYRKDKASIDU-QWWZWVQMSA-N D-cystine Chemical compound OC(=O)[C@H](N)CSSC[C@@H](N)C(O)=O LEVWYRKDKASIDU-QWWZWVQMSA-N 0.000 description 1
- 101150074155 DHFR gene Proteins 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- 102100024746 Dihydrofolate reductase Human genes 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 238000012286 ELISA Assay Methods 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- 238000012413 Fluorescence activated cell sorting analysis Methods 0.000 description 1
- 108700004714 Gelonium multiflorum GEL Proteins 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 102000018713 Histocompatibility Antigens Class II Human genes 0.000 description 1
- 101000935587 Homo sapiens Flavin reductase (NADPH) Proteins 0.000 description 1
- 101000582320 Homo sapiens Neurogenic differentiation factor 6 Proteins 0.000 description 1
- 108090000144 Human Proteins Proteins 0.000 description 1
- 102000003839 Human Proteins Human genes 0.000 description 1
- 238000009015 Human TaqMan MicroRNA Assay kit Methods 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108020005350 Initiator Codon Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- 125000000393 L-methionino group Chemical group [H]OC(=O)[C@@]([H])(N([H])[*])C([H])([H])C(SC([H])([H])[H])([H])[H] 0.000 description 1
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 1
- 125000000773 L-serino group Chemical group [H]OC(=O)[C@@]([H])(N([H])*)C([H])([H])O[H] 0.000 description 1
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 1
- 108090001090 Lectins Proteins 0.000 description 1
- 102000004856 Lectins Human genes 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004895 Lipoproteins Human genes 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- 108091054438 MHC class II family Proteins 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 1
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 101100261636 Methanothermobacter marburgensis (strain ATCC BAA-927 / DSM 2133 / JCM 14651 / NBRC 100331 / OCM 82 / Marburg) trpB2 gene Proteins 0.000 description 1
- 241000204795 Muraena helena Species 0.000 description 1
- MSFSPUZXLOGKHJ-UHFFFAOYSA-N Muraminsaeure Natural products OC(=O)C(C)OC1C(N)C(O)OC(CO)C1O MSFSPUZXLOGKHJ-UHFFFAOYSA-N 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 241000187488 Mycobacterium sp. Species 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- KTHDTJVBEPMMGL-VKHMYHEASA-N N-acetyl-L-alanine Chemical compound OC(=O)[C@H](C)NC(C)=O KTHDTJVBEPMMGL-VKHMYHEASA-N 0.000 description 1
- KTHDTJVBEPMMGL-UHFFFAOYSA-N N-acetyl-L-alanine Natural products OC(=O)C(C)NC(C)=O KTHDTJVBEPMMGL-UHFFFAOYSA-N 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 102100030589 Neurogenic differentiation factor 6 Human genes 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 108090000526 Papain Proteins 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- 108010013639 Peptidoglycan Proteins 0.000 description 1
- 102000004861 Phosphoric Diester Hydrolases Human genes 0.000 description 1
- 108090001050 Phosphoric Diester Hydrolases Proteins 0.000 description 1
- 101100124346 Photorhabdus laumondii subsp. laumondii (strain DSM 15139 / CIP 105565 / TT01) hisCD gene Proteins 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 101000762949 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) Exotoxin A Proteins 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 108010066717 Q beta Replicase Proteins 0.000 description 1
- 108020004518 RNA Probes Proteins 0.000 description 1
- 239000003391 RNA probe Substances 0.000 description 1
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 1
- 108010039491 Ricin Proteins 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 108010022999 Serine Proteases Proteins 0.000 description 1
- 102000012479 Serine Proteases Human genes 0.000 description 1
- 108010079723 Shiga Toxin Proteins 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 101710136739 Teichoic acid poly(glycerol phosphate) polymerase Proteins 0.000 description 1
- 206010043376 Tetanus Diseases 0.000 description 1
- 102000002933 Thioredoxin Human genes 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 108010065158 Tumor Necrosis Factor Ligand Superfamily Member 14 Proteins 0.000 description 1
- 102000012883 Tumor Necrosis Factor Ligand Superfamily Member 14 Human genes 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 238000005903 acid hydrolysis reaction Methods 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 229960003896 aminopterin Drugs 0.000 description 1
- 150000008064 anhydrides Chemical class 0.000 description 1
- 229930002877 anthocyanin Natural products 0.000 description 1
- 235000010208 anthocyanin Nutrition 0.000 description 1
- 239000004410 anthocyanin Substances 0.000 description 1
- 150000004636 anthocyanins Chemical class 0.000 description 1
- 230000000340 anti-metabolite Effects 0.000 description 1
- 210000000612 antigen-presenting cell Anatomy 0.000 description 1
- 229940100197 antimetabolite Drugs 0.000 description 1
- 239000002256 antimetabolite Substances 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 229910052797 bismuth Inorganic materials 0.000 description 1
- 108010006025 bovine growth hormone Proteins 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000012832 cell culture technique Methods 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 238000011965 cell line development Methods 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- VJYIFXVZLXQVHO-UHFFFAOYSA-N chlorsulfuron Chemical compound COC1=NC(C)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)Cl)=N1 VJYIFXVZLXQVHO-UHFFFAOYSA-N 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000012875 competitive assay Methods 0.000 description 1
- 230000009918 complex formation Effects 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 238000012926 crystallographic analysis Methods 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 229960003067 cystine Drugs 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 108020001096 dihydrofolate reductase Proteins 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000000763 evoking effect Effects 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 239000012091 fetal bovine serum Substances 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 208000006454 hepatitis Diseases 0.000 description 1
- 231100000283 hepatitis Toxicity 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 101150113423 hisD gene Proteins 0.000 description 1
- 210000004754 hybrid cell Anatomy 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000006058 immune tolerance Effects 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 229940127121 immunoconjugate Drugs 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 238000011293 immunotherapeutic strategy Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- PZOUSPYUWWUPPK-UHFFFAOYSA-N indole Natural products CC1=CC=CC2=C1C=CN2 PZOUSPYUWWUPPK-UHFFFAOYSA-N 0.000 description 1
- RKJUIXBNRJVNHR-UHFFFAOYSA-N indolenine Natural products C1=CC=C2CC=NC2=C1 RKJUIXBNRJVNHR-UHFFFAOYSA-N 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 208000037798 influenza B Diseases 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000007852 inverse PCR Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 238000007834 ligase chain reaction Methods 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 230000002132 lysosomal effect Effects 0.000 description 1
- 239000006249 magnetic particle Substances 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 150000002739 metals Chemical class 0.000 description 1
- SXTAYKAGBXMACB-UHFFFAOYSA-N methionine S-imide-S-oxide Natural products CS(=N)(=O)CCC(N)C(O)=O SXTAYKAGBXMACB-UHFFFAOYSA-N 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 238000012775 microarray technology Methods 0.000 description 1
- ZAHQPTJLOCWVPG-UHFFFAOYSA-N mitoxantrone dihydrochloride Chemical compound Cl.Cl.O=C1C2=C(O)C=CC(O)=C2C(=O)C2=C1C(NCCNCCO)=CC=C2NCCNCCO ZAHQPTJLOCWVPG-UHFFFAOYSA-N 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 230000000269 nucleophilic effect Effects 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 210000003200 peritoneal cavity Anatomy 0.000 description 1
- 150000004633 phorbol derivatives Chemical class 0.000 description 1
- 239000002644 phorbol ester Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 108700028325 pokeweed antiviral Proteins 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000002953 preparative HPLC Methods 0.000 description 1
- 238000007639 printing Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 150000003212 purines Chemical class 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 150000003248 quinolines Chemical class 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000000405 serological effect Effects 0.000 description 1
- 238000012807 shake-flask culturing Methods 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 108060008226 thioredoxin Proteins 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 238000012090 tissue culture technique Methods 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 1
- 101150081616 trpB gene Proteins 0.000 description 1
- 101150111232 trpB-1 gene Proteins 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- QAOHCFGKCWTBGC-QHOAOGIMSA-N wybutosine Chemical compound C1=NC=2C(=O)N3C(CC[C@H](NC(=O)OC)C(=O)OC)=C(C)N=C3N(C)C=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O QAOHCFGKCWTBGC-QHOAOGIMSA-N 0.000 description 1
- QAOHCFGKCWTBGC-UHFFFAOYSA-N wybutosine Natural products C1=NC=2C(=O)N3C(CCC(NC(=O)OC)C(=O)OC)=C(C)N=C3N(C)C=2N1C1OC(CO)C(O)C1O QAOHCFGKCWTBGC-UHFFFAOYSA-N 0.000 description 1
Description
COMPOSITIONS AND METHODS FOR HIGH-LEVEL, LARGE-SCALE PRODUCTION OF RECOMBINANT PROTEINS
BACKGROUND OF THE INVENTION
Field of the Invention The present invention relates generally to gene expression and protein production and, more specifically, to compositions .and methods for the overexpression of recombmant proteins. Such compositions and methods are useful in the high-level, large-scale production of recombinant proteins.
Description of Related Art A major goal of the biotechnology industry is the development of stable cell-line based systems for the large-scale expression of recombinant proteins such as, e.g., recombinant antibodies. Standard methodologies require time consuming and labor intensive development of suitable recombinant host cell-lines. Conventionally, cells, such as, e.g., CHO-K1 or CHO DUX, are grown in the presence of fetal bovine serum . nd transfected by the expression vector- of interest. The entire population of cells subsequently undergoes a process of selection to remove cells that failed to take up the expression vector. The vector cont ning pool is then, typically, subcloned and screened for high-level expression. Each of the resulting high-level expressing clones is then expanded and slowly adapted to serum-free, suspension culture which adaptation often results in the loss of expression of the recombinant protein and/or polypeptide. hi addition to these general limitations in recombinant protein expression, efficient functional expression of multi-subunit proteins, such as, e.g., antibodies, requires appropriately balanced expression of both subunit chains. For example, traditional methodologies for the expression of antibody heavy and light chains rely on the co-tansfection of plasmids independently carrying a heavy and light chain coding region makes the maintenance of an equal copy number difficult and provides the potential for transcriptional interference between the genes if the vectors integrate close to one another in the genome.
Thus, in spite of considerable research, there remains a need in the art for improved compositions and methods for high-level, large-scale expression of recombinant proteins and/or polypeptides including antibody heavy and light chains. The present invention fulfills these needs .and further provides other related advantages by utilizing host cell-lines that are pre-adapted for serum-free, suspension culture in combination with suitable expression vectors for recombinant protein expression. Also provided herein are bi-directional UCOE vectors that permit the simultaneous, high- level expression of two or more recombinant proteins .and/or polypeptides from a single UCOE based plasmid vector.
SUMMARY OF THE INVENTION
The present invention is directed, generally, to compositions and methods for the rapid and efficient development of recombinant cell-lines that are suitable for high-level, large-scale development and manufacture of recombinant proteins and/or polypeptides. In one aspect, the present invention provides compositions, comprising:
(a) an immortalized host cell-line, capable of continuous growth in culture, which host cell-line is capable of growth in serum-free suspension culture, and (b) a vector for sustained overexpression of a recombinant protein and/or polypeptide, such as a UCOE- based vector described herein. The present invention, in another aspect, provides methods for the high- level, large-scale production of polypeptides. Particular methods comprise the steps of (a) obtaining an immortalized host cell-line capable of growth in suspension; (b) adapting the host cell-line for growth in serum-free medium; (c) transfecting the resulting immortalized host cell-line capable of growth in suspension and serum-free medium with a vector suitable for overexpression of a recombinant protein and/or polypeptide.
According to the compositions and methods of the present invention, suitable immortalized host cell-lines may possess one or more of the following properties: (a) doubling times of no more than 16 hours, preferably between 12 and 16 hours; (b) transfection efficiency of at least 70%, preferably at least 75%, 80%, 85%,
90% or 95%; (c) susceptible to standard selection agents such as, for example, hygromycin, G418, and puromycin; (d) absence of gal-gal glycosylation of recombinant protein and/or polypeptide.
Exemplary immortalized host cell-lines that may be adapted for use in the presently claimed invention include, but are not limited to, the following commercially available host cell-lines: (a) CHO-S (a Chinese hamster ovary host cell- line); (b) 293-F (a human host cell-line); (c) 293-H (a human host cell-line); (d) COS- 7L (a monkey host cell-line); (e) D.Mel-2 (an insect host cell-line); (f) Sf21 (an insect host cell-line); and (g) Sf9 (an insect host cell-line). Alternatively, suitable host cell- lines may be obtained through routine experimentation following the methodologies disclosed herein.
Vectors for overexpression of recombinant proteins and/or polypeptides suitable for use in the compositions and methods of the present invention may possess one or more of the following properties: (a) contains one or more elements that facilitate high-level, large-scale expression in the immortalized host cell-line and (b) are resistant to repression of the recombinant protein and/or polypeptide.
Within certain embodiments, vectors of the present invention may further comprise one or more universal chiOmatin opening elements (UCOEs) as defined herein below. Additionally or alternatively, vectors as disclosed herein may comprise one or more transcriptional promoters such as, for example, the CMV promoter.
Preferred compositions and methods of the present invention are capable of achieving expression levels of at least 50 mg recombinant protein and/or polypeptide per liter of culture, more preferably at least 100 mg recombinant protein and/or polypeptide per liter, and still more preferably at least 200 mg recombinant protein and/or polypeptide per liter.
The present invention further provides compositions and methods that are capable of scale-up to at least 100 liter scale with yields (per 100 liter culture) of at least 1 gram of protein and/or polypeptide, more preferably at least 5 grams of protein and/or polypeptide, still more preferably at least 10 grams of protein and/or polypeptide, and most preferably at least 20 grams of protein and/or polypeptide.
The present invention still further provides compositions and methods employing bi-directional vector systems for the high-level expression of two or more recombinant proteins on a single UCOE-based plasmid vector. Exemplary bidirectional vector systems may comprise one or more transcriptional promoter selected from the group consisting of the murine CMV promoter, the human CMV promoter, and the human beta-actin promoter.
The present invention also provides compositions and methods for improved expression of one or more recombinant protein comprising an RNP UCOE- based plasmid vector, such as, e.g., CET720GFP, optionally comprising one or more deletions within the 8 kb RNP UCOE portion. Illustrative UCOE deletion constructs will preferably retain significant UCOE activity, e.g., at least about 50%, preferably at least about 75%, and more preferably at least 90% or more of UCOE activity relative to the activity of the 8 kb RNP UCOE element described herein. Exemplary deletions may, optionally, comprise deletions within regions of the RNP UCOE selected from the group consisting of ΔBS, ΔEcoNI, ΔEM, ΔMluI, and ΔRV, as depicted in Table 4 and Figure 14. Deletions within the scope of the present invention are preferably at least 100 bp, more preferably at least 250 bp, still more preferably at least 1000 bp, still more preferably at least 2500 bp and still more preferably at least 4000 bp. Particularly illustrative UCOE vectors of the present invention will thus minimally comprise at least one or more UCOE portions, wherein the UCOE portions retain a desired level of UCOE activity. In one illustrative embodiment, at least about a 4.1 kb UCOE portion corresponding to nucleotide residues 5152-9254 of CET720GFP (SEQ ID NO: 9) is employed. This UCOE portion, for example, has been demonstrated herein to retain a level of UCOE activity comparable to that observed the full 8kb UCOE element corresponding to nucleotide residues 2225-10525 of CET720GFP (SEQ ID NO: 9). These and other UCOE portions can be readily identified, and their activities evaluated, via routine and art-recognized techniques in view of the disclosure provided herein.
These and other aspects of the present invention will become apparent upon reference to the following detailed description and attached drawings. All references disclosed herein are hereby incorporated by reference in their entirety as if each was incorporated individually.
BRIEF DESCRIPTION OF THE DRAWINGS AND SEQUENCE IDENTIFIERS
Figure 1 is a diagrammatic representation of UCOE-based antibody expression cassettes.
Figure 2A .and 2B are plasmid maps of vectors that may be used for expression of recombinant human antibodies. Figure 2A shows a plasmid for expression of recombinant human Ig heavy chain. Figure 2B shows a plasmid for expression of recombinant human Ig kappa light chain.
Figure 3 is a graph depicting antibody expression levels in CHO cells transfected with and without UCOEs. Figure 4 shows the results of scale-up of a CHO-S cell line transfected with vectors expressing the Heavy and Light chains of antibody Abl in shake-flask culture and in a 2 liter bioreactor. The left-hand panel shows antibody titer determined by ELIS A. The right-hand panel shows cell growth.
Figure 5 is a graph depicting the levels of Gal-Gal residues on the surface of murine hybridoma, CHO-K1, and CHO-S cells.
Figure 6 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUneolOO.
Figure 7 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUneo200. Figure 8 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUpuro300.
Figure 9 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUpuro400.
Figure 10 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUneo500.
Figure 11 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUneo600.
Figure 12 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUpuro700. Figure 13 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUpuroSOO.
Figure 14 is a diagrammatic representation of deletions within the 8 kb RNP UCOE of CET720GFP.
Figure 15 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUpuro350. Figure 16 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUpuro450.
Figure 17 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUneol200.
Figure 18 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUpuro 1450.
Figure 19 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUneol600.
Figure 20 is a diagrammatic representation of the bi-directional UCOE plasmid vector pBDUpurolδOO. Figure 21 is a graph depicting the antibody production rates for illustrative cell lines containing bi-directional UCOE plasmid vectors.
BRIEF DESCRIPTION OF THE SEQUENCE IDENTIFIERS
SEQ ID NO:l is the polynucleotide sequence of pBDUneolOO.
SEQ ID NO:2 is the polynucleotide sequence of pBDUneo200. SEQ ID NO:3 is the polynucleotide sequence of pBDUpuro300.
SEQ ID NO:4 is the polynucleotide sequence of ρBDUpuro400.
SEQ ID NO: 5 is the polynucleotide sequence of pBDUneo500.
SEQ ID NO: 6 is the polynucleotide sequence of pBDUneoδOO
SEQ ID NO: 7 is the polynucleotide sequence of pBDUpuro700. SEQ ID NO: 8 is the polynucleotide sequence of pBDUpuroδOO.
SEQ ID NO: 9 is the polynucleotide sequence of vector CET720GFP.
SEQ ID NOs: 10-26 represent illustrative primer sequences employed in Example 4 for the production of improved UCOE vectors according to the invention.
SEQ ID NO: 27 is the polynucleotide sequence of pBDUpuro350. SEQ ID NO: 28 is the polynucleotide sequence of pBDUpuro450.
SEQ ID NO: 29 is the polynucleotide sequence of pBDUneol200. SEQ ID NO: 30 is the polynucleotide sequence of pBDUpurol450. SEQ ID NO: 31 is the polynucleotide sequence of pBDUneol600. SEQ ID NO: 32 is the polynucleotide sequence of pBDUpurol800.
DETAILED DESCRIPTION OF THE INVENTION
The present invention is directed generally to compositions and methods for use in high-level, large-scale production of recombinant proteins and/or polypeptides. As described further below, illustrative compositions of the present invention include, but are not restricted to, immortalized, serum-free, suspension host cell-lines in combination with one or more expression vectors suitable for the high- level, large-scale expression of recombinant proteins and or polypeptides.
The practice of the present invention will employ, unless indicated specifically to the contrary, conventional methods of virology, immunology, microbiology, molecular biology and recombinant DNA techniques within the skill of the art, many of which are described below for the purpose of illustration. Such techniques are explained fully in the literature. See, e.g., Sambrook, et al. Molecular Cloning: A Laboratory Manual (2nd Edition, 1989); Maniatis et al. Molecular Cloning: A Laboratory Manual (1982); DNA Cloning: A Practical Approach, vol. I & II (D. Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed., 1984); Nucleic Acid Hybridization (B. Hames & S. Higgins, eds., 1985); Transcription and Translation (B. Hames & S. Higgins, eds., 1984); Animal Cell Culture (R. Freshney, ed., 1986); Perbal, A Practical Guide to Molecular Cloning (1984).
All publications, patents and patent applications cited herein, whether supra or infra, are hereby incorporated by reference in their entirety. As used in this specification and the appended claims, the singular forms
"a," "an" and "the" include plural references unless the content clearly dictates otherwise.
Preparation and Selection of Serum-free, Suspension Host cell-lines
Host cell-lines ideally suitable for use in the compositions and methods of the present invention may have one or more of the following attributes: (a) capable of immortal, continuous growth in culture; (b) adapted for growth in suspension; (c) rapid growth, preferably 12-16 hour doubling time; (d) high transfection efficiency, preferably at least 70%; (e) susceptibility to selection by standard selection agents, preferably hygromycin, G418 or puromycin; (f) protein glycosylation patterns consistent with use as a human therapeutic, preferably the absence of gal-gal glycosylation pattern; and (g) adapted for growth in serum-free medium, preferably chemically-defined, protein-free growth without indirect animal-derived components.
A host cell-line having one or more of these attributes may be used to develop a system for the rapid development of recombinant host cell-lines that may be transferred into development and manufacturing with reduced effort and time as compared to existing methodologies for the high-level, large-scale production of recombinant proteins and/or polypeptides.
For long-term, high-yield production of recombinant proteins, stable expression is generally preferred. For example, cell-lines that stably express a polynucleotide of interest may be transfected using expression vectors which may contain endogenous expression elements and a selectable marker gene on the same or on a separate vector. Following the introduction of the vector, cells may be allowed to grow for 1-2 days in an enriched media before they are switched to selective media. The purpose of the selectable marker is to confer resistance to selection, and its presence allows growth and recovery of cells that successfully express the introduced sequences. Resistant clones of stably transformed cells may be proliferated using tissue culture techniques appropriate to the cell type.
Any number of selection systems may be used to recover transformed cell-lines. These include, but are not limited to, the herpes simplex virus thymidine kinase (Wigler, M. et al. (1977) Cell 11:223-32) and adenine phosphoribosyltransferase (Lowy, I. et al. (1990) Cell 22:817-23) genes which can be employed in tk.sup.- or aprtsup.- cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can be used as the basis for selection; for example, dhfr which confers resistance to
methotrexate (Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-70); glutamine synthetase (GS) which confers glutamine - independent growth and resistance to methionine sulphoximine (Bebbington et al. (1992) Biotechnology 10(2):169-15; and Cockett et al. (1991) Nucleic Acids Res. 25;19(2):3\9-25 npt, which confers resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et al (1981) J Mol. Biol. 150: 1-14); and als or pat, which confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). Additional selectable genes have been described, for example, trpB, which allows cells to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl. Acad. Sci.-85:%041-5\). The use of visible ir rkers has gained popularity with such markers as anthocyanins, beta- glucuronidase and its substrate GUS, and luciferase and its substrate luciferin, being widely used not only to identify transformants, but also to quantify the amount of transient or stable protein expression attributable to a specific vector system (Rhodes, C. A. et al. (1995) Methods Mol. Biol 55:121-131).
Although the presence/absence of marker gene expression suggests that the gene of interest is also present, its presence and expression may need to be confirmed. For example, if the sequence encoding a polypeptide is inserted within a marker gene sequence, recombinant cells containing sequences can be identified by the absence of marker gene function. Alternatively, a marker gene can be placed in tandem with a polypeptide-encoding sequence under the control of a single promoter. Expression of the marker gene in response to induction or selection usually indicates expression of the tandem gene as well.
Alternatively, host cells that contain and express a desired polynucleotide sequence may be identified by a variety of procedures known to those of skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA- RNA hybridizations and protein bioassay or immunoassay techniques which include, for example, membrane, solution, or chip based technologies for the detection and/or quantification of nucleic acid or protein. A variety of protocols for detecting and measuring the expression of polynucleotide-encoded products, using either polyclonal or monoclonal antibodies
specific for the product are known in the art. Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes on a given polypeptide may be preferred for some applications, but a competitive binding assay may also be employed. These and other assays are described, among other places, in Hampton, R. et al. (1990; Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. E. et al. (1983; J. Exp. Med. 755:1211-1216).
A wide variety of labels and conjugation techniques are known by those skilled in the art and may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization or PCR probes for detecting sequences related to polynucleotides include oligolabeling, nick tr. nslation, end-labeling or PCR amplification using a labeled nucleotide. Alternatively, the sequences, or any portions thereof may be cloned into a vector for the production of an niRNA probe. Such vectors are known in the art, are commercially available, and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of commercially available kits. Suitable reporter molecules or labels, which may be used include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents as well as substrates, cofactors, inhibitors, magnetic particles, and the like.
Host cells transformed with a polynucleotide sequence of interest may be cultured under conditions suitable for the expression and recovery of the protein from cell culture. The protein produced by a recombinant cell may be secreted or contained intracellularly depending on the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing polynucleotides of the invention may be designed to contain signal sequences which direct secretion of the encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other recombinant constructions may be used to join sequences encoding a polypeptide of interest to nucleotide sequence encoding a polypeptide domain which will facilitate purification of soluble proteins. Such purification facilitating domains include, but are not limited to, metal chelating peptides such as histidine-tryptophan modules that allow
purification on immobilized metals, protein A domains that allow purification on immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity purification system (Immunex Corp., Seattle, WA). The inclusion of cleavable linker sequences such as those specific for Factor XA or enterokinase (Invitrogen) between the purification domain and the encoded polypeptide may be used to facilitate purification. One such expression vector provides for expression of a fusion protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues facilitate purification on IMLAC (immobilized metal ion affinity chromatography) as described in Porath, J. et al. (1992, Prot. Exp. Purif. 5:263-281) while the enterokinase cleavage site provides a means for purifying the desired polypeptide from the fusion protein. A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. et al. (1993; DNA Cell Biol. 72:441-453).
Serum-free, immortal host cell-lines are readily available from a variety of public and/or commercial sources such as, for example, the American Type Culture Collection (ATCC; Manassas, VA); Celox (St. Paul, MN); Invitrogen (Carlsbad, CA); the European and Japanese Cell Banks (ECACC, Salisbury, Wiltshire (UK) and JCRB, Shinjuky, Japan, respectively).
Suitable host cell-lines may be obtained by selecting an existing host cell-line that possesses one or more of the above attributes and adapt and/or select for variants of that host cell-line to obtained the remaining attributes. The use of pre- adapted host cell-lines ensures that the cells are capable of achieving the desired conditions prior to beginning the process of transfection and recombinant protein expression. As noted below, such cell-lines are ideally suited for use in conjunction with UCOE containing expression vectors because these vector systems are characterized by stable, long-term, high-level protein expression.
Exemplary suitable host cell-lines that may be modified and/or adapted for use according to the compositions and methods of the present invention include, but are not limited to, the following: (a) 293-F, a human host cell-line; (b) 293-H, a human host cell-line; (c) COS-7L, a monkey host cell-line; (d) D.MEL-2, an insect host cell-
line; (e) SF21, an insect host cell-line; (f) SF9, an insect host cell-line; and (g) CHO-S, a Chinese hamster ovary host cell-line.
For example, a Chinese hamster ovary subclone (CHO-S; Invitrogen/Gibco) that has been adapted to a commercially available chemically defined, protein free media may be suitably employed in the compositions and methods of the present invention. See, D'Anna et al., Radiation Research 148:260-271 (1997); D'Anna et al., Methods in Cell Science 18:115-125 (19960; Deaven et al., Chromosoma 41:129-144 (1973); Gorfein et al., Animal Cell Technology: Basic & Applied Aspects 9:247-252 (Kluwer Academic Publishers, Netherlands, 1998). The CHO-S host cell- line has a 12 to 16 hour doubling time in shaker flask cultures reaching a peak cell density of 9-11 x 106 viable cells/ml. They are susceptible to hygromycin at 400 ug/ml and geneticin (G418) at 600 ug/ml. The cells grow as attachment independent single cells even in a stationary culture.
The presence of the Gal l→3Galβl→4GlcNAc-R (Gal-Gal) carbohydrate residue on recombinant proteins used clinically has been associated with rapid protein clearance from the serum. Rodent cells typically introduce the terminal Gal-Gal disaccharide into the carbohydrate structures of secreted glycoproteins although the Gal-Gal residue is not found in human glycoproteins. As a result, the ability to produce recombinant protein without this particular carbohydrate structure is advantageous.
The CHO-S host cell-line is particularly well suited for use in conjunction with expression vectors comprising one or more UCOE elements, as noted herein below. This host cell-line possesses favorable growth characteristics and generates undetectable levels of the Gal-Gal carbohydrate moiety in its surface glycoproteins. Thus, the CHO-S host cell-line is suitable for expression of recombinant proteins and/or polypeptides produced for clinical use.
Preparation and Selection of Expression Vectors
Suitable vector systems for expression of recombinant proteins and/or polypeptides according to the present invention may include one or more of the following attributes: (a) ease of manipulation; (b) elements that make high-level
expression site-of-integration independent; (c) elements that make expression resistant to silencing/repression thereby allowing for sustained, stable expression over long periods of time; and (d) elements that express at high-levels in different cell types and in different species. In order to express a desired protein and/or polypeptide, the nucleotide sequences encoding the polypeptide, or functional equivalents, may be inserted into appropriate expression vector, i.e., a vector which contains the necessary elements for the transcription and translation of the inserted coding sequence. Methods which are well known to those skilled in the art may be used to construct expression vectors containing sequences encoding a polypeptide of interest and appropriate transcriptional and translational control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. Such techniques are described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York. N.Y.
A variety of expression vector/host systems may be utilized to contain and express polynucleotide sequences. These include, but are not limited to plasmid or cosmid DNA expression vectors; insect cell systems infected with virus expression vectors (e.g., baculovirus); plant cell systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV); or animal cell systems.
The "control elements" or "regulatory sequences" present in an expression vector are those non-translated regions of the vector—enhancers, promoters, 5' and 3' untranslated regions— which interact with host cellular proteins to carry out transcription and translation. Such elements may vary in their strength and specificity. Depending on the vector system and host utilized, any number of suitable transcription and translation elements, including constitutive and inducible promoters, may be used. In mammalian cell systems, promoters from mammalian genes or from mammalian viruses are generally preferred. If it is necessary to generate a cell-line that contains multiple copies of the sequence encoding a polypeptide, vectors containing GS or
DHFR selectable markers or vectors based on SV40 or EBV may be advantageously used with an appropriate selectable marker.
An insect system may also be used to express a polypeptide of interest. For example, in one such system, Autographa californica nuclear polyhedrosis virus (AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a non-essential region of the virus, such as the polyhedrin gene, and placed under control of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence will render the polyhedrin gene inactive and produce recombinant virus lacking coat protein. The recombinant viruses may then be used to infect, for example, S. frugiperda cells or Trichoplusia larvae in which the polypeptide of interest may be expressed (Engelhard, E. K. et al. (1994) Proc. Natl. Acad. Sci. 91 :3224-3227).
In mammalian host cells, a number of viral-based expression systems are generally available. For example, in cases where an adenovirus is used as an expression vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain a viable virus which is capable of expressing the polypeptide in infected host cells (Logan, J. and Shenk, T. (1984) Proc. Natl. Acad. Sci. 57:3655-3659). In addition, transcription enhancers, such as the Rous sarcoma vims (RSV) enhancer, may be used to increase expression in mammalian host cells.
Specific initiation signals may also be used to achieve more efficient translation of sequences encoding a polypeptide of interest. Such signals include the ATG initiation codon and adjacent sequences. In cases where sequences encoding the polypeptide, its initiation codon, and upstream sequences are inserted into the appropriate expression vector, no additional transcriptional or translational control signals may be needed. However, in cases where only coding sequence, or a portion thereof, is inserted, exogenous translational control signals including the ATG initiation codon should be provided. Furthermore, the initiation codon should be in the correct reading frame to ensure translation of the entire insert. Exogenous translational elements and initiation codons may be of various origins, both natural and synthetic.
The efficiency of expression may be enhanced by the inclusion of enhancers which are appropriate for the particular cell system which is used, such as those described in the literature (Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162).
Exemplary preferred elements suitable for making high-level expression site-of-integration independent include, for example, universal chromatin opening elements (UCOEs). UCOEs are polynucleotide sequences that maintain chromatin in an "open" configuration. See, e.g., Crombie et al., PCT Patent Application No. WO0005393 (2000). Inclusion of a UCOE in an expression vector upsteam of the promoter provides high-levels of expression that are independent of integration site and are resistant to silencing. Efficient expression can be derived from a single copy of an integrated gene site resulting in a higher percentage of cells expressing the marker gene in the selected pool in comparison to standard non-UCOE containing vectors. This, in combination with the utilization of a serum free, suspension adapted parent cell-line allows for rapid production of large quantities of protein in a short period of time. The increased efficiency obtained with the UCOE vector significantly reduces the number of transfectants which need to be screened in order to obtain a high productivity subclone.
Utilization of vectors containing one or more UCOEs in a suspension- adapted host cell-line allows for rapid development and scale-up for production protein and/or polypeptide such as, for example, antibody or fragment thereof. UCOEs allow for screening of a small number of subclones to obtain a clone capable of producing at least 50 mg/L of protein and/or polypeptide, more preferably at least 100 mg/L of protein and/or polypeptide, and still more preferably at least 200 mg/L of protein and/or polypeptide in a 5 week period in serum free conditions.
Preferably, expression vector systems suitable for use in the compositions and methods of the present invention are capable of yielding expression levels in excess of 1 g protein and/or polypeptide per liter of suspension culture. More preferably, expression vectors are capable of use in stable host cell-lines wherein least 20 pg protein and/or polypeptide per cell are achieved per day.
As discussed in detail herein below, within certain embodiments of the present invention, the protein and/or polypeptide may comprise one or more subunits such as, for example, antibody heavy and light chains or fragments thereof. As is well
understood in the art, efficient functional antibody production requires appropriately balanced expression of the heavy and light chains. Transfection of the two chains on separate plasmids makes maintenance of an equal copy number difficult and provides the potential for transcriptional interference between the genes if the vectors integrate close to one another in the genome. Consequently, bi-directional vectors for the co- expression of two genes on the same vector may be employed. As disclosed in further detail in the Examples herein below, exemplary bi-directional UCOE-based vector systems, within the scope of the present invention, may, optionally, be constructed based on the "hybrid" RNP beta-actin UCOE (Cobra Therapeutics). Vectors may comprise one or more antibiotic resistance markers such as, e.g., the neomycin or puromycin resistance markers, and/or may comprise one or more mammalian promoter such as, e.g., the murine CMV promoter (mCMV), the human CMV promoter (hCMV), or the human actin promoters to drive light or heavy chain expression.
Transfection of Host cell-lines with Expression Vectors of the Present Invention Transfection of a standard host cell-line, preadapted to grow in a large scale setting, allows for more rapid cell-line development thereby increasing the transition rate from research into development and manufacturing. In contrast, the traditional approach of using a parent cell-line which requires serum free and suspension adaptation after transfection further increases the need for screening a large number of subclones, because many of the subclones will not be able to grow under conditions that allow large scale protein production. Use of a preadapted cell-line can reduce the time required to develop a cell-line from months to weeks. The cell-line is preadapted to a chemically defined, protein free media and grows rapidly to high cell densities in a shaker flask or bioreactor. Suitable transfection protocols are readily known and/or available to those of skill in the art. Exemplary transfection protocols that are suitable for achieving high-level, large-scale transfection are those recommended by Invitrogen/Gibco for transfection of the CHO-S host cell-line. Generally, positive selection of transfected cells may be achieved using agents such as, for example, hygromycin, G418, and puromycin. Transfection efficiencies are typically at least 70%, more preferably at least
75%, 80%, 85%, 90% or 95%. Following transfection and selection, the pool of resulting clones may, optionally, be further subcloned to identify individual clones with the highest levels of protein expression.
Selection of Cell Culture Conditions Selection and testing of serum-free media suitable for culture of the immortalized suspension cells according to the present invention may be achieved by the skilled artisan by routine experimentation. For CHO-S cells, described herein above, the CD-CHO media is suitable, (e.g, available from Invitrogen or Gibco).
Exemplary Proteins and/or Polypeptides Suitable for High-level, Large-scale Expression As used herein, the terms "protein" and "polypeptide" are used in their conventional meaning, i.e., as a sequence of amino acids. The polypeptides are not limited to a specific length of the product; thus, peptides, oligopeptides, and proteins are included within the definition of polypeptide, and such terms may be used interchangeably herein unless specifically indicated otherwise. This term also does not refer to or exclude post-expression modifications of the polypeptide, for example, glycosylations, acetylations, phosphorylations and the like, as well as other modifications known in the art, both naturally occurring and non-naturally occurring. As noted above, however, preferred proteins and/or polypeptides according to the present invention lack Gal-Gal glycosylation. A polypeptide may be an entire protein, or a subsequence thereof. Particular polypeptides of interest in the context of this invention are a ino acid subsequences comprising epitopes, i.e., antigenic determinants substantially responsible for the immunogenic properties of a polypeptide and being capable of evoking an immune response.
In certain preferred embodiments, the polypeptides produced and/or employed according to the present invention are immunogenic, i.e., they react detectably within an immunoassay (such as an ELISA or T-cell stimulation assay) with antisera and/or T-cells from a patient with a cancer. Screening for immunogenic activity can be performed using techniques well known to the skilled artisan. For example, such screens can be performed using methods such as those described in
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In one illustrative example, a polypeptide may be immobilized on a solid support and contacted with patient sera to allow binding of antibodies within the sera to the immobilized polypeptide. Unbound sera may then be removed and bound antibodies detected using, for example, 125I-labeled Protein A.
As would be recognized by the skilled artisan, immunogenic portions of the polypeptides produced according to the disclosure provided herein are also encompassed by the present invention. An "immunogenic portion," as used herein, is a fragment of an immunogenic polypeptide of the invention that itself is immunologically reactive (i.e., specifically binds) with the B-cells and/or T-cell surface antigen receptors that recognize the polypeptide. Immunogenic portions may generally be identified using well known techniques, such as those summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 1993) and references cited therein. Such techniques include screening polypeptides for the ability to react with antigen-specific antibodies, antisera and/or T-cell-lines or clones. As used herein, antisera and antibodies are "antigen-specific" if they specifically bind to an antigen (i.e., they react with the protein in an ELISA or other immunoassay, and do not react detectably with unrelated proteins). Such antisera and antibodies may be prepared as described herein, and using well-known techniques. In one preferred embodiment, an immunogenic portion of a polypeptide of the present invention is a portion that reacts with antisera and/or T-cells at a level that is not substantially less than the reactivity of the full-length polypeptide (e.g., in an ELISA and/or T-cell reactivity assay). Preferably, the level of immunogenic activity of the immunogenic portion is at least about 50%, preferably at least about 70% and most preferably greater than about 90% of the immunogenicity for the full-length polypeptide. In some instances, preferred immunogenic portions will be identified that have a level of immunogenic activity greater than that of the corresponding full-length polypeptide, e.g., having greater than about 100% or 150% or more immunogenic activity. In certain other embodiments, illustrative immunogenic portions may include peptides in which an N-terminal leader sequence and/or transmembrane domain
have been deleted. Other illustrative immunogenic portions will contain a small N- and/or C-terminal deletion (e.g., 1-30 amino acids, preferably 5-15 amino acids), relative to the mature protein.
In another embodiment, a protein and/or polypeptide made and/or used according to the present invention may also comprise one or more polypeptides that are immunologically reactive with T cells and/or antibodies generated against a polypeptide of the invention, particularly a polypeptide having an amino acid sequence disclosed herein, or to an immunogenic fragment or variant thereof.
A polypeptide "variant," as the term is used herein, is a polypeptide that typically differs from a polypeptide specifically disclosed herein in one or more substitutions, deletions, additions and/or insertions. Such variants may be naturally occurring or may be synthetically generated, for example, by modifying one or more of the above polypeptide sequences of the invention and evaluating their activity as described herein and/or using any of a number of techniques well known in the art. Illustrative variant sequences according to the present invention are those sequences related by homology to the 8kb RNP UCOE sequence provided herein, or a subsequence thereof, which retain a desired degree of UCOE activity.
In one embodiment, for example, particularly illustrative variant sequences of the invention comprise polynucleotide sequences having at least 70%, 75%, 80%, 85%, 90%, 95% or 99% or more identity with a UCOE polynucleotide specifically disclosed herein. Preferably such variants exhibit at least 70%, 75%, 80%, 85%, 90%, 95% or 100% or more UCOE activity when compared with the UCOE activity exhibited by the 8 kb RNP UCOE element disclosed herein.
In many instances, a variant will contain conservative substitutions. A "conservative substitution" is one in which an amino acid is substituted for another amino acid that has similar properties, such that one skilled in the art of peptide chemistry would expect the secondary structure and hydropathic nature of the polypeptide to be substantially unchanged. As described above, modifications may be made in the structure of the polynucleotides and polypeptides of the present invention and still obtain a functional molecule that encodes a variant or derivative polypeptide with desirable characteristics, e.g., with immunogenic characteristics. When it is
desired to alter the amino acid sequence of a polypeptide to create an equivalent, or even an improved, variant or portion of a polypeptide of the invention, one skilled in the art will typically change one or more of the codons of the encoding DNA sequence according to Table 1. For example, certain amino acids may be substituted for other amino acids in a protein structure without appreciable loss of interactive binding capacity with structures such as, for example, antigen-binding regions of antibodies or binding sites on substrate molecules. Since it is the interactive capacity and nature of a protein that defines that protein's biological functional activity, certain amino acid sequence substitutions can be made in a protein sequence, and, of course, its underlying DNA coding sequence, and nevertheless obtain a protein with like properties. It is thus contemplated that various changes may be made in the peptide sequences of the disclosed compositions, or corresponding DNA sequences which encode said peptides without appreciable loss of their biological utility or activity.
Table 1
Amino Acids Codons
Alanine Ala A GCA GCC GCG GCU
Cysteine Cys C UGC UGU
Aspartic acid Asp D GAC GAU
Glutamic acid Glu E GAA GAG
Phenylalanine Phe F UUC UUU
Glycine Gly G GGA GGC GGG GGU
Histidine His H CAC CAU
Isoleucine He I AUA AUC AUU
Lysine Lys K AAA AAG
Leucine Leu L UUA UUG CUA CUC CUG CUU
Methionine Met M AUG
Asparagine Asn N AAC AAU
Proline Pro P CCA CCC CCG CCU
Glutamine Gin Q CAA CAG
Arginine Arg R AGA AGG CGA CGC CGG CGU
Serine Ser S AGC AGU UCA UCC UCG UCU
Threonine Thr T ACA ACC ACG ACU
Valine Val V GUA GUC GUG GUU
Tryptophan Trp w UGG
Tyrosine Tyr Y UAC UAU
In making such changes, the hydropathic index of amino acids may be considered. The importance of the hydropathic amino acid index in conferring interactive biologic function on a protein is generally understood in the art (Kyte and Doolittle, 1982, incorporated herein by reference). It is accepted that the relative hydropathic character of the amino acid contributes to the secondary stracture of the resultant protein, which in turn defines the interaction of the protein with other molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and the like. Each amino acid has been assigned a hydropathic index on the basis of its hydrophobicity and charge characteristics (K te and Doolittle, 1982). These values are:
isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). It is known in the art that certain amino acids may be substituted by other amino acids having a similar hydropathic index or score and still result in a protein with similar biological activity, i.e. still obtain a biological functionally equivalent protein. In making such changes, the substitution of amino acids whose hydropathic indices are within ±2 is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even more particularly preferred. It is also understood in the art that the substitution of like amino acids can be made effectively on the basis of hydrophilicity. U. S. Patent 4,554,101 (specifically incorporated herein by reference in its entirety), states that the greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino acids, correlates with a biological property of the protein. As detailed in U. S . Patent 4, 554, 101 , the following hydrophilicity values have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (-0.4); proline (-0.5 ± 1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (- 2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be substituted for another having a similar hydrophilicity value and still obtain a biologically equivalent, and in particular, an immunologically equivalent protein. In such changes, the substitution of amino acids whose hydrophilicity values are within ±2 is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even more particularly preferred.
As outlined above, amino acid substitutions are generally therefore based on the relative similarity of the amino acid side-chain substituents, for example, their hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions that take various of the foregoing characteristics into consideration are well known to those of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine and asparagine; and valine, leucine and isoleucine.
h addition, any polynucleotide may be further modified to increase stability in vivo. Possible modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends; the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages in the backbone; and/or the inclusion of nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl- methyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and uridine.
Amino acid substitutions may further be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic nature of the residues. For example, negatively charged amino acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; and amino acids with uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may represent conservative changes include: (1) ala, pro, gly, glu, asp, gin, asn, ser, thr; (2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, his. A variant may also, or alternatively, contain nonconservative changes. In a preferred embodiment, variant polypeptides differ from a native sequence by substitution, deletion or addition of five amino acids or fewer. Variants may also (or alternatively) be modified by, for example, the deletion or addition of amino acids that have minimal influence on the immunogenicity, secondary structure and hydropathic nature of the polypeptide.
As noted above, polypeptides may comprise a signal (or leader) sequence at the N-terminal end of the protein, which co-translationally or post-translationally directs transfer of the protein. The polypeptide may also be conjugated to a linker or other sequence for ease of synthesis, purification or identification of the polypeptide (e.g., poly-His), or to enhance binding of the polypeptide to a solid support. For example, a polypeptide may be conjugated to an immunoglobulin Fc region.
When comparing polypeptide sequences, two sequences are said to be "identical" if the sequence of amino acids in the two sequences is the same when aligned for maximum correspondence, as described below. Comparisons between two
sequences are typically performed by comparing the sequences over a comparison window to identify and compare local regions of sequence similarity. A "comparison window" as used herein, refers to a segment of at least about 20 contiguous positions, usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned.
Optimal alignment of sequences for comparison may be conducted using the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, Inc., Madison, WI), using default parameters. This program embodies several alignment schemes described in the following references: Dayhoff, M.O. (1978) A model of evolutionary change in proteins - Matrices for detecting distant relationships. In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-11; Robinson, E.D. (1971) Comb. Theor 77:105; Saitou, N. Nei, M. (1987) Mol. Biol. Evol. 4:406- 425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy — the Principles and Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and Lipman, D.J. (1983) Proc. Natl. Acad, Sci. USA 80:126-130.
Alternatively, optimal alignment of sequences for comparison may be conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), or by inspection.
One preferred example of algorithms that are suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389-3402
and Altschul et al. (1990) J Mol. Biol. 215:403-410, respectively. BLAST and BLAST 2.0 can be used, for example with the parameters described herein, to determine percent sequence identity for the polynucleotides and polypeptides of the invention. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information. For amino acid sequences, a scoring matrix can be used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T and X determine the sensitivity and speed of the alignment.
In one preferred approach, the "percentage of sequence identity" is determined by comparing two optimally aligned sequences over a window of comparison of at least 20 positions, wherein the portion of the polypeptide sequence in the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent or less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the reference sequence (i.e., the window size) and multiplying the results by 100 to yield the percentage of sequence identity.
Within other illustrative embodiments, a polypeptide produced and/or employed according to the present invention may be a xenogeneic polypeptide that comprises a polypeptide having substantial sequence identity, as described above, to the human polypeptide (also termed autologous antigen) which served as a reference polypeptide, but which xenogeneic polypeptide is derived from a different, non-human species. One skilled in the art will recognize that "self antigens are often poor stimulators of CD8+ and CD4+ T-lymphocyte responses, and therefore efficient immunotherapeutic strategies directed against tumor polypeptides require the development of methods to overcome immune tolerance to particular self tumor
polypeptides. For example, humans immunized with prostase protein from a xenogeneic (non human) origin are capable of mounting an immune response against the counterpart human protein, e.g. the human prostase tumor protein present on human tumor cells. Therefore, one aspect of the present invention provides xenogeneic variants of the protein and/or polypeptides described herein.
More particularly, the invention is directed to mouse, rat, monkey, porcine and other non-human polypeptides which can be used as xenogeneic forms of human polypeptides set forth herein.
Within other illustrative embodiments, the present invention may employ and/or produce a fusion polypeptide that comprises multiple polypeptides and/or polypeptide subunits, as described herein, or that comprises at least one polypeptide as described herein and an unrelated sequence. A fusion partner may, for example, assist in providing T helper epitopes (an immunological fusion partner), preferably T helper epitopes recognized by humans, or may assist in expressing the protein (an expression enhancer) at higher yields than the native recombinant protein. Certain preferred fusion partners are both immunological and expression enhancing fusion partners. Other fusion partners may be selected so as to increase the solubility of the polypeptide or to enable the polypeptide to be targeted to desired intracellular compartments. Still further fusion partners include affinity tags, which facilitate purification of the polypeptide. Fusion polypeptides may generally be prepared using standard techniques, including chemical conjugation. Preferably, a fusion polypeptide is expressed as a recombinant polypeptide employing compositions and methods of the present invention, and allowing the production of increased levels in an expression system. Briefly, for example, DNA sequences encoding the polypeptide components may be assembled separately, and ligated into an appropriate expression vector. The 3' end of the DNA sequence encoding one polypeptide component is ligated, with or without a peptide linker, to the 5' end of a DNA sequence encoding the second polypeptide component so that the reading frames of the sequences are in phase. This permits translation into a single fusion polypeptide that retains the biological activity of both component polypeptides .
A peptide linker sequence may be employed to separate the first and second polypeptide components by a distance sufficient to ensure that each polypeptide folds into its secondary and tertiary structures. Such a peptide linker sequence is incorporated into the fusion polypeptide using standard techniques well known in the art. Suitable peptide linker sequences may be chosen based on the following factors: (1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a secondary structure that could interact with functional epitopes on the first and second polypeptides; and (3) the lack of hydrophobic or charged residues that might react with the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be used in the linker sequence. Amino acid sequences which may be usefully employed as linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et al, Proc. Natl. Acad. Sci. USA 53:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. Patent No. 4,751,180. The linker sequence may generally be from 1 to about 50 amino acids in length. Linker sequences are not required when the first and second polypeptides have non-essential N-terminal amino acid regions that can be used to separate the functional domains and prevent steric interference.
The ligated DNA sequences are operably linked to suitable transcriptional or translational regulatory elements. The regulatory elements responsible for expression of DNA are located only 5' to the DNA sequence encoding the first polypeptides. Similarly, stop codons required to end translation and transcription termination signals are only present 3' to the DNA sequence encoding the second polypeptide.
The fusion polypeptide can comprise a polypeptide made and/or described herein together with an unrelated protein, such as an immunogenic protein capable of eliciting a recall response. Examples of such proteins include tetanus, tuberculosis and hepatitis proteins (see, for example, Stoute et al. New Engl. J. Med., 336:86-91, 1997).
In one preferred embodiment, the immunological fusion partner is derived from a Mycobacterium sp., such as a Mycobacterium tuberculosis-derived Ral2 fragment. Ral2 compositions and methods for their use in enhancing the expression
and/or immunogenicity of heterologous polynucleotide/polypeptide sequences is described in U.S. Patent Application 60/158,585, the disclosure of which is incorporated herein by reference in its entirety. Briefly, Ral2 refers to a polynucleotide region that is a subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. MTB32A is a serine protease of 32 KD molecular weight encoded by a gene in virulent and avirulent strains of M tuberculosis. The nucleotide sequence and amino acid sequence of MTB32A have been described (for example, U.S. Patent Application 60/158,585; see also, Skeiky et al, Infection and Immun. (1999) 67:3998-4007, incorporated herein by reference). C-terminal fragments of the MTB32A coding sequence express at high levels and remain as a soluble polypeptides throughout the purification process. Moreover, Ral2 may enhance the immunogenicity of heterologous immunogenic polypeptides with which it is fused. One preferred Ral2 fusion polypeptide comprises a 14 KD C-terminal fragment corresponding to amino acid residues 192 to 323 of MTB32A. Other preferred Ral2 polynucleotides generally comprise at least about 15 consecutive nucleotides, at least about 30 nucleotides, at least about 60 nucleotides, at least about 100 nucleotides, at least about 200 nucleotides, or at least about 300 nucleotides that encode a portion of a Ral2 polypeptide. Ral2 polynucleotides may comprise a native sequence (i.e., an endogenous sequence that encodes a Ral2 polypeptide or a portion thereof) or may comprise a variant of such a sequence. Ral2 polynucleotide variants may contain one or more substitutions, additions, deletions and/or insertions such that the biological activity of the encoded fusion polypeptide is not substantially diminished, relative to a fusion polypeptide comprising a native Ral2 polypeptide. Variants preferably exhibit at least about 70% identity, more preferably at least about 80% identity and most preferably at least about 90% identity to a polynucleotide sequence that encodes a native Ral2 polypeptide or a portion thereof.
Within other preferred embodiments, an immunological fusion partner is derived from protein D, a surface protein of the gram-negative bacterium Haemophilus influenza B (WO 91/18926). Preferably, a protein D derivative comprises approximately the first third of the protein (e.g., the first N-terminal 100-110 amino acids), and a protein D derivative may be lipidated. Within certain preferred
embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to increase the expression level in E. coli (thus functioning as an expression enhancer). The lipid tail ensures optima presentation of the antigen to antigen presenting cells. Other fusion partners include the non-structural protein from influenzae virus, NS1 (hemaglutinin). Typically, the N-terminal 81 amino acids are used, although different fragments that include T-helper epitopes may be used.
In another embodiment, the immunological fusion partner is the protein known as LYTA, or a portion thereof (preferably a C-terminal portion). LYTA is derived from Streptococcus pneumoniae, which synthesizes an N-acetyl-L-alanine amidase known as amidase LYTA (encoded by the LytA gene; Gene 43:265-292, 1986). LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan backbone. The C-terminal domain of the LYTA protein is responsible for the affinity to the choline or to some choline analogues such as DEAE. This property has been exploited for the development of E. coli C-LYTA expressing plasmids useful for expression of fusion proteins. Purification of hybrid proteins containing the C-LYTA fragment at the amino terminus has been described (see Biotechnology 10:195-198, 1992). Within a preferred embodiment, a repeat portion of LYTA may be incorporated into a fusion polypeptide. A repeat portion is found in the C-terminal region starting at residue 178. A particularly preferred repeat portion incorporates residues 188-305.
Yet another illustrative embodiment involves fusion polypeptides, and the polynucleotides encoding them, wherein the fusion partner comprises a targeting signal capable of directing a polypeptide to the endosomal/lysosomal compartment, as described in U.S. Patent No. 5,633,234. An immunogenic polypeptide of the invention, when fused with this targeting signal, will associate more efficiently with MHC class II molecules and thereby provide enhanced in vivo stimulation of CD4+ T-cells specific for the polypeptide.
In general, protein and/or polypeptides (including fusion polypeptides) of the invention are isolated. An "isolated" polypeptide is one that is removed from its original environment. For example, a naturally-occurring protein or polypeptide is isolated if it is separated from some or all of the coexisting materials in the natural
system. Preferably, such polypeptides are also purified, e.g., are at least about 90% pure, more preferably at least about 95% pure and most preferably at least about 99% pure.
Particularly preferred polypeptides produced by the methods of the present invention include binding agents, such as antibodies and antigen-binding fragments thereof, that exhibit immunological binding to a target polypeptide of interest, such as a polypeptide associated with a particular disease state, or to a portion, variant or derivative thereof. An antibody, or antigen-binding fragment thereof, is said to "specifically bind," "immunogically bind," and/or is "immunologically reactive" to a polypeptide of the invention if it reacts at a detectable level (within, for example, an ELISA assay) with the polypeptide, and does not react detectably with unrelated polypeptides under similar conditions.
Immunological binding, as used in this context, generally refers to the non-covalent interactions of the type which occur between an immunoglobulin molecule and an antigen for which the immunoglobulin is specific. The strength, or affinity of immunological binding interactions can be expressed in terms of the dissociation constant (K ) of the interaction, wherein a smaller K represents a greater affinity. Immunological binding properties of selected polypeptides can be quantified using methods well known in the art. One such method entails measuring the rates of antigen-binding site/antigen complex formation and dissociation, wherein those rates depend on the concentrations of the complex partners, the affinity of the interaction, and on geometric parameters that equally influence the rate in both directions. Thus, both the "on rate constant" (Kon) and the "off rate constant" (K0ff) can be determined by calculation of the concentrations and the actual rates of association and dissociation. The ratio of Koff /Kon enables cancellation of all parameters not related to affinity, and is thus equal to the dissociation constant Kd. See, generally, Davies et al. (1990) Annual Rev. Biochem. 59:439-473.
An "antigen-binding site," or "binding portion" of an antibody refers to the part of the immunoglobulin molecule that participates in antigen binding. The antigen binding site is formed by amino acid residues of the N-terminal variable ("V") regions of the heavy ("H") and light ("L") chains. Three highly divergent stretches
within the V regions of the heavy and light chains are referred to as "hypervariable regions" which are interposed between more conserved flanking stretches known as "framework regions," or "FRs". Thus the term "FR" refers to amino acid sequences which are naturally found between and adjacent to hypervariable regions in immunoglobulins. In an antibody molecule, the three hypervariable regions of a light chain and the three hypervariable regions of a heavy chain are disposed relative to each other in three dimensional space to form an antigen-binding surface. The antigen- binding surface is complementary to the three-dimensional surface of a bound antigen, and the three hypervariable regions of each of the heavy and ight chains are referred to as "complementarity-determining regions," or "CDRs."
Certain binding agents, such as those specific for a tumor-associated protein, will be further capable of differentiating between patients with and without a cancer using the representative assays provided herein and known in the art. For example, antibodies or other binding agents that bind to a tumor protein will preferably generate a signal indicating the presence of a cancer in at least about 20% of patients with the disease, more preferably at least about 30% of patients. Alternatively, or in addition, the antibody will generate a negative signal indicating the absence of the disease in at least about 90% of individuals without the cancer. To determine whether a binding agent satisfies this requirement, biological samples (e.g., blood, sera, sputum, urine and/or tumor biopsies) from patients with and without a cancer (as determined using standard clinical tests) may be assayed as described herein for the presence of polypeptides that bind to the binding agent. Preferably, a statistically significant number of samples with .and without the disease will be assayed. Each binding agent should satisfy the above criteria; however, those of ordinary skill in the art will recognize that binding agents may be used in combination to improve sensitivity. Other binding agents produced according to the present invention will also have therapeutic value based on their specificity for tumor-associated polypeptide sequences.
Any agent that satisfies the above requirements may be a binding agent. For example, a binding agent may be a ribosome, with or without a peptide component, an RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an antibody or an antigen-binding fragment thereof. Antibodies may be prepared by any of
a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In addition to the methods exemplified herein according to the present invention, numerous antibody production techniques are available to the skilled artisan. For example, antibodies can also be produced by cell culture techniques, including the generation of monoclonal antibodies as described herein, or via transfection of antibody genes into suitable bacterial or mammalian cell hosts, in order to allow for the production of recombinant antibodies, hi one technique, an immunogen comprising the polypeptide is initially injected into any of a wide variety of mammals (e.g., mice, rats, rabbits, sheep or goats). In this step, the polypeptides of this invention may serve as the immunogen without modification. Alternatively, particularly for relatively short polypeptides, a superior immune response may be elicited if the polypeptide is joined to a carrier protein, such as bovine serum albumin or keyhole limpet hemocyanin. The immunogen is injected into the animal host, preferably according to a predetermined schedule incorporating one or more booster immunizations, and the animals are bled periodically. Polyclonal antibodies specific for the polypeptide may then be purified from such antisera by, for example, affinity chromatography using the polypeptide coupled to a suitable solid support.
Monoclonal antibodies specific for an antigenic polypeptide of interest may be prepared, for example, using the technique of Kohler and Milstein, Eur. J. Immunol. (5:511-519, 1976, and improvements thereto. Briefly, these methods involve the preparation of immortal cell-lines capable of producing antibodies having the desired specificity (i.e., reactivity with the polypeptide of interest). Such cell-lines may be produced, for example, from spleen cells obtained from an animal immunized as described above. The spleen cells are then immortalized by, for example, fusion with a myeloma cell fusion partner, preferably one that is syngeneic with the immunized animal. A variety of fusion techniques may be employed. For example, the spleen cells and myeloma cells may be combined with a nonionic detergent for a few minutes and then plated at low density on a selective medium that supports the growth of hybrid cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks,
colonies of hybrids are observed. Single colonies are selected and their culture supernatants tested for binding activity against the polypeptide. Hybridomas having high reactivity and specificity are preferred.
Monoclonal antibodies may be isolated from the supernatants of growing hybridoma colonies. In addition, various techniques may be employed to enhance the yield, such as injection of the hybridoma cell-line into the peritoneal cavity of a suitable vertebrate host, such as a mouse. Monoclonal antibodies may then be-harvested from the ascites fluid or the blood. Contaminants may be removed from the antibodies by conventional techniques, such as chromatography, gel filtration, precipitation, and extraction. The polypeptides of this invention may be used in the purification process in, for example, an affinity chromatography step.
A number of therapeutically useful molecules are known in the art which comprise antigen-binding sites that are capable of exhibiting immunological binding properties of an antibody molecule. The proteolytic enzyme papain preferentially cleaves IgG molecules to yield several fragments, two of which (the "F(ab)" fragments) each comprise a covalent heterodimer that includes an intact antigen-binding site. The enzyme pepsin is able to cleave IgG molecules to provide several fragments, including the "F(ab')2 " fragment which comprises both antigen-binding sites. An "Fv" fragment can be produced by preferential proteolytic cleavage of an IgM, and on rare occasions IgG or IgA immunoglobulin molecule. Fv fragments are, however, more commonly derived using recombinant techniques known in the art. The Fv fragment includes a non-covalent VH-V heterodimer including an antigen-binding site which retains much of the antigen recognition and binding capabilities of the native antibody molecule. Inbar et al. (1972) Proc. Nat. Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) Biochem 15:2706-2710; and Ehrlich et al. (1980) Biochem 19:4091-4096.
A single chain Fv ("sFv") polypeptide is a covalently linked VH-VL heterodimer which is expressed from a gene fusion including VH- and VL-encoding genes linked by a peptide-encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. USA 85(16):5879-5883. A number of methods have been described to discern chemical structures for converting the naturally aggregated— but chemically separated— light and heavy polypeptide chains from an antibody V region into an sFv molecule which will
fold into a three dimensional stracture substantially similar to the structure of an antigen-binding site. See, e.g., U.S. Pat. Nos. 5,091,513 and 5,132,405, to Huston et al.; and U.S. Pat. No. 4,946,778, to Ladner et al.
Each of the above-described molecules includes a heavy chain and a light chain CDR set, respectively interposed between a heavy chain and a light chain FR set which provide support to the CDRS and define the spatial relationship of the CDRs relative to each other. As used herein, the term "CDR set" refers to the three hypervariable regions of a heavy or light chain V region. Proceeding from the N- terminus of a heavy or light chain, these regions are denoted as '^CDRl," "CDR2," and "CDR3" respectively. An antigen-binding site, therefore, includes six CDRs, comprising the CDR set from each of a heavy and a light chain V region. A polypeptide comprising a single CDR, (e.g., a CDR1, CDR2 or CDR3) is referred to herein as a "molecular recognition unit." Crystallographic analysis of a number of antigen-antibody complexes has demonstrated that the amino acid residues of CDRs form extensive contact with bound antigen, wherein the most extensive antigen contact is with the heavy chain CDR3. Thus, the molecular recognition units are primarily responsible for the specificity of an antigen-binding site.
As used herein, the term "FR set" refers to the four flanking amino acid sequences which frame the CDRs of a CDR set of a heavy or light chain V region. Some FR residues may contact bound antigen; however, FRs are primarily responsible for folding the V region into the antigen-binding site, particularly the FR residues directly adjacent to the CDRS. Within FRs, certain amino residues and certain structural features are very highly conserved. In this regard, all V region sequences contain an internal disulfide loop of around 90 amino acid residues. When the V regions fold into a binding-site, the CDRs are displayed as projecting loop motifs which form an antigen- binding surface. It is generally recognized that there are conserved structural regions of FRs which influence the folded shape of the CDR loops into certain "canonical" structures— regardless of the precise CDR amino acid sequence. Further, certain FR residues are known to participate in non-covalent interdomain contacts which stabilize the interaction of the antibody heavy and light chains.
A number of "humanized" antibody molecules comprising an antigen- binding site derived from a non-human immunoglobulin have been described, including chimeric antibodies having rodent V regions and their associated CDRs fused to human constant domains (Winter et al. (1991) Nature 349:293-299; Lobuglio et al. (1989) Proc. Nat. Acad. Sci. USA 86:4220-4224; Shaw et al. (1987) J Immunol. 138:4534- 4538; and Brown et al. (1987) Cancer Res. 47:3577-3583), rodent CDRs grafted into a human supporting FR prior to fusion with an appropriate human antibody constant domain (Riechmann et al. (1988) Nature 332:323-327; Verhoeyen et al. (1988) Science 239:1534-1536; and Jones et al. (1986) Nature 321:522-525), and rodent CDRs supported by recombinantly veneered rodent FRs (European Patent Publication No. 519,596, published Dec. 23, 1992). These "humanized" molecules are designed to minimize unwanted immunological response toward rodent antihuman antibody molecules which limits the duration and effectiveness of therapeutic applications of those moieties in human recipients. As used herein, the terms "veneered FRs" and "recombinantly veneered
FRs" refer to the selective replacement of FR residues from, e.g., a rodent heavy or light chain V region, with human FR residues in order to provide a xenogeneic molecule comprising an antigen-binding site which retains substantially all of the native FR polypeptide folding structure. Veneering techniques are based on the understanding that the ligand binding characteristics of an antigen-binding site are determined primarily by the structure and relative disposition of the heavy and light chain CDR sets within the antigen-binding surface. Davies et al. (1990) Ann. Rev. Biochem. 59:439-473. Thus, antigen binding specificity can be preserved in a humanized antibody only wherein the CDR structures, their interaction with each other, and their interaction with the rest of the V region domains are carefully maintained. By using veneering techniques, exterior (e.g., solvent-accessible) FR residues which are readily encountered by the immune system are selectively replaced with human residues to provide a hybrid molecule that comprises either a weakly immunogenic, or substantially non-immunogenic veneered surface. The process of veneering makes use of the available sequence data for human antibody variable domains compiled by Kabat et al., in Sequences of Proteins of
Immunological Interest, 4th ed., (U.S. Dept. of Health and Human Services, U.S. Government Printing Office, 1987), updates to the Kabat database, and other accessible U.S. and foreign databases (both nucleic acid and protein). Solvent accessibilities of V region amino acids can be deduced from the known three-dimensional stracture for human and murine antibody fragments. There are two general steps in veneering a murine antigen-binding site. Initially, the FRs of the variable domains of an antibody molecule of interest are compared with corresponding FR sequences of human variable domains obtained from the above-identified sources. The most homologous human V regions are then compared residue by residue to corresponding murine amino acids. The residues in the murine FR which differ from the human counterpart are replaced by the residues present in the human moiety using recombinant techniques well known in the art. Residue switching is only carried out with moieties which are at least partially exposed (solvent accessible), and care is exercised in the replacement of amino acid residues which may have a significant effect on the tertiary structure of V region domains, such as proline, glycine and charged amino acids.
In this manner, the resultant "veneered" murine antigen-binding sites are thus designed to retain the murine CDR residues, the residues substantially adjacent to the CDRs, the residues identified as buried or mostly buried (solvent inaccessible), the residues believed to participate in non-covalent (e.g., electrostatic and hydrophobic) contacts between heavy and light chain domains, and the residues from conserved structural regions of the FRs which are believed to influence the "canonical" tertiary structures of the CDR loops. These design criteria are then used to prepare recombinant nucleotide sequences which combine the CDRs of both the heavy and light chain of a murine antigen-binding site into human-appearing FRs that can be used to transfect mammalian cells for the expression of recombinant human antibodies which exhibit the antigen specificity of the murine antibody molecule.
In another embodiment of the invention, antibodies produced according to the present invention may be coupled to one or more therapeutic agents. Suitable agents in this regard include radionuclides, differentiation inducers, drugs, toxins, and derivatives thereof. Preferred radionuclides include 90Y, 123I, 125I, 131L 186Re, Re,
01 1 919
At, and Bi. Preferred drugs include methotrexate, .and pyrimidine and purine
analogs. Preferred differentiation inducers include phorbol esters and butyric acid. Preferred toxins include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas exotoxin, Shigella toxin, and pokeweed antiviral protein.
A therapeutic agent may be coupled (e.g., covalently bonded) to a suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A direct reaction between an agent and an antibody is possible when each possesses a substituent capable of reacting with the other. For example, a nucleophilic group, such as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl- containing group, such as an anhydride or an acid halide, or with an alkyl group containing a good leaving group (e.g., a halide) on the other.
Alternatively, it may be desirable to couple a therapeutic agent and an antibody via a linker group. A linker group can function as a spacer to distance an antibody from an agent in order to avoid interference with binding capabilities. A linker group can also serve to increase the chemical reactivity of a substituent on an agent or an antibody, and thus increase the coupling efficiency. An increase in chemical reactivity may also facilitate the use of agents, or functional groups on agents, which otherwise would not be possible.
It will be evident to those skilled in the art that a variety of bifunctional or polyfunctional reagents, both homo- and hetero-functional (such as those described in the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker group. Coupling may be effected, for example, through amino groups, carboxyl groups, sulfhydryl groups or oxidized carbohydrate residues. There are numerous references describing such methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al.
Where a therapeutic agent is more potent when free from the antibody portion of the immunoconjugates of the present invention, it may be desirable to use a linker group that is cleavable during or upon internalization into a cell. A number of different cleavable linker groups have been described. The mechanisms for the intracellular release of an agent from these linker groups include cleavage by reduction of a disulfide bond (e.g., U.S. Patent No. 4,489,710, to Spitler), by irradiation of a photolabile bond (e.g., U.S. Patent No. 4,625,014, to Senter et al.), by hydrolysis of derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al.), by
serum complement-mediated hydrolysis (e.g., U.S. Patent No. 4,671,958, to Rodwell et al.), and acid-catalyzed hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et al.).
Polynucleotides Suitable for Expressing Proteins and/or Polypeptides
The present invention, in other aspects, provides polynucleotides that encode the recombinant proteins and/or polypeptides disclosed herein above. The terms "DNA" and "polynucleotide" are used essentially interchangeably herein to refer to a DNA molecule that has been isolated free of total genomic DNA of a particular species. "Isolated," as used herein, means that a polynucleotide is substantially away from other coding sequences, and that the DNA molecule does not contain large portions of unrelated coding DNA, such as large chromosomal fragments or other functional genes or polypeptide coding regions. Of course, this refers to the DNA molecule as originally isolated, and does not exclude genes or coding regions later added to the segment by the hand of man.
Polynucleotides may comprise a native sequence (i.e. an endogenous sequence that encodes a protein and/or polypeptide, for example an antibody, or portion thereof) or may comprise a sequence that encodes a variant or derivative, preferably and immunogenic variant or derivative, of such a sequence. In certain embodiments, the polynucleotide sequences may encode immunogenic polypeptides, as described above.
Typically, polynucleotide variants will contain one or more substitutions, additions, deletions and/or insertions, preferably such that the immunogenicity of the polypeptide encoded by the variant polynucleotide is not substantially diminished relative to a polypeptide encoded by a polynucleotide sequence specifically set forth herein). The term "variants" should also be understood to encompass homologous genes of xenogeneic origin. The polynucleotides of the present invention, or fragments thereof, regardless of the length of the coding sequence itself, may be combined with other DNA sequences, such as promoters, polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding segments, and the like, such that their overall length may vary considerably. It is therefore contemplated that a nucleic acid fragment of almost any length may be employed, with the total length preferably being limited by
the ease of preparation and use in the intended recombinant DNA protocol. For example, illustrative polynucleotide segments with total lengths of about 10,000, about 5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 base pairs in length, and the like, (including all intermediate lengths) are contemplated to be useful in many implementations of this invention.
Polynucleotides suitable for high-level, large-scale expression according to the present invention may be identified, prepared and/or manipulated using any of a variety of well established techniques (see generally, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY, 1989, and other like references). For example, a polynucleotide may be identified by screening a microarray of cDNAs for tumor-associated expression. Such screens may be performed, for example, using the microarray technology of Affymetrix, Inc. (Santa Clara, CA) according to the manufacturer's instructions (and essentially as described by Schena et al., Proc. Natl. Acad. Sci. USA 93: 10614-10619, 1996 and Heller et al., Proc. Natl Acad. Sci. USA :2150-2155, 1997). Alternatively, polynucleotides may be amplified from cDNA prepared from cells expressing the proteins described herein, such as tumor cells.
Many template dependent processes are available to amplify a target sequences of interest present in a sample. One of the best known amplification methods is the polymerase chain reaction (PCR™) which is described in detail in U.S. Patent Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein by reference in its entirety. Briefly, in PCR™, two primer sequences are prepared which are complementary to regions on opposite complementary strands of the target sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture along with a DNA polymerase (e.g., Tag polymerase). If the target sequence is present in a sample, the primers will bind to the target and the polymerase will cause the primers to be extended along the target sequence by adding on nucleotides. By raising and lowering the temperature of the reaction mixture, the extended primers will dissociate from the target to form reaction products, excess primers will bind to the target and to the reaction product and the process is repeated. Preferably reverse transcription and PCR™ amplification procedure may be performed in order to quantify
the amount of mRNA amplified. Polymerase chain reaction methodologies are well known in the art.
Any of a number of other template dependent processes, many of which are variations of the PCR ™ amplification technique, are readily known and available in the art. Illustratively, some such methods include the ligase chain reaction (referred to as LCR), described, for example, in Eur. Pat. Appl. Publ. No. 320,308 and U.S. Patent No. 4,883,750; Qbeta Replicase, described in PCT Intl. Pat. Appl. Publ. No. PCT/US87/00880; Strand Displacement Amplification (SDA) and Repair Chain Reaction (RCR). Still other amplification methods are described in Great Britain Pat. Appl. No. 2 202 328, and in PCT Intl. Pat. Appl. Publ. No. PCT/US89/01025. Other nucleic acid amplification procedures include transcription-based amplification systems (TAS) (PCT Intl. Pat. Appl. Publ. No. WO 88/10315), including nucleic acid sequence based amplification (NASBA) and 3SR. Eur. Pat. Appl. Publ. No. 329,822 describes a nucleic acid amplification process involving cyclically synthesizing single-stranded RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA). PCT Intl. Pat. Appl. Publ. No. WO 89/06700 describes a nucleic acid sequence amplification scheme based on the hybridization of a promoter/primer sequence to a target single-stranded DNA ("ssDNA") followed by transcription of many RNA copies of the sequence. Other amplification methods such as "RACE" (Frohman, 1990), and "one-sided PCR" (Ohara, 1989) are also well-known to those of skill in the art.
An amplified portion of a polynucleotide of the present invention may be used to isolate a full length gene from a suitable library (e.g., a tumor cDNA library) using well known techniques. Within such techniques, a library (cDNA or genomic) is screened using one or more polynucleotide probes or primers suitable for amplification. Preferably, a library is size-selected to include larger molecules. Random primed libraries may also be preferred for identifying 5' and upstream regions of genes. Genomic libraries are preferred for obtaining introns and extending 5' sequences. Alternatively, or in addition, essentially any amplified polynucleotide may be employed in routine subcloning techniques in order to arrive at a UCOE-based vector according to this invention.
For hybridization techniques, a partial sequence may be labeled (e.g., by nick-translation or end-labeling with 32P) using well known techniques. A bacterial or bacteriophage library is then generally screened by hybridizing filters containing denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe (see Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, NY, 1989). Hybridizing colonies or plaques are selected and expanded, and the DNA is isolated for further analysis. cDNA clones may be analyzed to determine the amount of additional sequence by, for example, PCR using a primer from the partial sequence and a primer from the vector. Restriction maps and partial sequences may be generated to identify one or more overlapping clones. The complete sequence may then be determined using standard techniques, which may involve generating a series of deletion clones. The resulting overlapping sequences can then assembled into a single contiguous sequence. A full length cDNA molecule can be generated by ligating suitable fragments, using well known techniques. Alternatively, amplification techniques, such as those described above, can be useful for obtaining a full length coding sequence from a partial cDNA sequence. One such amplification technique is inverse PCR (see Triglia et al., Nucl. Acids Res. 7(5:8186, 1988), which uses restriction enzymes to generate a fragment in the known region of the gene. The fragment is then circularized by intramolecular ligation and used as a template for PCR with divergent primers derived from the known region. Within an alternative approach, sequences adjacent to a partial sequence may be retrieved by amplification with a primer to a linker sequence and a primer specific to a known region. The amplified sequences are typically subjected to a second round of amplification with the same linker primer and a second primer specific to the known region. A variation on this procedure, which employs two primers that initiate extension in opposite directions from the known sequence, is described in WO 96/38591. Another such technique is known as "rapid amplification of cDNA ends" or RACE. This technique involves the use of an internal primer and an external primer, which hybridizes to a polyA region or vector sequence, to identify sequences that are 5' and 3' of a known sequence. Additional techniques include capture PCR (Lagerstrom et al., PCR Methods Applic. 7:111-19, 1991) and walking PCR (Parker et al., Nucl. Acids.
Res. 7P:3055-60, 1991). Other methods employing amplification may also be employed to obtain a full length cDNA sequence.
In certain instances, it is possible to obtain a full length cDNA sequence by analysis of sequences provided in an expressed sequence tag (EST) database, such as that available from GenBank. Searches for overlapping ESTs may generally be performed using well known programs (e.g., NCBI BLAST searches), and such ESTs may be used to generate a contiguous full length sequence. Full length DNA sequences may also be obtained by analysis of genomic fragments.
In certain preferred embodiments of the invention, polynucleotide sequences or fragments thereof are employed in the construction and/or use of UCOE- based vectors and encode one or more polypeptides of interest, such as antibodies or fusion proteins or functional equivalents thereof. Due to the inherent degeneracy of the genetic code, other DNA sequences that encode substantially the same or a functionally equivalent amino acid sequence may be produced and these sequences may be used to clone and express a given polypeptide.
As will be understood by those of skill in the art, it may be advantageous in some instances to produce polypeptide-encoding nucleotide sequences possessing non-naturally occurring codons. For example, codons preferred by a particular prokaryotic or eukaryotic host can be selected to increase the rate of protein expression or to produce a recombinant RNA transcript having desirable properties, such as a half- life which is longer than that of a transcript generated from the naturally occurring sequence.
Moreover, the polynucleotide sequences of the present invention can be engineered using methods generally known in the art in order to alter polypeptide encoding sequences for a variety of reasons, including but not limited to, alterations which modify the cloning, processing, and/or expression of the gene product. For example, DNA shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be used to engineer the nucleotide sequences. In addition, site-directed mutagenesis may be used to insert new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, or introduce mutations, and so forth.
A newly synthesized peptide may be substantially purified, for example, by preparative high performance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures and Molecular Principles, WH Freeman and Co., New York, N.Y.) or other comparable techniques available in the art. The composition of the synthetic peptides may be confirmed by amino acid analysis or sequencing (e.g., the Edman degradation procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, may be altered during direct synthesis and/or combined using chemical methods with sequences from other proteins, or any part thereof, to produce a variant polypeptide. The following Examples are offered by way of illustration not limitation.
EXAMPLES
EXAMPLE 1 EXPRESSION OF RECOMBINANT ANTIBODY IN A UCOE-BASED EXPRESSION VECTOR
SYSTEM This example discloses a comparison between the expression levels of recombinant antibodies using vectors with and without UCOEs.
Engineered human antibody Ab3 was expressed from vectors containing a human RNP UCOE as shown in Figure 1. Identical vectors, but without the UCOE element, were also constructed. The Ig heavy chain coding sequence in this example comprises an engineered human V-region sequence introduced upstream of and in frame with a genomic DNA fragment encoding a human Ig gamma- 1 constant region. The Ig light chain coding sequence comprises an engineered human V-region sequence introduced upstream of and in frame with a cDNA fragment encoding a human Ig kappa constant region. The vector for expression of the Ig heavy chain additionally contains a neo selectable marker gene and the vector for expression of the Ig light chain contains a hygromycin selectable marker. See Figure 2A.
CHO-K1 cells were co-transfected with the light-chain and heavy-chain vectors using lipofectamine (Life Technologies) according to the manufacturers' instructions. Cells were selected using hygromycin and G418. Pools of transfectants
were maintained and levels of assembled immunoglobulin secreted into culture medium were determined by ELISA at various times post-transfection. (Figure 3). In the absence of the RNP UCOE, antibody expression levels were low (approximately 48 ng/ml) 48 hours after transfection and declined thereafter. In contrast, in transfection pools from expression vectors containing the RNP UCOE, antibody levels continued to accumulate as the transfected cultures were exp.anded, reaching 3 micrograms/ml 15 days post-transfection. Thus, use of UCOEs permited rapid generation of pools of transfected cells that express high levels of recombinant immunoglobulin.
EXAMPLE 2 HIGH-LEVEL, LARGE-SCALE EXPRESSION ACHIEVED IN CHO HOST CELL-LINE
TRANSFECTED WITH UCOE-BASED EXPRESSION VECTOR SYSTEM
CHO-S cells were co-transfected with vectors containing UCOE antibody expression cassettes (shown in Figure 1) to produce the engineered human antibody Abl. The Ig heavy chain coding sequence comprises an engineered human V- region sequence introduced upstream of and in frame with a genomic DNA fragment encoding a human Ig gamma-4 constant region. The Ig light chain coding sequence comprises an engineered human V-region sequence introduced upstream of and in frame with a cDNA fragment encoding a human Ig kappa constant region. The vector for expression of the Ig Heavy chain additionally contains a neo selectable marker gene and the vector for expression of the Ig light chain contains a hygromycin selectable marker. See Figure 2B.
Transfections were carried out using lipofectamine (Life Technologies) according to the manufacturers' instructions. Cells were selected using hygromycin and G418 in CD-CHO medium (Life Technologies) and subclones were selected. This process took approximately 5 weeks. One subclone was scaled into a 2L bioreactor to perform final parameter optimization before being scaled into a 100L bioreactor. Production rates from the majority of transfectants expressing recombinant antibodies were typically approximately 5 pg/cell/day using this approach. Yields of one antibody in suspension culture reached approximately 200 mg/1. See Figure 4. The inclusion of the UCOE in the two expression vectors co-transfected into CHO-S cells resulted in
rapid isolation of a transfectant clone that could immediately be cultured in suspension in a defined medium.
EXAMPLE 3 Low LEVELS OF GAL-GAL RESIDUES ON CHO-Kl AND CHO-S HOST CELL-LINES As discussed hereinabove, the presence of the
Galαl→3Galβl→4GlcNAc-R (Gal-Gal) carbohydrate residue on antibodies used as human therapeutics has been associated with rapid protein clearance from the serum. As a result, the ability to produce recombinant protein without this residue is advantageous. See, e.g., Borrebaeck et al., Immunology Today 14:477-479 (1993) and Kagawa et al., J Biol Chem. 263:17508-17515 (1988). Utilizing the FITC labeled IB lectin and flow cytometry it was demonstrated that the Gal-Gal residue is not present on the surface of CHO-S cells. See Figure 5; methodology disclosed in Cho et al., J Biol. Chem. 272:13622-13628 (1997) and Gorelik et al., Cancer Res. 55:4185-4173 (1995). In this respect, CHO-S resembles the other widely used CHO line tested, CHO-Kl . In contrast, the mouse hybridoma cell-line tested in this experiment showed high levels of cell-surface associated Gal-Gal carbohydrate. Mass spectroscopy of a purified recombinant protein produced in the cell-line demonstrated the absence of the Gal-Gal residue (data not shown).
EXAMPLE 4 BI-DIRECTIONAL UCOE VECTORS FOR IMPROVED EXPRESSION LEVELS
OF MULTI-SUBUNIT RECOMBINANT PROTEINS
This Example discloses improved expression of recombinant antibody heavy and light protein chains on bi-directional UCOE vector systems.
The two Sfi I sites of pORTl (Cobra Therapeutics) were changed to Mfe I sites by introduction of adapter molecules comprised of annealed oligos Mfe.F, 5'~ AACAATTGGCGGC (SEQ ID NO: 10) and Mfe.R, 5'-GCCAATTGTTGCC (SEQ ID NO: 11). The HSV TK polyA site was then amplified from pVgRXR (Invitrogen) with primers TK.F, 5'ACGCGTCGACGGAAGGAGACAATACCGGAAG (SEQ ID NO: 12) and TK.R, 5'-CCGCTCGAGTTGGGGTGGGGAAAAGGAA (SEQ ID NO: 13),
and the Sal I to Xho I fragment was inserted into the Sal I site. Following this, the murine PGK polyA site was amplified from male BALB/c genomic DNA (Clontech) using primers mPGK.F, 5'-CGGGATCCGCCTGAGAAAGGAAGTGAGCTG (SEQ ID NO: 14) and mPGK.R, 5'-GAAGATCTGGAGGAATGAGCTGGCCCTTA (SEQ ID NO: 15), and the BamH I to Bgl II fragment was cloned into the BamH I site. The Ase I to Sal I fragment of pcDNA3.1 containing the neo expression cassette was treated with T4 DNA polymerase, ligated to Spe I linkers (5'-GACTAGTC; SEQ ID NO: 16) and the Spe I fragment was then cloned into the Spe I site to give pORTneoF; or the EcoR I to Not I fragment of CET700 (Cobra Therapeutics) carrying the puromycin resistance cassette was treated with T4 DNA polymerase, ligated to Xba I linkers, and the Xba I fragment was cloned into the Xba I site to give pORTpuroF. The Hind III to BamH I murine CMV promoter fragment from pCMVEGFPN-1 (Cobra) was subcloned into the Hind III to BamH I sites of the Hybrid UCOE in BKS+ (Cobra). The human CMV promoter was then amplified from plasmid pIRESneo (Clontech) using primers hCMVF, 5'-CTCGAGTTATTAATAGTAATCAATTACGGGGTCAT (SEQ ID NO: 17) and hCMVR, 5'-GTCGACGATCTGACGGTTCACTAAACCAGCTCT (SEQ ID NO: 18) and the Xho I to Sal I fragment was cloned into the Sal I site. The BamH I to Sal I fragment was then cloned into the BamH I to Sal I sites of pORTneoF to give pBDUneolOO, or into pORTpuroF to give pBDUpuro300. The two ATG codons upstream of the Sal I cloning site in the Hybrid UCOE in BKS+ were altered by site- directed mutagenesis, then the BamH I to Sal I fragment was cfoned into the BamH I to Sal I sites of pORTneoF to give pBDUneo200, or into pORTpuroF to give pBDUpuro400.
Human antibody light chains were cloned into either the BamH I or Sal I sites of all four bi-directional UCOE vectors (pBDUneolOO, pBDUneo200, pBDUpuro300 and pBDUpuro400; Figures 6-9 and SEQ ID NOs: 1-4, respectively), followed by the heavy chain at the remaining BamH I or Sal I cloning site to give pBDUneoll2, pBDUneol21, pBDUneo212, pBDUneo221, pBDUpuroll2, pBDUpurol21, pBDUpuro212 and pBDUpuro221. Additional bi-directional UCOE vectors suitable for co-expression of two or more recombinant proteins are disclosed in Figures 10-13 (SEQ ID NOs: 5-8)
and are referred to as pBDUneo500, pBDUneo600, pBDUpuro700 and pBDUpuro800, respectively. These vectors may be employed, for example, to optimize the hybrid UCOE orientation for antibody expression, as well as to provide alternative promoter combinations for optimization. Plasmid pORTpuroF was digested with Xbal (partial) and Nsil to remove the bovine growth hormone polyA site, then ligated to the SV40 early polyA site which was amplified with primers 14506, 5'-
CCAATGCATAGGTTGGGCTTCGGGAATCGT (SEQ ID NO: 19) and 14507, 5'- GCTCTAGATCTCGACGGTATACAGACATGAT (SEQ ID NO: 20) followed by digestion with Xbal and Nsil, to give plasmid pORTpuroF2. The Hybrid UCOE vector containing the murine CMV promoter downstream of the human RNP UCOE and with the two mutated ATG codons between the actin promoter and the Sal I site, was digested with BamHI and Hindlll to remove the murine CMV promoter, then ligated to the human CMV promoter that had been amplified with primers 14425, 5'- CCCAAGCTTATTAATAGTAATCAATTACGGGGTCAT (SEQ ID NO: 21) and 14426, 5,-CAAGGATCCGATCTGACGG'TTCACTAAACCAGCTCT (SEQ ID NO: 22) followed by digestion with BamHI and Hindlll. An adapter comprised of annealed oligos 14466, 5'-TCGAGTCGTTTAAACTCTAG (SEQ ID NO: 23) and 14465, 5'- TCGACTAGAGTTTAAACGAC (SEQ ID NO: 24) was then inserted at the Sail site, digested with Pmel and Sail, and ligated to the murine CMV promoter that had been amplified with primers 14435, 5'-
GAATΓCGAGCTCGCCCAACTCCGCCCGTTTTAT (SEQ ID NO: 25) and 14436, 5'-ATTTGTCGACTCTAGACCCGGGCTGCAGCGAGGAGCTCT (SEQ ID NO: 26) followed by digestion with Sail. The plasmid either with, or without, the murine CMV promoter was then digested with BamHI and Sail, and ligated to BamHI and Sail digested pORTneoF to give plasmids pBDUneo500 and pBDUneo600; or was ligated to BamHI and Sail digested plasmid pORTpuroF2 to give plasmids pBDUpuro700 and pBDUpuro800, respectively.
G418 or puromycin-resistant bi-directional UCOE vectors expressing antibody heavy and light chains were transfected into CHO-Kl or CHO-S cells using Lipofectamine or DMRIE-C (Invitrogen), respectively, following the manufacturer's
instructions, and selected with 500 ug/ml G418 (neo vectors) or 12.5 ug/ml puromycin (puro vectors). Pools were selected and antibody production rates compared between the different constructs to determine the optimal promoter and selectable marker combination for antibody expression in CHO cells. The results of expression studies in CHO-S suspensions cells are depicted in Table 2. These data demonstrated that vectors containing the light chain expressed from the murine CMV promoter gave the best antibody expression. Vectors containing puromycin or G418-resistance markers were used. Additionally, two bidirectional vectors, one containing a puromycin-resistance marker and one containing a G418-resistance marker, were co-transfected. Pools were selected, and antibody production rates determined. Separately, the G418 or puromycin-resistant transfecant pools displayed similar production rates, but the production rate of the co-transfected pool was significantly higher. This suggests that it may be possible to increase production rate by having two copies of the antibody expression vector, maintained with different selectable markers. Selecting pools with higher levels of puromycin (25-50 μg/ml versus 12.5 μg/ml) did not correlate with increased production.
Clonal lines were isolated from the puromycin-resistant pool carrying pBDUpuro421. Fifteen out of twenty-two clonal cell lines expressed measurable amounts of antibody. Initial production-rate determinations indicated that the cell lines had antibody secretion rates of up to 16 pg/cell/day (Table 3). Southern blot analysis identified at least one clone having a production rate of 13 pg/cell/day and has approximately a single copy of the vector DNA (clone S421.7). Clones from this pool were isolated with production rates of 3-18 pg/cell/day. Clones expressing approx. 5 pg/cell/day were used for initial fermentation experiments.
Table 2 Expression of hAbl (IgG4 from bi-directional UCOE vectors
Vector H3 Promoter Kl Promoter Production Rate
(pg/cell/day) pBDUneol l2 murine CMV human CMV 0.3 pBDUneol21 human CMV murine CMV 1.5 pBDUneo212 murine CMV human beta-actin 0.06 pBDUneo221 human beta-actin murine CMV 1.3 ρBDUρuro312 murine CMV human CMV 0.5 ρBDUpuro321 human CMV murine CMV 1.4 pBDUpuro412 murine CMV human beta-actin 0.05 pBDUpuro421 human beta-actin murine CMV 2.3
Cotransfection* * human CMV human CMV 0.7 pBDUneo221 human beta-actin murine CMV 1.3 pBDUpuro421 human beta-actin murine CMV 1 pBDUneo221+ human beta-actin murine CMV 5 pBDUpuro421
** Cotransfection was carried out previously using the same antibody genes each driven from 4kb UCOE CMV vectors (hygromycin and neomycin selection)
Table 3 Expression of hAbl in clonal CHO-S cell lines transfected with pBDUpuro421
PuromycinR Cell Line Production Rate
(pg/cell/day)
S421.2 5.4
S421.3 0.5
S421.4 0.5
S421.7 13.4
S421.8 5.4
S421.9 0.04
S421.12 1.4
S421.14 6.7
S421.15 0.3
S421.16 7.2
S421.17 5
S421.18 0.8
S421.20 1.2
S421.21 0.3
S421.22 16
EXAMPLE 5
DELETION ANALYSIS OF THE RNP UCOE
This Example discloses polynucleotide deletions within an RNP UCOE plasmid vector for improved expression of recombinant proteins. Briefly, a series of deletions within the 8 kb RNP UCOE were prepared to identify both important functional elements and regions that may be removed without affecting UCOE function. A green fluorescent protein gene (GFP) was cloned into plasmid CET720 (Cobra Therapeutics), and deletions were subsequently introduced into the UCOE region (Figure 14). The first set of these deletions was transfected into CHO-S cells, and examined for the ability to express GFP. In a transient assay (two days post transfection), all of the plasmids were able to express GFP as determined by fluorescence microscopy. Stable pools carrying the different constructs were then selected, and GFP expression determined by FACS analysis. One month post- transfection, all of the deletions displayed both a higher percentage of positive cells than a control plasmid which did not contain the UCOE (>50% versus 10% without the UCOE), and a higher mean fluorescence for the positive population than the control vector that did not contain the UCOE (Table 4).
These data defined more precisely the region of the human RNP UCOE required for full activity and identified a shorter (approximately 7kb) UCOE element with full activity. This new 7kb UCOE element was defined by deletion ΔRV and extends from nucleotide 2225 - 9254 in Figure 14.
Table 4 GFP expression from plasmids containing deletions within the 8 kb RNP UCOE
Plasmid Region Deleted Percent Mean Fluorescence of
Positive Positive Population
CET720GFP (8 kb None 68 516
UCOE)
CET700GFP (no nt. 2225-10525 10 136
UCOE)
ΔBS (4 kb UCOE) nt. 2225-6341 61 370
ΔEcoNI nt. 3875-6916 65 439
ΔEX2 nt. 6916-7053 53 384
ΔEM nt. 6916-7209 66 423
ΔMX nt. 7053-7209 66 464
ΔMluI nt. 7209-8293 58 448
ΔRV nt. 9254-10342 72 548
Vector CET720GFP (represented by SEQ ID NO: 9, which contains the 8 kb human RNP UCOE) was digested with EcoRV, MM, EcoNI, or BamHI plus Sail, the ends were blunted with T4 DNA polymerase and religated to produce vectors deltaRV, delta MM, deltaEcoNI and deltaBS, respectively. CET720 was digested with PflMI and blunted with T4 DNA polymerase, then cut with BamHI. The blunt to BamHI fragment was cloned into the EcoRV to BamHI sites of pBluescript II SK (+) to give pPB720. ρPB720 was digested with EcoNI and MM, M and Xhol (partial), or EcoNI and Xhol (partial), the ends were treated with T4 DNA polymerase and recircularized. The PshAI fragment from each of the resulting vectors was cloned into the PshAI sites of CET720GFP to give illustrative vectors deltaEM, deltaEX and deltaMX, respectively.
EXAMPLE 6 ADDITIONAL DELETION ANALYSIS OF THE RNP UCOE Previous examples have identified via deletion analysis that the UCOE regions from nucleotides 2225-6916 and 9254-10342 of vector CET720GFP (SEQ ID NO:9) can be removed without loss of UCOE activity (see Example 5 above). In this example, minimal regions of the 8kb RNP UCOE that are important for its activity are further defined. Importantly, this analysis more precisely defined an illustrative 4.1 kb region of the human RNP UCOE that retains full activity.
Briefly, fragments of the 8kb RNP UCOE were blunted and ligated to Hindlll linkers (New England Biolabs; Catalog Number S1098S), digested with Hindlll and ligated to Hindlll digested and calf-intestinal alkaline phosphatase-treated vector CET700GFP. Vectors were transfected into CHO-S cells using DMRIE-C (Invitrogen), where all constructs were capable of expressing GFP in a transient assay (data not shown). After 2 weeks in puromycin selection, the geometric mean fluorescence of the positive population was determined by FACS, and expressed as a percentage of the
control (CET720GFP), the results of which are summarized in Table 5 below. Vector 700FRV, which contains a 4.1 kb Mfel to EcoRV fragment of the RNP UCOE, corresponding to nucleotide residues 5152-9254 of CET720GFP, retained Ml UCOE activity relative to the 8 kb UCOE region of nucleotide residues 2225-10525 of CET720GFP. Thus, this 4.1kb UCOE fragment represents a new minimal UCOE element that retains activity at levels comparable to that for the Ml 8kb UCOE element.
Table 5
Activity was also determined for the three UCOE fragments contained within 700HRV.R, 700FRV.R and 700BRV.R, but with the UCOE fragments inserted in the opposite orientation, to give plasmids 700HRV.F, 700FRV.F and 700BRV.F, respectively. Again, all plasmids were capable of expressing GFP in a transient assay. After 3 weeks in puromycin selection, the geometric mean fluorescence of the positive population was determined by FACS, and expressed as a percentage of the contiol (CET720GFP), the results of which are summarized in Table 6 below. While lower levels of activity were observed for plasmids containing UCOE in the opposite orientation, all fragments nonetheless retained UCOE activity.
Table 6
EXAMPLE 7 PREPARATION OF ADDITIONAL ILLUSTRATIVE BI-DIRECTIONAL UCOE VECTORS
Previous examples have described the preparation and evaluation of numerous illustrative UCOE vectors. In this example, additional UCOE vectors were constructed. For example, vectors pBDUpuro350 (SEQ ID NO: 27) and pBDUpuro450 (SEQ ID NO: 28) were prepared so as to be equivalent to the previously described vectors pBDUpuro300 and pBDUpuro400, with the exception that the polyA site following the puromycin resistance gene was replaced with the SV40 polyA site (see also Figures 15 and 16). Several additional vectors will replace the 8kb RNP UCOE element with the 4.1kb Mfel-EcoRV fragment identified hereinabove by deletion analysis to retain Ml UCOE activity. To alter the polyA site of the puromycin resistance cassette of the pBDUpuro vector series, the SV40 polyA site was amplified from pBSneo.23 by polymerase chain reaction and the reaction product was digested with Nsil and Xbal and inserted into the Nsil to Xbal site of pORTpuroF to replace the BGH polyA site. This new vector, pORTpuroF' was sequentially digested with BamHI and Sail, and cloned into the BamHI to Sail sites of HUCMV (hybrid UCOE with murine CMV promoter) to give plasmid pBDUpuro350 (SEQ ID NO: 27; see also Figure 15), or cloned into the BamHI site of pUCOEact3 (hybrid UCOE with site directed mutagenesis of the ATG codons in the actin promoter) to give pBDUpuro450 (SEQ ID NO: 28; see also Figure 16). Addditional UCOE vectors are constracted by inserting a Hindlll site at the position of the Kpnl site at the border between the human beta-actin and RNP UCOE fragments in plasmids pUCOEact3 and pUCOEact3hCMV. The 4kb Hindlll fragment carrying the RNP UCOE is then removed and replaced with the 4.1kb RNP UCOE fragment from 700FRV.R. The Sail to BamHI (partial) fragments are then cloned into the Sail to BamHI sites of pORTneoF and pORTpuroF' to give pBDUpurol200 (SEQ ID NO: 29; see also Figure 17), pBDUpurol450 (SEQ ID NO: 30; see also Figure 18), pBDUneol600 (SEQ ID NO: 31; see also Figure 19) and pBDUpurolδOO (SEQ ID NO: 32; see also Figure 20).
EXAMPLE 8 EVALUATION OF VECTOR FEATURES IMPORTANT FOR BI-DIRECTIONAL UCOE ACTIVITY
1. Effect of bi-directional UCOE vector copy number on antibody production rate in CHO-S cells:
CHO-S cell line S421.7 have been shown to contain a single copy of vector pBDUpuro421, which expresses hAbl (IgG4). To determine if additional vector copies could increase antibody expression levels, S421.7 was retiansfected with vector pBDUneo221 that also expresses hAbl, but carries a different selectable marker (G418 resistance). Clonal cell lines were isolated and analyzed for production rate (Figure 21). Many cell lines appear to have higher production rates than the parental line S421.7, indicating that additional vector copies can increase production. Initial copy number analysis indicated that cell lines S7.16, S7.20 and S7.23 contain 1-2 copies of vector pBDUneo221 (data not shown).
2. Effect of hybrid UCOE orientation and promoter choice on antibody production in CHO-S cells
Stable pools of CHO-S cells carrying various bi-directional UCOE vectors expressing hAbl (IgG4) were analyzed to determine both the effect of the orientation of the hybrid UCOE relative to the antibody genes, and the effect of different promoters on antibody expression rates. CHO-S cells were transfected with a series of bi-directional UCOE vectors expressing hAbl (IgG4), and stable pools were selected with either 12.5 μgΛtnl puromycin or 500 μg/ml G418. The location of the heavy chain (H) and the light chain (K) relative to the hybrid UCOE element (actin end versus RNP end) and the promoters used are shown in Table 7 below. Antibody production rates were measured by ELISA, and western blot analysis was performed to determine the distribution of light chain and heavy chain in the supernatant (supe) versus the cell lysate (lysate). The orientation of the hybrid UCOE showed only minor effects on antibody expression levels, however the choice of promoter combination resulted in some differences in production rates. The highest production rates were obtained in these experiments using illustrative vectors expressing the heavy chain from the human
beta-actin promoter, and the light chain from either the murine CMV or human CMV promoters (e.g., pBDUpuro454 and pBDUpuro804).
Table 7
Vector Actin RNP end Heavy Heavy Kappa Kappa Prod. Rate
End Chain Chain Chain Chain (pε/cell/dav
(supe) (lysate) (supe) (lysate) pBDUpuro352 hCMV-K mCMV-H + ++ + - 0.159 pBDUpuro354 CMV-H mCMV-K + + +++ + 0.256 pBDUpuro452 actin-K mCMV-H +/- ++ +/- - 0.0056 pBDUpuro454 actin-H mCMV-K ++ + +++ ++ 0.657 pBDUpuro702 hCMV-K mCMV-H ++ ++ ++ + 0.391 pBDUpuro704 hCMV-H mCMV-K ++ ++ ++ +/- 0.170 pBDUpuro802 actin-K mCMV-H +/- +++ +/- - 0.020 pBDUpuro804 actin-H mCMV-K +++ +++ +++ ++ 0.608
3. Transcription versus production rates in CHO-S cells
Clonal cell lines were isolated from the puromycin resistant pools carrying pBDUpuro452, pBDUpuro454 and pBDUpuro804. Approximately two thirds of clonal lines carrying pBDUpuro454 and pBDUpuro804 had measurable antibody production rates from 1 to 10 pg/cell day, similar to previous results obtained with vector pBDUpuro421 (data not shown). TaqMan assays on genomic DNA samples suggested that clonal lines S452.3, S454.5 and S804.4 carried single copies of the bidirectional UCOE vectors pBDUpuro452, pBDUpuro454 and pBDUpuro804, respectively. Cell line S421.7, previously shown by Southern analysis to have a single copy of pBDUpuro421 (pBDUpuro400 with the heavy chain expressed from the human actin promoter, and the light chain from the murine CMV promoter) was included as a control. To study the correlation between production rate and transcription of the antibody chains, TaqMan RT-PCR assays were carried out on these lines, the results of which are summarized in Table 8 below. Both heavy and light chain RNA levels in line S452.3 were significantly lower than those observed in the control lines D6 and S421.7, that have been shown to express antibody well. However, lines S454.5 and S804.4 had RNA levels as well as production levels similar to the positive contiol lines. Together
with western blot analysis (data not shown), these results indicate that the RNA levels of antibody heavy and light chains observed in these lines correlates with the production rates observed.
Table 8
Ct, cycle number threshold; CHO-S, parental cell line; D6, clonal cell line carrying two pieces of a vector expressing the light chain and 4-8 copies of the heavy chain expressed from the hCMV promoter for hAbl; S421.7, clonal cell line carrying a single copy of pBDupuro421; S454.5, clonal cell line carrying a single copy of pBDUpuro454; S804.4 clonal cell line carrying a single copy of pBDUpuro804; and S452.3, clonal cell line carrying a single copy of pBDUpuro452.
U.S. patents, U.S. patent application publications, U.S. patent applications, foreign patents, foreign patent applications and non-patent publications referred to in this specification and/or listed in the Application Data Sheet are incorporated herein by reference in their entirety.
From the foregoing it will be appreciated that, although specific embodiments of the invention have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the invention. Accordingly, the invention is not limited except as by the appended claims.
Claims (48)
1. A composition for achieving high-level, large scale protein and/or polypeptide expression, said composition comprising:
(a) an immortalized host cell-line, capable of continuous growth in culture wherein said host cell-line is capable of growth in serum-free suspension culture, and
(b) a vector for sustained overexpression of a recombinant protein and/or polypeptide, wherein said host cell-line is transfected with said vector.
2. The composition of claim 1 wherein said immortalized host cell- line has a doubling time of no more than 16 hours.
3. The composition of claim 2 wherein said doubling time is no more than 12 hours.
4. The composition of claim 1 having an efficiency of transfection of at least 70%.
5. The composition of claim 4 wherein said efficiency of transfection is at least 75%.
6. The composition of claim 4 wherein said efficiency of transfection is at least 85%.
7. The composition of claim 4 wherein said efficiency of transfection is at least 95%.
8. The composition of claim 1 wherein said host cell-line is susceptible to selection agents selected from the group consisting of: hygromycin, G418, and puromycin.
9. The composition of claim 1 wherein said host cell-line is characterized by the absence of gal-gal glycosylation of said recombinant protein and/or polypeptide.
10. The composition of claim 1 wherein said host cell-line is selected from the group consisting of CHO-S, 293-F, 293-H, COS-7L, D.Mel-2, Sf21, and Sf9.
11. The composition of claim 1 wherein said vector further comprises a property selected from the group consisting of (a) containing one or more elements that facilitate high-level, large-scale expression in the immortalized host cell- line and (b) resistance to repression of the recombinant protein and/or polypeptide.
12. The composition of claim 1 wherein said vector further comprises one or more universal chromatin opening elements (UCOEs).
13. The composition of claim 1 wherein said composition is characterized in being capable of achieving expression levels of at least 50 mg recombinant protein and/or polypeptide per liter of culture.
14. The composition of claim 13 wherein said composition is characterized in being capable of achieving expression levels of at least 100 mg recombinant protein and/or polypeptide per liter of culture.
15. The composition of claim 13 wherein said composition is characterized in being capable of achieving expression levels of at least 200 mg recombinant protein and/or polypeptide per liter of culture.
16. The composition of claim 1 wherein said composition is capable of scale-up to at least 100 liter scale and wherein said composition is capable of yields of at least 1 gram of protein and/or polypeptide.
17. The composition of claim 16 wherein said composition is capable of yields of at least 10 grams of protein and/or polypeptide.
18. The composition of claim 16 wherein said composition is capable of yields of at least 20 grams of protein and/or polypeptide.
19. A method for the high-level, large-scale production of a protein and/or polypeptide, said method comprising the steps of
(a) obtaining an immortilized host cell-line capable of growth in suspension;
(b) adapting said immortilized host cell-line for growth in serum-free medium;
(c) tiansfecting said serum-free growth adapted immortalized cell- line with a vector suitable for high-level expression of a recombinant protein and/or polypeptide.
20. The method of claim 19 wherein said immortalized host cell-line has a doubling time of no more than 16 hours.
21. The method of claim 20 wherein said doubling time is no more than 12 hours.
22. The method of claim 19 having an efficiency of transfection of at least 70%.
23. The method of claim 22 wherein said efficiency of transfection is at least 75%.
24. The method of claim 22 wherein said efficiency of transfection is at least 85%.
25. The method of claim 22 wherein said efficiency of transfection is at least 95%.
26. The method of claim 19 wherein said host cell-line is susceptible to selection agents selected from the group consisting of: hygromycin, G418, and puromycin.
27. The method of claim 19 wherein said host cell-line is characterized by the absence of gal-gal glycosylation of said recombinant of protein and/or polypeptide.
28. The method of claim 19 wherein said host cell-line is selected from the group consisting of CHO-S, 293-F, 293-H, COS-7L, D.Mel-2, Sf21, and Sf9.
29. The method of claim 19 wherein said vector further comprises a property selected from the group consisting of (a) containing one or more elements that facilitate high-level, large-scale expression in the immortalized host cell-line and (b) resistance to repression of the recombinant protein and/or polypeptide.
30. The method of claim 19 wherein said vector further comprises one or more universal chromatin opening elements (UCOEs).
31. The method of claim 19 wherein said method is characterized in being capable of achieving expression levels of at least 50 mg recombinant protein and/or polypeptide per liter of culture.
32. The method of claim 31 wherein said method is characterized in being capable of achieving expression levels of at least 100 mg recombinant protein and/or polypeptide per liter of culture.
33. The method of claim 31 wherein said method is characterized in being capable of achieving expression levels of at least 200 mg recombinant protein and/or polypeptide per liter of culture.
34. The method of claim 19 wherein said method is capable of scale- up to at least 100 liter scale and wherein said method is capable of yields of at least 1 gram of protein and/or polypeptide.
35. The method of claim 34 wherein said method is capable of yields of at least 10 grams of protein and/or polypeptide.
36. The method of claim 34 wherein said method is capable of yields of at least 20 grams of protein and/or polypeptide.
37. A bi-directional vector for high-level, large-scale expression, of a multisubunit protein and/or polypeptide, said composition comprising:
(a) at least one UCOE element; and
(b) a first tianscriptional promoter; and
(c) a second tianscriptional promoter; wherein said UCOE element is operably linked to said first and said second tianscriptional promoter and wherein said first transcriptional promoter is oriented in the opposite direction as said second tianscriptional promoter
38. The bi-directional vector of claim 37 wherein said UCOE element is an RNP UCOE.
39. The bi-directional vector of claim 37 wherein said first transcriptional promoter is selected from the group consisting of a human CMV promoter, a murine CMV promoter and a human beta-actin promoter.
40. A composition for achieving high-level, large scale protein and/or polypeptide expression, said composition comprising:
(a) an immortalized host cell-line, capable of continuous growth in culture wherein said host cell-line is capable of growth in serum-free suspension culture, and
(b) the bi-directional vector of claim 37, wherein said host cell-line is transfected with said vector.
41. A method for the high-level, large-scale production of a protein and or polypeptide, said method comprising the steps of
(a) obtaining a host cell-line capable of continuous growth;
(b) adapting said host cell-line for growth in serum-free medium to create a cell-line capable of continuous growth in serum-free medium;
(c) tiansfecting said cell-line capable of continuous growth in serum- free medium with a vector of claim 37.
42. The method of claim 41 wherein said host cell-line capable of continuous growth is also capable of growth in suspension.
43. The method of claim 42 wherein said host cell-line capable of continuous growth in suspension is a CHO-S cell-line.
44. A vector for high-level, large scale expression, of a multisubunit protein and/or polypeptide, said composition comprising:
(a) at least one UCOE element; and
(b) a tianscriptional promoter; said vector further comprising one or more deletion within regions of the RNP UCOE selected from the group consisting of ΔBS, ΔEcoNI, ΔEM, ΔMM, and ΔRV as depicted in Table 4 and Figure 14.
45. The vector of claim 44 wherein said deletion is within the region of the RNP UCOE depicted by ΔBS in Table 4 and Figure 14.
46. The vector of claim 44 wherein said deletion is at least 100 bp.
47. The vector of claim 44 wherein said deletion is at least 1,000 bp.
48. The vector of claim 44 wherein said deletion is at least 4,000 bp.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US60/295,961 | 2001-06-04 | ||
US60/333,620 | 2001-11-26 | ||
US60/352,404 | 2002-01-29 |
Publications (1)
Publication Number | Publication Date |
---|---|
AU2002305834A1 true AU2002305834A1 (en) | 2002-12-16 |
Family
ID=
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1402006A1 (en) | Compositions and methods for high-level, large-scale production of recombinant proteins | |
JP4139867B2 (en) | Stable protein product with improved bactericidal / permeability and pharmaceutical composition comprising the same | |
JP5432117B2 (en) | Recombinant expression vector element (rEVE) for enhancing expression of recombinant protein in a host cell | |
EP0939763B1 (en) | Directed switch-mediated dna recombination | |
US5733782A (en) | Methods and compositions for high protein production from non-native DNA | |
US20040033561A1 (en) | Immunoglobulin DNA cassette molecules, monobody constructs, methods of production, and methods of use therefor | |
JP6279466B2 (en) | New expression vector | |
CN101292036A (en) | Methods for recombinantly expressing polypeptides | |
KR20040044406A (en) | Recombinant tumor specific antibody and use thereof | |
KR101494072B1 (en) | An expression vector comprising a polynucleotide encoding a modified glutamine synthetase and a method for preparing a target protein employing the same | |
WO1998016654A1 (en) | Production of a multimeric protein by cell fusion method | |
EP1137808A1 (en) | Methods for making recombinant cells | |
EP1702071B1 (en) | Transactivation system for mammalian cells | |
US8426572B2 (en) | Artificial entropic bristle domain sequences and their use in recombinant protein production | |
US6964199B2 (en) | Methods and compositions for enhanced protein expression and/or growth of cultured cells using co-transcription of a Bcl2 encoding nucleic acid | |
EP1910421B1 (en) | Compositions and methods for metabolic selection of transfected cells | |
AU2002305834A1 (en) | Compositions and methods for high-level, large-scale production of recombinant proteins | |
KR20220057596A (en) | Allogeneic Cell Compositions and Methods of Use | |
US20190031752A1 (en) | Method for Producing Antibodies | |
US20090137004A1 (en) | Artificial entropic bristle domain sequences and their use in recombinant protein production | |
JP3570721B2 (en) | Methods and compositions for high protein production from recombinant DNA | |
MXPA00000677A (en) | Identification of human cell lines for the production of human proteins by endogenous gene activation | |
AU7217401A (en) | Production of a multimeric protein by cell fusion method |