CN118265538A - Transmembrane neoantigenic peptides - Google Patents
Transmembrane neoantigenic peptides Download PDFInfo
- Publication number
- CN118265538A CN118265538A CN202280033953.0A CN202280033953A CN118265538A CN 118265538 A CN118265538 A CN 118265538A CN 202280033953 A CN202280033953 A CN 202280033953A CN 118265538 A CN118265538 A CN 118265538A
- Authority
- CN
- China
- Prior art keywords
- cells
- sequence
- cell
- tumor
- peptide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
Abstract
Description
技术领域Technical Field
本公开提供了可用于癌症治疗的由可转座元件(TE)-外显子融合转录物编码的跨膜嵌合蛋白和肿瘤新抗原肽,核酸、疫苗、抗体和免疫细胞。The present disclosure provides transmembrane chimeric proteins and tumor neoantigen peptides encoded by transposable element (TE)-exon fusion transcripts that can be used for cancer treatment, nucleic acids, vaccines, antibodies and immune cells.
背景技术Background technique
基于T细胞的免疫疗法的出现,使用输注激活的基因工程化T细胞和其他免疫细胞,或通过递送单或双特异性抗体(如BiTE),使高精度肿瘤靶向发生了革命性变化。嵌合抗原受体(CAR)T细胞和BiTE是T细胞重定向免疫疗法的主要形式,利用单链可变片段(scFv)靶向肿瘤以诱导靶细胞死亡。使用这些方法来重新定向免疫细胞靶向能够消除先前对免疫系统“不可见”的恶性细胞,并为复发或难治性肿瘤患者提供了优异的治疗效果。这在CAR T细胞的情况下尤其有效,其中抗体结合结构域与T细胞信号传导蛋白(例如CD3)的融合能够重新定向T细胞对抗原的特异性。CAR的主要优势是T细胞被激活并能发挥效应子功能,例如细胞毒性颗粒和细胞因子的释放,而不识别由主要组织相容性复合物(MHC)呈递的肽,因为CAR直接与细胞表面分子相互作用。The advent of T cell-based immunotherapy has revolutionized high-precision tumor targeting using infusions of activated genetically engineered T cells and other immune cells, or by delivering mono- or bispecific antibodies such as BiTEs. Chimeric antigen receptor (CAR) T cells and BiTEs are the main forms of T cell redirected immunotherapy, using single-chain variable fragments (scFv) to target tumors to induce target cell death. Redirecting immune cell targeting using these approaches has enabled the elimination of malignant cells that were previously "invisible" to the immune system and has provided excellent therapeutic effects for patients with relapsed or refractory tumors. This is particularly effective in the case of CAR T cells, where the fusion of an antibody binding domain to a T cell signaling protein (e.g., CD3) is able to redirect T cells to antigen specificity. The main advantage of CARs is that T cells are activated and can exert effector functions, such as the release of cytotoxic granules and cytokines, without recognizing peptides presented by the major histocompatibility complex (MHC) because CARs interact directly with cell surface molecules.
尽管有工程化、重新定向和影响细胞功能和相互作用的能力,但仍存在与在肿瘤细胞上表达的靶向蛋白相关的挑战。缺乏真正的肿瘤特异性抗原(TSA)和发展肿瘤内抗原逃逸仍然是有效的靶向治疗的主要挑战,并且方法如双抗原靶向已显示出有希望的早期结果。这些挑战促使努力发现血液和实体恶性肿瘤的生物标记物。Despite the ability to engineer, redirect, and influence cellular functions and interactions, challenges remain associated with targeting proteins expressed on tumor cells. The lack of true tumor-specific antigens (TSAs) and the development of intratumoral antigen escape remain major challenges to effective targeted therapy, and approaches such as dual antigen targeting have shown promising early results. These challenges have prompted efforts to discover biomarkers for both hematological and solid malignancies.
TSA仅在恶性肿瘤中表达,并且通常被认为是通过MHC在细胞表面呈递的蛋白突变。然而,TSA的类别可扩大至包括源自肿瘤特异性融合转录物的蛋白、肿瘤特异性糖基化、细胞表面蛋白的肿瘤特异性突变和内质网内逃避重折叠的错折叠蛋白。TSAs are expressed only in malignant tumors and are generally considered to be mutated proteins presented on the cell surface via MHC. However, the category of TSAs can be expanded to include proteins derived from tumor-specific fusion transcripts, tumor-specific glycosylation, tumor-specific mutations of cell surface proteins, and misfolded proteins that escape refolding in the endoplasmic reticulum.
肿瘤相关抗原的治疗性靶向(TAA)在某些情况下取得了成功,但同时也警告了可能与治疗相关的潜在脱靶效应。这类靶点广泛定义为,与匹配的健康组织相比,在恶性组织中具有更高表达水平,如人表皮生长因子受体2(HER2),或者定义为其表达是谱系限制的,如CD19。与TSA不同,TAA通常会有靶内副作用,这使得直接治疗相关的副作用与脱靶(on-target/off-tumor)毒性更难区分。靶向CD19的CAR T在表达CD19的肿瘤中的细胞治疗是表明CAR T细胞具有临床有效性的突破性治疗。因此,靶向谱系特异性的TAA是可能的,但仅在健康组织被视为可有可无或存在可接受的毒性水平时才是合理的。Therapeutic targeting of tumor-associated antigens (TAAs) has been successful in some cases, but comes with the caveat of potential off-target effects that may be associated with treatment. Such targets are broadly defined as having higher expression levels in malignant tissue compared to matched healthy tissue, such as human epidermal growth factor receptor 2 (HER2), or as having expression that is lineage restricted, such as CD19. Unlike TSAs, TAAs often have on-target side effects, making it more difficult to distinguish direct treatment-related side effects from on-target/off-tumor toxicities. CAR T cell therapy targeting CD19 in CD19-expressing tumors was a breakthrough therapy that demonstrated the clinical efficacy of CAR T cells. Therefore, targeting lineage-specific TAAs is possible but only justified if healthy tissue is deemed dispensable or there is an acceptable level of toxicity.
鉴定共享的真实TSA(不存在于组织中)或具有最小脱靶风险的TAA是免疫肿瘤学领域的主要挑战。Identification of shared bona fide TSAs (not present in tissues) or TAAs with minimal off-target risk is a major challenge in the field of immuno-oncology.
关于肿瘤中可转座元件(TE)的一些现有报道包括(Helman,E.et al.(2014).Genome Res.)(Schiavetti,F.et al.(2002).Cancer Res.,Takahashi,Y.et al.(2008).J.Clin.Invest.).(Chiappinelli,K.B.et al.(2015).Cell,Roulois,D.et al.(2015).Cell)。然而,TE与由肿瘤细胞表达的抗原情形之间的关系尚未得到深入研究。Some existing reports on transposable elements (TEs) in tumors include (Helman, E. et al. (2014). Genome Res.)(Schiavetti, F. et al. (2002). Cancer Res., Takahashi, Y. et al. (2008). J. Clin. Invest.).(Chiappinelli, K. B. et al. (2015). Cell, Roulois, D. et al. (2015). Cell). However, the relationship between TEs and the antigenic profile expressed by tumor cells has not been studied in depth.
新的肿瘤新抗原将受到关注,并可能改进或降低癌症治疗的成本,尤其是在采用细胞治疗、通过抗原结合结构域的靶向治疗和疫苗接种策略的情况下。New tumor neoantigens will gain interest and have the potential to improve or reduce the cost of cancer treatment, especially with cell therapy, targeted therapy via antigen-binding domains, and vaccination strategies.
发明概述SUMMARY OF THE INVENTION
本公开提供了嵌合多肽(或蛋白)和编码这种多肽序列的核苷酸序列;抗体或其抗原结合片段、T细胞受体(TCR,特别是非HLA限制性TCR)或特异性结合这种嵌合多肽或蛋白的嵌合抗原受体(CAR);产生这种抗体、TCR或CAR的方法;编码这种新抗原肽、抗体、CAR或TCR的多核苷酸,其任选地与异源调控序列连接;特异性结合这种嵌合多核苷酸或蛋白的免疫细胞;以及方法,特别是使用这种产品的治疗方法。The present disclosure provides chimeric polypeptides (or proteins) and nucleotide sequences encoding such polypeptide sequences; antibodies or antigen-binding fragments thereof, T cell receptors (TCRs, particularly non-HLA restricted TCRs) or chimeric antigen receptors (CARs) that specifically bind to such chimeric polypeptides or proteins; methods for producing such antibodies, TCRs or CARs; polynucleotides encoding such new antigenic peptides, antibodies, CARs or TCRs, which are optionally linked to heterologous regulatory sequences; immune cells that specifically bind to such chimeric polynucleotides or proteins; and methods, particularly therapeutic methods using such products.
本公开提供了包含或由SEQ ID NO:1-21542中任何一个组成并包含新抗原序列的肿瘤嵌合多肽(或蛋白),其中所述蛋白位于细胞膜上。The present disclosure provides a tumor chimeric polypeptide (or protein) comprising or consisting of any one of SEQ ID NOs: 1-21542 and comprising a neoantigen sequence, wherein the protein is located on a cell membrane.
更具体地,本公开进一步提供了分离的肿瘤新抗原序列(通常为表位),其中序列来自SEQ ID NO:1-8202的任何一个嵌合蛋白,包括其片段,并且包含TE衍生的氨基酸序列的至少一部分,或者来自SEQ ID NO:1424-8202、8203-10163和12831-21542中的任何一个,特别是衍生自其中供体为TE的嵌合融合转录物序列。在一些实施方案中,肿瘤新抗原序列与TE衍生的氨基酸序列和外显子衍生的氨基酸序列之间的断点重叠。在其他实施方案中,肿瘤新抗原序列源自纯TE序列。在其他实施方案中,肿瘤新抗原序列由TE衍生的氨基酸序列和外显子衍生的氨基酸序列之间的连接下游的非经典ORF编码。通常,肿瘤新抗原序列来自其所属的嵌合蛋白的胞外部分。More specifically, the present disclosure further provides an isolated tumor neoantigen sequence (generally an epitope), wherein the sequence is derived from any one of the chimeric proteins of SEQ ID NO: 1-8202, including fragments thereof, and comprises at least a portion of a TE-derived amino acid sequence, or is derived from any one of SEQ ID NO: 1424-8202, 8203-10163, and 12831-21542, in particular derived from a chimeric fusion transcript sequence in which the donor is TE. In some embodiments, the tumor neoantigen sequence overlaps the breakpoint between the TE-derived amino acid sequence and the exon-derived amino acid sequence. In other embodiments, the tumor neoantigen sequence is derived from a pure TE sequence. In other embodiments, the tumor neoantigen sequence is encoded by a non-classical ORF downstream of the connection between the TE-derived amino acid sequence and the exon-derived amino acid sequence. Typically, the tumor neoantigen sequence is derived from the extracellular portion of the chimeric protein to which it belongs.
通常,跨膜嵌合蛋白在超过1%,特别是超过5%,并且通常超过10%的肿瘤样品中表达。Typically, the transmembrane chimeric protein is expressed in more than 1%, particularly more than 5%, and often more than 10% of tumor samples.
通常,与正常样品相比,跨膜嵌合蛋白在肿瘤样品中的表达水平更高。Typically, the transmembrane chimeric protein is expressed at higher levels in tumor samples compared to normal samples.
通常,嵌合蛋白在正常样品中的表达低于20%,特别是低于10%,低于5%或低于1%。Typically, the expression of the chimeric protein in a normal sample is less than 20%, particularly less than 10%, less than 5% or less than 1%.
在某些实施方案中,来源于TE核苷酸序列的嵌合蛋白序列部分暴露于细胞表面。In certain embodiments, the portion of the chimeric protein sequence derived from the TE nucleotide sequence is exposed on the cell surface.
本公开进一步包括结合本文定义的跨膜嵌合蛋白,特别是结合来自SEQ ID 1-8202的任何一种嵌合蛋白的肿瘤新抗原序列(通常为表位)的抗原结合结构域,其Kd结合亲和力小于约10-7M。特异性结合本文公开的跨膜嵌合多肽或蛋白的抗体、TCR或CAR可结合至少4个、至少5个、至少6个或至少7个氨基酸的肿瘤新抗原肽序列。The present disclosure further includes an antigen binding domain that binds to a transmembrane chimeric protein as defined herein, in particular a tumor neoantigen sequence (generally an epitope) from any one of the chimeric proteins of SEQ ID 1-8202, with a Kd binding affinity of less than about 10 -7 M. An antibody, TCR or CAR that specifically binds to a transmembrane chimeric polypeptide or protein disclosed herein may bind to a tumor neoantigen peptide sequence of at least 4, at least 5, at least 6 or at least 7 amino acids.
在一些实施方案中,抗原结合结构域结合来自本文公开的任何一种嵌合蛋白的序列,所述序列包含TE衍生的氨基酸序列的至少一部分或来自SEQ ID NO:1424-8202、8203-10163和12831-21542中的任何一个,特别是衍生自其中供体为TE的嵌合融合转录物序列。在一些实施方案中,由抗原结合结构域结合的嵌合蛋白的序列或片段与TE衍生的氨基酸序列和外显子衍生的氨基酸序列之间的断点重叠。在其他实施方案中,新抗原肽源自纯TE序列。在其他实施方案中,由抗原结合结构域结合的嵌合蛋白的序列或片段由TE衍生的氨基酸序列和外显子衍生的氨基酸序列之间的连接的下游的非经典ORF编码。In some embodiments, the antigen binding domain binds to a sequence from any of the chimeric proteins disclosed herein, the sequence comprising at least a portion of a TE-derived amino acid sequence or from any of SEQ ID NOs: 1424-8202, 8203-10163, and 12831-21542, particularly derived from a chimeric fusion transcript sequence in which the donor is a TE. In some embodiments, the sequence or fragment of the chimeric protein bound by the antigen binding domain overlaps the breakpoint between the TE-derived amino acid sequence and the exon-derived amino acid sequence. In other embodiments, the new antigenic peptide is derived from a pure TE sequence. In other embodiments, the sequence or fragment of the chimeric protein bound by the antigen binding domain is encoded by a non-classical ORF downstream of the junction between the TE-derived amino acid sequence and the exon-derived amino acid sequence.
在某些实施方案中,抗原结合结构域包含一个或多个,通常一个或两个免疫球蛋白区域。In certain embodiments, an antigen binding domain comprises one or more, typically one or two, immunoglobulin regions.
值得注意的是,抗原结合结构域可包含抗体的重链可变区(VH)和/或抗体的轻链可变区(VL)。Notably, the antigen binding domain may comprise an antibody heavy chain variable region (VH) and/or an antibody light chain variable region (VL).
本公开还包括包含如本文所定义的抗原结合结构域的抗体,其中所述抗体选自完整的IgG、scFv、BiTE或多特异性抗体。所述抗体可以来源于人、鼠或骆驼科。The present disclosure also includes antibodies comprising an antigen binding domain as defined herein, wherein the antibody is selected from a complete IgG, a scFv, a BiTE or a multispecific antibody. The antibody may be derived from human, mouse or camelid.
本公开还包括嵌合抗原受体(CAR)或非HLA限制性重组T细胞受体(TCR),其包含本文定义的抗原结合结构域。The present disclosure also includes a chimeric antigen receptor (CAR) or a non-HLA restricted recombinant T cell receptor (TCR) comprising an antigen binding domain as defined herein.
本公开的非HLA限制性重组TCR通常包含能够与第二胞外抗原结合结构域二聚的胞外抗原结合结构域。在一些实施方案中,第二胞外抗原结合结构域结合肿瘤抗原,优选地其中肿瘤抗原选自pHER95、CD19、MUC16、MUC1、CAIX、CEA、CD8、CD7、CD10、CD20、CD22、CD30、CD70、CLL1、CD33、CD34、CD38、CD41、CD44、CD49f、CD56、CD74、CD133、CD138、EGP-2、EGP-40、EpCAM、Erb-B2、Erb-B3、Erb-B4、FBP、胎儿乙酰胆碱受体、叶酸受体-a、GD2、GD3、HER-2、hTERT、IL-13R-a2、κ-轻链、KDR、LeY、L1细胞粘附分子、MAGE-A1、间皮素、MAGEA3、p53、MART1,GP100、蛋白酶3(PR1)、酪氨酸酶、生存素、hTERT、EphA2、NKG2D配体、NY-ESO-1、癌胚抗原(h5T4)、PSCA、PSMA、ROR1、TAG-72、TAG-R2、WT-1、BCMA、CD123、CD44V6、NKCS1、EGF1R、EGFR-VIII、CD99、CD70、ADGRE2、CCR1、LILRB2、LILRB4、PRAME和ERBB。The non-HLA restricted recombinant TCRs of the present disclosure generally comprise an extracellular antigen binding domain that can dimerize with a second extracellular antigen binding domain. In some embodiments, the second extracellular antigen binding domain binds to a tumor antigen, preferably wherein the tumor antigen is selected from pHER95, CD19, MUC16, MUC1, CAIX, CEA, CD8, CD7, CD10, CD20, CD22, CD30, CD70, CLL1, CD33, CD34, CD38, CD41, CD44, CD49f, CD56, CD74, CD133, CD138, EGP-2, EGP-40, EpCAM, Erb-B2, Erb-B3, Erb-B4, FBP, fetal acetylcholine receptor, folate receptor-a, GD2, GD3, HER-2, hTERT, IL- 13R-a2, kappa-light chain, KDR, LeY, L1 cell adhesion molecule, MAGE-A1, mesothelin, MAGEA3, p53, MART1, GP100, proteinase 3 (PR1), tyrosinase, survivin, hTERT, EphA2, NKG2D ligand, NY-ESO-1, carcinoembryonic antigen (h5T4), PSCA, PSMA, ROR1, TAG-72, TAG-R2, WT-1, BCMA, CD123, CD44V6, NKCS1, EGF1R, EGFR-VIII, CD99, CD70, ADGRE2, CCR1, LILRB2, LILRB4, PRAME and ERBB.
本公开进一步包括一种CAR,其包含:The present disclosure further includes a CAR comprising:
a)胞外域,其包含一个或多个抗原结合结构域,其至少一个选自本文所述的抗原结合结构域,a) an extracellular domain comprising one or more antigen binding domains, at least one of which is selected from the antigen binding domains described herein,
b)跨膜结构域,b) transmembrane domain,
c)任选的一个或多个共刺激结构域,例如选自CD28、4-1BB(CD137)、ICOS-1、CD27、OX 40(CD137)、DAP10和GITR(AITR)c) optionally one or more co-stimulatory domains, for example selected from CD28, 4-1BB (CD137), ICOS-1, CD27, OX 40 (CD137), DAP10 and GITR (AITR)
d)一个或多个胞内信号传导结构域,其包含一个或多个ITAM,例如:来自CD3-ζ的胞内信号传导结构域或其一部分,或缺少一个或两个ITAM的变体(例如:ITAM3和/或ITAM2,参见上文详述和参考文献)、FcRγ、FcRβ、CD3γ、CD3δ、CD3ε、CDS、CD22、CD79a、CD79b和/或CD66d,特别是包含其中ITAM2和ITAM3已失活的经修饰的CD3ζ胞内信号域的胞内信号传导结构域,d) one or more intracellular signaling domains comprising one or more ITAMs, for example: an intracellular signaling domain from CD3-ζ or a portion thereof, or a variant lacking one or two ITAMs (for example: ITAM3 and/or ITAM2, see above for details and references), FcRγ, FcRβ, CD3γ, CD3δ, CD3ε, CDS, CD22, CD79a, CD79b and/or CD66d, in particular an intracellular signaling domain comprising a modified CD3ζ intracellular signaling domain in which ITAM2 and ITAM3 have been inactivated,
在一些实施方案中,本文公开的CAR包含选自CD28、CD8或CD3-ζ的跨膜结构域。In some embodiments, a CAR disclosed herein comprises a transmembrane domain selected from CD28, CD8, or CD3-ζ.
在一些实施方案中,本文公开的CAR包含一个或多个共刺激结构域,其可选自:CD28、4-1BB(CD137)、ICOS-1、CD27、OX 40(CD137)、DAP10和GITR(AITR)。In some embodiments, the CAR disclosed herein comprises one or more co-stimulatory domains, which can be selected from: CD28, 4-1BB (CD137), ICOS-1, CD27, OX 40 (CD137), DAP10 and GITR (AITR).
在一些实施方案中,本文公开的CAR包含胞内信号传导结构域,其包含CD3-ζ多肽的胞内信号传导结构域,或其片段,任选地包含其中免疫受体酪氨酸基激活基序2(ITAM2)和免疫受体酪氨酸基激活基序3(ITAM3)失活的CD3-ζ多肽。In some embodiments, a CAR disclosed herein comprises an intracellular signaling domain comprising an intracellular signaling domain of a CD3-zeta polypeptide, or a fragment thereof, optionally comprising a CD3-zeta polypeptide in which the immunoreceptor tyrosine-based activation motif 2 (ITAM2) and the immunoreceptor tyrosine-based activation motif 3 (ITAM3) are inactivated.
本公开还涵盖:This disclosure also covers:
-抗体或其抗原结合片段、T细胞受体(TCR)或针对其对来自SEQ ID NO:1-21542中任何一个的嵌合蛋白的结合亲和力而被选择的嵌合抗原受体(CAR),包括其一部分,例如长度至少为4、5、6、7或8个氨基酸,或包含这种抗体、其抗原结合片段、TCR或CAR的组合物。-An antibody or antigen-binding fragment thereof, a T cell receptor (TCR), or a chimeric antigen receptor (CAR) selected for its binding affinity to a chimeric protein from any one of SEQ ID NOs: 1-21542, including a portion thereof, such as at least 4, 5, 6, 7 or 8 amino acids in length, or a composition comprising such an antibody, antigen-binding fragment thereof, TCR or CAR.
-编码如本文定义的新抗原肽、抗体、CAR或TCR的多核苷酸,其通常可操作地与异源调控核苷酸序列连接,以及编码这种多核苷酸的载体;- a polynucleotide encoding a neoantigenic peptide, antibody, CAR or TCR as defined herein, which is usually operably linked to a heterologous regulatory nucleotide sequence, and a vector encoding such a polynucleotide;
-靶向来自SEQ ID NO:1-21542中任何一个的一种或多种嵌合蛋白的免疫细胞或免疫细胞群体,包括其一部分,例如长度至少为4、5、6、7或8个氨基酸,其中免疫细胞群体优选靶向如本文公开的多种不同的嵌合蛋白或其片段,或包含这种免疫细胞或免疫细胞群体的组合物,其任选地与生理或药理学上可接受的缓冲液、载体、赋形剂、免疫刺激剂和/或佐剂组合。- An immune cell or immune cell population targeted to one or more chimeric proteins from any one of SEQ ID NOs: 1-21542, including a portion thereof, such as at least 4, 5, 6, 7 or 8 amino acids in length, wherein the immune cell population is preferably targeted to a plurality of different chimeric proteins or fragments thereof as disclosed herein, or a composition comprising such an immune cell or immune cell population, optionally in combination with a physiologically or pharmacologically acceptable buffer, carrier, excipient, immunostimulant and/or adjuvant.
通常,抗体或其抗原结合片段、TCR或CAR结合在细胞表面表达的嵌合蛋白或其片段,其中Kd亲和力约为10-6M或更低。Typically, the antibody or antigen-binding fragment thereof, TCR or CAR binds to the chimeric protein or fragment thereof expressed on the surface of a cell with a Kd affinity of about 10 -6 M or less.
本公开进一步提供了产生如本文所定义的抗体、非HLA限制性TCR或CAR的方法,其包括以下步骤:选择结合本公开的嵌合蛋白或其片段的抗体、非HLA限制性TCR或CAR,通常为SEQ ID NO:1-21542的任何一种跨膜嵌合多肽序列,其中Kd亲和常数约为10-7M或更小。本公开还包括通过这种方法产生的抗体、CAR和TCR。The present disclosure further provides a method for producing an antibody, a non-HLA restricted TCR or a CAR as defined herein, comprising the following steps: selecting an antibody, a non-HLA restricted TCR or a CAR that binds to a chimeric protein or a fragment thereof of the present disclosure, generally any one of SEQ ID NOs: 1-21542 transmembrane chimeric polypeptide sequences, wherein the Kd affinity constant is about 10-7 M or less. The present disclosure also includes antibodies, CARs and TCRs produced by this method.
本公开进一步涵盖任选地与异源调控序列连接的编码如本文定义的嵌合蛋白或多肽、抗体、CAR和/或非HLA限制性TCR的多核苷酸,以及包含其的载体。本公开还包括包含CAR和/或TCR,特别是本文定义的非HLA限制性TCR的免疫细胞。所述免疫细胞可以是同种异体的或自体的。其通常选自T细胞、天然杀伤T细胞、CD4+/CD8+T细胞、TIL/肿瘤衍生的CD8T细胞、中央记忆CD8+T细胞、Treg、MAIT、YδT细胞、人胚胎干细胞和可分化出淋巴细胞的多能干细胞。在某些实施方案中,免疫细胞是Suv39h1缺陷的,特别是在所述免疫细胞中,Suv39h1基因通过整个基因、外显子或区域的缺失、用外源序列替换、和/或通过suv39h1基因内的移码或错义突变而被破坏。Suv39h1基因通常是编码UniProt中提及的O43463人Suv39h1蛋白的人Suv39h1基因。还预期了制备这种免疫细胞的方法,例如,通过体内或离体地将编码如本文所述任何抗体、TCR或CAR的核酸或载体递送至细胞。The present disclosure further encompasses polynucleotides encoding chimeric proteins or polypeptides, antibodies, CARs and/or non-HLA restricted TCRs as defined herein, optionally connected to heterologous regulatory sequences, and vectors comprising them. The present disclosure also includes immune cells comprising CAR and/or TCR, particularly non-HLA restricted TCRs as defined herein. The immune cells may be allogeneic or autologous. They are generally selected from T cells, natural killer T cells, CD4+/CD8+T cells, TIL/tumor-derived CD8T cells, central memory CD8+T cells, Treg, MAIT, YδT cells, human embryonic stem cells, and pluripotent stem cells that can differentiate into lymphocytes. In certain embodiments, immune cells are Suv39h1 defective, particularly in the immune cells, the Suv39h1 gene is destroyed by deletion of the entire gene, exon or region, replacement with an exogenous sequence, and/or by frameshift or missense mutations within the suv39h1 gene. The Suv39h1 gene is typically a human Suv39h1 gene encoding the O43463 human Suv39h1 protein mentioned in UniProt. Methods of preparing such immune cells are also contemplated, for example, by delivering a nucleic acid or vector encoding any antibody, TCR or CAR as described herein to a cell in vivo or ex vivo.
本公开进一步涵盖包含有效量的如本文定义的免疫细胞和药学上可接受的赋形剂的药物组合物。The present disclosure further encompasses a pharmaceutical composition comprising an effective amount of an immune cell as defined herein and a pharmaceutically acceptable excipient.
本公开还涵盖治疗用途,特别是如本文定义的嵌合蛋白或多肽、抗体、非HLA限制性TCR、CAR、多核苷酸、载体、免疫细胞或包含其的组合物在需要治疗的受试者中抑制癌细胞增殖或癌症治疗的治疗用途。通常,这种组合物进一步包含药物赋形剂。本文使用的治疗包括预防性和治疗性治疗。The present disclosure also encompasses therapeutic uses, in particular, the therapeutic use of chimeric proteins or polypeptides, antibodies, non-HLA restricted TCRs, CARs, polynucleotides, vectors, immune cells or compositions comprising them as defined herein to inhibit cancer cell proliferation or cancer treatment in subjects in need of treatment. Typically, such compositions further comprise a pharmaceutical excipient. The treatment used herein includes preventive and therapeutic treatments.
本公开还涵盖如本文定义的嵌合蛋白或多肽、抗体、非HLA限制性TCR、CAR、多核苷酸、载体、免疫细胞或包含其的组合物在有需要的受试者中在癌症的细胞疗法中的用途。通常,这种组合物进一步包含药物赋形剂。The present disclosure also encompasses the use of chimeric proteins or polypeptides, antibodies, non-HLA restricted TCRs, CARs, polynucleotides, vectors, immune cells, or compositions comprising the same as defined herein in cell therapy for cancer in subjects in need thereof. Typically, such compositions further comprise a pharmaceutical excipient.
还预期了包含任选与无菌的药学上可接受的赋形剂、载体和/或缓冲剂一起的前述任何一种的药物组合物以及使用这种药物组合物的方法。Also contemplated are pharmaceutical compositions comprising any of the foregoing, optionally together with sterile pharmaceutically acceptable excipients, carriers, and/or buffers, and methods of using such pharmaceutical compositions.
在如本文所述的任何实施方案中,上文定义的癌症治疗产品(即如本文定义的跨膜嵌合蛋白或多肽、抗体、非HLA限制性TCR、CAR、多核苷酸、载体、免疫细胞或包含其的组合物)可与至少一种其他治疗剂组合施用。这种进一步的治疗剂通常可以是化疗剂或免疫治疗剂,任选地是检查点抑制剂。In any embodiment as described herein, the cancer treatment product defined above (i.e., a transmembrane chimeric protein or polypeptide, an antibody, a non-HLA restricted TCR, a CAR, a polynucleotide, a carrier, an immune cell, or a composition comprising the same as defined herein) can be administered in combination with at least one other therapeutic agent. This further therapeutic agent can generally be a chemotherapeutic agent or an immunotherapeutic agent, optionally a checkpoint inhibitor.
例如,根据本公开,任何癌症治疗产品均可与抗免疫抑制/免疫刺激剂组合施用。例如,进一步向受试者施用一种或多种检查点抑制剂,通常选自PD-1抑制剂、PD-L1抑制剂、Lag-3抑制剂、Tim-3抑制剂、TIGIT抑制剂、BTLA抑制剂、T细胞活化的V结构域Ig抑制因子(VISTA)抑制剂和CTLA-4抑制剂或IDO抑制剂。For example, according to the present disclosure, any cancer treatment product can be administered in combination with an anti-immunosuppressive/immunostimulatory agent. For example, one or more checkpoint inhibitors are further administered to the subject, typically selected from PD-1 inhibitors, PD-L1 inhibitors, Lag-3 inhibitors, Tim-3 inhibitors, TIGIT inhibitors, BTLA inhibitors, V-domain Ig inhibitors of T cell activation (VISTA) inhibitors, and CTLA-4 inhibitors or IDO inhibitors.
所述方法、嵌合蛋白或多肽和癌症治疗产品的各种实施方案详述如下。除了明确提及的替代方案之外,本申请涵盖了这些实施方案的组合。Various embodiments of the methods, chimeric proteins or polypeptides, and cancer therapeutic products are described in detail below. Except for the alternatives explicitly mentioned, the present application encompasses combinations of these embodiments.
发明详述DETAILED DESCRIPTION OF THE INVENTION
正常组织中可转座元件的表达由于胚胎发育早期建立的DNA甲基化而沉默。通过组蛋白修饰提供额外的抑制层。TE可在肿瘤细胞中被再活化。本发明人发现并提供了明确的证据,即外显子和TE之间的非经典选择性剪接事件可能是肿瘤抗原,尤其是肿瘤特异性抗原的来源。The expression of transposable elements in normal tissues is silenced by DNA methylation established early in embryonic development. An additional layer of inhibition is provided by histone modification. TEs can be reactivated in tumor cells. The inventors have found and provided clear evidence that non-classical alternative splicing events between exons and TEs may be the source of tumor antigens, especially tumor-specific antigens.
本发明人开发了用于鉴定肿瘤抗原,特别是肿瘤特异性抗原的方法。具体地,本发明人已经确定了用于鉴定源自TE和外显子之间的连接(JET)的肿瘤抗原的方法。因此,在一些实施方案中,本发明涉及用于鉴定和选择由融合转录物(即嵌合,在本文也称为连接外显子TE-JET)序列编码的肿瘤新抗原肽的方法,该融合转录物序列包含TE序列的一部分和外显子序列的一部分。The inventors have developed methods for identifying tumor antigens, particularly tumor-specific antigens. Specifically, the inventors have determined methods for identifying tumor antigens derived from junctions (JETs) between TEs and exons. Therefore, in some embodiments, the present invention relates to methods for identifying and selecting tumor neoantigenic peptides encoded by fusion transcript (i.e., chimeric, also referred to herein as junction exon TE-JET) sequences, which fusion transcript sequences comprise a portion of a TE sequence and a portion of an exon sequence.
本发明人在此进一步提供一组代表优异的细胞表面肿瘤新抗原靶候选物的跨膜肿瘤嵌合蛋白或多肽。The present inventors further provide a group of transmembrane tumor chimeric proteins or polypeptides that represent excellent cell surface tumor neoantigen target candidates.
通过根据本公开的方法鉴定的新抗原肿瘤特异性肽具有高免疫原性。事实上,由于它们来源于正常细胞中不存在或以低水平表达的融合转录物(由可转座元件-TE-和外显子序列组成),因此本公开的肽预期表现出非常低的免疫耐受性。The new antigenic tumor-specific peptides identified by the method according to the present disclosure are highly immunogenic. In fact, since they are derived from fusion transcripts (composed of transposable elements -TEs- and exon sequences) that are absent or expressed at low levels in normal cells, the peptides of the present disclosure are expected to exhibit very low immune tolerance.
本公开还允许在患者群体中选择具有共享的肿瘤新表位的肽。这种共享的肿瘤肽具有很高的治疗价值,因为它们可用于大患者群体的免疫治疗。The present disclosure also allows the selection of peptides with shared tumor neo-epitopes in a patient population. Such shared tumor peptides have high therapeutic value because they can be used for immunotherapy in large patient populations.
定义definition
根据本公开,术语“疾病”是指任何病理状态,包括癌症疾病,尤其是本文所述的癌症疾病的形式。According to the present disclosure, the term "disease" refers to any pathological condition, including cancer diseases, particularly the forms of cancer diseases described herein.
术语“正常”是指健康受试者或组织的健康状态或状况,即非病理状况,其中“健康”优选指非癌症。The term "normal" refers to the healthy state or condition of a healthy subject or tissue, ie, a non-pathological condition, wherein "healthy" preferably refers to non-cancerous.
癌症(医学术语:恶性肿瘤)是一类疾病,其中一组细胞表现出不受控制的生长(分裂超出正常范围)、入侵(侵入和破坏邻近组织)以及有时转移(通过淋巴液或血液扩散至身体其他部位)。癌症的这三种恶性特征使其区别于良性肿瘤,良性肿瘤是自限制的,并且不会入侵或转移。大多数癌症形成肿瘤,但有些癌症(如白血病)不形成肿瘤。Cancer (medical term: malignancy) is a class of diseases in which a group of cells exhibit uncontrolled growth (dividing beyond normal limits), invasion (invading and destroying neighboring tissues), and sometimes metastasis (spreading to other parts of the body through lymph fluid or blood). These three malignant characteristics of cancer distinguish it from benign tumors, which are self-limited and do not invade or metastasize. Most cancers form tumors, but some, such as leukemia, do not.
恶性肿瘤本质上是癌症的同义词。恶性瘤、恶性赘生物和恶性肿瘤本质上是癌症的同义词。Malignant neoplasm is essentially a synonym for cancer. Malignant neoplasm, malignant growth, and malignant tumor are essentially synonyms for cancer.
如本文所用,术语“肿瘤”或“肿瘤疾病”是指细胞的异常生长(称为赘生细胞、致瘤细胞或肿瘤细胞),优选形成肿胀或病灶。“肿瘤细胞”是指通过快速、不受控制的细胞增殖而生长,并且在启动新生长的刺激停止后继续生长的异常细胞。肿瘤显示出部分或完全缺乏与正常组织的结构组织和功能协调,并且通常形成明显的组织块,其可以是良性的、恶性前的或恶性的。As used herein, the term "tumor" or "neoplastic disease" refers to an abnormal growth of cells (called neoplastic cells, tumorigenic cells or tumor cells), preferably forming a swelling or lesion. "Tumor cell" refers to an abnormal cell that grows by rapid, uncontrolled cell proliferation and continues to grow after the stimulus that initiated new growth has ceased. Tumors display a partial or complete lack of structural organization and functional coordination with normal tissues, and usually form a distinct mass of tissue, which can be benign, pre-malignant or malignant.
良性肿瘤是缺乏癌症所有三种恶性特征的肿瘤。因此,根据定义,良性肿瘤不会以无限制的侵略性的方式生长,不会入侵周围组织,也不会扩散至非邻近组织(转移)。A benign tumor is one that lacks all three malignant characteristics of cancer. Thus, by definition, a benign tumor does not grow in an unrestrained, aggressive manner, does not invade surrounding tissue, and does not spread to non-adjacent tissues (metastasize).
赘生物是由赘生引起的异常组织块。赘生物(希腊文中的新生长)是细胞的异常增殖。细胞的生长超过其周围正常组织的生长,并且与其不协调。即使在刺激停止后,生长仍以同样过度的方式持续。它通常会导致肿块或肿瘤。赘生物可以是良性的、恶性前的或恶性的。A neoplasm is an abnormal mass of tissue caused by a neoplasm. Neoplasm (Greek for new growth) is an abnormal proliferation of cells. The growth of cells exceeds and is out of harmony with the growth of the normal tissue around it. Even after the stimulus stops, the growth continues in the same excessive manner. It usually results in a lump or tumor. Neoplasms can be benign, pre-malignant, or malignant.
根据本公开的“肿瘤的生长”或“肿瘤生长”涉及肿瘤增加其大小的趋势和/或肿瘤细胞增殖的趋势。"Growth of a tumor" or "tumor growth" according to the present disclosure relates to the tendency of a tumor to increase its size and/or the tendency of tumor cells to proliferate.
为本公开的目的,术语“癌症”和“癌症疾病”与术语“肿瘤”和“肿瘤疾病”可互换使用。For the purposes of this disclosure, the terms "cancer" and "cancer disease" are used interchangeably with the terms "tumor" and "tumor disease."
癌症根据与肿瘤相似的细胞类型进行分类,因此根据推定为肿瘤来源的组织进行分类。这些分别是组织学和位置。Cancers are classified based on the type of cells that resemble the tumor and therefore based on the tissue from which the tumor is presumed to have originated. These are histology and location, respectively.
根据本申请,癌症可影响以下任何一种组织或器官:乳房;肝脏;肾;心脏、纵隔、胸膜;口底;唇;唾液腺;舌头;牙龈;口腔;腭;扁桃体;喉;气管;支气管、肺;咽、下咽、口咽、鼻咽;食道;消化器官,如胃、肝内胆管、胆道、胰腺、小肠、结肠;直肠;膀胱、胆囊、输尿管等泌尿器官;直肠乙状结肠结;肛门、肛管;皮肤;骨头;关节、肢体关节软骨;眼和附件;脑;周围神经、自主神经系统;脊髓、脑神经、脑膜;和中枢神经系统的各个部分;结缔组织、皮下组织和其他软组织;腹膜后,腹膜;肾上腺;甲状腺;内分泌腺和相关结构;女性生殖器官,如卵巢、子宫、子宫颈;子宫体、阴道、外阴;男性生殖器官,如阴茎、精巢和前列腺;造血系统和网状内皮系统;血液;淋巴结;胸腺。According to the present application, cancer may affect any of the following tissues or organs: breast; liver; kidney; heart, mediastinum, pleura; floor of mouth; lip; salivary glands; tongue; gums; oral cavity; palate; tonsils; larynx; trachea; bronchi, lungs; pharynx, hypopharynx, oropharynx, nasopharynx; esophagus; digestive organs such as stomach, intrahepatic bile duct, bile duct, pancreas, small intestine, colon; rectum; urinary organs such as bladder, gallbladder, ureters; rectosigmoid colon; anus, anal canal; skin; bones; joints, limbs Articular cartilage of the body; eyes and appendages; brain; peripheral nerves, autonomic nervous system; spinal cord, cranial nerves, meninges; and various parts of the central nervous system; connective tissue, subcutaneous tissue and other soft tissues; retroperitoneum, peritoneum; adrenal glands; thyroid gland; endocrine glands and related structures; female reproductive organs, such as ovaries, uterus, cervix; uterine body, vagina, vulva; male reproductive organs, such as penis, testes and prostate; hematopoietic system and reticuloendothelial system; blood; lymph nodes; thymus.
因此,根据本公开的术语“癌症”包括白血病、精原细胞瘤、黑色素瘤、畸胎瘤、淋巴瘤、成神经细胞瘤、神经胶质瘤、直肠癌、子宫内膜癌、肾癌、肾上腺癌、甲状腺癌、血癌、皮肤癌、脑癌、宫颈癌、肠癌、肝癌、结肠癌、胃癌、肠癌、头颈癌、胃肠癌、淋巴结癌、食道癌、结直肠癌、胰腺癌、耳、鼻、喉(ENT)癌、乳腺癌、前列腺癌、子宫癌、卵巢癌和肺癌及其转移。其实例为肺癌、乳腺癌、前列腺癌、结肠癌、肾细胞癌、子宫癌或上述癌症类型或肿瘤的转移。根据本公开的术语癌症还包括癌症转移和癌症复发。Thus, the term "cancer" according to the present disclosure includes leukemia, seminoma, melanoma, teratoma, lymphoma, neuroblastoma, glioma, rectal cancer, endometrial cancer, kidney cancer, adrenal cancer, thyroid cancer, blood cancer, skin cancer, brain cancer, cervical cancer, intestinal cancer, liver cancer, colon cancer, stomach cancer, intestinal cancer, head and neck cancer, gastrointestinal cancer, lymph node cancer, esophageal cancer, colorectal cancer, pancreatic cancer, ear, nose, throat (ENT) cancer, breast cancer, prostate cancer, uterine cancer, ovarian cancer and lung cancer and metastasis thereof. Examples thereof are lung cancer, breast cancer, prostate cancer, colon cancer, renal cell carcinoma, uterine cancer or metastasis of the above cancer types or tumors. The term cancer according to the present disclosure also includes cancer metastasis and cancer recurrence.
肺癌的主要类型为小细胞肺癌(SCLC)和非小细胞肺癌(NSCLC)。非小细胞肺癌主要分为三种亚型:鳞状细胞肺癌、肺腺癌(LUAD)和大细胞肺癌。腺癌约占肺癌的10%。这种癌症通常在肺的外周可见,而小细胞肺癌和鳞状细胞肺癌则相反,两者倾向于位于更中央。The main types of lung cancer are small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC). NSCLC is divided into three main subtypes: squamous cell lung cancer, lung adenocarcinoma (LUAD), and large cell lung cancer. Adenocarcinoma accounts for about 10% of lung cancers. This cancer is usually found in the periphery of the lungs, in contrast to small cell lung cancer and squamous cell lung cancer, which tend to be more centrally located.
“转移”是指癌细胞从其原始部位扩散至身体的另一部位。转移的形成是一个非常复杂的过程,依赖于恶性细胞与原发性肿瘤的分离、细胞外基质的侵入、内皮基底膜的渗透进入体腔和血管,然后在被血液转运后浸润到靶器官。最后,新肿瘤(即继发性肿瘤或转移肿瘤)在靶部位的生长取决于血管生成。即使在原发性肿瘤切除后,肿瘤转移仍经常发生,因为肿瘤细胞或成分可能残留并发展为转移潜能。在一个实施方案中,根据本公开的术语“转移”涉及“远处转移”涉及远离原发性肿瘤和区域淋巴结系统的转移。"Metastasis" refers to the spread of cancer cells from their original site to another part of the body. The formation of metastasis is a very complex process that depends on the separation of malignant cells from the primary tumor, the invasion of the extracellular matrix, the penetration of the endothelial basement membrane into the body cavity and blood vessels, and then infiltrates into the target organ after being transported by the blood. Finally, the growth of new tumors (i.e., secondary tumors or metastatic tumors) at the target site depends on angiogenesis. Even after the primary tumor is resected, tumor metastasis often occurs because tumor cells or components may remain and develop into metastatic potential. In one embodiment, the term "metastasis" according to the present disclosure relates to "distant metastasis" and refers to metastasis away from the primary tumor and the regional lymph node system.
继发性或转移性肿瘤的细胞与原发性肿瘤的细胞相似。这意味着,例如,如果卵巢癌转移至肝脏,则继发性肿瘤由异常的卵巢细胞组成,而不是异常的肝细胞。肝脏中的肿瘤被称为转移性卵巢癌,而非肝癌。The cells of a secondary or metastatic tumor are similar to those of the primary tumor. This means, for example, that if ovarian cancer spreads to the liver, the secondary tumor is made up of abnormal ovarian cells rather than abnormal liver cells. The tumor in the liver is called metastatic ovarian cancer, not liver cancer.
当一个人再次受到过去曾影响其的病症的影响时,则发生复发或再生。例如,如果患者患有肿瘤疾病,接受了所述疾病的成功治疗并再次发展所述疾病,则所述新发展的疾病可被视为复发或再生。然而,根据本公开,肿瘤疾病的复发或再生可能但不一定发生在原肿瘤疾病的部位。因此,例如,如果患者患有卵巢肿瘤并接受了成功的治疗,则复发或再生可以是卵巢肿瘤的发生或在不同于卵巢的部位的肿瘤的发生。肿瘤的复发或再生还包括肿瘤发生在与原肿瘤部位不同的部位以及原肿瘤部位的情况。优选地,患者已接受治疗的原始肿瘤为原发性肿瘤,且位于不同于原始肿瘤部位的肿瘤为继发性或转移性肿瘤。Recurrence or regeneration occurs when a person is again affected by a condition that affected them in the past. For example, if a patient suffers from a neoplastic disease, receives successful treatment for the disease and develops the disease again, the newly developed disease may be considered a recurrence or regeneration. However, according to the present disclosure, the recurrence or regeneration of a neoplastic disease may, but does not necessarily, occur at the site of the original neoplastic disease. Thus, for example, if a patient suffers from an ovarian tumor and receives successful treatment, the recurrence or regeneration may be the occurrence of an ovarian tumor or the occurrence of a tumor at a site other than the ovary. The recurrence or regeneration of a tumor also includes situations where the tumor occurs at a site different from the original tumor site as well as the original tumor site. Preferably, the original tumor for which the patient has been treated is a primary tumor, and the tumor located at a site different from the original tumor site is a secondary or metastatic tumor.
“治疗”是指向受试者施用本文所述的化合物或组合物,以预防或消除疾病,包括减少受试者的肿瘤大小或肿瘤数量;阻止或减缓受试者的疾病;抑制或减缓受试者新疾病的发展;降低当前或先前患有疾病的受试者的症状和/或复发的频率或严重性;和/或延长,即提高受试者的寿命。具体地,术语“疾病的治疗”包括治愈、缩短持续时间、改善、预防、减缓或抑制疾病的进展或恶化,或预防或延迟疾病或其症状的发作。"Treatment" refers to administering a compound or composition described herein to a subject to prevent or eliminate a disease, including reducing the size or number of a tumor in a subject; stopping or slowing a disease in a subject; inhibiting or slowing the development of a new disease in a subject; reducing the frequency or severity of symptoms and/or recurrences in a subject currently or previously suffering from a disease; and/or prolonging, i.e., increasing, the life span of a subject. Specifically, the term "treatment of a disease" includes curing, shortening the duration, ameliorating, preventing, slowing or inhibiting the progression or worsening of a disease, or preventing or delaying the onset of a disease or its symptoms.
“处于风险”是指与一般群体相比,被鉴定为具有比正常几率更高的疾病(尤其是癌症)发生几率的受试者,即患者。此外,曾经患有或目前患有疾病(尤其是癌症)的受试者是疾病发生风险增加的受试者,因为该受试者可能会继续发生疾病。目前患有或曾经患有癌症的受试者也具有更高的癌症转移风险。"At risk" refers to a subject, i.e., a patient, who is identified as having a higher than normal chance of developing a disease, especially cancer, compared to the general population. In addition, a subject who has had or currently has a disease, especially cancer, is a subject at increased risk of developing the disease because the subject may go on to develop the disease. A subject who currently has or has had cancer also has a higher risk of cancer metastasis.
本文所述的治疗活性剂、疫苗和组合物可通过任何常规途径施用,包括注射或输注。The therapeutically active agents, vaccines and compositions described herein may be administered by any conventional route, including injection or infusion.
本文所述的试剂以有效量施用。“有效量”是指单独或与其他剂量或与其他治疗剂一起达到所需反应或所需效果的量。在治疗特定疾病或特定病症的情况下,所需反应优选涉及抑制疾病进程。这包括减缓疾病的进展,特别是中断或逆转疾病的进展。在疾病或病症的治疗中,所需反应也可以是延迟所述疾病或病症的发作或预防所述疾病或病症的发作。The agents described herein are administered in an effective amount. An "effective amount" refers to an amount that, alone or in combination with other doses or other therapeutic agents, achieves a desired response or desired effect. In the case of treating a specific disease or a specific condition, the desired response preferably involves inhibiting the progression of the disease. This includes slowing the progression of the disease, particularly interrupting or reversing the progression of the disease. In the treatment of a disease or condition, the desired response may also be delaying the onset of the disease or condition or preventing the onset of the disease or condition.
本文所述的试剂的有效量将取决于待治疗的病症、疾病的严重程度、患者的个体参数(包括年龄、生理状况、大小和体重)、治疗持续时间、伴随治疗的类型(如有)、具体施用途径和类似因素。因此,本文所述的试剂的施用剂量可取决于各种此类参数。在患者对初始剂量的反应不充分的情况下,可以使用更高的剂量(或通过不同的、更局部的施用途径获得更高的有效剂量)。The effective amount of an agent described herein will depend on the condition to be treated, the severity of the disease, the individual parameters of the patient (including age, physical condition, size and weight), the duration of treatment, the type of concomitant treatment (if any), the specific route of administration, and similar factors. Thus, the dosage of an agent described herein administered may depend on a variety of such parameters. In the event that a patient does not respond adequately to an initial dose, a higher dose may be used (or a higher effective dose obtained by a different, more localized route of administration).
本文所述的药物组合物优选为无菌的,并含有有效量的治疗活性物质以产生所需反应或效果。The pharmaceutical compositions described herein are preferably sterile and contain an effective amount of the therapeutically active substance to produce the desired response or effect.
本文所述的药物组合物通常以药学上相容的量和药学上相容的制剂施用。术语“药学上相容的”是指不与药物组合物活性成分的作用相互作用的无毒材料。此类制剂通常可包含盐、缓冲物质、防腐剂、载体,补充免疫增强物质(例如佐剂,如CpG寡核苷酸、细胞因子、趋化因子、皂苷、GM-CSF和/或RNA),以及其他治疗活性化合物(如适用)。当用于医药时,盐类应是药学上相容的。The pharmaceutical compositions described herein are generally administered in pharmaceutically compatible amounts and pharmaceutically compatible formulations. The term "pharmaceutically compatible" refers to non-toxic materials that do not interact with the effects of the active ingredients of the pharmaceutical composition. Such formulations may generally include salts, buffer substances, preservatives, carriers, supplementary immune enhancing substances (e.g., adjuvants, such as CpG oligonucleotides, cytokines, chemokines, saponins, GM-CSF and/or RNA), and other therapeutically active compounds (if applicable). When used in medicine, salts should be pharmaceutically compatible.
如本文所用,术语“核酸分子”包括编码目标多肽或其片段的任何核酸分子。这种核酸分子无需与内源核酸序列100%同源或相同,但可表现出基本上的同一性。与内源序列具有“基本同一性”或“基本同源性”的多核苷酸通常能够与双链核酸分子的至少一条链杂交。“杂交”是指在各种严格条件下,互补多核苷酸序列(例如,本文所述的基因)或其部分之间形成双链分子的一对。(参见,例如,Wahl,G.M.and S.L.Berger(1987)MethodsEnzymol.152:399;Kimmel,A.R.(1987)Methods Enzymol.152:507)。例如,严格的盐浓度通常低于约750mM NaCl和75mM柠檬酸三钠,例如低于约500mM NaCl和50mM柠檬酸三钠,或低于约250mM NaCl和25mM柠檬酸三钠。低严格性杂交可在不存在有机溶剂(例如甲酰胺)的情况下获得,而高严格性杂交可在存在至少约35%的甲酰胺(例如至少约50%的甲酰胺)的情况下获得。严格的温度条件通常包括至少约30℃、至少约37℃或至少约42℃的温度。本领域技术人员熟知不同的附加参数,例如杂交时间、洗涤剂(例如十二烷基硫酸钠(SDS))的浓度以及载体DNA的纳入或排除。根据需要通过组合这些不同的条件来实现不同的严格程度。在某些实施方案中,杂交将在30℃下在750mM的NaCl、75mM的柠檬酸三钠和1%的SDS中进行。在某些实施方案中,杂交将在37℃下在500mM的NaCl、50mM的柠檬三钠、1%的SDS、35%的甲酰胺和100pg/ml的变性鲑鱼精DNA(ssDNA)中进行。在某些实施方案中,杂交将在42℃下在250mM的NaCl、25mM的柠檬三钠、1%的SDS、50%的甲酰胺和200pg/ml的ssDNA中进行。对于本领域技术人员而言,这些条件的有效变化是显而易见的。对于大多数应用而言,杂交后的洗涤步骤在严格性上也有所不同。洗涤严格条件可由盐浓度和温度定义。如上所述,可通过降低盐浓度或提高温度来提高洗涤严格性。例如,洗涤步骤的严格盐浓度可低于约30mMNaCl和3mM柠檬酸三钠,例如低于约15mM NaCl和1.5mM柠檬酸三钠。洗涤步骤的严格温度条件通常包括至少约25℃的温度、至少约42℃的温度或至少约68℃的温度。在某些实施方案中,洗涤步骤将在25℃下在30mM的NaCl、3mM的柠檬酸三钠和0.1%的SDS中进行。在某些实施方案中,洗涤步骤将在42℃下在15mM的NaCl、1.5mM的柠檬酸三钠和0.1%的SDS中进行。在某些实施方案中,洗涤步骤将在68℃下在15mM的NaCl、1.5mM的柠檬酸三钠和0.1%的SDS中进行。杂交技术为本领域技术人员所熟知,例如描述于Benton and Davis(Science 196:180,1977);Grunstein and Rogness(Proc.Natl.Acad.Sci.,USA 72:3961,1975);Ausubelet al.(Current Protocols in Molecular Biology,Wiley Interscience,New York,2001);Berger and Kimmel(Guide to Molecular Cloning Techniques,1987,AcademicPress,New York);和Sambrook et al.,Molecular Cloning:A Laboratory Manual,ColdSpring Harbor Laboratory Press,New York中。As used herein, the term "nucleic acid molecule" includes any nucleic acid molecule encoding a target polypeptide or a fragment thereof. Such a nucleic acid molecule need not be 100% homologous or identical to an endogenous nucleic acid sequence, but may exhibit substantial identity. A polynucleotide having "substantial identity" or "substantial homology" to an endogenous sequence is generally capable of hybridizing to at least one strand of a double-stranded nucleic acid molecule. "Hybridization" refers to the formation of a pair of double-stranded molecules between complementary polynucleotide sequences (e.g., genes described herein) or portions thereof under various stringent conditions. (See, e.g., Wahl, G.M. and S.L. Berger (1987) Methods Enzymol. 152: 399; Kimmel, A.R. (1987) Methods Enzymol. 152: 507). For example, stringent salt concentrations are typically less than about 750 mM NaCl and 75 mM trisodium citrate, such as less than about 500 mM NaCl and 50 mM trisodium citrate, or less than about 250 mM NaCl and 25 mM trisodium citrate. Low stringency hybridization can be obtained in the absence of an organic solvent (e.g., formamide), while high stringency hybridization can be obtained in the presence of at least about 35% formamide (e.g., at least about 50% formamide). Stringent temperature conditions typically include a temperature of at least about 30° C., at least about 37° C., or at least about 42° C. Those skilled in the art are familiar with different additional parameters, such as hybridization time, the concentration of a detergent (e.g., sodium dodecyl sulfate (SDS)), and the inclusion or exclusion of carrier DNA. Different stringencies are achieved by combining these different conditions as needed. In certain embodiments, hybridization will be carried out at 30° C. in 750 mM NaCl, 75 mM trisodium citrate, and 1% SDS. In certain embodiments, hybridization will be carried out at 37°C in 500 mM NaCl, 50 mM trisodium citrate, 1% SDS, 35% formamide, and 100 pg/ml denatured salmon sperm DNA (ssDNA). In certain embodiments, hybridization will be carried out at 42°C in 250 mM NaCl, 25 mM trisodium citrate, 1% SDS, 50% formamide, and 200 pg/ml ssDNA. Effective variations of these conditions are apparent to those skilled in the art. For most applications, the post-hybridization washing steps also vary in stringency. Washing stringency conditions can be defined by salt concentration and temperature. As described above, washing stringency can be increased by reducing salt concentration or increasing temperature. For example, the stringent salt concentration of the washing step can be less than about 30 mM NaCl and 3 mM trisodium citrate, such as less than about 15 mM NaCl and 1.5 mM trisodium citrate. Stringent temperature conditions for the washing steps typically include a temperature of at least about 25° C., a temperature of at least about 42° C., or a temperature of at least about 68° C. In certain embodiments, the washing steps will be carried out at 25° C. in 30 mM NaCl, 3 mM trisodium citrate, and 0.1% SDS. In certain embodiments, the washing steps will be carried out at 42° C. in 15 mM NaCl, 1.5 mM trisodium citrate, and 0.1% SDS. In certain embodiments, the washing steps will be carried out at 68° C. in 15 mM NaCl, 1.5 mM trisodium citrate, and 0.1% SDS. Hybridization techniques are well known to those skilled in the art and are described, for example, in Benton and Davis (Science 196:180, 1977); Grunstein and Rogness (Proc. Natl. Acad. Sci., USA 72:3961, 1975); Ausubel et al. (Current Protocols in Molecular Biology, Wiley Interscience, New York, 2001); Berger and Kimmel (Guide to Molecular Cloning Techniques, 1987, Academic Press, New York); and Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York.
“基本上相同”或“基本上同源”是指与参考氨基酸序列(例如,本文所述的任何一种氨基酸序列)或核酸序列(例如,本文所述的任何一种核酸序列)至少约50%同源或相同的多肽或核酸分子。在某些实施方案中,这种序列与用于比较的氨基酸或核酸序列至少约60%、至少约65%、至少约70%、至少约75%、至少约80%、至少约85%、至少约90%、至少约95%、至少约99%或至少约100%同源或相同。"Substantially identical" or "substantially homologous" refers to a polypeptide or nucleic acid molecule that is at least about 50% homologous or identical to a reference amino acid sequence (e.g., any of the amino acid sequences described herein) or nucleic acid sequence (e.g., any of the nucleic acid sequences described herein). In certain embodiments, such a sequence is at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 99%, or at least about 100% homologous or identical to the amino acid or nucleic acid sequence used for comparison.
序列同一性可通过使用序列分析软件(例如,Sequence Analysis SoftwarePackage of the Genetics Computer Group,University of Wisconsin BiotechnologyCenter,1710 University Avenue,Madison,Wis.53705,BLAST、BESTFIT、GAP或PILEUP/PRETTYBOX程序)进行测量。这种软件通过对各种取代、缺失和/或其他修饰分配同源性程度来匹配相同或相似的序列。保守取代通常包括下列组内的取代:甘氨酸、丙氨酸;缬氨酸、异亮氨酸、亮氨酸;天门冬氨酸、谷氨酸、天门冬酰胺、谷氨酰胺;丝氨酸、苏氨酸;赖氨酸、精氨酸;苯丙氨酸、酪氨酸。在确定同一性程度的示例性方法中,可以使用BLAST程序,通过e-3和e-100之间的概率得分表示密切相关的序列。Sequence identity can be measured using sequence analysis software (e.g., Sequence Analysis Software Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis. 53705, BLAST, BESTFIT, GAP or PILEUP/PRETTYBOX programs). Such software matches identical or similar sequences by assigning degrees of homology to various substitutions, deletions and/or other modifications. Conservative substitutions typically include substitutions within the following groups: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; lysine, arginine; phenylalanine, tyrosine. In an exemplary method of determining the degree of identity, the BLAST program can be used, with closely related sequences represented by probability scores between e-3 and e-100.
“类似物”是指具有参考多肽或核酸分子功能的结构相关的多肽或核酸分子。An "analog" refers to a structurally related polypeptide or nucleic acid molecule that has the function of a reference polypeptide or nucleic acid molecule.
除非在上下文中明确说明或显而易见,本文使用的术语“约”应被理解为在本领域的正常容差范围内,例如在平均值的2个标准偏差内。约可理解为在规定值的50%、45%、40%、35%、30%、25%、20%、15%、10%、9%、8%、7%、6%、5%、4%、3%、2%、1%、0.5%、0.1%、0.05%或0.01%范围内。除非上下文中另有明确规定,本文提供的所有数值均由术语“约”修饰。Unless expressly stated or apparent in the context, the term "about" as used herein is to be understood as being within the normal tolerance range of the art, such as within 2 standard deviations of the mean. Approximately can be understood as being within 50%, 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.1%, 0.05% or 0.01% of the specified value. Unless expressly stated otherwise in the context, all numerical values provided herein are modified by the term "about".
本文使用的“可转座元件”是一种重复的DNA序列,其通过逆转录酶(I类TE或逆转录转座子)产生的RNA拷贝或通过从其初始位置(II类TE或DNA转座子)切除自体,从而能在基因组中从一个位置移动到另一个位置。因此,它包括基因组内源性部分(即非来自于感染)的I类(逆转录转座子,包括含有LTR、LINE和SINE的那些)和基因组内源性部分(即非来自于感染)的II类(DNA转座子)。这包括来自两类的自主和非自主元件。根据本公开,TE序列可以,例如,选自I类TE,如包括内源性逆转录病毒(ERV)、长分散核元件(LINE)和短分散核元件(SINE)和哺乳动物长末端重复转座子(MaLR)的逆转录转座子,以及II类TE,如基因组内源性部分的DNA转座子。"Transposable element" as used herein is a repetitive DNA sequence that can be moved from one position to another in the genome by RNA copies produced by reverse transcriptase (class I TE or retrotransposon) or by excision of itself from its original position (class II TE or DNA transposon). Therefore, it includes class I (retrotransposon, including those containing LTR, LINE and SINE) in the endogenous part of the genome (i.e. not from infection) and class II (DNA transposon) in the endogenous part of the genome (i.e. not from infection). This includes autonomous and non-autonomous elements from both classes. According to the present disclosure, TE sequences can, for example, be selected from class I TE, such as retrotransposons including endogenous retroviruses (ERVs), long interspersed nuclear elements (LINEs) and short interspersed nuclear elements (SINEs) and mammalian long terminal repeat transposons (MaLRs), and class II TE, such as DNA transposons in the endogenous part of the genome.
迄今为止,逆转录转座子数量更多,其特征类似于逆转录病毒,如HIV。逆转录转座子通过RNA中间体复制机制的逆转录发挥作用。它们通常分组为三个主要类别:在逆转录因子主体旁侧的具有长末端重复(LTR)的逆转录转座子,其编码与逆转录病毒类似的逆转录酶;具有长分散核元件(LINE、LINE-1或L1)的逆转录子,其编码逆转录酶但缺乏LTR,且由RNA聚合酶II转录;以及具有短分散核元件(SINE)的逆转录转座子,其不编码逆转录酶,且由RNA聚合酶III转录。DNA转座子具有不涉及RNA中间体的转座机制。转座由几种转座酶催化。LTR包括内源性逆转录病毒(ERV),而非LTR TE又细分为长分散(LINE)和短分散元件(SINE),由LINE整合机制动员的非自主转座子。这些谱系由系统发育相关的家族组成,进一步分支成多个亚家族,每个亚家族源自一个前体拷贝。随着时间的推移,突变的积累导致每个亚家族成员的共有序列出现分歧。对于TE逆转录转座子的综述,参见ichardson,SandraR et al.“The Influence of LINE-1 and SINE Retrotransposons on MammalianGenomes.”Microbiology spectrum vol.3,2(2015):MDNA3-0061-2014。To date, there are more retrotransposons, which have characteristics similar to those of retroviruses such as HIV. Retrotransposons work by reverse transcription of an RNA intermediate replication mechanism. They are generally grouped into three main categories: retrotransposons with long terminal repeats (LTRs) flanking the main body of the retrovirus, which encode reverse transcriptases similar to retroviruses; retrotransposons with long interspersed nuclear elements (LINE, LINE-1, or L1), which encode reverse transcriptases but lack LTRs and are transcribed by RNA polymerase II; and retrotransposons with short interspersed nuclear elements (SINEs), which do not encode reverse transcriptases and are transcribed by RNA polymerase III. DNA transposons have a transposition mechanism that does not involve RNA intermediates. Transposition is catalyzed by several transposases. LTRs include endogenous retroviruses (ERVs), while non-LTR TEs are further subdivided into long interspersed (LINEs) and short interspersed elements (SINEs), non-autonomous transposons mobilized by the LINE integration mechanism. These lineages consist of phylogenetically related families that further branch into multiple subfamilies, each derived from a precursor copy. Over time, the accumulation of mutations leads to divergence in the consensus sequence of each subfamily member. For a review of TE retrotransposons, see Richardson, Sandra R et al. "The Influence of LINE-1 and SINE Retrotransposons on Mammalian Genomes." Microbiology spectrum vol. 3, 2 (2015): MDNA3-0061-2014.
经典的L1元件的长度约为6000个碱基对(bp),且由两个非重叠的开放阅读框(ORF)组成,其旁侧为非翻译区(UTR)和靶位点重复。LINE-1逆转录转座子在哺乳动物基因组中的扩增已超过1.6亿年。在人类中,自大约6500万至7500万年前祖先鼠和人类谱系分离以来,绝大多数LINE-1序列均已扩增。单独的基因组LINE-1序列与从现代活跃的LINE-1衍生的共有序列之间的序列比较可用于估算基因组LINE-1的年龄(Khan H,Smit A,Boissinot S;Genome Res.2006 Jan;16(1):78-87)。L1亚家族通常分为老(L1M、AluJ)、中间(L1P、L1PB、AluS)、年轻(L1HS、L1PA、AluY)和相关(HAL、FAM)亚家族。在人类中,唯一自主活跃的家族是长分散元件-1(LINE-1或L1),但少数L1拷贝仍具有逆转录转座能力,所有这些拷贝均属于最年轻的人特异性L1HS亚家族。The classical L1 element is approximately 6000 base pairs (bp) in length and consists of two non-overlapping open reading frames (ORFs) flanked by untranslated regions (UTRs) and target site repeats. LINE-1 retrotransposons have been amplified in mammalian genomes for more than 160 million years. In humans, the vast majority of LINE-1 sequences have been amplified since the separation of the ancestral mouse and human lineages approximately 65 to 75 million years ago. Sequence comparisons between individual genomic LINE-1 sequences and consensus sequences derived from modern active LINE-1 can be used to estimate the age of genomic LINE-1 (Khan H, Smit A, Boissinot S; Genome Res. 2006 Jan; 16(1): 78-87). L1 subfamilies are generally divided into old (L1M, AluJ), intermediate (L1P, L1PB, AluS), young (L1HS, L1PA, AluY) and related (HAL, FAM) subfamilies. In humans, the only autonomously active family is long interspersed element-1 (LINE-1 or L1), but a few L1 copies still have retrotransposition capacity, all of which belong to the youngest, human-specific L1HS subfamily.
SVA元件包括一个进化上年轻的、非自主的逆转录转座子家族,该家族在大约2500万年前的灵长目动物谱系中出现(Hancks DC,Kazazian HH Jr,Semin Cancer Biol.2010Aug;20(4):234-45)。经典的SVA元件约为2000bp,并具有由以下组成的复合结构:1)六聚体CCCTCT重复序列;2)反相类Alu元素重复序列;3)一组富含GC的可变核苷酸串联重复序列(VNTR);4)与HERVK-10(一种无活性LTR逆转录转座子)具有共享同源性的SINE-R序列;和5)经典的聚腺苷酸切割特异性因子(CPSF)结合位点,其后是聚(A)尾。最年轻的SVA亚家族包括SVA-D、SVA-E、SVA-F和SVA-F1亚家族。SVA elements comprise an evolutionarily young, non-autonomous family of retrotransposons that emerged in the primate lineage approximately 25 million years ago (Hancks DC, Kazazian HH Jr, Semin Cancer Biol. 2010 Aug; 20(4): 234-45). The classical SVA element is approximately 2000 bp and has a complex structure consisting of: 1) hexameric CCCTCT repeats; 2) inverted Alu-like repeats; 3) a set of GC-rich variable nucleotide tandem repeats (VNTRs); 4) a SINE-R sequence that shares homology with HERVK-10 (an inactive LTR retrotransposon); and 5) a classical polyadenylation cleavage specificity factor (CPSF) binding site followed by a poly(A) tail. The youngest SVA subfamilies include the SVA-D, SVA-E, SVA-F, and SVA-F1 subfamilies.
“信使RNA(mRNA)”是对应于基因遗传序列的单链RNA分子,并且在生产蛋白质的过程中由核糖体读取。mRNA在转录过程中产生,在此过程中,RNA聚合酶将基因转化为初始转录物mRNA(也称为前体timRNA)。该前体mRNA通常仍含有内含子,这些区域不会继续编码最终的氨基酸序列。它们在RNA剪接过程中被去除,只留下外显子,那些编码蛋白质的区域。该外显子序列构成成熟mRNA。成熟mRNA随后被核糖体读取,并利用转运RNA(tRNA)携带的氨基酸,核糖体产生肽序列的这一过程称为翻译。"Messenger RNA (mRNA)" is a single-stranded RNA molecule that corresponds to the genetic sequence of a gene and is read by the ribosome in the process of producing proteins. mRNA is produced during transcription, during which RNA polymerase converts the gene into an initial transcript mRNA (also called pre-timRNA). This pre-mRNA usually still contains introns, which are regions that do not go on to encode the final amino acid sequence. They are removed during the RNA splicing process, leaving only the exons, those regions that encode proteins. This exon sequence constitutes the mature mRNA. The mature mRNA is then read by the ribosome, and using the amino acids carried by the transfer RNA (tRNA), the ribosome produces a peptide sequence. This process is called translation.
本文所指的“转录物”是由生物体表达的信使RNA(或mRNA)或mRNA的一部分,特别是在特定组织或甚至特定组织中。转录物的表达取决于多种因素而异。与正常健康细胞相比,转录物在癌细胞中的表达可能发生改变。As used herein, a "transcript" is a messenger RNA (or mRNA) or a portion of an mRNA that is expressed by an organism, particularly in a specific tissue or even in a specific tissue. The expression of a transcript varies depending on a variety of factors. The expression of a transcript in cancer cells may be altered compared to normal healthy cells.
本文所指的“转录组”是由细胞基因表达或转录的完整组的信使RNA或mRNA分子。在一些实施方案中,术语“转录组”也可用于描述在特定细胞(或组织类型)中产生的一系列mRNA转录物。与以其稳定性为特征的基因组相比,转录组变化活跃。事实上,生物体的转录组取决于多种因素而异,包括发育阶段、环境和生理条件。通常,与相应的正常健康细胞相比,癌细胞中的转录组也发生了改变。通常,本文所指的转录组是人转录组。术语“转录组模式”和“转录组”在本文中用作同义词。" Transcriptome " referred to herein is the messenger RNA or mRNA molecule of the complete group expressed or transcribed by cell gene.In some embodiments, the term " transcriptome " can also be used to describe a series of mRNA transcripts produced in a specific cell (or tissue type).Compared with the genome characterized by its stability, the transcriptome changes actively.In fact, the transcriptome of an organism depends on a variety of factors and varies, including developmental stage, environment and physiological conditions.Usually, compared with corresponding normal healthy cells, the transcriptome in cancer cells has also changed.Usually, the transcriptome referred to herein is the human transcriptome.The terms " transcriptome pattern " and " transcriptome " are used as synonyms in this article.
阅读框是将核酸(DNA或RNA)分子中的核苷酸序列分成一组连续的、不重叠的三联体的方式。A reading frame is the way in which the nucleotide sequence in a nucleic acid (DNA or RNA) molecule is divided into a set of consecutive, non-overlapping triplets.
开放阅读框架(ORF)是阅读框中能够翻译成肽的部分。ORF是一段连续的密码子串,其包含位于转录起始位点的起始密码子(例如AUG)和终止密码子(例如UAA、UAG或UGA)。ORF中的ATG密码子(不一定是第一个)可以指示翻译的起始位置。转录终止位点位于ORF之后,翻译终止密码子之外。在具有多个外显子的真核基因中,ORF跨越内含子/外显子区域,在ORF转录后可剪接在一起,产生用于蛋白质翻译的最终mRNA。An open reading frame (ORF) is the portion of a reading frame that can be translated into a peptide. An ORF is a continuous string of codons that includes a start codon (e.g., AUG) and a stop codon (e.g., UAA, UAG, or UGA) located at the start site of transcription. The ATG codon in the ORF (not necessarily the first) can indicate the start position of translation. The transcription termination site is located after the ORF, outside the translation stop codon. In eukaryotic genes with multiple exons, the ORF spans the intron/exon region, which can be spliced together after the ORF is transcribed to produce the final mRNA for protein translation.
本文所指的“经典ORF”是在数据库(例如Ensembl基因组/转录组/蛋白质组数据库集合,通常为HG19)中描述或注释的、在一个mRNA序列中具有特定阅读框的蛋白质编码序列。通常,经典ORF与正常健康细胞的外显子之一相同。"Classic ORF" referred to herein is a protein coding sequence with a specific reading frame in an mRNA sequence described or annotated in a database (e.g., Ensembl genome/transcriptome/proteome database collection, usually HG19). Typically, a classic ORF is identical to one of the exons of a normal healthy cell.
本文所指的“非经典ORF”是在基因组数据库(例如Ensembl基因组/转录组/蛋白质组数据库)中未描述的(即未注释的)、在mRNA序列中具有特定阅读框的蛋白质编码序列。因此,通常,非经典ORF意为,与正常健康细胞中的外显子的常规阅读框相比,阅读框发生了移码。然而,在一些实施方案中,非经典的可以在基因组数据库(例如Ensembl数据库)中描述,但是该mRNA序列代表正常细胞中的次要种型。就次要种型而言,其在正常细胞中通常低于5%,特别是低于2%,或优选低于1%。"Non-classical ORF" as referred to herein is a protein coding sequence that is not described (i.e., not annotated) in a genome database (e.g., the Ensembl genome/transcriptome/proteome database) and has a specific reading frame in an mRNA sequence. Therefore, in general, non-classical ORF means that the reading frame is frameshifted compared to the conventional reading frame of the exon in a normal healthy cell. However, in some embodiments, non-classical can be described in a genome database (e.g., the Ensembl database), but the mRNA sequence represents a minor type in a normal cell. With respect to minor types, it is generally less than 5%, particularly less than 2%, or preferably less than 1% in a normal cell.
外显子是基因的任何部分,其将编码该基因在内含子通过RNA剪接被移除后产生的最终成熟RNA的一部分。术语“外显子”是指基因内的DNA序列和RNA转录物中的相应序列。在RNA剪接中,内含子被移除,并且将外显子作为产生成熟信使RNA的一部分彼此共价结合。根据本申请的外显子序列包含一个或多个外显子的至少一部分。通常,外显子序列包含一个或两个外显子的至少一部分。Exon is any part of a gene that will encode a portion of the final mature RNA produced by the gene after the introns are removed by RNA splicing. The term "exon" refers to a DNA sequence within a gene and the corresponding sequence in the RNA transcript. In RNA splicing, introns are removed and exons are covalently bound to each other as part of producing mature messenger RNA. An exon sequence according to the present application comprises at least a portion of one or more exons. Typically, an exon sequence comprises at least a portion of one or two exons.
剪接后成熟RNA中存在的3’端和5’端的非翻译序列(3’UTR和5’UTR)为外显子序列,但它们是非编码序列,因为这些序列位于翻译起始密码子上游(5’UTR)或翻译终止密码子下游(3’UTR)。The untranslated sequences at the 3' and 5' ends (3'UTR and 5'UTR) present in the mature RNA after splicing are exonic sequences, but they are non-coding sequences because these sequences are located upstream of the translation start codon (5'UTR) or downstream of the translation stop codon (3'UTR).
在本申请中,术语“融合转录物”、“嵌合转录物”、“TE-外显子转录物”或“连接外显子TE”(JET)被无区别地用作同义词。根据本公开,“融合或嵌合”“转录物或序列”被定义为部分与外显子序列对齐和部分与可转座元件(TE)序列对齐的转录物。融合或嵌合的转录物在本文也简称为JET(外显子和TE之间的连接)。通常,根据本说明书的融合转录物具有大于2.10-6的读长的标准化数量。读长的标准化数量被定义为覆盖融合的读长数量除以样品文库大小。In the present application, the terms "fusion transcript", "chimeric transcript", "TE-exon transcript" or "joined exon TE" (JET) are used indiscriminately as synonyms. According to the present disclosure, a "fusion or chimeric""transcript or sequence" is defined as a transcript that is partially aligned with an exon sequence and partially aligned with a transposable element (TE) sequence. Fusion or chimeric transcripts are also referred to herein as JETs (junctions between exons and TEs). Typically, fusion transcripts according to the present specification have a normalized number of reads greater than 2.10-6 . The normalized number of reads is defined as the number of reads covering the fusion divided by the sample library size.
术语“多肽”在本说明书中用于表示一系列残基,通常为L-氨基酸,通常通过相邻氨基酸的α-氨基和羧基之间的肽键相互连接。多肽或肽可以是多种长度,以中性(不带电荷)形式或以盐形式,并且不存在例如糖基化、侧链氧化或磷酸化等修饰,或包含这些修饰,前提是这种修饰不会破坏本文所述的多肽的生物活性。除非明确提及术语“多肽”,肽和蛋白可互换地指如本文所述的JET衍生的新抗原肽、多肽或蛋白。The term "polypeptide" is used in this specification to refer to a series of residues, usually L-amino acids, which are usually connected to each other by peptide bonds between the α-amino and carboxyl groups of adjacent amino acids. Polypeptides or peptides can be of various lengths, in neutral (uncharged) form or in salt form, and are free of modifications such as glycosylation, side chain oxidation or phosphorylation, or contain these modifications, provided that such modifications do not destroy the biological activity of the polypeptides described herein. Unless the term "polypeptide" is explicitly mentioned, peptides and proteins refer interchangeably to JET-derived neoantigenic peptides, polypeptides or proteins as described herein.
如本文所用,pJET是来源于(即编码自)嵌合/融合转录物或JET的肽或多肽。pJET在本文中也被称为翻译JET。As used herein, pJET is a peptide or polypeptide derived from (ie, encoded by) a chimeric/fusion transcript or JET. pJET is also referred to herein as translated JET.
“参考基因组”或“代表性基因组”是由科学家组装的数字核酸序列数据库,其作为基因种型集合的代表性例子。由于参考基因组通常是通过对多个供体的DNA进行测序而组装的,因此无法准确代表任何单个个体(动物或人)的基因集合。相反,参考提供了来自每个供体的不同DNA序列的单倍体镶嵌图。A "reference genome" or "representative genome" is a digital nucleic acid sequence database assembled by scientists that serves as a representative example of a collection of genotypes. Because a reference genome is usually assembled by sequencing DNA from multiple donors, it cannot accurately represent the collection of genes of any single individual (animal or human). Instead, the reference provides a haploid mosaic of different DNA sequences from each donor.
RNA-Seq(称为RNA测序的简称)是利用下一代测序(NGS)来揭示生物样品中RNA(通常是信使RNA,mRNA)的存在和数量并产生大量原始测序读长(通常至少数千万)的测序技术。单细胞RNA测序(scRNA-Seq)提供了单个细胞的表达谱。读长是指来自生物样品或单个细胞的RNA片段的RNA序列。被测序的RNA样品称为RNA文库。因此,RNA测序数据通常被称为RNA读长。RNA-Seq (short for RNA sequencing) is a sequencing technology that uses next-generation sequencing (NGS) to reveal the presence and quantity of RNA (usually messenger RNA, mRNA) in biological samples and produces a large number of raw sequencing reads (usually at least tens of millions). Single-cell RNA sequencing (scRNA-Seq) provides expression profiles of individual cells. Reads refer to the RNA sequence of RNA fragments from a biological sample or a single cell. The sequenced RNA sample is called an RNA library. Therefore, RNA sequencing data is often referred to as RNA reads.
在本申请中,“MHC分子”或“HLA分子”是指至少一种MHC/HLA I类分子或至少一种MHC/HLA II类分子。MHC I类蛋白在机体大多数有核细胞上形成功能性受体。在HLA中有3种主要的MHC I类基因:HLA-A、HLA-B、HLA-C,和3种次要基因HLA-E、HLA-F和HLA-G。32-微球蛋白与主要和次要基因亚单位结合产生异二聚体。I类的MHC分子由重链和轻链组成,并且可结合约8-11个氨基酸的肽,但如果该肽具有合适的结合基序,则通常为8或9个氨基酸,并将其呈递给细胞毒性T-淋巴细胞。通过肽主链中的原子与所有MHC I类分子的肽结合槽中的恒定位点之间的接触,使肽的结合在其两端稳定。在结合肽的氨基端和羧基端的槽的两端有恒定位点。肽长度的变化通过肽骨架的扭结来适应,通常是在允许所需灵活性的脯氨酸或甘氨酸残基处。与I类MHC分子结合的肽通常来源于内源性蛋白质抗原。例如,在人中,I类MHC分子的重链通常为HLA-A、HLA-B或HLA-C单体,并且轻链为β-2-微球蛋白。有3种主要的和2种次要的由HLA编码的MHC II类蛋白。II类基因结合形成通常在抗原呈递细胞表面表达的异二聚体(αβ)蛋白受体。通过II类MHC分子结合的肽通常来源于细胞外或外源蛋白抗原。例如,在人中,α链和β链尤其是HLA-DR、HLA-DQ和HLA-DP单体。MHC II类分子能够结合约8-20个氨基酸的肽,特别是10-25个氨基酸或13-25个氨基酸(如果该肽具有合适的结合基序),并将其呈递给辅助性T细胞。该肽沿MHCI肽结合槽呈延伸构象,该结合槽(不同于MHC I类肽结合槽)两端开放。它主要通过主链原子与排列在肽结合槽中的保守残基的接触而被固定。In the present application, "MHC molecule" or "HLA molecule" refers to at least one MHC/HLA class I molecule or at least one MHC/HLA class II molecule. MHC class I proteins form functional receptors on most nucleated cells in the body. There are three major MHC class I genes in HLA: HLA-A, HLA-B, HLA-C, and three minor genes HLA-E, HLA-F and HLA-G. 32-microglobulin binds to the major and minor gene subunits to produce heterodimers. Class I MHC molecules are composed of heavy chains and light chains, and can bind peptides of about 8-11 amino acids, but if the peptide has a suitable binding motif, it is usually 8 or 9 amino acids, and presents it to cytotoxic T-lymphocytes. The binding of the peptide is stabilized at both ends by contact between atoms in the peptide backbone and constant sites in the peptide binding groove of all MHC class I molecules. There are constant sites at both ends of the groove that binds the amino and carboxyl ends of the peptide. Variations in peptide length are accommodated by kinking of the peptide backbone, typically at proline or glycine residues that allow the required flexibility. Peptides that bind to class I MHC molecules are typically derived from endogenous protein antigens. For example, in humans, the heavy chains of class I MHC molecules are typically HLA-A, HLA-B, or HLA-C monomers, and the light chains are beta-2-microglobulin. There are three major and two minor MHC class II proteins encoded by HLA. Class II genes bind to form heterodimeric (αβ) protein receptors that are typically expressed on the surface of antigen presenting cells. Peptides that are bound by class II MHC molecules are typically derived from extracellular or exogenous protein antigens. For example, in humans, the α and β chains are particularly HLA-DR, HLA-DQ, and HLA-DP monomers. MHC class II molecules are able to bind peptides of about 8-20 amino acids, particularly 10-25 amino acids or 13-25 amino acids (if the peptide has a suitable binding motif), and present them to helper T cells. The peptide is in an extended conformation along the MHC class I peptide binding groove, which (unlike the MHC class I peptide binding groove) is open at both ends. It is primarily held in place by contacts between backbone atoms and conserved residues lining the peptide binding groove.
术语“肽组”是指由特定基因组表达的或存在于特定生物体或细胞类型(例如癌细胞)中的一整套肽。因此,蛋白质学组分析(蛋白质组学)是指在特定时间点对基因组、细胞或组织表达的整套肽或蛋白进行分离、鉴定和定量。The term "peptide group" refers to the complete set of peptides expressed by a specific genome or present in a specific organism or cell type (e.g., cancer cells). Thus, proteomic analysis (proteomics) refers to the separation, identification, and quantification of the complete set of peptides or proteins expressed by a genome, cell, or tissue at a specific time point.
蛋白质组学分析通常基于两种主要技术,即双向凝胶电泳(2-DGE)(Harper S etal.,In:Coligan JE,Dunn BM,Speicher DW,Wing-field PT,editors.Current Protocolsin Protein Science.John Wiley&Sons;Hoboken,N.J.:1998.pp.10.4.1-10.4.36.)和质谱(MS)(Aebersold&Mann,2003),其均是分析蛋白复杂质混合物的有力方法。HPLC是蛋白质组研究的替代分离技术,特别是在低分子量蛋白质和肽的分离和鉴定方面(Garbis etal.,2005)。MS允许基于气相中离子的质荷比(m/z)确定蛋白质或肽的分子质量。术语“基于凝胶的”或“无凝胶的”蛋白质组学用于有关于所应用的分离技术,2-DGE或HPLC;蛋白质组学方法也可以是“自下而上”或“自上而下”,其基本上分别从它们的蛋白酶(例如,胰蛋白酶)消化物或通过质谱仪作为整体来鉴定蛋白质。Proteomic analysis is usually based on two main techniques, namely two-dimensional gel electrophoresis (2-DGE) (Harper S et al., In: Coligan JE, Dunn BM, Speicher DW, Wing-field PT, editors. Current Protocols in Protein Science. John Wiley &Sons; Hoboken, N.J.: 1998. pp. 10.4.1-10.4.36.) and mass spectrometry (MS) (Aebersold & Mann, 2003), which are powerful methods for analyzing complex mixtures of proteins. HPLC is an alternative separation technique for proteomic research, especially in the separation and identification of low molecular weight proteins and peptides (Garbis et al., 2005). MS allows the molecular mass of a protein or peptide to be determined based on the mass-to-charge ratio (m/z) of ions in the gas phase. The terms "gel-based" or "gel-free" proteomics are used in relation to the separation technique applied, 2-DGE or HPLC; proteomics approaches may also be "bottom-up" or "top-down," essentially identifying proteins as a whole from their protease (e.g., trypsin) digests or by mass spectrometry, respectively.
自下而上的蛋白质组学是一种从生物样品(组织或细胞)中鉴定蛋白质并通过在质谱分析前对蛋白质进行蛋白水解消化来表征其氨基酸序列和翻译后修饰的常用方法。粗蛋白提取物经酶消化后,通常通过液相色谱结合质谱(一种称为鸟枪蛋白质组学的技术)进行一个或多个维度的肽分离。通过将蛋白水解肽的质量或其串联质谱与从序列数据库或肽谱库中注释的肽谱预测的质量进行比较,可以鉴定肽,并将多种肽鉴定组装成蛋白质鉴定。Bottom-up proteomics is a common approach to identify proteins from biological samples (tissues or cells) and characterize their amino acid sequence and post-translational modifications by proteolytic digestion of proteins prior to mass spectrometric analysis. Crude protein extracts are enzymatically digested and then separated into peptides in one or more dimensions, usually by liquid chromatography coupled to mass spectrometry (a technique known as shotgun proteomics). Peptides can be identified by comparing the masses of proteolytic peptides or their tandem mass spectra with the masses predicted from annotated peptide spectra from sequence databases or peptide spectral libraries, and multiple peptide identifications can be assembled into a protein identification.
在自上而下的蛋白质组学中,完整的蛋白质在消化和/或片段化前在质谱仪中或通过2D电泳进行纯化。自上而下的蛋白质组学使用离子捕获质谱仪以存储分离的蛋白质离子用于质量测量和串联质谱(MS/MS)分析,或使用其他蛋白质纯化方法,例如结合MS/MS的双向凝胶电泳。In top-down proteomics, intact proteins are purified in a mass spectrometer or by 2D electrophoresis prior to digestion and/or fragmentation. Top-down proteomics uses an ion trap mass spectrometer to store isolated protein ions for mass measurement and tandem mass spectrometry (MS/MS) analysis, or other protein purification methods such as two-dimensional gel electrophoresis coupled to MS/MS.
根据MS生成的数据,该蛋白质通过人工质谱分析进行从头测序,或通过SEQUEST、Mascot、Phenyx、X!Tandem和OMSSA等序列搜索引擎自动处理。这些算法是基于实验和理论MS/MS数据之间的相关性开发的;后者由蛋白质数据库如UniProt/Swiss-Prot的计算机模拟酶切产生(Deutsch,Lam,&Aebersold,2008)。Based on the data generated by MS, the protein is sequenced de novo by manual mass spectrometry analysis or automatically processed by sequence search engines such as SEQUEST, Mascot, Phenyx, X! Tandem and OMSSA. These algorithms are developed based on the correlation between experimental and theoretical MS/MS data; the latter are generated by computer-simulated enzyme digestion of protein databases such as UniProt/Swiss-Prot (Deutsch, Lam, & Aebersold, 2008).
术语“免疫肽组”,通常也称为“免疫肽组模式”、“pMHC汇辑(pMHC repertoire)”或“MHC-配体组(MHC-ligandome)”或“HLA配体组(HLA ligandome)”,是指特定细胞类型内的一整套肽,其在细胞表面结合至少一种MHC/HLA分子。相应地,“免疫肽组学”已成为描述MHC/HLA-配体组分析的术语。最常见的免疫肽组学方法依赖于质谱(MS)。免疫肽组学样品通常从裂解的细胞或组织中分离的MHC,例如通过使用等位基因特异性抗体、泛特异性抗体或工程化亲和标签系统来制备。分离的复合物被酸洗脱,并且使用截留分子量(MWCO)、固相提取或其他技术从MHC分子中纯化肽,并且随后通过MS分析(例如参见综述L.E.Stopfer etal.,Immuno-Oncology and Technology,Volume 11,2021,100042)。The term "immunopeptidome", also often referred to as "immunopeptidome pattern", "pMHC repertoire" or "MHC-ligandome" or "HLA ligandome", refers to a complete set of peptides within a specific cell type that bind to at least one MHC/HLA molecule on the cell surface. Accordingly, "immunopeptidomics" has become a term to describe the analysis of MHC/HLA-ligandomes. The most common immunopeptidomics methods rely on mass spectrometry (MS). Immunopeptidomics samples are typically prepared from MHC isolated from lysed cells or tissues, for example by using allele-specific antibodies, pan-specific antibodies, or engineered affinity tag systems. The separated complexes are acid eluted, and the peptides are purified from the MHC molecules using molecular weight cutoff (MWCO), solid phase extraction or other techniques, and subsequently analyzed by MS (see, for example, review L.E.Stopfer et al., Immuno-Oncology and Technology, Volume 11, 2021, 100042).
如本文所用,术语“抗体”不仅指完整的抗体分子,还指保留免疫原结合能力的抗体分子片段。这种片段在本领域中也是众所周知的,并且经常在体外和体内使用。因此,如本文所用,术语“抗体”不仅指完整的免疫球蛋白分子,还指众所周知的活性片段F(ab′)2和Fab。As used herein, the term "antibody" refers not only to complete antibody molecules, but also to antibody molecule fragments that retain the ability to bind to an immunogen. Such fragments are also well known in the art and are often used in vitro and in vivo. Therefore, as used herein, the term "antibody" refers not only to complete immunoglobulin molecules, but also to the well-known active fragments F(ab') 2 and Fab.
F(ab′)2和缺乏完整抗体Fc片段的Fab片段从循环中更快清除,并且可能具有更少的完整抗体的非特异性组织结合(Wahl et ah,J.Nucl.Med.24:316-325(1983))。F(ab') 2 and Fab fragments lacking the Fc fragment of an intact antibody are cleared from the circulation more rapidly and may have less nonspecific tissue binding than intact antibodies (Wahl et ah, J. Nucl. Med. 24:316-325 (1983)).
如本文所用,抗体包括全天然抗体、双特异性抗体;嵌合抗体;Fab、Fab’、单链V区片段(scFv)、融合多肽和非常规抗体。As used herein, antibodies include all-natural antibodies, bispecific antibodies; chimeric antibodies; Fab, Fab', single-chain V region fragment (scFv), fusion polypeptides and unconventional antibodies.
在某些实施方案中,抗体是包含至少两条重链(H)和两条轻链(L)并通过二硫键相互连接的糖蛋白。每条重链由一个重链可变区(在本文简称为VH)和一个重链恒定区(CH)组成。重链恒定区由三个结构域,CH1、CH2和CH3组成。每条轻链由轻链可变区(在本文简称为VL)和轻链恒定区CL组成。轻链恒定区由一个结构域CL组成。VH区和VL区可进一步细分为高变区,称为互补决定区(CDR),与更保守的称为框架区(FR)的区域交叉。每一个VH和VL由三个CDR和四个FR组成,从氨基端至羧基端按下列顺序排列:FR1、CDR1、FR2、CDR2、FR3、CDR3、FR4。In certain embodiments, antibodies are glycoproteins comprising at least two heavy chains (H) and two light chains (L) interconnected by disulfide bonds. Each heavy chain consists of a heavy chain variable region (abbreviated herein as VH ) and a heavy chain constant region ( CH ). The heavy chain constant region consists of three domains, CH1, CH2, and CH3. Each light chain consists of a light chain variable region (abbreviated herein as VL ) and a light chain constant region CL . The light chain constant region consists of one domain CL . The VH and VL regions can be further subdivided into hypervariable regions, called complementarity determining regions (CDRs), which intersect with more conservative regions called framework regions (FRs). Each VH and VL consists of three CDRs and four FRs, arranged in the following order from amino terminus to carboxyl terminus: FR1, CDR1, FR2, CDR2, FR3, CDR3, FR4.
重链和轻链的可变区包含与抗原相互作用的结合结构域。抗体的恒定区可介导免疫球蛋白与宿主组织或因子的结合,包括免疫系统的各种细胞(例如效应细胞)和经典补体系统的第一组分(C1q)。The variable regions of the heavy and light chains contain the binding domain that interacts with the antigen. The constant regions of the antibodies mediate the binding of the immunoglobulin to host tissues or factors, including various cells of the immune system (e.g., effector cells) and the first component (C1q) of the classical complement system.
如本文所用,“CDR”被定义为抗体的互补决定区氨基酸序列,其为免疫球蛋白重链和轻链的高变区(参见,例如,Rabat et al.,Sequences of Proteins of ImmunologicalInterest,4th U.S.Department of Health and Human Services,National Institutesof Health(1987))。通常,抗体在可变区包含三个重链和三个轻链CDR或CDR区。CDR为抗体与抗原或表位的结合提供了大部分接触残基。在某些实施方案中,CDR区使用Rabat系统(Rabat,E.A.,et al.(1991)Sequences of Proteins of Immunological Interest,FifthEdition,ET.S.Department of Health and Human Services,NIH Publication no.91-3242)进行描述。As used herein, "CDR" is defined as the complementary determining region amino acid sequence of an antibody, which is a hypervariable region of an immunoglobulin heavy chain and light chain (see, e.g., Rabat et al., Sequences of Proteins of Immunological Interest, 4th U.S. Department of Health and Human Services, National Institutes of Health (1987)). Typically, an antibody comprises three heavy chain and three light chain CDRs or CDR regions in the variable region. The CDRs provide most of the contact residues for the binding of an antibody to an antigen or epitope. In certain embodiments, the CDR regions are described using the Rabat system (Rabat, E.A., et al. (1991) Sequences of Proteins of Immunological Interest, Fifth Edition, ET.S. Department of Health and Human Services, NIH Publication no. 91-3242).
如本文所用,术语“单链可变片段”或“scFv”是免疫球蛋白重链(VH)和轻链(VL)的可变区的融合蛋白,其共价连接以形成VH::VL异二聚体。VH和VL直接连接,或通过编码肽的接头(例如10、15、20、25个氨基酸)连接,该接头将VH的N端与VL的C端连接,或将VH的C端与VL的N端连接。针对灵活性,接头通常富含甘氨酸,同时针对溶解性,接头还富含丝氨酸或苏氨酸。尽管去除了恒定区并引入了接头,scFv蛋白仍保留了初始免疫球蛋白的特异性。单链Fv多肽抗体可由包含编码VH和VL的序列的核酸表达,如Huston,et al.(Proc.Nat.Acad.Sci.USA,85:5879-5883,1988)。还参见美国专利第5,091,513号、第5,132,405号和第4,956,778号;以及美国专利公开第20050196754号和第20050196754号。As used herein, the term "single-chain variable fragment" or "scFv" is a fusion protein of the variable regions of the heavy chain ( VH ) and light chain ( VL ) of an immunoglobulin, which are covalently linked to form a VH :: VL heterodimer. VH and VL are linked directly or through a linker (e.g., 10, 15, 20, 25 amino acids) encoding a peptide that connects the N-terminus of VH to the C-terminus of VL , or connects the C-terminus of VH to the N-terminus of VL . The linker is usually rich in glycine for flexibility, and rich in serine or threonine for solubility. Despite the removal of the constant region and the introduction of the linker, the scFv protein retains the specificity of the original immunoglobulin. Single-chain Fv polypeptide antibodies can be expressed from nucleic acids comprising sequences encoding VH and VL , such as Huston, et al. (Proc. Nat. Acad. Sci. USA, 85: 5879-5883, 1988). See also U.S. Patent Nos. 5,091,513; 5,132,405; and 4,956,778; and U.S. Patent Publication Nos. 20050196754 and 20050196754.
如本文所用,术语“亲和力”指结合强度的度量。亲和力取决于抗体结合位点和抗原决定簇之间的立体化学匹配的紧密程度、它们之间接触面积的大小和/或带电和疏水基团的分布。如本文所用,术语“亲和力”还包括“亲合力”,其是指在形成可逆复合物后抗原-抗体键的强度。计算抗体对抗原的亲和力的方法在本领域中是已知的,包括但不限于各种抗原结合实验,例如功能性测定(例如流式细胞术测定)、表面等离子体共振测定(例如BIACORE测定)和动力学排阻测定(例如KINEXA测定)。As used herein, the term "affinity" refers to a measure of binding strength. Affinity depends on the closeness of the stereochemical matching between the antibody binding site and the antigenic determinant, the size of the contact area between them and/or the distribution of charged and hydrophobic groups. As used herein, the term "affinity" also includes "avidity", which refers to the strength of the antigen-antibody bond after forming a reversible complex. The method for calculating the affinity of an antibody to an antigen is known in the art, including but not limited to various antigen binding experiments, such as functional assays (such as flow cytometry assays), surface plasmon resonance assays (such as BIACORE assays) and kinetic exclusion assays (such as KINEXA assays).
本文使用的术语“嵌合抗原受体”或“CAR”是指包含与能够激活或刺激免疫响应细胞的胞内信号结构域融合的细胞外抗原结合结构域和跨膜结构域的分子。在某些实施方案中,CAR的细胞外抗原结合结构域包含scFv。scFv可来源于将抗体的重链可变区和轻链可变区融合。作为替代或补充,scFv可来源于Fab’s(而非抗体,例如从Fab文库获得)。在某些实施方案中,scFv融合至跨膜结构域,然后融合至胞内信号传导结构域。在某些实施方案中,CAR对于抗原具有高结合亲和力或亲合力。The term "chimeric antigen receptor" or "CAR" as used herein refers to a molecule comprising an extracellular antigen binding domain and a transmembrane domain fused to an intracellular signaling domain capable of activating or stimulating an immune response cell. In certain embodiments, the extracellular antigen binding domain of CAR comprises scFv. ScFv may be derived from fusing the heavy chain variable region and the light chain variable region of an antibody. As an alternative or in addition, scFv may be derived from Fab's (rather than antibodies, such as obtained from a Fab library). In certain embodiments, scFv is fused to a transmembrane domain and then fused to an intracellular signaling domain. In certain embodiments, CAR has a high binding affinity or avidity for an antigen.
本文使用的术语“抗原结合结构域”是指能够特异性结合细胞上存在的特定抗原决定簇或一组抗原决定簇的结构域。As used herein, the term "antigen binding domain" refers to a domain that is capable of specifically binding to a particular antigenic determinant or a group of antigenic determinants present on a cell.
本文的术语“免疫细胞”通常包括T细胞、天然杀伤T细胞、CD4+/CD8+T细胞、TIL/肿瘤衍生的CD8T细胞、中央记忆CD8+T细胞、Treg、MAIT、YδT细胞、人胚胎干细胞和可分化出淋巴细胞的多能干细胞。The term "immune cells" herein generally includes T cells, natural killer T cells, CD4+/CD8+T cells, TIL/tumor-derived CD8T cells, central memory CD8+T cells, Treg, MAIT, YδT cells, human embryonic stem cells and pluripotent stem cells that can differentiate into lymphocytes.
“分离的细胞”是指与细胞天然伴随的分子和/或细胞成分分离的细胞。An "isolated cell" refers to a cell that is separated from molecules and/or cellular components that are naturally associated with the cell.
术语“分离的”、“纯化的”或“生物纯的”是指在不同程度上不合在其天然状态下通常伴随其的成分的材料。“分离”表示与初始来源或环境的分离程度。“纯化”表示分离程度高于分离。“纯化的”或“生物纯的”蛋白质充分地不含其他物质,从而使任何杂质不会严重影响蛋白质的生物性质或造成其他不利后果。也就是说,如果通过重组DNA技术生产的核酸或肽基本上不含细胞物质、病毒物质或培养基,或通过化学合成的核酸或肽基本上不含化学前体或其他化学物质,则该核酸或肽被纯化。纯度和均一性通常使用分析化学技术确定,例如聚丙烯酰胺凝胶电泳或高效液相色谱。术语“纯化的”可以表示核酸或蛋白质在电泳凝胶中基本上产生一个条带。对于可进行修饰(例如,磷酸化或糖基化)的蛋白质,不同的修饰可产生不同的分离的蛋白质,其可被单独纯化。The terms "isolated", "purified" or "biologically pure" refer to materials that are free to varying degrees from components that normally accompany them in their native state. "Isolated" means a degree of separation from the original source or environment. "Purified" means a degree of separation greater than separation. A "purified" or "biologically pure" protein is sufficiently free of other substances so that any impurities do not seriously affect the biological properties of the protein or cause other adverse consequences. That is, if a nucleic acid or peptide produced by recombinant DNA technology is substantially free of cellular material, viral material or culture medium, or a nucleic acid or peptide synthesized by chemical synthesis is substantially free of chemical precursors or other chemicals, the nucleic acid or peptide is purified. Purity and homogeneity are generally determined using analytical chemistry techniques, such as polyacrylamide gel electrophoresis or high performance liquid chromatography. The term "purified" can mean that a nucleic acid or protein produces essentially one band in an electrophoretic gel. For proteins that can be modified (e.g., phosphorylated or glycosylated), different modifications can produce different isolated proteins, which can be purified separately.
选择肿瘤新抗原肽的方法Method for selecting tumor neoantigen peptides
根据本公开用于选择肿瘤新抗原肽的方法包括The method for selecting tumor neoantigen peptides according to the present disclosure comprises
-从受试者的癌细胞样品中mRNA序列中鉴定融合转录物(或JET)序列的步骤,所述融合转录物(JET)序列包含可转座元件(TE)序列和外显子序列,且包括开放阅读框(ORF),以及- a step of identifying a fusion transcript (or JET) sequence from mRNA sequences in a cancer cell sample of a subject, wherein the fusion transcript (JET) sequence comprises a transposable element (TE) sequence and an exon sequence and includes an open reading frame (ORF), and
-选择由融合转录物序列的所述ORF的一部分编码的至少8个氨基酸的肿瘤新抗原肽的步骤,- a step of selecting a tumor neoantigenic peptide of at least 8 amino acids encoded by a portion of said ORF of the fusion transcript sequence,
其中所述ORF与TE和外显子序列之间的连接重叠,是纯TE和/或非经典的,并且wherein the ORF overlaps with the junction between the TE and exon sequence, is pure TE and/or non-canonical, and
其中所述肿瘤新抗原肽与所述受试者的至少一种主要组织相容性复合物(MHC)分子结合。Wherein the tumor neoantigen peptide binds to at least one major histocompatibility complex (MHC) molecule of the subject.
通常,从外显子序列的非经典ORF的一部分翻译的肽被免疫系统识别为非自体的。Typically, peptides translated from a portion of a non-canonical ORF in an exonic sequence are recognized by the immune system as non-self.
在一些实施方案中,外显子序列来自致癌基因和/或肿瘤抑制基因和/或来自其突变变体之一。In some embodiments, the exonic sequence is from an oncogene and/or a tumor suppressor gene and/or from one of its mutant variants.
从概念上讲,癌症是连续体细胞突变积累的结果。许多研究表明,致癌基因功能的获得和肿瘤抑制基因功能的丧失是正常细胞发展癌症所必须的。对于二倍体生物体,功能获得性突变通常为显性或半显性,而功能丧失性突变通常为隐性。肿瘤发生的两次突变假说认为,癌症的发展是由肿瘤抑制基因的两个等位基因的缺失引起的。Conceptually, cancer is the result of the accumulation of continuous somatic mutations. Many studies have shown that gain of function of oncogenes and loss of function of tumor suppressor genes are required for normal cells to develop cancer. For diploid organisms, gain-of-function mutations are usually dominant or semi-dominant, while loss-of-function mutations are usually recessive. The two-mutation hypothesis of tumorigenesis holds that the development of cancer is caused by the loss of both alleles of tumor suppressor genes.
致癌基因(也称为癌症基因)是其作用积极促进细胞增殖或生长的基因。正常的非突变型被称为原癌基因。该突变型具有过度或不适当的活性,导致肿瘤生长。致癌基因可在癌症基因标记数据库(CGMD)中鉴定(Pradeepkiran,J.,Sainath,S.,Kramthi Kumar,K.etal.CGMD:.Sci Rep 5,12035(2015)“An integrated database of cancer genes andmarkers”)。致癌基因(ONC)也可从癌症基因网络数据库(NCG 5.0)下载(An O,Dall′OlioGM,Mourikis TP,Ciccarelli FD,Nucleic Acids Res.2016 Jan 4;44(D1):D992-9;“NCG5.0:updates of a manually curated repository of cancer genes and associatedproperties from cancer mutational screenings”)。致癌基因的非限制性实例包括:L-MYC、LYL-1、LYT-10、LYT-10/Cα1、MAS、MDM-2、MLL、MOS、MTG8/AML1、MYB、MYH11/CBFB、NEU、N-MYC、OST、PAX-5、PBX1/E2A、PIM-1、PRAD-1、RAF、RAR/PML、RAS-H、RAS-K、RAS-N、REL/NRG、RET、RHOM1、RHOM2、ROS、SKI、SIS、SET/CAN、SRC、TAL1、TAL2、TAN-1、TIAMI、TSC2和TRK。Oncogenes (also called cancer genes) are genes whose action actively promotes cell proliferation or growth. Normal non-mutated types are called proto-oncogenes. The mutant type has excessive or inappropriate activity, leading to tumor growth. Oncogenes can be identified in the Cancer Gene Marker Database (CGMD) (Pradeepkiran, J., Sainath, S., Kramthi Kumar, K. et al. CGMD:. Sci Rep 5, 12035 (2015) "An integrated database of cancer genes and markers"). Oncogenes (ONC) can also be downloaded from the Cancer Gene Network Database (NCG 5.0) (An O, Dall'OlioGM, Mourikis TP, Ciccarelli FD, Nucleic Acids Res. 2016 Jan 4; 44 (D1): D992-9; "NCG5.0: updates of a manually curated repository of cancer genes and associated properties from cancer mutational screenings"). Non-limiting examples of oncogenes include: L-MYC, LYL-1, LYT-10, LYT-10/Cα1, MAS, MDM-2, MLL, MOS, MTG8/AML1, MYB, MYH11/CBFB, NEU, N-MYC, OST, PAX-5, PBX1/E2A, PIM-1, PRAD-1, RAF, RAR/PML, RAS-H, RAS-K, RAS-N, REL/NRG, RET, RHOM1, RHOM2, ROS, SKI, SIS, SET/CAN, SRC, TAL1, TAL2, TAN-1, TIAMI, TSC2, and TRK.
肿瘤抑制基因(也称为抑癌基因)代表细胞生长控制的另一面,通常作用于抑制细胞增殖和肿瘤发展。因此,肿瘤抑制基因通常是抑制细胞分裂或生长的基因。TSG功能的丧失会促进不受控制的细胞分裂和肿瘤生长。通过编码转录调控蛋白的视网膜母细胞瘤的遗传分析而鉴定的Rb(一种肿瘤抑制基因)作为鉴定导致多种不同人类癌症发展的其他肿瘤抑制基因的原型。肿瘤抑制基因尤其描述于“Cooper GM.The Cell:A MolecularApproach.2nd edition.Sunderland(MA):Sinauer Associates;2000.Tumor SuppressorGenes”中。肿瘤抑制基因(TSG)也可从肿瘤抑制基因数据库(TSGene 2.0)中下载(参见参考资料Zhao M,Kim P,Mitra R,Zhao J,Zhao Z;Nucleic Acids Res.2016 Jan 4;44(D1):D1023-31;“TSGene 2.0:an updated literature-based knowledgebase for tumorsuppressor genes”)。在本上下文中,肿瘤抑制基因的非限制性实例包括:APC、BRCA1、BRCA2、DPC4、INK4、MADR2、NF1、NF2、p53、PTC、PTEN、Rb、RB1、VHL、WT1、BUB1、bub 1、TGF-βRII、Axin、DPC4、p300、PPARγ、p16、DPC4、PTEN和hSNF5。Tumor suppressor genes (also called tumor suppressor genes) represent the other side of cell growth control, usually acting to inhibit cell proliferation and tumor development. Therefore, tumor suppressor genes are generally genes that inhibit cell division or growth. Loss of TSG function promotes uncontrolled cell division and tumor growth. Rb (a tumor suppressor gene), identified by genetic analysis of retinoblastoma encoding a transcriptional regulatory protein, serves as a prototype for identifying other tumor suppressor genes that lead to the development of a variety of different human cancers. Tumor suppressor genes are described in particular in "Cooper GM. The Cell: A Molecular Approach. 2nd edition. Sunderland (MA): Sinauer Associates; 2000. Tumor Suppressor Genes". Tumor suppressor genes (TSGs) can also be downloaded from the tumor suppressor gene database (TSGene 2.0) (see reference Zhao M, Kim P, Mitra R, Zhao J, Zhao Z; Nucleic Acids Res. 2016 Jan 4; 44(D1): D1023-31; "TSGene 2.0: an updated literature-based knowledgebase for tumor suppressor genes"). In the present context, non-limiting examples of tumor suppressor genes include: APC, BRCA1, BRCA2, DPC4, INK4, MADR2, NF1, NF2, p53, PTC, PTEN, Rb, RB1, VHL, WT1, BUB1, bub 1, TGF-βRII, Axin, DPC4, p300, PPARγ, p16, DPC4, PTEN and hSNF5.
致癌基因、肿瘤抑制基因或“双因子”基因(同时具有致癌和肿瘤抑制功能)可以通过数据库检索和文本挖掘进行系统性鉴定。事实上,关于致癌基因或肿瘤抑制基因的信息通常可以在Ensembl数据库中找到(但还参见Shen L,Shi Q,Wang W.Double agents:geneswith both oncogenic and tumor-suppressor functions.Oncogenesis.2018;7(3):25.Published 2018 Mar 13)。双因子基因可作为在上述两个数据库之间重叠的基因而被鉴定(还参见如上Shen et al.,Oncogenesis 2018)。Oncogenes, tumor suppressor genes or "double-factor" genes (having both oncogenic and tumor suppressor functions) can be systematically identified by database searching and text mining. In fact, information about oncogenes or tumor suppressor genes can usually be found in the Ensembl database (but see also Shen L, Shi Q, Wang W. Double agents: genes with both oncogenic and tumor-suppressor functions. Oncogenesis. 2018; 7(3): 25. Published 2018 Mar 13). Double-factor genes can be identified as genes that overlap between the above two databases (see also Shen et al., Oncogenesis 2018, above).
不受任何理论的约束,本发明人认为选择其中外显子序列来自致癌基因和/或肿瘤抑制基因的融合具有高度相关性,原因如下:Without being bound by any theory, the inventors believe that choosing fusions in which the exon sequences are from oncogenes and/or tumor suppressor genes is highly relevant for the following reasons:
致癌基因中的TE插入可改变其致癌活性。因此,将TE序列插入致癌基因活性域可能导致癌基因的组成性活性,类似于驱动突变。因此,这些融合产生的嵌合致癌基因可能代表一个新的致癌蛋白家族。如果是这种情况,用小分子拮抗剂靶向这些新的“融合致癌基因”的活性可能代表了表达这些嵌合致癌基因的癌症的潜在治疗方法。TE insertions in oncogenes can alter their oncogenic activity. Therefore, insertion of TE sequences into oncogene active domains may result in constitutive activity of oncogenes, similar to driver mutations. Therefore, chimeric oncogenes generated by these fusions may represent a new family of oncoproteins. If this is the case, targeting the activity of these new “fusion oncogenes” with small molecule antagonists may represent a potential therapeutic approach for cancers expressing these chimeric oncogenes.
将TE插入肿瘤抑制剂中可使其抑制功能失活,通常导致功能丧失(例如通过引入终止密码子、ORF变化或破坏性氨基酸延伸),从而导致致癌过程。Insertion of TEs into tumor suppressors can inactivate their suppressive function, often leading to loss of function (e.g., by introduction of stop codons, ORF changes, or disruptive amino acid stretches), thereby contributing to the oncogenic process.
涉及癌症驱动基因的融合将是过继细胞治疗、抗体、ADC、T细胞结合物等的优异靶标。如果融合致癌基因参与肿瘤发生,则预期它们会对癌细胞更具特异性,从而减少了抗性的发展(因为靶标的致癌活性)。Fusions involving cancer driver genes would be excellent targets for adoptive cell therapy, antibodies, ADCs, T-cell binders, etc. If the fusion oncogenes are involved in tumorigenesis, they would be expected to be more specific to cancer cells, thereby reducing the development of resistance (due to the oncogenic activity of the target).
在一些实施方案中,TE序列位于融合转录物序列的5’端(TE序列也称为供体序列),并且外显子序列相对于连接位于融合转录物序列的3’端(外显子序列因此被称为受体序列)。“位于融合转录物序列的5’端”的表述是指该元件位于融合转录物序列的连接上游。“位于融合转录物序列的3’端”的表述是指该元件位于融合转录物序列的连接下游。In some embodiments, the TE sequence is located at the 5' end of the fusion transcript sequence (TE sequence is also called donor sequence), and the exon sequence is located at the 3' end of the fusion transcript sequence relative to the junction (exon sequence is therefore called acceptor sequence). The expression "located at the 5' end of the fusion transcript sequence" means that the element is located upstream of the junction of the fusion transcript sequence. The expression "located at the 3' end of the fusion transcript sequence" means that the element is located downstream of the junction of the fusion transcript sequence.
在更特定的实施方案中,TE序列位于融合转录物序列的5’端,并且外显子序列位于融合转录物序列的3’端,并且编码新抗原肽的所述融合转录物序列的ORF的一部分与连接重叠。在这种情况下,ORF可以是经典的或非经典的。应理解,ORF可包括连接,但新抗原肽序列可能不源自连接。在一些实施方案中,当新抗原肽序列包含源自连接的序列时,所获得的肽因此由TE序列和外显子序列编码。In a more specific embodiment, the TE sequence is located at the 5' end of the fusion transcript sequence, and the exon sequence is located at the 3' end of the fusion transcript sequence, and a portion of the ORF of the fusion transcript sequence encoding the neoantigenic peptide overlaps with the junction. In this case, the ORF can be classical or non-classical. It should be understood that the ORF may include a junction, but the neoantigenic peptide sequence may not be derived from the junction. In some embodiments, when the neoantigenic peptide sequence includes a sequence derived from a junction, the peptide obtained is therefore encoded by the TE sequence and the exon sequence.
“ORF的一部分与TE序列和外显子序列之间的连接正在重叠或重叠”的表述是指所述连接包含在融合转录物序列的ORF的一部分中,其编码所述新抗原肽。The expression "a portion of the ORF is overlapping or overlaps with the junction between the TE sequence and the exon sequence" means that the junction is contained in a portion of the ORF of the fusion transcript sequence, which encodes the neo-antigenic peptide.
在其中(i)编码新抗原肽的ORF的一部分与TE序列和外显子序列之间的连接重叠,和(ii)TE序列和外显子序列分别位于融合转录物序列的5’端和3’端的实施方案中,所述ORF的一部分通常编码至少8个氨基酸的新抗原肽,包括来自TE序列的至少1-6个氨基酸,特别是2-6个,以及来自外显子序列的至少1-6个,特别是2-6个氨基酸。In embodiments where (i) a portion of the ORF encoding the neo-antigenic peptide overlaps with the junction between the TE sequence and the exon sequence, and (ii) the TE sequence and the exon sequence are located at the 5' end and 3' end of the fusion transcript sequence, respectively, the portion of the ORF generally encodes a neo-antigenic peptide of at least 8 amino acids, including at least 1-6 amino acids, particularly 2-6, from the TE sequence, and at least 1-6, particularly 2-6 amino acids, from the exon sequence.
在其中TE序列位于融合转录物序列的5’端,并且外显子序列位于融合转录物序列的3’端的另一个实施方案中,编码所述新抗原肽的ORF的一部分位于连接下游,因此ORF是非经典的。In another embodiment in which the TE sequence is located at the 5' end of the fusion transcript sequence and the exon sequence is located at the 3' end of the fusion transcript sequence, a portion of the ORF encoding the neoantigenic peptide is located downstream of the junction, so that the ORF is non-classical.
“ORF的一部分位于连接下游”的表述是指编码新抗原肽的ORF的一部分不与连接重叠,但包含在相对于连接的所述融合转录物序列的3’端。在本实施方案中,由于相对于连接的3’端部分是外显子序列,因此编码新抗原肽的ORF的一部分包含在外显子序列中。因此,由于ORF的一部分仅位于外显子序列中,所获得的肽在非经典ORF中通过该外显子序列编码。因此,在其中外显子序列位于相对于连接的融合转录物序列的3’端,并且其中编码新抗原肽的ORF的一部分位于具有非经典阅读框的连接的下游的特定实施方案中,融合转录物序列的ORF的一部分编码包括来自TE序列的0个氨基酸和来自外显子序列的至少8个氨基酸的新抗原肽。The expression “a portion of the ORF is located downstream of the connection” means that a portion of the ORF encoding the neo-antigenic peptide does not overlap with the connection, but is included in the 3’ end of the fusion transcript sequence relative to the connection. In the present embodiment, since the 3’ end portion relative to the connection is an exon sequence, a portion of the ORF encoding the neo-antigenic peptide is contained in the exon sequence. Therefore, since a portion of the ORF is located only in the exon sequence, the obtained peptide is encoded by the exon sequence in the non-classical ORF. Therefore, in a specific embodiment in which the exon sequence is located at the 3’ end of the fusion transcript sequence relative to the connection, and in which a portion of the ORF encoding the neo-antigenic peptide is located downstream of the connection having a non-classical reading frame, a portion of the ORF of the fusion transcript sequence encodes a neo-antigenic peptide comprising 0 amino acids from the TE sequence and at least 8 amino acids from the exon sequence.
在另一个实施方案中,TE序列位于融合转录物序列的3’端,并且外显子序列相对于连接位于融合转录物序列的5’端。In another embodiment, the TE sequence is located 3' to the fusion transcript sequence and the exon sequence is located 5' to the fusion transcript sequence relative to the junction.
在一些实施方案中,TE序列位于融合转录物序列的3’端,并且外显子序列位于融合转录物序列的5’端,并且编码新抗原肽的所述融合转录物序列的ORF的一部分与TE序列和外显子序列之间的连接重叠。在这种情况下,ORF也可以是经典的或非经典的。获得的肽由TE序列和外显子序列编码。In some embodiments, the TE sequence is located at the 3' end of the fusion transcript sequence, and the exon sequence is located at the 5' end of the fusion transcript sequence, and a portion of the ORF of the fusion transcript sequence encoding the new antigenic peptide overlaps with the connection between the TE sequence and the exon sequence. In this case, the ORF can also be classical or non-classical. The obtained peptide is encoded by the TE sequence and the exon sequence.
在其中编码新抗原肽的ORF的一部分与外显子序列和TE序列之间的连接重叠,并且其中外显子序列和TE序列分别位于融合转录物序列的5’端和3’端的特定的实施方案中,所述ORF的一部分编码至少8个氨基酸的新抗原肽,包括来自TE序列的至少1至6个、特别是2至6个氨基酸和来自外显子序列的至少1至6个、特别是2至6个氨基酸。In a specific embodiment in which a portion of the ORF encoding the neo-antigenic peptide overlaps with the junction between the exon sequence and the TE sequence, and in which the exon sequence and the TE sequence are located at the 5' end and 3' end of the fusion transcript sequence, respectively, the portion of the ORF encodes a neo-antigenic peptide of at least 8 amino acids, including at least 1 to 6, particularly 2 to 6 amino acids from the TE sequence and at least 1 to 6, particularly 2 to 6 amino acids from the exon sequence.
在另一个实施方案中,TE序列位于融合转录物序列的3’端,外显子序列位于融合转录物序列的5’端,并且编码新抗原肽的ORF的一部分位于外显子序列和TE序列之间连接的下游。任选地,由纯TE序列编码的肽序列是非标准的。In another embodiment, the TE sequence is located at the 3' end of the fusion transcript sequence, the exon sequence is located at the 5' end of the fusion transcript sequence, and a portion of the ORF encoding the new antigenic peptide is located downstream of the connection between the exon sequence and the TE sequence. Optionally, the peptide sequence encoded by the pure TE sequence is non-standard.
在本实施方案中,由于相对于连接的3’端部分为TE序列,因此编码新抗原肽的ORF的一部分由TE序列编码。因此,该ORF的一部分编码的新抗原肽不包括来自外显子序列的任何氨基酸并包括来自TE序列的至少8个氨基酸。在其中TE序列位于相对于连接的融合转录物序列的3’端,并且编码新抗原肽的ORF的一部分位于连接的下游的特定的实施方案中,融合转录物序列的ORF的一部分编码的新抗原肽包括来自外显子序列的0个氨基酸和来自TE序列的至少8个氨基酸。In this embodiment, since the 3' end relative to the connection is divided into a TE sequence, a portion of the ORF encoding the new antigenic peptide is encoded by the TE sequence. Therefore, the new antigenic peptide encoded by the portion of the ORF does not include any amino acids from the exon sequence and includes at least 8 amino acids from the TE sequence. In a specific embodiment in which the TE sequence is located at the 3' end of the fusion transcript sequence relative to the connection, and a portion of the ORF encoding the new antigenic peptide is located downstream of the connection, the new antigenic peptide encoded by the portion of the ORF of the fusion transcript sequence includes 0 amino acids from the exon sequence and at least 8 amino acids from the TE sequence.
肿瘤新抗原肽是由体细胞改变(DNA序列内的经典突变)产生的肽,被识别为与自体不同,由抗原呈递细胞(APC)如树突状细胞(DC)和肿瘤细胞自体呈递。交叉呈递起着重要作用,因为APC能够将外源性抗原从吞噬体转移到胞质溶胶中,以便由蛋白酶体蛋白水解切割成主要组织相容性复合物I(MHC I)表位。Tumor neoantigen peptides are peptides generated by somatic alterations (classical mutations within the DNA sequence) that are recognized as different from self and presented by antigen presenting cells (APCs) such as dendritic cells (DCs) and tumor cells themselves. Cross-presentation plays an important role because APCs are able to transfer exogenous antigens from phagosomes to the cytosol for proteolytic cleavage into major histocompatibility complex I (MHC I) epitopes by the proteasome.
在本公开中,该改变对应于包含可转座元件(TE)序列和外显子序列的融合mRNA序列的转录。这可能是由体细胞(即,特别是在肿瘤克隆中)转座引起的。它也可能不是由从头转座引起的,而是由肿瘤特定的转录去抑制引起的,使得TE和附近的基因被共转录。In the present disclosure, the change corresponds to the transcription of a fusion mRNA sequence comprising a transposable element (TE) sequence and an exon sequence. This may be caused by somatic transposition (i.e., particularly in tumor clones). It may also not be caused by de novo transposition, but by tumor-specific transcriptional derepression, so that TE and nearby genes are co-transcribed.
根据本公开的新抗原肽可能在正常健康样品中完全不存在(即,在正常健康样品中不表达),因此对肿瘤样品具有特异性。或者,其可以在正常细胞中低水平表达,和/或与正常(健康)样品相比,在肿瘤样品中不成比例地表达。The new antigenic peptides according to the present disclosure may be completely absent in normal healthy samples (i.e., not expressed in normal healthy samples), and therefore specific to tumor samples. Alternatively, they may be expressed at low levels in normal cells and/or expressed disproportionately in tumor samples compared to normal (healthy) samples.
它也可以通过由其形成癌症的细胞谱系选择性表达。It may also be selectively expressed by the cell lineages from which cancer develops.
根据本公开的癌症或肿瘤样品可从任何前文定义的组织或器官的任何实体肿瘤或非实体肿瘤中分离,例如乳腺癌、肺癌和/或黑色素瘤。在一些实施方案中,癌症样品来自急性髓细胞性白血病、肾上腺皮质癌、膀胱尿路上皮癌、乳腺导管癌、乳腺小叶癌、子宫颈癌、胆管癌、结直肠腺癌、食道癌、胃腺癌、多形性成胶质细胞瘤、头颈鳞状细胞癌、肝细胞癌、肾嫌色细胞癌、肾透明细胞癌、肾乳头细胞癌、低级神经胶质瘤、肺腺癌、肺鳞状细胞癌、间皮瘤、卵巢浆液性腺癌、胰腺导管腺癌、副神经节瘤和嗜铬细胞瘤、前列腺癌、肉瘤、皮肤黑色素瘤、睾丸生殖细胞癌、胸腺瘤、甲状腺乳头状癌、子宫癌肉瘤、子宫体宫内膜样癌或葡萄膜黑色素瘤样品。在特定实施例中,癌症样品来自肺癌样品,特别是LUAD样品。Cancer or tumor samples according to the present disclosure can be isolated from any solid tumor or non-solid tumor of any tissue or organ defined above, such as breast cancer, lung cancer and/or melanoma. In some embodiments, the cancer sample is from acute myeloid leukemia, adrenocortical carcinoma, bladder urothelial carcinoma, breast ductal carcinoma, breast lobular carcinoma, cervical cancer, bile duct carcinoma, colorectal adenocarcinoma, esophageal cancer, gastric adenocarcinoma, glioblastoma multiforme, head and neck squamous cell carcinoma, hepatocellular carcinoma, renal chromophobe cell carcinoma, renal clear cell carcinoma, renal papillary cell carcinoma, low-grade glioma, lung adenocarcinoma, lung squamous cell carcinoma, mesothelioma, ovarian serous adenocarcinoma, pancreatic ductal adenocarcinoma, paraganglioma and pheochromocytoma, prostate cancer, sarcoma, skin melanoma, testicular germ cell carcinoma, thymoma, thyroid papillary carcinoma, uterine carcinosarcoma, uterine body endometrioid carcinoma or uveal melanoma sample. In a specific embodiment, the cancer sample is from a lung cancer sample, particularly a LUAD sample.
通常,根据本公开,通过将来自癌症样品的mRNA序列与参考基因组进行比对,然后区分正常和异常(基于数据库信息非注释或非经典的)连接,来进行鉴定所述融合转录物序列的步骤。Generally, according to the present disclosure, the step of identifying the fusion transcript sequence is performed by aligning the mRNA sequence from the cancer sample with a reference genome and then distinguishing between normal and abnormal (non-annotated or non-canonical based on database information) junctions.
根据本公开,正常连接通常对应于其中供体和受体在同一条链上且不相距太远的连接(例如,不在不同的染色体上)。According to the present disclosure, normal ligation generally corresponds to ligation in which the donor and acceptor are on the same strand and not too far apart (eg, not on different chromosomes).
根据本公开,异常连接通常对应于不同染色体上的供体序列和受体序列之间的连接,或顺式(相同的染色体)但不同链上的连接(无论顺序和5’-3’义)。According to the present disclosure, aberrant junctions typically correspond to junctions between donor and acceptor sequences on different chromosomes, or junctions in cis (same chromosome) but on different strands (regardless of order and 5'-3' sense).
根据本公开,通常可使用的mRNA序列是RNA seq数据(如本文结果所示)。RNA seq数据通常从由细胞或组织样品中获得的纯化的RNA中获得,片段化并反向转录成cDNA。然后在高通量平台(例如Illumina GA/HiSeq-参见http://www.illumina.com、SOLiD或Roche454)上对获得的cDNA进行扩增和序列测定(下一代序列测定-NGS)。这一过程会从cDNA片段的一端产生数百万个短读长。这一过程的一个常见变体是从每个cDNA片段的两端产生读长,称为“双端”读长。According to the present disclosure, the mRNA sequence that can be generally used is RNA seq data (as shown in the results herein). RNA seq data is generally obtained from purified RNA obtained from a cell or tissue sample, fragmented and reversely transcribed into cDNA. The obtained cDNA is then amplified and sequenced (next generation sequencing-NGS) on a high-throughput platform (e.g., Illumina GA/HiSeq-see http://www.illumina.com, SOLiD or Roche454). This process generates millions of short reads from one end of the cDNA fragment. A common variant of this process is to generate reads from both ends of each cDNA fragment, referred to as "double-ended" reads.
在一些实施方案中,可以使用适应性软件,例如:剪接的转录物与参考的比对(即STA-参见RDobin,Alexander et al.“STAR:ultrafast universal RNA-seq aligner.”Bioinformatics(Oxford,England)vol.29,1(2013):15-21),TopHat2(Kim,Daehwan etal.“TopHat2:accurate alignment of transcriptomes in the presence ofinsertions,deletions and gene fusions.”Genome biology vol.14,4 R36.25Apr.2013,doi:10.1186/gb-2013-14-4-r36)或HISAT(Kim,Daehwan et al.“HISAT:a fastspliced aligner with low memory requirements.”Nature methods vol.12,4(2015):357-60.doi:10.1038/nmeth.3317)将mRNA序列针对相应的参考基因组(例如人参考基因组Hg19 ENSEMBL(RNA序列,GRCh37))进行比对。STAR是独立的软件,使用顺序最大可比对种子搜索,然后进行种子聚类和拼接,以对准RNA序列读长。它可以检测经典连接、非经典剪接和融合/嵌合转录物。通常,可以根据分别从ENSEMBL和RepeatMasker数据库(从UCSC基因组浏览器下载)中获得的定义,按照结果中的详细说明进行连接检测。因此,在一些实施例中,使用专用数据库(例如Ensembl和Repeatmasker数据库)以计算机模拟确定正常和异常连接,并且以计算机模拟提取TE和外显子序列之间具有连接的融合转录物。In some embodiments, adaptive software can be used, such as: alignment of spliced transcripts to a reference (i.e., STA-see RDobin, Alexander et al. "STAR: ultrafast universal RNA-seq aligner." Bioinformatics (Oxford, England) vol. 29, 1 (2013): 15-21), TopHat2 (Kim, Daehwan et al. "TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions." Genome biology vol. 14, 4 R36. 25Apr. 2013, doi: 10.1186/gb-2013-14-4-r36) or HISAT (Kim, Daehwan et al. "HISAT: a fast spliced aligner with low memory requirements." Nature methods vol.12, 4(2015):357-60.doi:10.1038/nmeth.3317) aligns the mRNA sequence against the corresponding reference genome (e.g., human reference genome Hg19 ENSEMBL (RNA sequence, GRCh37)). STAR is an independent software that uses a sequential maximum alignable seed search, followed by seed clustering and splicing to align RNA sequence reads. It can detect classical connections, non-classical splicing, and fusion/chimeric transcripts. Generally, connection detection can be performed according to the detailed instructions in the results, based on the definitions obtained from the ENSEMBL and RepeatMasker databases (downloaded from the UCSC genome browser), respectively. Therefore, in some embodiments, a dedicated database (e.g., Ensembl and Repeatmasker databases) is used to determine normal and abnormal connections by computer simulation, and fusion transcripts with connections between TE and exon sequences are extracted by computer simulation.
更具体而言,在一些实施方案中,使用经典STAR双程模式27将目标样品(或细胞)的RNAseq读长与参考基因组(例如经典的hg19基因组)比对,以鉴定未注释的连接。如前所述,JET被鉴定为外显子(尤其是编码DNA序列-CD-外显子)和TE(或重复元件RE)之间的连接。根据本公开,TE(或RE)可根据本领域常用数据库的定义(例如ENSEMBL(GRCh37)和RepeatMasker)进行鉴定(即过滤)。More specifically, in some embodiments, the RNAseq reads of the target sample (or cell) are aligned to a reference genome (e.g., a classic hg19 genome) using the classic STAR two-pass mode 27 to identify unannotated connections. As previously described, JETs are identified as connections between exons (especially coding DNA sequences-CD-exons) and TEs (or repeat elements REs). According to the present disclosure, TEs (or REs) can be identified (i.e., filtered) according to definitions of commonly used databases in the art (e.g., ENSEMBL (GRCh37) and RepeatMasker).
根据本发明,mRNA序列可来自所有类型的癌细胞或肿瘤细胞样品。肿瘤可以是实体或非实体肿瘤。具体而言,mRNA基因序列来自任何受先前定义的癌症或肿瘤影响的组织或器官,例如乳腺癌、肺癌和/或黑色素瘤。在一个特定的实施方案中,mRNA序列来自LUAD样品。肿瘤样品可从例如癌症基因组图谱(TCGA)中获得。在一些实施方案中,所述mRNA序列是从细胞系获得的,例如来自癌细胞系百科(CCLE)的肿瘤细胞系。According to the present invention, mRNA sequences can be from all types of cancer cells or tumor cell samples. Tumors can be solid or non-solid tumors. Specifically, mRNA gene sequences are from any tissue or organ affected by previously defined cancer or tumors, such as breast cancer, lung cancer and/or melanoma. In a specific embodiment, mRNA sequences are from LUAD samples. Tumor samples can be obtained from, for example, the Cancer Genome Atlas (TCGA). In some embodiments, the mRNA sequence is obtained from a cell line, such as a tumor cell line from the Cancer Cell Line Encyclopedia (CCLE).
在一些实施方案中,剪接读长的数量可以通过独特比对读长的数量来标准化。通常选择表达水平超过2.10-7的JET。In some embodiments, the number of spliced reads can be normalized by the number of uniquely aligned reads.JETs with expression levels exceeding 2.10-7 are typically selected.
在本公开的一些实施方案中,融合转录物序列共享超过1%;值得注意的是,超过5%、超过10%、超过15%、超过20%甚至超过25%的癌症样品(通常从各种患者处获得,例如针对TCGA中给定癌症类型收集的癌症样品)和/或细胞系。在一些实施方案中,根据本公开的融合转录物序列在癌症样品中共享超过1%;尤其是超过2%、超过5%、超过10%、超过15%、超过20%甚至超过25%的癌症患者。因此,融合转录物序列可对几种癌症之间共有的一种癌症类型具有特异性。In some embodiments of the present disclosure, the fusion transcript sequence is shared by more than 1%; notably, more than 5%, more than 10%, more than 15%, more than 20% or even more than 25% of cancer samples (typically obtained from various patients, such as cancer samples collected for a given cancer type in TCGA) and/or cell lines. In some embodiments, the fusion transcript sequence according to the present disclosure is shared in cancer samples by more than 1%; in particular, more than 2%, more than 5%, more than 10%, more than 15%, more than 20% or even more than 25% of cancer patients. Therefore, the fusion transcript sequence can be specific to a cancer type shared between several cancers.
根据本公开,与正常健康细胞相比,融合转录物序列在肿瘤细胞中以较高水平表达。在一些实施方案中,融合转录物序列在癌细胞(从一种或多种癌症样品或一种或多种细胞系获得)中表达,而不在健康细胞(来自一种或多种组织样品或一种或多种细胞系)中表达,特别是不在胸腺健康细胞中表达。在一些实施方案中,当JET的表达水平低于2.10-7,特别是低于2.10-8且通常不可检测时,JET被认为不在细胞中表达。根据本公开,这种融合转录物可被称为肿瘤特异性融合。根据本发明,与如前所定义的正常细胞相比在肿瘤细胞中以较高水平表达的融合转录物,通常与正常细胞相比在癌细胞中不成比例表达的融合转录物可被称为肿瘤相关融合转录物(TAF)。根据本申请,如果肿瘤相关融合转录物存在于超过1%,特别是超过2%,超过5%,特别是超过10%的肿瘤样品中(来自相同或不同的肿瘤类型,特别是从TCGA数据库获得的,优选针对相同癌症类型),并且存在于低于20%的正常样品中,则可以选择肿瘤相关融合转录物。替代地或额外地,融合转录物序列可在至少1;2;3;4;5;6;7;8;9;10;11;12;13;14;15;16;17;18;19;20个细胞系表达。According to the present disclosure, compared with normal healthy cells, the fusion transcript sequence is expressed at a higher level in tumor cells. In some embodiments, the fusion transcript sequence is expressed in cancer cells (obtained from one or more cancer samples or one or more cell lines), but not in healthy cells (from one or more tissue samples or one or more cell lines), especially not in thymic healthy cells. In some embodiments, when the expression level of JET is lower than 2.10-7 , especially lower than 2.10-8 and usually undetectable, JET is considered not to be expressed in cells. According to the present disclosure, such fusion transcripts may be referred to as tumor-specific fusions. According to the present invention, fusion transcripts expressed at higher levels in tumor cells compared with normal cells as defined above, fusion transcripts that are usually expressed disproportionately in cancer cells compared with normal cells may be referred to as tumor-associated fusion transcripts (TAF). According to the present application, if the tumor-associated fusion transcript is present in more than 1%, especially more than 2%, more than 5%, especially more than 10% of the tumor samples (from the same or different tumor types, especially obtained from the TCGA database, preferably for the same cancer type), and is present in less than 20% of the normal samples, then the tumor-associated fusion transcript may be selected. Alternatively or additionally, the fusion transcript sequence may be expressed in at least 1; 2; 3; 4; 5; 6; 7; 8; 9; 10; 11; 12; 13; 14; 15; 16; 17; 18; 19; 20 cell lines.
在一些实施方案中,该方法进一步包括任选地通过计算机模拟或使用体外技术(阐述特别参见实施例)测定肿瘤新抗原肽与所述癌症受试者的至少一种MHC分子的结合亲和力的步骤。In some embodiments, the method further comprises the step of determining the binding affinity of the tumor neoantigenic peptide to at least one MHC molecule of the cancer subject, optionally by computer simulation or using in vitro techniques (see the Examples for illustration).
当对人样品进行该方法时,该方法可包括测定患者的I类或I类主要组织相容性复合物(MHC,又称人白细胞抗原(HLA)等位基因)的步骤。需要注意的是,由于实验室小鼠的MHC等位基因是众所周知的,因此在该特定情况下可能不需要该步骤。在本申请中,“MHC分子”指至少一种MHC I类分子或至少一种MHC II类分子。When the method is performed on a human sample, the method may include a step of determining the class I or class II major histocompatibility complex (MHC, also known as human leukocyte antigen (HLA) alleles) of the patient. It should be noted that since the MHC alleles of laboratory mice are well known, this step may not be required in this particular case. In this application, "MHC molecule" refers to at least one MHC class I molecule or at least one MHC class II molecule.
通过分析已知的MHC I和MHC II的序列并确定每个结构域的等位基因变异,建立了MHC等位基因数据库。这通常可以通过使用本领域众所周知的适当软件算法在计算机模拟中确定。现已开发了多种工具,用于从全基因组测序数据(全外显子组、全基因组和RNA测序数据)中获取HLA等位基因信息,包括OptiType、Polysolver、PHLAT、HLAreporter、HLAforest、HLAminer和seq2HLA(Kiyotani K et al.,Immunopharmacogenomics towardspersonalized cancer immunotherapy targeting neoantigens;Cancer Science 2018;109:542-549)。例如,seq2hla工具(参见Boegel S,Lower M,Schafer M,et al.HLA typingfrom RNA-Seq sequence reads.Genome Med.2012;4:102)被很好地设计为进行本文所公开的方法,其是一种以python和R编写的计算机模拟方法,该方法以fastq格式将标准RNA-Seq序列读长作为输入,使用包含所有HLA等位基因的bowtie索引(Langmead B,et al.,Ultrafast and memory-efficient alignment of short DNA sequences to the humangenome.Genome Biol.2009,10:R25-10.1186/gb-2009-10-3-r25),并输出最可能的HLA I类和II类基因型(4位分辨率)、每次调用的p值以及每一类的表达。An MHC allele database was established by analyzing the known sequences of MHC I and MHC II and determining the allelic variation of each domain. This can usually be determined in silico using appropriate software algorithms well known in the art. A variety of tools have been developed to obtain HLA allele information from whole genome sequencing data (whole exome, whole genome, and RNA sequencing data), including OptiType, Polysolver, PHLAT, HLAreporter, HLAforest, HLAminer, and seq2HLA (Kiyotani K et al., Immunopharmacogenomics towards personalized cancer immunotherapy targeting neoantigens; Cancer Science 2018; 109: 542-549). For example, the seq2hla tool (see Boegel S, Lower M, Schafer M, et al. HLA typingfrom RNA-Seq sequence reads. Genome Med. 2012; 4: 102) is well designed to perform the method disclosed in this article. It is a computer simulation method written in python and R, which takes standard RNA-Seq sequence reads in fastq format as input, uses a bowtie index containing all HLA alleles (Langmead B, et al., Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009, 10: R25-10.1186/gb-2009-10-3-r25), and outputs the most likely HLA class I and class II genotypes (4-bit resolution), p-values for each call, and expression of each class.
通常,TE和外显子序列之间具有连接的序列通过计算机模拟提取。由每个序列编码的所有可能的肽对来自患者(或小鼠)的每个MHC等位基因的亲和力可以例如使用计算方法在计算机模拟确定,以预测肽对HLA分子的结合亲和力。事实上,准确的预测方法是基于人工神经网络和预测的IC50。例如,经NetMHC修饰后以预测与未报道的配体的等位基因结合的肽的NetMHCpan软件非常适合实施本文所公开的方法((Lundegaard C et al.,NetMHC-3.0:accurate web accessible predictions of human,mouse and monkey MHC class Iaffinities for peptides of length 8-11;Nucleic Acids Res.2008;36:W509-W512;Nielsen M et al.NetMHCpan,a method for quantitative predictions of peptidebinding to any HLA-A and-B locus protein of known sequence.PLoS One.2007;2:e796,but see also Kiyotani K et al.,Immunopharmacogenomics towardspersonalized cancerimmunotherapy targeting neoantigens;Cancer Science 2018;109:542-549 and Yarchoan M et al.,Nat rev.cancer 2017;17(4):209-222)。NetMHCpan软件使用人工神经网络(ANN)预测肽与任何已知序列的MHC分子的结合。该方法基于超过180,000个定量结合数据和MS衍生的MHC洗脱配体的组合进行训练。结合亲和力数据涵盖了172个来自人(HLA-A、B、C、E)、小鼠(H-2)、牛(BoLA)、灵长类动物(Patr、Mamu、Gogo)和猪(SLA)的MHC分子。MS洗脱的配体数据涵盖55个HLA和小鼠等位基因。Typically, sequences with connections between TE and exon sequences are extracted by computer simulation. The affinity of all possible peptides encoded by each sequence to each MHC allele from a patient (or mouse) can be determined in silico, for example, using computational methods to predict the binding affinity of the peptide to HLA molecules. In fact, accurate prediction methods are based on artificial neural networks and predicted IC50 . For example, the NetMHCpan software, which has been modified to predict peptides that bind to alleles of unreported ligands, is well suited for implementing the methods disclosed herein ((Lundegaard C et al., NetMHC-3.0: accurate web accessible predictions of human, mouse and monkey MHC class I affinities for peptides of length 8-11; Nucleic Acids Res. 2008; 36: W509-W512; Nielsen M et al. NetMHCpan, a method for quantitative predictions of peptide binding to any HLA-A and -B locus protein of known sequence. PLoS One. 2007; 2: e796, but see also Kiyotani K et al., Immunopharmacogenomics towards personalized cancer immunotherapy targeting neoantigens; Cancer Science 2018; 109: 542-549 and Yarchoan M et al., Nat rev. cancer 2017;17(4):209-222). NetMHCpan software uses an artificial neural network (ANN) to predict peptide binding to any MHC molecule of known sequence. The method is trained on a combination of over 180,000 quantitative binding data and MS-derived MHC-eluted ligands. Binding affinity data cover 172 MHC molecules from human (HLA-A, B, C, E), mouse (H-2), bovine (BoLA), primate (Patr, Mamu, Gogo), and porcine (SLA). MS-eluted ligand data cover 55 HLA and mouse alleles.
在示例性实施方案中,如上所述通过融合转录物所编码并且针对MHC等位基因的Kd亲和力小于10-4、10-5、10-6、10-7M或小于500nM,特别是小于50nM的新抗原肽被选择作为肿瘤新抗原肽。In an exemplary embodiment, a neoantigenic peptide encoded by the fusion transcript as described above and having a Kd affinity for an MHC allele of less than 10 −4 , 10 −5 , 10 −6 , 10 −7 M or less than 500 nM, particularly less than 50 nM, is selected as a tumor neoantigenic peptide.
如上所述,可以使用适当的软件(例如netMHCpan)通过计算机模拟测定所选肽对MHC等位基因的亲和力。因此,在一些实施方案中,新抗原肽以低于NetMHCpan 4.0预测的2%百分位数轶得分的结合亲和力结合MHC I类。在其他实施方案中,新抗原肽以低于NetMHCpanII 3.2预测的10%百分位数轶得分的结合亲和力结合II类。As described above, the affinity of the selected peptide to the MHC allele can be determined by computer simulation using appropriate software (e.g., netMHCpan). Thus, in some embodiments, the neoantigenic peptide binds to MHC class I with a binding affinity that is lower than the 2% percentile outlier score predicted by NetMHCpan 4.0. In other embodiments, the neoantigenic peptide binds to class II with a binding affinity that is lower than the 10% percentile outlier score predicted by NetMHCpanII 3.2.
亲和力也可以(替代地或额外地)在体外估算,例如通过使用结果中所包含的所述的MHC四聚体形成分析(参见实施例2,第2.1点和第2.2.2点)。技术人员通常可以使用来自的商用测定(根据其培训指南,特别使用来自的试剂盒)。通常,结合亲和力以与阳性对照结合的百分比确定。一般来说,选择结合百分比至少为阳性对照的30%,特别是至少40%,甚至至少50%的肽。通常,根据本公开,通常可根据本方法获得的新抗原肽以足以使该肽作为抗原呈递在细胞表面的亲和力结合至少一种HLA/MHC分子。一般来说,新抗原肽对至少一种HLA/MHC分子(通常为所述癌症患者的分子)具有小于10-4、或10-5、或10-6、或10-7或小于500nM、至少小于250nM、至少小于200nM、至少小于150nM、至少小于100nM、至少小于50nM或更小的IC50亲和力(较低的数字表示更强的结合亲和力)。Affinity can also (alternatively or additionally) be estimated in vitro, for example by using the MHC tetramer formation assay described in the results (see Example 2, points 2.1 and 2.2.2). The skilled person can usually use the data from commercial assays (according to their training guide, specifically using of Kit). Typically, the binding affinity is determined as a percentage of binding to a positive control. In general, a peptide is selected whose binding percentage is at least 30%, particularly at least 40%, or even at least 50% of that of the positive control. Typically, according to the present disclosure, the neoantigenic peptides that can be obtained according to the present method typically bind to at least one HLA/MHC molecule with an affinity sufficient to present the peptide as an antigen on the cell surface. Typically, the neoantigenic peptides have an IC50 affinity of less than 10-4 , or 10-5 , or 10-6 , or 10-7 , or less than 500nM, at least less than 250nM, at least less than 200nM, at least less than 150nM, at least less than 100nM, at least less than 50nM or less for at least one HLA/MHC molecule (typically a molecule of the cancer patient) (lower numbers indicate stronger binding affinity).
因此,根据本方法的进一步可选步骤可以独立地包括:Therefore, further optional steps according to the present method may independently include:
-排除在健康细胞上高水平或高频率表达的融合转录物或预测肽的步骤。融合转录物序列针对健康细胞的RNAseq数据的比对通常允许确定健康细胞中存在的融合转录物序列的相对数量;在一个实施方案中,在健康细胞上表达的融合转录物或预测的肽被舍弃。- A step of excluding fusion transcripts or predicted peptides that are expressed at high levels or frequencies on healthy cells. Alignment of fusion transcript sequences to RNAseq data of healthy cells generally allows the relative quantity of fusion transcript sequences present in healthy cells to be determined; in one embodiment, fusion transcripts or predicted peptides expressed on healthy cells are discarded.
-确认肿瘤新抗原肽未在受试者的健康细胞中表达的步骤。该步骤通常可使用基础的局部比对搜索工具(BLAST)进行,并针对健康细胞的蛋白质组进行新抗原肽序列的比对;优选地,针对正常健康细胞的蛋白质组比对的肽(例如使用BLAST)被舍弃。- A step of confirming that the tumor neoantigen peptide is not expressed in the healthy cells of the subject. This step can usually be performed using a basic local alignment search tool (BLAST) and comparing the neoantigen peptide sequence against the proteome of healthy cells; preferably, peptides compared against the proteome of normal healthy cells (e.g., using BLAST) are discarded.
-确认融合转录物或预测肽在受试者的癌细胞中表达的步骤。通常可以通过从癌细胞样品中提取的mRNA中的RT-PCR来检查所选融合转录物序列在癌细胞中的存在。- A step of confirming that the fusion transcript or predicted peptide is expressed in a cancer cell of the subject. The presence of the selected fusion transcript sequence in cancer cells can usually be examined by RT-PCR in mRNA extracted from a cancer cell sample.
在一些实施方案中,本方法还可包括对鉴定的融合(JET)转录物进行计算机模拟翻译以生成JET衍生蛋白数据库(JET-db)的步骤。通常,作为供体的含有基因的链索引JET可以使用经典ORF翻译,从涉及基因直至断点后的第一个终止密码子。在TE作为供体的JET中,3个可能的ORF可以被翻译,并且通常仅保留更接近断点和两个终止密码子之间的序列。该JET db(通常也结合到人蛋白质组)随后可对从肿瘤样品和/或肿瘤细胞系获得的基于质谱的蛋白质组数据集问询,该数据集通常包含从肿瘤样品和/或肿瘤细胞系获得的蛋白质组数据。在一些实施方案中,可以使用公共质谱数据集。该实施方案在本申请中提供的结果中得到很好的描述。这种分析也用于鉴定JET衍生肽或蛋白。In some embodiments, the method may also include a step of performing computer simulation translation of the identified fusion (JET) transcript to generate a JET-derived protein database (JET-db). Typically, the chain index JET containing genes as donors can use classical ORF translation, from the first stop codon after the involved gene until the breakpoint. In JET with TE as donor, 3 possible ORFs can be translated, and usually only the sequence closer to the breakpoint and between the two stop codons is retained. The JET db (usually also combined with the human proteome) can then query the mass spectrometry-based proteome data set obtained from tumor samples and/or tumor cell lines, which typically contains proteome data obtained from tumor samples and/or tumor cell lines. In some embodiments, a public mass spectrometry data set can be used. This embodiment is well described in the results provided in the present application. This analysis is also used to identify JET-derived peptides or proteins.
在更具体的实施方案中,JETdb(通常结合到人蛋白质组)可对基于免疫肽组学的质谱数据集问询,如本文所包括的实施例中所详述。该实施方案允许鉴定呈递给MHC分子的JET衍生肽或蛋白(pJET)。In a more specific embodiment, JETdb (typically bound to the human proteome) can be interrogated against immunopeptidomics-based mass spectrometry datasets, as detailed in the Examples included herein. This embodiment allows identification of JET-derived peptides or proteins (pJET) presented to MHC molecules.
为确保JET衍生肽与正常样品中发现的经典蛋白或JET衍生肽不匹配,可使用UniProt/TrEMBL数据库和/或来自正常(包括例如近肿瘤)样品或细胞(例如来自TCGA和/或CCLE等公共数据库)的计算机模拟翻译的JET过滤已鉴定的肽。To ensure that JET-derived peptides do not match classical proteins or JET-derived peptides found in normal samples, the identified peptides can be filtered using the UniProt/TrEMBL database and/or in silico translations of JET from normal (including, e.g., near-tumor) samples or cells (e.g., from public databases such as TCGA and/or CCLE).
新抗原肽New antigen peptide
本公开还涉及一种分离的肿瘤新抗原肽,其包含至少8、9、10、11或12个氨基酸,由来自融合转录物的开放阅读框(ORF)的一部分编码,该融合转录物是包含可转座元件(TE)序列和外显子序列的人mRNA序列。该肽的长度可以是8-9、8-10、8-11、12-25、13-25、12-20或13-20个氨基酸。尽管ORF与TE序列和外显子序列之间的连接重叠,但可以理解,肿瘤新抗原肽本身可能不包含该连接。The present disclosure also relates to an isolated tumor neoantigen peptide comprising at least 8, 9, 10, 11 or 12 amino acids, encoded by a portion of an open reading frame (ORF) from a fusion transcript, which is a human mRNA sequence comprising a transposable element (TE) sequence and an exon sequence. The length of the peptide can be 8-9, 8-10, 8-11, 12-25, 13-25, 12-20 or 13-20 amino acids. Although the connection between the ORF and the TE sequence and the exon sequence overlaps, it is understood that the tumor neoantigen peptide itself may not contain the connection.
更具体地,本公开还涵盖由来自癌细胞的人融合mRNA序列的一部分编码的分离的肿瘤新抗原肽,所述融合mRNA包含TE序列和外显子序列。More specifically, the present disclosure also encompasses an isolated tumor neoantigenic peptide encoded by a portion of a human fusion mRNA sequence from a cancer cell, the fusion mRNA comprising a TE sequence and an exon sequence.
该肽的长度可为8-9、8-10、8-11、12-25、13-25、12-20或13-20个氨基酸,并满足上述一种或多种新抗原肽特征。至少8个氨基酸的肽的N端可由起始于1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18或更高的任何一个核苷酸位置的三联密码子编码(应理解,本公开预期起始位置为1至8000的任何整数,而无需列出1至8000的每一个数字)。The length of the peptide may be 8-9, 8-10, 8-11, 12-25, 13-25, 12-20 or 13-20 amino acids and meet one or more of the above-mentioned new antigenic peptide characteristics. The N-terminus of the peptide of at least 8 amino acids may be encoded by a triplet codon starting at any nucleotide position of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 or higher (it should be understood that the present disclosure contemplates any integer from 1 to 8000 starting position without listing every number from 1 to 8000).
上述定义的肽通常可根据本公开的方法获得,因此包括一种或多种前述特征。具体地,根据本公开的新抗原肽可表现出以下进一步特征中的一种或组合:The peptides defined above are generally obtainable according to the methods of the present disclosure and thus include one or more of the aforementioned characteristics. Specifically, the new antigenic peptides according to the present disclosure may exhibit one or a combination of the following further characteristics:
-其结合或特异性结合受试者的MHC I类,且为8至11个氨基酸,特别是8、9、10或11个氨基酸。通常地,新抗原肽的长度为8或9个氨基酸,并结合受试者的至少一个MHC I类分子;或者,其结合所述受试者的至少一种MHC II类分子,并含有12至25个氨基酸,特别是12、13、14、15、16、17、18、19、20、21、22、23、24或25个氨基酸。- It binds or specifically binds to the subject's MHC class I and is 8 to 11 amino acids, in particular 8, 9, 10 or 11 amino acids. Typically, the neoantigenic peptide is 8 or 9 amino acids in length and binds to at least one MHC class I molecule of the subject; or, it binds to at least one MHC class II molecule of the subject and contains 12 to 25 amino acids, in particular 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25 amino acids.
-其以足以使肽作为抗原呈递在细胞表面的亲和力结合所述癌症患者的至少一种HLA/MHC分子。通常,新抗原肽具有小于10-4、或10-5、或10-6、或10-7或小于500nM、至少小于250nM、至少小于200nM、至少小于150nM、至少小于100nM、至少小于50nM或更小的IC50(较低的数字表示更强的结合亲和力)。- It binds to at least one HLA/MHC molecule of the cancer patient with an affinity sufficient to present the peptide as an antigen on the cell surface. Typically, the neoantigenic peptide has an IC50 of less than 10-4 , or 10-5 , or 10-6 , or 10-7 , or less than 500 nM, at least less than 250 nM, at least less than 200 nM, at least less than 150 nM, at least less than 100 nM, at least less than 50 nM or less (lower numbers indicate stronger binding affinity).
-当向受试者施用时,其不会诱导显著的自体免疫应答和/或引起免疫耐受性。- When administered to a subject, it does not induce a significant autoimmune response and/or induce immune tolerance.
-其在肿瘤样品中的表达水平高于正常健康样品。通常地,根据本公开,如果融合转录物存在于超过1%,特别是超过2%,超过5%,或超过10%的肿瘤样品(来自相同或不同的肿瘤类型,通常来自一个或多个受试者,通常来自TCGA肿瘤样品)和低于20%的正常样品中,则可以选择融合转录物。替代地或额外地,转录物可在一种或多种(至少2、5、10、20、50、100个细胞系,例如来自CCLE的细胞系)中鉴定。在一些实施方案中,新抗原更具体地为肿瘤特异性抗原(TSA),即:其仅在癌症样品中表达,而不在正常样品中表达,或在正常样品中以相对较低的水平表达(例如,表达的mRNA序列代表来自正常样品的正常细胞中的次要种型)。-Its expression level in tumor samples is higher than that in normal healthy samples. Generally, according to the present disclosure, a fusion transcript can be selected if it is present in more than 1%, particularly more than 2%, more than 5%, or more than 10% of tumor samples (from the same or different tumor types, typically from one or more subjects, typically from TCGA tumor samples) and less than 20% of normal samples. Alternatively or additionally, the transcript can be identified in one or more (at least 2, 5, 10, 20, 50, 100 cell lines, such as cell lines from CCLE). In some embodiments, the new antigen is more specifically a tumor-specific antigen (TSA), that is, it is only expressed in cancer samples, not in normal samples, or is expressed at a relatively low level in normal samples (for example, the expressed mRNA sequence represents a minor type in normal cells from normal samples).
-其包含TE序列和外显子序列之间的连接,换句话说,其由TE序列的一部分和外显子序列的一部分编码,ORF为经典或非经典的,或- it contains a junction between a TE sequence and an exon sequence, in other words, it is encoded by a portion of a TE sequence and a portion of an exon sequence, the ORF being canonical or non-canonical, or
-其由外显子序列的非经典ORF编码,或- it is encoded by a non-canonical ORF in an exonic sequence, or
-其由TE序列编码,任选地在非经典ORF中。- which is encoded by a TE sequence, optionally in a non-classical ORF.
肿瘤新抗原肽可首先通过对受试者肿瘤细胞中融合转录物序列的RT转录分析进行验证。同样,根据本发明的肿瘤新抗原肽的免疫通常会引起T细胞应答。The tumor neoantigen peptide can first be verified by RT transcription analysis of the fusion transcript sequence in the subject's tumor cells. Likewise, immunization with the tumor neoantigen peptide according to the present invention will generally induce a T cell response.
在特定实施方案中,本公开涵盖NSCLC新抗原肽,其包含SEQ ID NO:1-117中任何一个的至少8个氨基酸。通常,SEQ ID NO:1-117的所述新抗原肽以足以使肽作为抗原呈递在细胞表面的亲和力结合HLA-A02。对MHC等位基因的亲和力可通过本领域已知技术测定,特别是如上所述的计算机模拟或体外技术;In certain embodiments, the present disclosure encompasses NSCLC neoantigenic peptides comprising at least 8 amino acids of any one of SEQ ID NOs: 1-117. Typically, the neoantigenic peptides of SEQ ID NOs: 1-117 bind to HLA-A02 with an affinity sufficient to present the peptide as an antigen on the cell surface. Affinity for MHC alleles can be determined by techniques known in the art, particularly computer simulations or in vitro techniques as described above;
在特定实施方案中,根据本公开的肿瘤新抗原肽结合至少1%、5%、10%、15%、20%、25%或更多受试者中存在的MHC分子。值得注意的是,在患有癌症的受试者群体中,至少有1%、5%、10%、15%、20%、25%的受试者表达本文公开的肿瘤新抗原肽。In certain embodiments, the tumor neoantigen peptides according to the present disclosure bind to at least 1%, 5%, 10%, 15%, 20%, 25% or more of the MHC molecules present in a subject. Of note, in a population of subjects with cancer, at least 1%, 5%, 10%, 15%, 20%, 25% of the subjects express the tumor neoantigen peptides disclosed herein.
更具体而言,本公开的肿瘤新抗原肽能够引发针对癌症患者群体中至少1%、5%、10%、15%、20%或25%的受试者中存在的肿瘤的免疫应答。More specifically, the tumor neoantigen peptides disclosed herein are capable of eliciting an immune response against tumors present in at least 1%, 5%, 10%, 15%, 20% or 25% of the subjects in a cancer patient population.
如前所定义,癌症可能影响以下任何一种组织或器官:乳房;肝脏;肾;心脏、纵隔、胸膜;口底;唇、唾液腺;舌头;牙龈;口腔;腭;扁桃体;喉;气管;支气管、肺;咽、下咽、口咽、鼻咽;食道;消化器官,如胃、肝内胆管、胆道、胰腺、小肠、结肠;直肠;膀胱、胆囊、输尿管等泌尿器官;直肠乙状结肠结;肛门、肛管;皮肤;骨头;关节、肢体关节软骨;眼睛和附件;脑;周围神经、自主神经系统;脊髓、脑神经、脑膜;和中枢神经系统的各个部分;结缔组织、皮下组织和其他软组织;腹膜后,腹膜;肾上腺;甲状腺;内分泌腺和相关结构;女性生殖器官,如卵巢、子宫、子宫颈;子宫体、阴道、外阴;男性生殖器官,如阴茎、精巢和前列腺;造血系统和网状内皮系统;血液;淋巴结;胸腺。例如,根据本申请的肿瘤或癌症包括白血病、精原细胞瘤、黑色素瘤、畸胎瘤、淋巴瘤、神经母细胞瘤、神经胶质瘤、直肠癌、子宫内膜癌、肾癌、肾上腺癌、甲状腺癌、血癌、皮肤癌、脑癌、宫颈癌、肠癌、肝癌、结肠癌、胃癌、肠癌、头颈癌、胃肠癌、淋巴结癌、食道癌、结直肠癌、胰腺癌、耳、鼻、喉(ENT)癌、乳腺癌、前列腺癌、子宫癌、卵巢癌和其实例为肺癌、乳腺癌、前列腺癌、结肠癌、肾细胞癌、子宫癌或上述癌症类型或肿瘤的转移。根据本公开的术语“癌症”还包括癌症转移和癌症复发。As defined above, cancer may affect any of the following tissues or organs: breast; liver; kidney; heart, mediastinum, pleura; floor of mouth; lips, salivary glands; tongue; gums; oral cavity; palate; tonsils; larynx; trachea; bronchi, lungs; pharynx, hypopharynx, oropharynx, nasopharynx; esophagus; digestive organs such as stomach, intrahepatic bile ducts, bile ducts, pancreas, small intestine, colon; rectum; urinary organs such as bladder, gallbladder, ureters; rectosigmoid colon; anus, anal canal; skin; bones; joints, limbs Articular cartilage of the body; eyes and appendages; brain; peripheral nerves, autonomic nervous system; spinal cord, cranial nerves, meninges; and various parts of the central nervous system; connective tissue, subcutaneous tissue, and other soft tissues; retroperitoneum, peritoneum; adrenal glands; thyroid gland; endocrine glands and related structures; female reproductive organs, such as ovaries, uterus, cervix; uterine body, vagina, vulva; male reproductive organs, such as penis, testes, and prostate; hematopoietic system and reticuloendothelial system; blood; lymph nodes; thymus. For example, tumors or cancers according to the present application include leukemia, seminoma, melanoma, teratoma, lymphoma, neuroblastoma, glioma, rectal cancer, endometrial cancer, kidney cancer, adrenal cancer, thyroid cancer, blood cancer, skin cancer, brain cancer, cervical cancer, intestinal cancer, liver cancer, colon cancer, stomach cancer, intestinal cancer, head and neck cancer, gastrointestinal cancer, lymph node cancer, esophageal cancer, colorectal cancer, pancreatic cancer, ear, nose, throat (ENT) cancer, breast cancer, prostate cancer, uterine cancer, ovarian cancer and examples thereof are lung cancer, breast cancer, prostate cancer, colon cancer, renal cell carcinoma, uterine cancer or metastasis of the above cancer types or tumors. The term "cancer" according to the present disclosure also includes cancer metastasis and cancer recurrence.
通常,当向受试者施用时,根据本公开的新抗原肽不会诱导显著的自体免疫应答和/或引起免疫耐受性。耐受机制涉及宿主中的克隆缺失、忽视、无能或抑制,其中高亲和力自反应性T细胞的数量减少。Typically, when administered to a subject, the neoantigenic peptides according to the present disclosure do not induce a significant autoimmune response and/or induce immune tolerance. Tolerance mechanisms involve clonal deletion, neglect, anergy or suppression in the host, in which the number of high-affinity self-reactive T cells is reduced.
新抗原肽也可以通过延长或减少化合物的氨基酸序列(例如通过增加或删除氨基酸)进行修饰。也可以通过改变某些残基的顺序或组成来修饰肽,很容易理解,对生物活性至关重要的某些氨基酸残基,例如关键接触部位的残基或保守残基,通常不会被改变,从而对生物活性无副作用。非关键氨基酸不必限于天然存在于蛋白质中的那些,例如L-α-氨基酸或其D-异构体,也可以包括非天然氨基酸,例如β-γ-δ-氨基酸,以及许多L-α-氨基酸的衍生物。The new antigenic peptide can also be modified by extending or reducing the amino acid sequence of the compound (for example, by adding or deleting amino acids). The peptide can also be modified by changing the order or composition of certain residues. It is easy to understand that certain amino acid residues that are essential for biological activity, such as residues in key contact sites or conserved residues, are usually not changed, so as to have no side effects on biological activity. Non-critical amino acids are not necessarily limited to those naturally present in proteins, such as L-α-amino acids or their D-isomers, but can also include non-natural amino acids, such as β-γ-δ-amino acids, and many derivatives of L-α-amino acids.
通常,采用一系列单个氨基酸取代的肽来确定静电荷、疏水性等对结合的影响。例如,沿着肽的长度进行一系列带正电(例如Lys或Arg)或带负电(例如Glu)的氨基酸取代,显示出对各种MHC分子和T细胞受体的不同敏感性模式。另外,可以采用使用较小的、相对中性的分子(例如Ala、Gly、Pro或类似残基)的多个取代。取代可以是同源寡聚物或异源寡聚物。被取代或添加的残基的数量和类型取决于必要接触点之间的必要间隔和所寻求的某些功能属性(例如,疏水性与亲水性)。与亲代肽的亲和力相比,通过这种取代也可以提高对MHC分子或T细胞受体的结合亲和力。在任何情况下,这种取代应采用选择的氨基酸残基或其他分子片段,以避免例如可能破坏结合的空间和电荷干扰。Typically, a series of peptides with single amino acid substitutions are used to determine the effects of electrostatic charge, hydrophobicity, etc. on binding. For example, a series of positively charged (e.g., Lys or Arg) or negatively charged (e.g., Glu) amino acid substitutions along the length of the peptide show different sensitivity patterns to various MHC molecules and T cell receptors. In addition, multiple substitutions using smaller, relatively neutral molecules (e.g., Ala, Gly, Pro or similar residues) can be used. The substitutions can be homo-oligomers or hetero-oligomers. The number and type of residues substituted or added depends on the necessary spacing between the necessary contact points and certain functional attributes sought (e.g., hydrophobicity vs. hydrophilicity). The binding affinity to MHC molecules or T cell receptors can also be improved by such substitutions compared to the affinity of the parent peptide. In any case, such substitutions should use selected amino acid residues or other molecular fragments to avoid, for example, steric and charge interference that may destroy binding.
氨基酸取代通常为单个残基。取代、缺失、插入或其任何组合可被组合以获得最终的肽。取代变体是指其中至少一个肽的残基被移除并且另一个残基被插入其位置的变体。当希望精细调节肽的特性时,通常根据下表1进行这种取代。Amino acid substitutions are usually single residues. Substitution, deletion, insertion or any combination thereof can be combined to obtain the final peptide. Substitution variants refer to variants in which at least one residue of the peptide is removed and another residue is inserted in its place. When it is desired to fine-tune the properties of the peptide, such substitutions are usually performed according to Table 1 below.
表1Table 1
功能上的实质性变化(例如对MHC分子或T细胞受体的亲和力)是通过选择比上表中保守程度低的取代来实现的,即选择它们在维持(a)取代区域的肽骨架结构(例如片状或螺旋构象)或(b)分子在靶部位的电荷或疏水性或(c)侧链的主体方面的作用中具有更显著差异的残基。通常预期会产生最大的肽性质变化的取代为(a)亲水性残基(例如丝氨酸)取代(或被取代)疏水性残基(例如亮氨酸、异亮氨酸、苯丙氨酸、缬氨酸或丙氨酰);(b)具有正电侧链的残基(例如赖氨酸、精氨酸或组氨酸)取代(或被取代)负电荷残基(例如谷氨酰或天冬氨酰);或(c)具有庞大侧链的残基(例如苯丙氨酸)取代(或被取代)不具有侧链的残基(例如甘氨酸)。Substantial changes in function (e.g., affinity for MHC molecules or T cell receptors) are achieved by selecting substitutions that are less conservative than those in the table above, i.e., by selecting residues that have a more significant difference in their role in maintaining (a) the structure of the peptide backbone in the area of the substitution (e.g., sheet or helical conformation) or (b) the charge or hydrophobicity of the molecule at the target site or (c) the bulk of the side chain. Substitutions that are generally expected to produce the greatest changes in peptide properties are (a) substitutions of (or with) hydrophobic residues (e.g., leucine, isoleucine, phenylalanine, valine, or alanyl) for hydrophilic residues (e.g., serine); (b) substitutions of (or with) negatively charged residues (e.g., glutamyl or aspartyl) for residues with positively charged side chains (e.g., lysine, arginine, or histidine); or (c) substitutions of (or with) residues with bulky side chains (e.g., phenylalanine) for residues without side chains (e.g., glycine).
所述肽和多肽还可包含新抗原肽或新抗原多肽中两个或多个残基的电子等排体。此处定义的电子等排体是可以取代第二序列的两个或多个残基的序列,因为第一序列的空间构象适配第二序列的特定结合部位。该术语具体包括本领域技术人员熟知的肽骨架修饰。这种修饰包括酰胺氮、α-碳、酰胺羰基的修饰、酰胺键的完全替换、延伸、缺失或骨架交联。大致参见Spatola,Chemistry and Biochemistry of Amino Acids,Peptides andProteins,Vol.VII(Weinstein ed.,1983)。The peptides and polypeptides may also contain isosteres of two or more residues in the neoantigenic peptide or neoantigenic polypeptide. The isosteres defined herein are sequences that can replace two or more residues of the second sequence because the spatial conformation of the first sequence fits the specific binding site of the second sequence. The term specifically includes peptide backbone modifications well known to those skilled in the art. Such modifications include modifications of amide nitrogen, α-carbon, amide carbonyl, complete replacement, extension, deletion or backbone cross-linking of amide bonds. See generally Spatola, Chemistry and Biochemistry of Amino Acids, Peptides and Proteins, Vol. VII (Weinstein ed., 1983).
此外,新抗原肽可以与载体蛋白、配体或抗体结合。肽的半衰期可通过PEG化、糖基化、聚唾液酸化、HES化、重组PEG模拟物、Fc融合、清蛋白融合、纳米颗粒附着、纳米颗粒包封、胆固醇融合、铁融合或酰化来提高。In addition, the neoantigenic peptides can be conjugated to carrier proteins, ligands or antibodies. The half-life of the peptides can be increased by PEGylation, glycosylation, polysialylation, HESylation, recombinant PEG mimics, Fc fusion, albumin fusion, nanoparticle attachment, nanoparticle encapsulation, cholesterol fusion, iron fusion or acylation.
用各种氨基酸模拟物或非天然氨基酸对肽和多肽进行的修饰特别有助于提高肽和多肽在体内的稳定性。稳定性可以通过多种方式测定。例如,肽酶和各种生物介质,如人血浆和血清,已被用于测试稳定性。参见,例如Verhoef et al.,Eur.J.Drug MetabPharmacokin.11:291-302(1986)。使用25%人血清(v/v)测定方便地确定本公开的肽的半衰期。方案大致如下。合并的人血清(AB型,非热灭活)在使用前通过离心进行脱脂。然后用RPMI组织培养液将血清稀释至25%,并用于测试肽的稳定性。在预定的时间间隔内,将少量反应溶液移出并添加到6%三氯乙酸水溶液或乙醇中。将混浊的反应样品冷却(4℃)15分钟,然后旋转来沉淀出的沉淀的血清蛋白。随后使用稳定性特定的色谱条件通过反相HPLC确定肽的存在。Modification of peptides and polypeptides with various amino acid mimetics or unnatural amino acids is particularly helpful in improving the stability of peptides and polypeptides in vivo. Stability can be determined in a variety of ways. For example, peptidases and various biological media, such as human plasma and serum, have been used to test stability. See, for example, Verhoef et al., Eur. J. Drug Metab Pharmacokin. 11: 291-302 (1986). The half-life of the peptides disclosed herein is conveniently determined using a 25% human serum (v/v) assay. The protocol is generally as follows. Pooled human serum (AB type, non-heat inactivated) is defatted by centrifugation prior to use. The serum is then diluted to 25% with RPMI tissue culture medium and used to test the stability of the peptides. At predetermined time intervals, a small amount of the reaction solution is removed and added to 6% trichloroacetic acid aqueous solution or ethanol. The turbid reaction sample is cooled (4° C.) for 15 minutes and then spun to precipitate the precipitated serum proteins. The presence of the peptide is then determined by reverse phase HPLC using stability-specific chromatographic conditions.
可以对肽和多肽进行修饰,以提供除改善的血清半衰期以外的所需属性。例如,肽诱导CTL活性的能力可以通过与包含至少一个能够诱导T辅助细胞应答的表位的序列连接而增强。特别优选的免疫原性肽/T辅助结合物通过间隔分子连接。间隔子通常由相对较小的中性分子组成,例如在生理条件下基本不带电的氨基酸或氨基酸模拟物。间隔子通常选自例如Ala、Gly或其他非极性氨基酸或中性极性氨基酸的中性间隔子。应当理解,任选存在的间隔子不需要由相同的残基组成,因此可以是异源寡聚体或同源寡聚体。当存在时,间隔子通常为至少一个或两个残基,更通常为三至六个残基。或者,该肽可以与T辅助肽连接,而无需间隔子。Peptides and polypeptides can be modified to provide the desired attributes in addition to improved serum half-life. For example, the ability of peptides to induce CTL activity can be enhanced by being connected to a sequence comprising at least one epitope that can induce T helper cell responses. Particularly preferred immunogenic peptide/T helper conjugates are connected by spacer molecules. Spacers are usually composed of relatively small neutral molecules, such as amino acids or amino acid mimetics that are substantially uncharged under physiological conditions. Spacers are usually selected from neutral spacers such as Ala, Gly or other non-polar amino acids or neutral polar amino acids. It should be understood that the optional spacers do not need to be composed of the same residues, and therefore can be hetero-oligomers or homo-oligomers. When present, spacers are usually at least one or two residues, more usually three to six residues. Alternatively, the peptide can be connected to the T helper peptide without spacers.
新抗原肽可直接或通过位于肽的氨基或羧基末端的间隔子与T辅助肽连接。新抗原肽或T辅助肽的氨基端可以被酰化。示例性的T辅助肽包括破伤风类毒素830-843、流行性感冒病毒307-319、疟疾环子虫382-398和378-389The neoantigenic peptide can be linked to the T helper peptide directly or via a spacer located at the amino or carboxyl terminus of the peptide. The amino terminus of the neoantigenic peptide or the T helper peptide can be acylated. Exemplary T helper peptides include tetanus toxoid 830-843, influenza virus 307-319, malarial ringworm 382-398 and 378-389
本文所述的多种新抗原肽也可以任选地通过间隔子连接在一起。The multiple neoantigenic peptides described herein may also be optionally linked together via spacers.
跨膜嵌合多肽(或蛋白)和结合其的抗原结合结构域Transmembrane chimeric polypeptide (or protein) and antigen binding domain binding thereto
本公开提供了一组跨膜嵌合多肽(本文也称为pJET或融合转录物衍生肽),其提供了优异的胞外新抗原候选物。所述嵌合蛋白来源于从生物信息管道中预测的融合转录物,所述生物信息管道被开发用于鉴定如上定义的全基因组非经典剪接区域。使用了在TCGA(癌症基因组图谱)和CCLE(博德研究所癌细胞系百科)公开可用的RNA-Seq数据(在实施例部分描述)。The present disclosure provides a set of transmembrane chimeric polypeptides (also referred to herein as pJET or fusion transcript-derived peptides) that provide excellent extracellular neoantigen candidates. The chimeric proteins are derived from fusion transcripts predicted from a bioinformatics pipeline developed to identify genome-wide non-classical splicing regions as defined above. Publicly available RNA-Seq data from TCGA (Cancer Genome Atlas) and CCLE (Broad Institute Cancer Cell Line Encyclopedia) were used (described in the Examples section).
根据本发明的融合转录物(本文也称为嵌合转录物或JET)的鉴定和表征已在本申请的上述部分(也参见实施例)中详细说明。The identification and characterization of fusion transcripts according to the invention (also referred to herein as chimeric transcripts or JETs) has been described in detail in the above section of the present application (see also the Examples).
如前所述,融合转录物来自已知对产生功能多样性至关重要的替代性剪接机制。事实上,剪接机制允许通过现有外显子和内含子序列的重排,从单个基因表达编码多种蛋白的多种mRNA(即转录物或剪接变体)。本文观察到的剪接改变的类型包括外显子跳读、内含子保留和使用替代性剪接供体或受体位点。在根据本发明的融合转录物中,TE可以作为供体(位于5’位)或受体(位于3’位),相应地,外显子可以是受体或供体。因此,TE-外显子剪接导致“非编码”基因组的一部分并入编码基因组,从而将非编码基因组序列暴露于翻译机制。也称为JET(连接外显子TE)的这些融合(或嵌合)转录物包括ORF(开放阅读框),即它们是阅读框中有能力翻译成多肽或蛋白的部分。当TE为受体时,融合转录物的ORF是经典的(即与经典转录物相同),而当TE为供体时,ORF可以是经典的(通常为ORF1),或可以是与ORF1相比,移码1或2个核苷酸(通常分别为ORF2和3)。融合转录物不仅包括融合的TE和外显子序列(对应于JET),还可进一步包括对应于各种转录物同种型的外显子,如果外显子是供体,则在融合断点的上游(在外显子和TE之间),如果TE是供体,则在融合断点的下游。As previously mentioned, fusion transcripts come from alternative splicing mechanisms known to be essential for producing functional diversity. In fact, the splicing mechanism allows the expression of multiple mRNAs (i.e., transcripts or splice variants) encoding multiple proteins from a single gene through the rearrangement of existing exon and intron sequences. The types of splicing changes observed herein include exon skipping, intron retention, and the use of alternative splicing donor or acceptor sites. In fusion transcripts according to the present invention, TE can be used as a donor (positioned at 5') or an acceptor (positioned at 3'), and accordingly, an exon can be an acceptor or a donor. Therefore, TE-exon splicing causes a part of a "non-coding" genome to be incorporated into a coding genome, thereby exposing a non-coding genome sequence to a translation mechanism. These fusion (or chimeric) transcripts, also referred to as JET (joining exon TE), include ORF (open reading frame), i.e., they are parts that have the ability to be translated into polypeptides or proteins in a reading frame. When TE is the acceptor, the ORF of the fusion transcript is canonical (i.e., the same as the canonical transcript), while when TE is the donor, the ORF can be canonical (usually ORF1), or can be frameshifted by 1 or 2 nucleotides compared to ORF1 (usually ORF2 and 3, respectively). The fusion transcript includes not only the fused TE and exon sequence (corresponding to JET), but may further include exons corresponding to various transcript isoforms, upstream of the fusion breakpoint (between the exon and TE) if the exon is the donor, and downstream of the fusion breakpoint if the TE is the donor.
更具体而言,本公开提供了表9-15和19-20中提及的、SEQ为ID NO:1-21542的跨膜嵌合多肽(或蛋白)。表14和19提供了从融合转录物(JET)翻译的氨基酸序列,其中外显子为供体。表15和20提供了从融合转录物(JET)翻译的氨基酸序列,其中外显子为供体。More specifically, the present disclosure provides transmembrane chimeric polypeptides (or proteins) mentioned in Tables 9-15 and 19-20, SEQ ID NOs: 1-21542. Tables 14 and 19 provide amino acid sequences translated from fusion transcripts (JET), wherein exons are donors. Tables 15 and 20 provide amino acid sequences translated from fusion transcripts (JET), wherein exons are donors.
这组(跨膜)嵌合蛋白是通过进一步选择具有外显子序列的融合转录物获得的,所述序列在正常蛋白质组数据库(例如通常为UniProt)中被注释为属于编码跨膜蛋白的转录物。所选的融合转录物的序列随后被翻译(计算机模拟)成融合(或嵌合)多肽序列(也称为翻译的连接或pJET或翻译的JET)。This set of (transmembrane) chimeric proteins is obtained by further selecting fusion transcripts with exon sequences that are annotated as belonging to transcripts encoding transmembrane proteins in normal proteome databases (e.g., typically UniProt). The sequences of the selected fusion transcripts are then translated (in silico) into fusion (or chimeric) polypeptide sequences (also called translated junctions or pJETs or translated JETs).
其中外显子为供体的融合转录物在转录物的经典ORF后从转录物的起始翻译至外显子和TE之间的断点后的第一个终止密码子。Fusion transcripts in which the exon is the donor are translated after the classical ORF of the transcript from the start of the transcript to the first stop codon after the breakpoint between the exon and the TE.
其中TE为供体的融合转录物在3个ORF(1-3)后从TE的起始或从TE和外显子之间的断点前的最后一个终止密码子后翻译至所述断点后的第一个终止密码子。The fusion transcript in which TE is the donor is translated after 3 ORFs (1-3) from the start of TE or from the last stop codon before the breakpoint between TE and exon to the first stop codon after the breakpoint.
仅保留了含有至少3个源自TE序列的氨基酸的翻译的多肽序列。Only translated polypeptide sequences containing at least 3 amino acids derived from TE sequences were retained.
在一些实施方案中,与UniProt中任何参考的或注释的蛋白序列相匹配的来自翻译的连接的肽序列被舍弃,因此,重点关注未注释的嵌合肽(如结果中所举例说明的)。In some embodiments, peptide sequences from translated junctions that matched any reference or annotated protein sequence in UniProt were discarded, thus focusing on unannotated chimeric peptides (as exemplified in the Results).
表9-13提供了从前述蛋白基因组方法中鉴定的肽(通常是考虑其大小的多肽)序列。通常,在来自总蛋白质组学(所有肽组)或表面蛋白质组学(表面组)的MS原始数据中检索本申请的管道生成的融合转录物(即JET)库(使用各种公共RNA seq数据集)。Tables 9-13 provide peptides (usually polypeptides of size) sequences identified from the aforementioned protein genome methods. Typically, the fusion transcripts (i.e., JET) libraries generated by the pipeline of the present application are searched in MS raw data from total proteomics (all peptide groups) or surface proteomics (surface groups) (using various public RNA seq data sets).
因此,本公开特别涉及在细胞膜上表达的嵌合多肽(或蛋白)。分配至跨膜室(如上所述)的计算机模拟多肽的细胞膜表达可通过几种体外方法在不同分子水平上进行实验验证:Therefore, the present disclosure is particularly directed to chimeric polypeptides (or proteins) expressed on cell membranes. Cell membrane expression of computer-simulated polypeptides partitioned into transmembrane compartments (as described above) can be experimentally verified at different molecular levels by several in vitro methods:
a)包含所选的翻译的连接的蛋白或肽可在宿主细胞例如肿瘤细胞系(如Hela、CHO等)中异位表达,以确认质膜内的稳定性和适当整合。可以设计包含翻译的连接和标签序列(例如FLAG或HA)的表达载体,从而产生带标签的融合嵌合蛋白。包含融合的蛋白或肽也含有表位标签,因此可以通过流式细胞术或使用可商购的抗标签抗体的显微术进行检测。克隆入载体的序列可优选设计为标签序列位于紧接TE序列之前或之后的形式。这种方法允许选择产生稳定蛋白的翻译的连接肽或蛋白,所述稳定蛋白在细胞膜中表达,并且通常将TE序列暴露于胞外空间。A) The protein or peptide containing the selected translation connection can be ectopically expressed in a host cell, such as a tumor cell line (such as Hela, CHO, etc.) to confirm stability and proper integration within the plasma membrane. An expression vector containing a translation connection and a tag sequence (such as FLAG or HA) can be designed to produce a tagged fusion chimeric protein. The fused protein or peptide also contains an epitope tag, so it can be detected by flow cytometry or microscopy using commercially available anti-tag antibodies. The sequence cloned into the vector can preferably be designed in a form in which the tag sequence is located immediately before or after the TE sequence. This method allows the selection of a translation connection peptide or protein that produces a stable protein, which is expressed in the cell membrane and usually exposes the TE sequence to the extracellular space.
b)也可以进行靶向测序实验,以在额外的肿瘤标本或细胞系中扩增和检测融合转录物。这可以通过常规PCR、定量实时PCT或SMRT全长转录物测序(PACBIO技术)进行。b) Targeted sequencing experiments can also be performed to amplify and detect fusion transcripts in additional tumor specimens or cell lines. This can be performed by conventional PCR, quantitative real-time PCR, or SMRT full-length transcript sequencing (PACBIO Technology).
c)翻译的序列也可在“翻译组”(其代表细胞内翻译mRNA的整体)中通过核蛋白体谱或ribo-Seq检测。因此,核糖体谱分析能够监测连接转录物翻译过程并预测嵌合体(翻译的连接肽或蛋白)的丰度。通过这种技术,可以鉴定翻译的JET衍生转录物(mRNA)的区域,从而确定融合多肽的翻译起始和终止位点。使用这种桥接JET序列的基因组学和蛋白质组学验证的方法,可以完全确定融合转录物的翻译区域。c) Translated sequences can also be detected in the "translatome" (which represents the entirety of the translating mRNA in the cell) by ribosome profiling or ribo-Seq. Thus, ribosome profiling can monitor the translation process of the junction transcript and predict the abundance of chimeras (translated junction peptides or proteins). By this technique, the regions of the translated JET-derived transcripts (mRNA) can be identified, thereby determining the translation start and stop sites of the fusion polypeptide. Using this method of genomic and proteomic validation of the bridged JET sequence, the translated regions of the fusion transcript can be fully determined.
d)可以通过肿瘤标本和细胞系中的靶向质谱方法对蛋白表达进行进一步的实验验证。可以进行靶向蛋白质组学实验,以极高的精度、灵敏度、特异性和通量每次定量少量的嵌合蛋白(包含翻译的连接)。d) Further experimental validation of protein expression can be performed by targeted mass spectrometry approaches in tumor specimens and cell lines. Targeted proteomics experiments can be performed to quantify small amounts of chimeric proteins (including translated junctions) at a time with extremely high precision, sensitivity, specificity, and throughput.
在更具体的实施方案中,本公开是指如本文定义的嵌合多肽,其中源自TE核苷酸序列的序列的一部分暴露于细胞表面。TE衍生序列的细胞表面暴露可基于所述TE衍生序列的预测拓扑进行计算机模拟预测。In a more specific embodiment, the present disclosure refers to a chimeric polypeptide as defined herein, wherein a portion of the sequence derived from the TE nucleotide sequence is exposed on the cell surface. The cell surface exposure of the TE-derived sequence can be predicted in silico based on the predicted topology of the TE-derived sequence.
预测蛋白位于细胞表面通常涉及(i)检测TM结构域或脂锚;(ii)膜内蛋白方向的定义,包括胞外暴露结构域的鉴定;和/或(iii)亚细胞位置的预测。已有预测TM结构域、信号肽和GPI连接的蛋白的生物信息工具(参见Kail L,Krogh A,Sonnhammer ELL(2004)Acombined transmembrane topology and signal peptide prediction method.J MolBiol 338:1027-1036;Jones DT(2007)Improving the accuracy of transmembraneprotein topology prediction using evolutionary information.Bioinformatics 23:538-544;Reeb J,Kloppmann E,Bemhofer M,Rost B(2015)Evaluation of transmembranehelix predictions in 2014.Proteins 83:473-484;Krogh A,Larsson B,von Heijne G,Sonnhammer ELL(2001)Predicting transmembrane protein topology with a hiddenMarkov model:Application to complete genomes.J Mol Biol 305:567-580;ViklundH,Elofsson A(2008)OCTOPUS:Improving topology prediction by two-track ANN-based preference scores and an extended topological grammar.Bioinformatics24:1662-1668;Fankhauser N,Maser P(2005)Identification of GPI anchor attachmentsignals by a Kohonen self-organizing map.Bioinformatics 21:1846-1852)。Predicting that a protein is located on the cell surface generally involves (i) detection of TM domains or lipid anchors; (ii) definition of the orientation of the protein within the membrane, including identification of extracellularly exposed domains; and/or (iii) prediction of subcellular location. There are bioinformatic tools for predicting TM domains, signal peptides and GPI-linked proteins (see Kail L, Krogh A, Sonnhammer ELL (2004) A combined transmembrane topology and signal peptide prediction method. J Mol Biol 338: 1027-1036; Jones DT (2007) Improving the accuracy of transmembrane protein topology prediction using evolutionary information. Bioinformatics 23: 538-544; Reeb J, Kloppmann E, Bemhofer M, Rost B (2015) Evaluation of transmembranehelix predictions in 2014. Proteins 83: 473-484; Krogh A, Larsson B, von Heijne G, Sonnhammer ELL (2001) Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes. J Mol Biol 305:567-580; Viklund H, Elofsson A (2008) OCTOPUS: Improving topology prediction by two-track ANN-based preference scores and an extended topological grammar. Bioinformatics 24:1662-1668; Fankhauser N, Maser P (2005) Identification of GPI anchor attachment signals by a Kohonen self-organizing map. Bioinformatics 21:1846-1852).
对于给定的翻译的连接,TE衍生序列的细胞表面表达的实验验证也可以如上所述进行(参见a点)。For a given translated linkage, experimental verification of cell surface expression of the TE-derived sequence can also be performed as described above (see point a).
如前所述,通过从表面组(具有暴露于胞外空间的至少1个、特别是至少2个、至少3个或至少4个氨基酸残基的所有质膜蛋白)的MS原始数据中检索JET库,可以基于蛋白基因组方法实现对在细胞表面表达的肽的鉴定。因此,表面组是质膜蛋白质组的子集,质膜蛋白质组是膜蛋白质组(所有膜蛋白整体)的子集。附着在胞外脂层上[例如,通过糖基磷脂酰肌醇(GPI)锚]的单侧整合膜蛋白是人表面组的一部分As described above, identification of peptides expressed on the cell surface can be achieved based on a proteomic approach by searching the JET library from the MS raw data of the surface group (all plasma membrane proteins with at least 1, in particular at least 2, at least 3 or at least 4 amino acid residues exposed to the extracellular space). The surface group is therefore a subset of the plasma membrane proteome, which is a subset of the membrane proteome (all membrane proteins as a whole). Unilateral integral membrane proteins attached to the extracellular lipid layer [e.g., via a glycosylphosphatidylinositol (GPI) anchor] are part of the human surface group.
在一些实施方案中,本公开更具体地涉及由TE衍生序列和外显子衍生序列之间的连接的下游的非经典ORF产生的嵌合多肽。通过将标签定位在非经典ORF序列之前或之后,也可以如上所述(参见a点)对具有预测为胞外拓扑结构的非经典ORF翻译序列的细胞表面表达进行实验性确认。In some embodiments, the present disclosure is more specifically directed to chimeric polypeptides generated from a non-classical ORF downstream of the junction between a TE-derived sequence and an exon-derived sequence. By positioning the tag before or after the non-classical ORF sequence, the cell surface expression of a non-classical ORF translation sequence with a predicted extracellular topology can also be experimentally confirmed as described above (see point a).
在一些实施方案中,可基于融合类型(TE供体或受体)和膜蛋白的亚型选择翻译的连接。这种选择可通过以下方式实现:In some embodiments, the translational linkage can be selected based on the fusion type (TE donor or acceptor) and the subtype of the membrane protein. This selection can be achieved by:
-完整膜蛋白-Integral membrane proteins
○I型单程蛋白(如此定位以使其羧基末端朝向胞质溶胶):选择的转录物是由TE作为供体(融合TE->外显子)的融合衍生的,在某些情况下,这种融合在TE作为受体(融合外显子-TE)的第二融合之前。在后一种情况下,转录物由双融合“外显子-TE-外显子”或“元融合(metafusion)”产生,其中产生的转录物包括侧接两个经典外显子的TE外显子外切序列(exonisated sequence)。o Type I single-pass proteins (positioned so that their carboxyl termini face the cytosol): selected transcripts are derived from fusions with TE as donor (fusion TE->exon), which in some cases are preceded by a second fusion with TE as acceptor (fusion exon-TE). In the latter case, transcripts are generated by a double fusion "exon-TE-exon" or "metafusion", in which the resulting transcript includes a TE exon exonisated sequence flanking two classical exons.
○ II类单程蛋白(其氨基末端朝向胞质溶胶):选择的转录物是由TE作为受体(融合外显子->TE)的融合衍生的,在某些情况下,这种融合在TE作为供体(融合TE->外显子)的第二融合之后。在后一种情况下,转录物由双融合“外显子-TE-外显子”或“元融合”产生,其中产生的转录物包括侧接两个经典外显子的TE外显子外切序列。○ Class II single-pass proteins (with their amino termini facing the cytosol): selected transcripts are derived from fusions with TE as acceptor (fusion exon->TE), which in some cases are followed by a second fusion with TE as donor (fusion TE->exon). In the latter case, transcripts are generated from double fusions "exon-TE-exon" or "meta-fusions", where the resulting transcripts include a TE exon exon sequence flanking two canonical exons.
○多程或多跨膜蛋白(多肽链多次穿过膜):TE序列可作为供体或受体和/或元融合的一部分。断点(或元融合中的两个断点之一)位于两个跨膜螺旋之间的膜的胞外侧。o Multipass or multispanning proteins (polypeptide chains pass through the membrane multiple times): TE sequences can act as donors or acceptors and/or as part of a metafusion. The breakpoint (or one of the two breakpoints in a metafusion) is located on the extracellular side of the membrane between the two transmembrane helices.
○单侧整合蛋白(仅附着于膜的一侧且不横跨整个膜的整合膜蛋白):选择的转录物是由TE作为受体(融合外显子->TE)、作为供体(融合TE->外显子)或同时作为两者(元融合)的融合衍生的。○ Unilateral integrins (integral membrane proteins that are attached to only one side of the membrane and do not span the entire membrane): Transcripts selected are derived from fusions with TE as acceptor (fusion exon->TE), as donor (fusion TE->exon), or as both (meta-fusion).
-外周膜蛋白(附着或结合于细胞膜):选择的转录物是由TE作为受体(融合外显子->TE)、作为供体(融合TE->外显子)或同时作为两者(元融合)的融合衍生的。- Peripheral membrane proteins (attached or bound to the cell membrane): Transcripts selected are derived from fusions with TE as acceptor (fusion exon->TE), as donor (fusion TE->exon) or both (meta-fusion).
通常,根据本公开的嵌合蛋白在超过1%,特别是超过5%,并通常超过10%的肿瘤样品中表达(来自一个或多个受试者和/或来自一种或多种肿瘤类型,其中肿瘤样品可以从TCGA获得)。替代地或额外地,嵌合蛋白(或pJET)可在一种或多种细胞系(通常来自CCLE),特别是在至少1;2;3;4;5;6;7;8;9;10;20;50;100种细胞系中表达。Typically, the chimeric protein according to the present disclosure is expressed in more than 1%, particularly more than 5%, and typically more than 10% of tumor samples (from one or more subjects and/or from one or more tumor types, where tumor samples can be obtained from TCGA). Alternatively or additionally, the chimeric protein (or pJET) may be expressed in one or more cell lines (typically from CCLE), particularly in at least 1; 2; 3; 4; 5; 6; 7; 8; 9; 10; 20; 50; 100 cell lines.
通常,与正常样品(包括近肿瘤样品)和/或细胞系相比,本公开的嵌合蛋白在肿瘤样品中的表达水平更高。更具体而言,嵌合蛋白优选在低于20%,特别是低于10%,低于5%或低于1%的正常样品或细胞系中表达。在一些实施方案中,嵌合蛋白在正常样品(包括近肿瘤样品)中不可检测地表达。Typically, the chimeric protein of the present disclosure is expressed at a higher level in tumor samples than in normal samples (including adjacent tumor samples) and/or cell lines. More specifically, the chimeric protein is preferably expressed in less than 20%, particularly less than 10%, less than 5% or less than 1% of normal samples or cell lines. In some embodiments, the chimeric protein is not detectably expressed in normal samples (including adjacent tumor samples).
本公开还涵盖与SEQ ID NO:1-8202的任何一种嵌合多肽(或蛋白)具有至少50%、特别是至少55%、60%、65%、70%、75%、80%、85%、90%、95%、96%、97%、98%、99%同一性的嵌合多肽变体。所述变体嵌合多肽在细胞表面膜上表达。通常,在所述变体中,源自TE核苷酸序列的序列与变体衍生的肽的TE衍生序列保持至少90%、95%、96%、97%、98%、99%的同一性。更优选地,所述变体嵌合蛋白与正常蛋白质组数据库(例如UniProt)中的任何注释的多肽或蛋白都不匹配。The present disclosure also encompasses chimeric polypeptide variants having at least 50%, in particular at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% identity to any chimeric polypeptide (or protein) of SEQ ID NO: 1-8202. The variant chimeric polypeptide is expressed on the cell surface membrane. Typically, in the variant, the sequence derived from the TE nucleotide sequence maintains at least 90%, 95%, 96%, 97%, 98%, 99% identity to the TE-derived sequence of the variant-derived peptide. More preferably, the variant chimeric protein does not match any annotated polypeptide or protein in a normal proteome database (e.g., UniProt).
本文公开的(跨膜)嵌合蛋白也可以通过延长或减少化合物的氨基酸序列,例如通过添加或缺失氨基酸进行修饰。嵌合蛋白也可以通过改变某些残基的顺序或组成进行修饰,很容易理解,对于生物活性至关重要的某些氨基酸残基,例如关键接触位点或保守残基,在不对生物活性产生不利影响的情况下通常不会改变。非关键氨基酸不必限于天然存在于蛋白中的那些,例如L-α-氨基酸或其D-异构体,也可以包括非天然氨基酸,例如β-γ-δ-氨基酸,以及许多L-α-氨基酸衍生物。The (transmembrane) chimeric proteins disclosed herein can also be modified by extending or reducing the amino acid sequence of the compound, for example by adding or deleting amino acids. The chimeric protein can also be modified by changing the order or composition of certain residues, and it is easy to understand that certain amino acid residues that are essential for biological activity, such as key contact sites or conserved residues, are usually not changed without adversely affecting the biological activity. Non-critical amino acids are not necessarily limited to those naturally present in proteins, such as L-α-amino acids or their D-isomers, and can also include non-natural amino acids, such as β-γ-δ-amino acids, and many L-α-amino acid derivatives.
通常,采用一系列单个氨基酸取代的肽来确定静电荷、疏水性等对结合的影响。取代可以是均质低聚物或异质低聚物。被取代或添加的残基的数量和类型取决于必要接触点之间的必要间隔和所寻求的某些功能属性(例如,疏水性与亲水性)。在任何情况下,这种取代应采用选择的氨基酸残基或其他分子片段,以避免例如可能破坏结合的空间和电荷干扰。Typically, a series of single amino acid substitutions are made to the peptide to determine the effects of net charge, hydrophobicity, etc. on binding. Substitutions may be homo-oligomeric or hetero-oligomeric. The number and type of residues substituted or added will depend on the necessary spacing between necessary contact points and certain functional attributes sought (e.g., hydrophobicity vs. hydrophilicity). In any case, such substitutions should be made with amino acid residues or other molecular fragments selected to avoid, for example, steric and charge interferences that could disrupt binding.
氨基酸取代通常为单个残基。取代、缺失、插入或其任何组合可被组合以获得最终的肽。取代变体是指其中至少一个肽的残基被移除,并且另一个残基被插入其位置的变体。当希望精细调节肽的特性时,通常根据下表1进行这种取代。Amino acid substitutions are usually single residues. Substitution, deletion, insertion or any combination thereof can be combined to obtain the final peptide. Substitution variants refer to variants in which at least one residue of the peptide is removed and another residue is inserted in its place. When it is desired to fine-tune the properties of the peptide, such substitutions are usually performed according to Table 1 below.
需要提及的是,本文公开的嵌合多肽或蛋白也可通过蛋白酶体机制加工,并产生通常至少8个氨基酸(特别是8-25个氨基酸)的新抗原肽,所述新抗原肽结合如前详述的所述受试者的至少一个主要组织相容性复合物(MHC)分子。It should be mentioned that the chimeric polypeptides or proteins disclosed herein can also be processed by the proteasome mechanism and produce a neoantigenic peptide of generally at least 8 amino acids (particularly 8-25 amino acids), which binds to at least one major histocompatibility complex (MHC) molecule of the subject as described in detail above.
本公开还涵盖本文所述的抗原结合结构域,其结合如上述定义的嵌合蛋白或其片段,特别是其长度至少为4、5、6、7或8个氨基酸的新抗原肿瘤序列(或表位),其中解离常数(Kd)约为2×10-7M或更小。在某些实施例中,Kd为约2×10-7M或更小,约1×10-7M或更小,约9×10-8M或更小,约9×10-9M或更小,约5×10-9M或更小,约4×10-9M或更小,约3×10-9M或更小,约2×10-9M或更小,或约1×10-9M或更小,或约1×10-10M或更小,或约1×10-12M或更小。在某些非限制性实施方案中,Kd为约3×10-9M或更小。在某些非限制性实施方案中,Kd为约1×10-9M至约3×10--7M。在某些非限制性实施方案中,Kd为约1.5×10-9M至约3×10-7M。在某些非限制性实施方案中,Kd为约1.5×10-9M至约2.7×10-7M。在某些非限制性实施方案中,Kd为约1.5×10-10M至约2.7×10-7M。在某些非限制性实施方案中,Kd为约1.5×10-12M至约2.7×10-7M。The present disclosure also encompasses an antigen binding domain as described herein, which binds to a chimeric protein or fragment thereof as defined above, in particular a neoantigenic tumor sequence (or epitope) of at least 4, 5, 6, 7, or 8 amino acids in length, wherein the dissociation constant (Kd) is about 2× 10-7 M or less. In certain embodiments, the Kd is about 2× 10-7 M or less, about 1× 10-7 M or less, about 9× 10-8 M or less, about 9× 10-9 M or less, about 5× 10-9 M or less, about 4× 10-9 M or less, about 3× 10-9 M or less, about 2× 10-9 M or less, or about 1× 10-9 M or less, or about 1× 10-10 M or less, or about 1× 10-12 M or less. In certain non-limiting embodiments, the Kd is about 3× 10-9 M or less. In certain non-limiting embodiments, K d is from about 1×10 -9 M to about 3×10 -7 M. In certain non-limiting embodiments, K d is from about 1.5×10 -9 M to about 3×10 -7 M. In certain non-limiting embodiments, K d is from about 1.5×10 -9 M to about 2.7×10 -7 M. In certain non-limiting embodiments, K d is from about 1.5×10 -10 M to about 2.7×10 -7 M. In certain non-limiting embodiments, K d is from about 1.5×10 -12 M to about 2.7×10 -7 M.
抗原结合结构域(例如Fv或其类似物)的结合可通过例如酶联免疫吸附测定(ELISA)、放射免疫测定(RIA)、FACS分析、生物测定(例如生长抑制)、Western Blot测定或荧光显微术来确认。这些测定的每一种通常通过使用目标复合物特异性的标记试剂(例如,抗体或Fv)来检测特定目标蛋白-抗体复合物的存在。例如,Fv可被放射性标记并用于放射免疫分析(RIA)(参见,例如,Weintraub,B.,Principles of Radioimmunoassays,SeventhTraining Course on Radioligand Assay Techniques,The Endocrine Society,March,1986,其通过引用并入本文)。放射性同位素可通过使用g计数器或闪烁计数器或放射自显影等方法进行检测。The binding of the antigen binding domain (e.g., Fv or its analog) can be confirmed by, for example, enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), FACS analysis, bioassay (e.g., growth inhibition), Western Blot assay, or fluorescence microscopy. Each of these assays is usually performed by using a labeling agent (e.g., antibody or Fv) specific for the target complex to detect the presence of a specific target protein-antibody complex. For example, Fv can be radiolabeled and used in radioimmunoassay (RIA) (see, e.g., Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on Radioligand Assay Techniques, The Endocrine Society, March, 1986, which is incorporated herein by reference). Radioisotopes can be detected by using methods such as a g counter or a scintillation counter or autoradiography.
对于显微术,标记试剂(例如,抗体或Fv)可以直接与荧光团缀合,或被针对标记试剂的荧光团缀合的二抗识别。荧光团(也称为荧光染料)的非限制性例子包括花菁(例如Cy3)或罗丹明(例如TRITC)或荧光素(例如FITC)的衍生物。For microscopy, the labeling agent (e.g., antibody or Fv) can be directly conjugated to a fluorophore, or recognized by a secondary antibody conjugated to the fluorophore of the labeling agent. Non-limiting examples of fluorophores (also called fluorescent dyes) include derivatives of cyanines (e.g., Cy3) or rhodamines (e.g., TRITC) or fluoresceins (e.g., FITC).
在某些实施方案中,胞外抗原结合结构域用荧光标记物标记。荧光标记物的非限制性例子包括绿色荧光蛋白(GFP)、蓝色荧光蛋白(例如,EBFP、EBFP2、蓝铜矿和mKalamal)、青色荧光蛋白(例如,ECFP、天蓝和CyPet)和黄色荧光蛋白(例如,YFP、Citrine、Venus和YPet)。In certain embodiments, the extracellular antigen-binding domain is labeled with a fluorescent marker. Non-limiting examples of fluorescent markers include green fluorescent protein (GFP), blue fluorescent protein (e.g., EBFP, EBFP2, Azurite and mKalamal), cyan fluorescent protein (e.g., ECFP, Cerulean and CyPet) and yellow fluorescent protein (e.g., YFP, Citrine, Venus and YPet).
在一些实施方案中,本文公开的抗原结合结构域结合本文所述的嵌合蛋白的氨基酸序列(或表位)的片段(或肿瘤新抗原肽序列),所述嵌合蛋白包含至少一个TE衍生的氨基酸序列或来自SEQ ID NO:1424-8202;8203-10163和12831-21542中的任何一个(通常由融合转录物编码,其中TE是供体)。在一些实施方案中,来自本文所述嵌合蛋白的肽序列与TE衍生的氨基酸序列和外显子衍生的氨基酸序列之间的断点重叠。在其他实施方案中,肽序列源自纯TE序列。在其他实施方案中,肽序列由TE衍生的氨基酸序列和外显子衍生的氨基酸序列之间的连接的下游的非经典ORF编码。In some embodiments, the antigen binding domain disclosed herein binds to a fragment of the amino acid sequence (or epitope) of the chimeric protein described herein (or a tumor neoantigen peptide sequence), the chimeric protein comprising at least one TE-derived amino acid sequence or from any one of SEQ ID NOs: 1424-8202; 8203-10163 and 12831-21542 (generally encoded by a fusion transcript, wherein TE is the donor). In some embodiments, the peptide sequence from the chimeric protein described herein overlaps the breakpoint between the TE-derived amino acid sequence and the exon-derived amino acid sequence. In other embodiments, the peptide sequence is derived from a pure TE sequence. In other embodiments, the peptide sequence is encoded by a non-classical ORF downstream of the junction between the TE-derived amino acid sequence and the exon-derived amino acid sequence.
在一些实施方案中,根据本公开的抗原结合结构域结合来自本公开的任何一种嵌合多肽新抗原肽或其片段的新抗原肽序列,其中所述新抗原肽序列:In some embodiments, an antigen binding domain according to the present disclosure binds to a neoantigenic peptide sequence from any one of the chimeric polypeptide neoantigenic peptides or fragments thereof of the present disclosure, wherein the neoantigenic peptide sequence:
a)来自SEQ ID NO:1-21542中的任何一个或其片段,并且包含至少一个源自TE衍生的氨基酸或核苷酸序列的序列,a) from any one of SEQ ID NOs: 1-21542 or a fragment thereof, and comprising at least one sequence derived from an amino acid or nucleotide sequence derived from TE,
任选地(i)与TE衍生的氨基酸序列和外显子衍生的氨基酸序列之间的断点重叠的片段,或,Optionally (i) a fragment overlapping the breakpoint between the TE-derived amino acid sequence and the exon-derived amino acid sequence, or,
任选地(ii)纯TE序列;或Optionally (ii) pure TE sequence; or
b)来自SEQ ID NO:1-1423;8203-10163和10164-12830中的任何一个(通常由融合转录物编码,其中外显子为供体)或其片段,并且由TE衍生的氨基酸序列和外显子衍生的氨基酸序列之间的连接的下游的非经典ORF编码。b) from any one of SEQ ID NOs: 1-1423; 8203-10163 and 10164-12830 (typically encoded by a fusion transcript, in which the exon is the donor) or a fragment thereof, and encoded by a non-canonical ORF downstream of the junction between the TE-derived amino acid sequence and the exon-derived amino acid sequence.
通常,肽序列来自嵌合蛋白的胞外部分。Typically, the peptide sequence is derived from the extracellular portion of the chimeric protein.
在某些实施方案中,抗原结合结构域包含TCR的抗原结合部分。In certain embodiments, the antigen binding domain comprises an antigen binding portion of a TCR.
在某些实施方案中,抗原结合结构域包含抗体或其片段的抗原结合部分。在某些实施方案中,抗原结合结构域包含抗体的重链可变区(VH)和/或轻链可变区(VL)。在某些实施方案中,抗原结合结构域包含单链可变片段(scFv)。In certain embodiments, the antigen binding domain comprises an antigen binding portion of an antibody or fragment thereof. In certain embodiments, the antigen binding domain comprises a heavy chain variable region (VH) and/or a light chain variable region (VL) of an antibody. In certain embodiments, the antigen binding domain comprises a single chain variable fragment (scFv).
在某些实施方案中,抗原结合结构域包含仅重链的抗体(VHH)或其变体和/或VL结构域或其变体。In certain embodiments, the antigen binding domain comprises a heavy chain only antibody (VHH) or a variant thereof and/or a VL domain or a variant thereof.
在某些实施方案中,抗原结合结构域包含Fab,其任选被交联。在某些实施方案中,抗原结合结构域包含F(ab)2。在某些实施方案中,任何前述分子可包含在具有异源序列的融合蛋白中,以形成抗原结合结构域。In certain embodiments, the antigen binding domain comprises a Fab, which is optionally cross-linked. In certain embodiments, the antigen binding domain comprises a F(ab)2. In certain embodiments, any of the foregoing molecules may be included in a fusion protein with a heterologous sequence to form an antigen binding domain.
在某些实施方案中,胞外抗原结合结构域源自源于鼠、人或骆驼科动物(例如美洲驼)的scFv、Fab或抗体。In certain embodiments, the extracellular antigen binding domain is derived from a scFv, Fab or antibody derived from mouse, human or camelid (eg, llama).
肽产品、多核苷酸和载体Peptide products, polynucleotides and vectors
蛋白或肽可通过本领域技术人员已知的任何技术制备,包括通过标准分子生物学技术表达蛋白质、多肽或肽,从天然来源分离蛋白质或肽,或化学合成蛋白质或肽。对应于各种基因的核苷酸和蛋白质、多肽和肽序列先前已经公开,并且可以在本领域普通技术人员已知的计算机化数据库中找到。一个这样的数据库是位于国家卫生研究院网站的国家生物技术信息中心的Genbank和GenPept数据库。已知基因的编码区可使用本文公开的技术或本领域普通技术人员已知的技术进行扩增和/或表达。或者,蛋白、多肽和肽的各种商用制剂为本领域技术人员所知。Protein or peptide can be prepared by any technology known to those skilled in the art, including expression of protein, polypeptide or peptide by standard molecular biology techniques, isolation of protein or peptide from natural sources, or chemical synthesis of protein or peptide. Nucleotide and protein, polypeptide and peptide sequences corresponding to various genes have been previously disclosed and can be found in computerized databases known to those of ordinary skill in the art. One such database is the Genbank and GenPept databases of the National Center for Biotechnology Information located at the National Institutes of Health website. The coding region of a known gene can be amplified and/or expressed using technology disclosed herein or known to those of ordinary skill in the art. Alternatively, various commercial preparations of protein, polypeptide and peptide are known to those skilled in the art.
在另一方面,本公开提供了编码如本文公开的新抗原肽的核酸(例如多核苷酸)。多核苷酸可选自DNA、cDNA、PNA、CNA、RNA、单链和/或双链、或天然或稳定形式的多核苷酸,例如具有硫代磷酸骨架的多核苷酸或其组合,并且只要其编码肽,其可以包含或不包含内含子。只有含有通过天然存在的肽键连接的天然存在的氨基酸残基的肽可被多核苷酸编码。On the other hand, the present disclosure provides nucleic acids (e.g., polynucleotides) encoding new antigenic peptides as disclosed herein. The polynucleotides may be selected from DNA, cDNA, PNA, CNA, RNA, single-stranded and/or double-stranded, or natural or stable forms of polynucleotides, such as polynucleotides with a thiophosphate backbone or a combination thereof, and as long as it encodes a peptide, it may or may not contain introns. Only peptides containing naturally occurring amino acid residues connected by naturally occurring peptide bonds can be encoded by polynucleotides.
本公开的又一方面提供了能够表达如本文公开的新抗原肽的表达载体。不同细胞类型的表达载体在本领域中是众所周知的,并且无需过度实验即可选择。通常,将DNA以适当的方向和正确的阅读框架插入表达载体(如质粒)中进行表达。表达载体将包含所需宿主识别的合适的异源转录和/或翻译调控核苷酸序列。编码肿瘤新抗原肽的多核苷酸可以与该异源调控核苷酸序列连接,或可以与该异源调控核苷酸序列不相邻但仍可操作地连接。然后通过标准技术将载体引入宿主。例如,可以在Sambrook et al.(1989)MolecularCloning,A Laboratory Manual,Cold Spring Harbor Laboratory,Cold Spring Harbor,N.Y.中找到指南。Another aspect of the present disclosure provides an expression vector capable of expressing a new antigenic peptide as disclosed herein. Expression vectors for different cell types are well known in the art and can be selected without undue experimentation. Typically, DNA is inserted into an expression vector (such as a plasmid) in the proper direction and correct reading frame for expression. The expression vector will contain a suitable heterologous transcription and/or translation regulatory nucleotide sequence recognized by the desired host. The polynucleotide encoding the tumor new antigen peptide may be linked to the heterologous regulatory nucleotide sequence, or may be non-adjacent to the heterologous regulatory nucleotide sequence but still operably linked. The vector is then introduced into the host by standard techniques. For example, guidance can be found in Sambrook et al. (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.
抗原呈递细胞Antigen presenting cells
本公开还包括已经用一种或多种前述定义的肽脉冲和/或通过前述方法获得的抗原呈递细胞群体。优选地,抗原呈递细胞为树突状细胞或人工抗原呈递细胞(参见Neal,Lillian R et al.“The Basics of Artificial Antigen Presenting Cells in T Cell-Based Cancer Immunotherapies.”Journal of immunology research and therapyvol.2,1(2017):68-79)。树突状细胞(DC)是专门的抗原呈递细胞(APC),具有刺激初始T细胞和启动病原体初级免疫应答的非凡能力。事实上,成熟DC的主要作用是感知抗原并产生激活其他免疫细胞(尤其是T细胞)的中介体。由于DC表达触发T细胞上TCR(信号1)和共刺激分子(信号2)的MHC分子,因此它是淋巴细胞激活的有效刺激因子。此外,DC还分泌支持T细胞扩增的细胞因子。T细胞需要以加工肽的形式呈递抗原,以识别外来病原体或肿瘤。病原体/肿瘤蛋白衍生肽表位通过MHC分子呈递。MHC I类(MHC-I)和MHC II类(MHC-II)分子分别向CD8+T细胞和CD4+T细胞呈递加工肽。重要的是,DC归巢至含有大量T细胞群体的炎性部位,以促进免疫应答。因此,DC可能是任何免疫治疗方法的重要组成部分,因为它们与适应性免疫应答的激活密切相关。就疫苗而言,DC疗法可以在健康志愿者或感染性疾病或癌症患者内增强T细胞对所需靶标的免疫应答。在一个实施方案中,APCS是人工APC,其经基因修饰以表达所需的T细胞共刺激分子、人HLA等位基因和/或细胞因子。这种人工抗原呈递细胞(aAPC)能够提供足够的T细胞参与、共刺激以及细胞因子持续释放的要求,从而允许T细胞的受控扩增。这些细胞不受时间和有限可用性的限制,可以小份储存,用于随后从不同供体产生T细胞系,因此代表了用于免疫治疗应用的现成试剂。在这些aAPC上表达有效的共刺激信号赋予该系统更高的效率,从而提高过继免疫治疗的功效。此外,aAPC可被工程化以表达引导特定细胞因子释放的基因,以促进所需T细胞亚群的优先扩增,用于过继转移;例如长寿命记忆T细胞(参见综述Hasan AH et al.,.Artificial Antigen Presenting Cells:AnOff the Shelf Approach for Generation of Desirable T-Cell Populations forBroad Application of Adoptive Immunotherapy;Adv Genet Eng.2015;4(3):130,KimJV,Latouche JB,Rivière I,Sadelain M.The ABCs of artificial antigenpresentation.Nat Biotechnol.2004;22:403-410 or Wang C,Sun W,Ye Y,Bomba HN,GuZ.Bioengineering of Artificial Antigen Presenting Cells and LymphoidOrgans.Theranostics 2017;7(14):3504-3516.)。The present disclosure also includes a population of antigen presenting cells that have been pulsed with one or more of the aforementioned peptides and/or obtained by the aforementioned methods. Preferably, the antigen presenting cells are dendritic cells or artificial antigen presenting cells (see Neal, Lillian R et al. "The Basics of Artificial Antigen Presenting Cells in T Cell-Based Cancer Immunotherapies." Journal of immunology research and therapy vol. 2, 1 (2017): 68-79). Dendritic cells (DC) are specialized antigen presenting cells (APC) with the extraordinary ability to stimulate initial T cells and initiate primary immune responses to pathogens. In fact, the main function of mature DC is to sense antigens and produce mediators that activate other immune cells (especially T cells). Since DC expresses MHC molecules that trigger TCR (signal 1) and co-stimulatory molecules (signal 2) on T cells, it is an effective stimulatory factor for lymphocyte activation. In addition, DC also secretes cytokines that support T cell expansion. T cells need to present antigens in the form of processed peptides to recognize foreign pathogens or tumors. Pathogen/tumor protein-derived peptide epitopes are presented by MHC molecules. MHC class I (MHC-I) and MHC class II (MHC-II) molecules present processed peptides to CD8+T cells and CD4+T cells, respectively. Importantly, DCs home to inflammatory sites containing a large number of T cell populations to promote immune responses. Therefore, DCs may be an important component of any immunotherapy approach because they are closely related to the activation of adaptive immune responses. With regard to vaccines, DC therapy can enhance the immune response of T cells to the desired target in healthy volunteers or patients with infectious diseases or cancer. In one embodiment, APCs are artificial APCs that are genetically modified to express the desired T cell co-stimulatory molecules, human HLA alleles and/or cytokines. This artificial antigen presenting cell (aAPC) can provide sufficient T cell participation, co-stimulation, and the requirement for sustained release of cytokines, thereby allowing controlled expansion of T cells. These cells are not limited by time and limited availability and can be stored in small portions for subsequent generation of T cell lines from different donors, thus representing ready-made reagents for immunotherapy applications. The expression of potent co-stimulatory signals on these aAPCs confers greater efficiency to this system, thereby improving the efficacy of adoptive immunotherapy. In addition, aAPCs can be engineered to express genes that direct the release of specific cytokines to promote the preferential expansion of desired T cell subsets for adoptive transfer; for example, long-lived memory T cells (see review Hasan AH et al., . Artificial Antigen Presenting Cells: An Off the Shelf Approach for Generation of Desirable T-Cell Populations for Broad Application of Adoptive Immunotherapy; Adv Genet Eng. 2015; 4(3): 130, Kim JV, Latouche JB, Rivière I, Sadelain M. The ABCs of artificial antigen presentation. Nat Biotechnol. 2004; 22: 403-410 or Wang C, Sun W, Ye Y, Bomba HN, Gu Z. Bioengineering of Artificial Antigen Presenting Cells and Lymphoid Organs. Theranostics 2017; 7(14): 3504-3516.).
通常,树突状细胞是用本文公开的新抗原肽脉冲的自体树突状细胞。该肽可以是引起适当T细胞反应的任何合适的肽。抗原呈递细胞(或刺激细胞)通常在其表面上具有MHCI类或II类分子,并且在一个实施方案中,其自体基本上不能向MHC I类或II类分子装载所选择的抗原。MHC I类或II类分子可在体外容易地装载选定的抗原。Typically, dendritic cells are autologous dendritic cells pulsed with the new antigenic peptides disclosed herein. The peptide can be any suitable peptide that causes an appropriate T cell response. Antigen presenting cells (or stimulator cells) typically have MHC class I or class II molecules on their surface, and in one embodiment, their own body is substantially unable to load the selected antigen to MHC class I or class II molecules. MHC class I or class II molecules can be easily loaded with selected antigens in vitro.
作为替代方案,抗原呈递细胞可包含编码本文公开的肿瘤新抗原肽的表达构建体。多核苷酸可以是前述定义的任何合适的多核苷酸,并且优选其能够转导树突状细胞,从而导致肽的呈递和免疫的诱导。As an alternative, the antigen presenting cell may comprise an expression construct encoding the tumor neoantigen peptide disclosed herein. The polynucleotide may be any suitable polynucleotide as defined above, and preferably it is capable of transducing dendritic cells, resulting in presentation of the peptide and induction of immunity.
因此,本发明包括这样的APC群体:其可被本文所述的新抗原肽脉冲或装载、经遗传修饰(通过DNA或RNA转移)以表达至少一种本文所述的新抗原肽或包含编码本文所述肿瘤新抗原肽的表达构建体。通常,APC群体被脉冲或装载、修饰以表达或包含至少一种、至少5种、至少10种、至少15种或至少20种不同的新抗原肽或编码其的表达构建体。Thus, the present invention includes such APC populations: which can be pulsed or loaded with the neoantigenic peptides described herein, genetically modified (by DNA or RNA transfer) to express at least one neoantigenic peptide described herein, or contain expression constructs encoding tumor neoantigenic peptides described herein. Typically, the APC population is pulsed or loaded, modified to express or contain at least one, at least 5, at least 10, at least 15 or at least 20 different neoantigenic peptides or expression constructs encoding them.
本公开还包括包含本文公开的APC的组合物。APC可悬浮于任何已知的生理学上相容的药物载体中,例如细胞培养基、生理盐水、磷酸盐缓冲盐水、细胞培养基等,以形成生理上可接受的水性药物组合物。肠胃外介质包括氯化钠溶液、林格氏葡萄糖、葡萄糖和氯化钠、乳酸林格氏。可以根据需要添加其他物质,例如抗微生物剂。如本文所用,“载体”是指适合作为将APC递送至合适的体外或体内作用部位的介质的任何物质。因此,载体可以充当用于配制含有APC的治疗试剂或实验试剂的赋形剂。优选的载体能够以能够与T细胞相互作用的形式维持APC。这种载体的实例包括但不限于水、磷酸盐缓冲盐水、盐水、林格氏溶液、葡萄糖溶液、含血清溶液、Hank’s溶液和其他生理学上平衡的水溶液或细胞培养基。水性载体还可包含接近接受者生理条件所需的合适的辅助物质,例如增强化学稳定性和等张性。合适的辅助物质包括,例如,乙酸钠、氯化钠、乳酸钠、氯化钾、氯化钙、脱水山梨醇单月桂酸酯、三乙醇胺油酸酯,以及用于生产磷酸盐缓冲液、Tris缓冲液和碳酸氢盐缓冲液的其他物质。The present disclosure also includes compositions comprising the APCs disclosed herein. The APCs can be suspended in any known physiologically compatible pharmaceutical carrier, such as cell culture medium, physiological saline, phosphate buffered saline, cell culture medium, etc., to form a physiologically acceptable aqueous pharmaceutical composition. Parenteral media include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's. Other substances, such as antimicrobial agents, may be added as needed. As used herein, "carrier" refers to any substance suitable as a medium for delivering APCs to a suitable in vitro or in vivo site of action. Therefore, the carrier can serve as an excipient for the preparation of therapeutic agents or experimental agents containing APCs. Preferred carriers are capable of maintaining APCs in a form capable of interacting with T cells. Examples of such carriers include, but are not limited to, water, phosphate buffered saline, saline, Ringer's solution, glucose solution, serum-containing solution, Hank's solution, and other physiologically balanced aqueous solutions or cell culture media. The aqueous carrier may also contain suitable auxiliary substances required to approach the physiological conditions of the recipient, such as enhancing chemical stability and isotonicity. Suitable auxiliary substances include, for example, sodium acetate, sodium chloride, sodium lactate, potassium chloride, calcium chloride, sorbitan monolaurate, triethanolamine oleate, and other substances for the production of phosphate buffers, Tris buffers, and bicarbonate buffers.
疫苗组合物Vaccine composition
本公开进一步包括能够引起特异性T细胞应答的疫苗或免疫原性组合物,包括:The present disclosure further includes vaccines or immunogenic compositions capable of eliciting a specific T cell response, comprising:
-一种或多种如本文定义的新抗原肽,- one or more neoantigenic peptides as defined herein,
-编码如本文定义的新抗原肽的一种或多种多核苷酸;和/或- one or more polynucleotides encoding a neoantigenic peptide as defined herein; and/or
-如上所述的抗原呈递细胞群体(例如自体树突状细胞或人工APC)。- A population of antigen presenting cells as described above (eg autologous dendritic cells or artificial APCs).
优选地,根据本公开,由先前定义的肿瘤特异性融合编码的新抗原肽用于疫苗组合物。所述新抗原肽也可称为肿瘤特异性肽。优选地,根据本公开,还使用编码肿瘤特异性肽的多核苷酸。Preferably, according to the present disclosure, the neoantigenic peptides encoded by the previously defined tumor-specific fusions are used in the vaccine composition. The neoantigenic peptides may also be referred to as tumor-specific peptides. Preferably, according to the present disclosure, polynucleotides encoding tumor-specific peptides are also used.
合适的疫苗或免疫原性组合物将优选包含1至20种新抗原肽,更优选包含2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25种不同的新抗原肽,进一步优选包含6、7、8、9、10、11、12、13或14种不同的新抗原肽,最优选包含12、13或14种新抗原肽。A suitable vaccine or immunogenic composition will preferably comprise 1 to 20 neoantigenic peptides, more preferably 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 different neoantigenic peptides, further preferably 6, 7, 8, 9, 10, 11, 12, 13 or 14 different neoantigenic peptides, and most preferably 12, 13 or 14 neoantigenic peptides.
一种或多种新抗原肽可与载体蛋白连接。当组合物包含两种或多种新抗原肽时,两种或多种(例如2-25)肽可通过如上所述的间隔分子(例如包含2-6个非极性或中性氨基酸的间隔子)线性连接。One or more neoantigenic peptides may be linked to a carrier protein. When the composition comprises two or more neoantigenic peptides, the two or more (e.g., 2-25) peptides may be linearly linked by a spacer molecule as described above (e.g., a spacer comprising 2-6 non-polar or neutral amino acids).
在本公开的一个实施方案中,选择不同的新抗原肽、编码多核苷酸、载体或APC,使得一种疫苗或免疫原性组合物包含能够与不同的MHC分子(例如不同的MHC I类分子)结合的新抗原肽。优选地,该新抗原肽能够与最高频率出现的MHC I类分子结合,例如能够与至少2种优选的、更优选至少3种优选的、甚至更优选至少4种优选的MHC I类分子结合的不同片段。在一些实施方案中,组合物包含能够与一种或多种II类分子结合的肽、编码多核苷酸、载体或APC。MHC任选地为HLA-A、-B、-C、-DP、-DQ或-DR。In one embodiment of the present disclosure, different neoantigenic peptides, encoding polynucleotides, vectors or APCs are selected so that a vaccine or immunogenic composition comprises neoantigenic peptides that can bind to different MHC molecules (e.g., different MHC class I molecules). Preferably, the neoantigenic peptide is capable of binding to the MHC class I molecules that appear with the highest frequency, for example, different fragments that can bind to at least 2 preferred, more preferably at least 3 preferred, and even more preferably at least 4 preferred MHC class I molecules. In some embodiments, the composition comprises peptides, encoding polynucleotides, vectors or APCs that can bind to one or more class II molecules. MHC is optionally HLA-A, -B, -C, -DP, -DQ or -DR.
疫苗或免疫原性组合物能够引起特异性细胞毒性T细胞反应和/或特异性辅助T细胞反应。The vaccine or immunogenic composition is capable of eliciting a specific cytotoxic T cell response and/or a specific helper T cell response.
因此,在特定实施方案中,本公开还涉及如上所述的新抗原肽,其中该新抗原肽具有肿瘤特异性新表位并包含在疫苗或免疫原性组合物中。疫苗组合物应理解为指用于产生预防和/或治疗疾病的免疫的组合物。因此,疫苗是包含或产生抗原的药物,旨在用于人或动物通过疫苗产生特异性抗性和保护性物质。“免疫原性组合物”应理解为指包含或产生抗原并能够引发抗原特异性体液或细胞免疫应答(例如T细胞反应)的组合物。Therefore, in a specific embodiment, the present disclosure also relates to a new antigenic peptide as described above, wherein the new antigenic peptide has a tumor-specific new epitope and is contained in a vaccine or immunogenic composition. Vaccine compositions should be understood to refer to compositions used to produce immunity for preventing and/or treating diseases. Therefore, vaccines are drugs that contain or produce antigens, and are intended for humans or animals to produce specific resistance and protective substances through vaccines. "Immunogenic composition" should be understood to refer to a composition that contains or produces an antigen and is capable of eliciting an antigen-specific humoral or cellular immune response (e.g., a T cell response).
在一个优选实施方案中,根据本公开的新抗原肽的长度为8或9个残基,或13至25个残基。当该肽少于20个残基时,为了使该肽更适合于体内免疫,所述新抗原肽任选地侧接额外的氨基酸,以获得更多氨基酸(通常超过20个)的免疫肽。In a preferred embodiment, the length of the new antigenic peptide according to the present disclosure is 8 or 9 residues, or 13 to 25 residues. When the peptide is less than 20 residues, in order to make the peptide more suitable for in vivo immunization, the new antigenic peptide is optionally flanked by additional amino acids to obtain an immune peptide with more amino acids (usually more than 20).
包含本文所述肽的药物组合物(即疫苗或免疫原性组合物)可施用已患癌症的个体。在治疗应用中,以足以引起对肿瘤抗原的有效CTL反应并治愈或至少部分抑制症状和/或并发症的量向患者施用组合物。足以达到该目的的量被定义为“治疗有效剂量”。该用途的有效量取决于,例如肽组成、施用方式、所治疗疾病的阶段和严重程度、患者的体重和一般健康状况以及处方医生的判断,但初始免疫(即治疗性或预防性施用)的范围通常为针对70kg患者约1.0μg至约50,000μg的肽,随后根据患者的反应和状况,通过测量患者血液中的特异性CTL活性,来提高肽的剂量或按照数周至数月的增强方案将肽的剂量从约1.0μg提高至约10,000μg。必须记住,本发明的肽和组合物通常可用于严重疾病状态,即威胁生命或潜在威胁生命的情况,尤其是当癌症已经转移时。在这种情况下,考虑到外来物质的最小化和肽的相对无毒性质,治疗医生很可能且可能认为需要施用基本上过量的这些肽组合物。Pharmaceutical compositions (i.e., vaccines or immunogenic compositions) comprising the peptides described herein can be administered to individuals who already have cancer. In therapeutic applications, the composition is administered to the patient in an amount sufficient to induce an effective CTL response to the tumor antigen and to cure or at least partially inhibit symptoms and/or complications. An amount sufficient to achieve this purpose is defined as a "therapeutically effective dose". The effective amount for this purpose depends on, for example, the peptide composition, the mode of administration, the stage and severity of the disease being treated, the patient's weight and general health, and the judgment of the prescribing physician, but the initial immunization (i.e., therapeutic or prophylactic administration) is generally in the range of about 1.0 μg to about 50,000 μg of peptide for a 70 kg patient, followed by increasing the dose of the peptide or increasing the dose of the peptide from about 1.0 μg to about 10,000 μg according to a booster regimen of several weeks to several months, depending on the patient's response and condition, by measuring the specific CTL activity in the patient's blood. It must be remembered that the peptides and compositions of the present invention are generally used in serious disease states, i.e., life-threatening or potentially life-threatening situations, especially when the cancer has metastasized. In such cases, it is likely and likely that the treating physician will desirably administer a substantial excess of these peptide compositions in view of the minimization of foreign matter and the relatively nontoxic nature of the peptides.
就治疗用途而言,应在检测或手术切除肿瘤时开始施用。随后提高剂量,直至至少症状基本上减轻,并持续一段时间。For therapeutic use, administration should be initiated upon detection or surgical removal of a tumor, and the dosage should be increased thereafter until at least substantial amelioration of symptoms is achieved and sustained for a period of time.
用于治疗性治疗的疫苗或免疫原性组合物旨在用于肠胃外、局部、经鼻、经口或经局部施用。优选地,药物组合物通过肠胃外施用,例如静脉内、皮下、皮内或肌内施用。该组合物可在手术切除部位施用,以诱导对肿瘤的局部免疫应答。Vaccines or immunogenic compositions for therapeutic treatment are intended for parenteral, topical, nasal, oral or topical administration. Preferably, the pharmaceutical composition is administered parenterally, such as intravenously, subcutaneously, intradermally or intramuscularly. The composition can be administered at the site of surgical resection to induce a local immune response to the tumor.
疫苗或免疫原性组合物可以是药物组合物,其额外包含药学上可接受的佐剂、免疫刺激剂、稳定剂、载体、稀释剂、赋形剂和/或本领域技术人员熟知的任何其他材料。此类材料应当是无毒性的且不应干扰活性成分的功效。载体优选为水性载体,但其载体或其他材料的确切性质将取决于施用途径。可以使用多种水性载体,例如水、缓冲水、0.9%盐水、0.3%甘氨酸、透明质酸等。这些组合物可以通过常规的、众所周知的灭菌技术进行灭菌,或者可以进行无菌过滤。所得水溶液可按原样包装使用,或冻干,冻干制剂在施用前与无菌溶液混合。该组合物可进一步包含接近生理条件所需的药学上可接受的辅助物质,例如pH调节剂和缓冲剂、张力调节剂、湿润剂等,例如乙酸钠、乳酸钠、氯化钠、氯化钾、氯化钙、脱水山梨醇单月桂酸酯、三乙醇胺油酸酯等。例如,参见Butterfield,BMJ.2015 22;350 for adiscussion of cancer vaccines。The vaccine or immunogenic composition can be a pharmaceutical composition, which additionally comprises a pharmaceutically acceptable adjuvant, an immunostimulant, a stabilizer, a carrier, a diluent, an excipient and/or any other material well known to those skilled in the art. Such materials should be non-toxic and should not interfere with the efficacy of the active ingredient. The carrier is preferably an aqueous carrier, but the exact nature of its carrier or other materials will depend on the route of administration. A variety of aqueous carriers can be used, such as water, buffered water, 0.9% saline, 0.3% glycine, hyaluronic acid, etc. These compositions can be sterilized by conventional, well-known sterilization techniques, or can be aseptically filtered. The resulting aqueous solution can be packaged and used as is, or lyophilized, and the lyophilized preparation is mixed with a sterile solution before administration. The composition may further comprise pharmaceutically acceptable auxiliary substances required to approach physiological conditions, such as pH regulators and buffers, tension regulators, wetting agents, etc., such as sodium acetate, sodium lactate, sodium chloride, potassium chloride, calcium chloride, sorbitan monolaurate, triethanolamine oleate, etc. See, e.g., Butterfield, BMJ. 2015 22; 350 for a discussion of cancer vaccines.
增加或扩大宿主对抗原化合物的免疫应答的佐剂实例包括乳化剂、胞壁酰二肽、阿夫立定、水性佐剂如氢氧化铝、基于壳聚糖的佐剂、皂苷、油、两性蛋白、LPS、细菌细胞壁提取物、细菌DNA、CpG序列、合成寡核苷酸、细胞因子及其组合。乳化剂包括,例如,十二酸和油酸的钾、钠和铵盐,脂肪酸的钙、镁和铝盐,有机磺酸盐,例如十二烷基硫酸钠、十六烷基三乙基溴化铵(cetyltrhethylammonlum bromide)、甘油酯、聚氧乙烯乙二醇酯和醚,以及脱水山梨醇脂肪酸酯及其聚氧乙烯、阿拉伯胶、明胶、卵磷脂和/或胆固醇。包含油成分的辅料包括矿物油、植物油或动物油。其他佐剂包括弗氏完全佐剂(FCA)或弗氏不完全佐剂(FIA)。用作额外免疫刺激剂的细胞因子包括干扰素α、白细胞介素-2(IL-2)和粒细胞巨噬细胞集落刺激因子(GM-CSF)或其组合。Examples of adjuvants that increase or expand the host's immune response to antigenic compounds include emulsifiers, muramyl dipeptides, avridine, aqueous adjuvants such as aluminum hydroxide, chitosan-based adjuvants, saponins, oils, amphoteric proteins, LPS, bacterial cell wall extracts, bacterial DNA, CpG sequences, synthetic oligonucleotides, cytokines and combinations thereof. Emulsifiers include, for example, potassium, sodium and ammonium salts of lauric acid and oleic acid, calcium, magnesium and aluminum salts of fatty acids, organic sulfonates, such as sodium lauryl sulfate, cetyltriethylammonium bromide, glycerides, polyoxyethylene glycol esters and ethers, and sorbitan fatty acid esters and polyoxyethylene thereof, gum arabic, gelatin, lecithin and/or cholesterol. The auxiliary material comprising an oil component includes mineral oil, vegetable oil or animal oil. Other adjuvants include Freund's complete adjuvant (FCA) or Freund's incomplete adjuvant (FIA). Cytokines useful as additional immunostimulants include interferon alpha, interleukin-2 (IL-2), and granulocyte macrophage colony stimulating factor (GM-CSF), or a combination thereof.
疫苗或免疫原性制剂中本文所述的肽的浓度可以变化很大,即根据重量从低于约0.1%,通常为或至少约2%至高达20%至50%或更高,并且根据所选择的特定施用方式,主要根据液体体积、粘度等进行选择。The concentration of the peptides described herein in the vaccine or immunogenic formulation can vary widely, i.e., from less than about 0.1%, typically at or at least about 2% to as high as 20% to 50% or more by weight, and is selected primarily based on liquid volume, viscosity, etc., depending on the particular mode of administration chosen.
本文所述的肽也可通过脂质体施用,其将肽靶向特定的细胞组织,例如淋巴组织。脂质体也可用于增加肽的半衰期。脂质体包括乳剂、泡沫、胶束、不溶性单层、液晶、磷脂分散体、层状层等。在这些制剂中,待递送的肽作为脂质体的一部分单独或与结合例如淋巴细胞中普遍存在的受体的分子(例如结合CD45抗原的单克隆抗体)结合,或与其他治疗性或免疫原性组合物结合。因此,填充有本发明所需肽的脂质体可被定向至淋巴细胞部位,在该部位,脂质体随后递送选定的治疗性/免疫原性肽组合物。用于本发明的脂质体由标准的囊泡形成脂质形成,该脂质通常包括中性和带负电荷的磷脂和固醇,例如胆固醇。脂质的选择通常基于对例如脂质体尺寸、酸不稳定性和脂质体在血液中稳定性的考虑。多种方法可用于制备脂质体,如Szoka et al.,Ann.Rev.Biophys.Bioeng.9;467(1980),美国专利号4,235,871、4501728、4,501,728、4,837,028和5,019,369。The peptides described herein can also be administered via liposomes, which target the peptides to specific cell tissues, such as lymphoid tissue. Liposomes can also be used to increase the half-life of the peptides. Liposomes include emulsions, foams, micelles, insoluble monolayers, liquid crystals, phospholipid dispersions, lamellar layers, etc. In these preparations, the peptide to be delivered is combined as part of the liposome alone or with molecules that bind to receptors that are ubiquitous in lymphocytes (e.g., monoclonal antibodies that bind to CD45 antigens), or with other therapeutic or immunogenic compositions. Therefore, liposomes filled with the desired peptides of the present invention can be directed to the lymphocyte site, where the liposomes subsequently deliver the selected therapeutic/immunogenic peptide composition. The liposomes used in the present invention are formed by standard vesicle-forming lipids, which typically include neutral and negatively charged phospholipids and sterols, such as cholesterol. The choice of lipids is typically based on considerations such as liposome size, acid instability, and stability of the liposomes in the blood. Various methods are available for preparing liposomes, such as Szoka et al., Ann. Rev. Biophys. Bioeng. 9; 467 (1980), U.S. Pat. Nos. 4,235,871, 4501728, 4,501,728, 4,837,028 and 5,019,369.
为了靶向免疫细胞,待掺入到脂质体中的配体可包括例如对所需免疫系统细胞的细胞表面决定簇具有特异性的抗体或其片段。可以将含有肽的脂质体悬液以根据施用方式、递送的肽和待治疗的疾病阶段等而变化的剂量通过静脉、经局部、局部等方式施用。To target immune cells, the ligand to be incorporated into the liposomes may include, for example, antibodies or fragments thereof that are specific for cell surface determinants of the desired immune system cells. The liposomal suspension containing the peptide may be administered intravenously, topically, topically, etc., at doses that vary depending on the mode of administration, the peptide being delivered, and the stage of the disease to be treated.
对于固体组合物,可以使用常规的或纳米颗粒无毒的固体载体,包括例如药物级的甘露醇、乳糖、淀粉、硬脂酸镁、糖精钠、滑石、纤维素、葡萄糖、蔗糖、碳酸镁等。对于口服施用,通过掺入任何通常使用的赋形剂(例如前述载体)和通常10-95%的活性成分(即本发明的一种或多种肽),更优选25%-75%的浓度,形成药学上可接受的无毒组合物。For solid compositions, conventional or nanoparticle non-toxic solid carriers can be used, including, for example, pharmaceutical grade mannitol, lactose, starch, magnesium stearate, sodium saccharin, talc, cellulose, glucose, sucrose, magnesium carbonate, etc. For oral administration, a pharmaceutically acceptable non-toxic composition is formed by incorporating any commonly used excipients (e.g., the aforementioned carriers) and generally 10-95% of the active ingredient (i.e., one or more peptides of the present invention), more preferably 25%-75%.
对于气雾剂施用,免疫原性肽优选与表面活性剂和推进剂一起以细分形式提供。肽的通常百分比为0.01%-20%(重量),优选1%-10%。表面活性剂当然必须是无毒的,并且优选可溶于推进剂。这类试剂的代表是含有6-22个碳原子的脂肪酸的酯或偏酯,例如己酸、辛酸、十二酸、软脂酸、硬脂酸、亚油酸、亚麻酸、胆甾醇酯和油酸与脂族多元醇或其环酸酐的酯或偏酯。可以采用混合酯,例如混合或天然甘油酯。表面活性剂可占组合物重量的0.1%-20%,优选0.25%-5%。组合物的余量通常为推进剂。也可以根据需要包含载体,例如用于鼻内递送的卵磷脂。For aerosol administration, the immunogenic peptide is preferably provided in a subdivided form together with a surfactant and a propellant. The usual percentage of the peptide is 0.01%-20% (by weight), preferably 1%-10%. The surfactant must of course be non-toxic and preferably soluble in the propellant. Representatives of this type of agent are esters or partial esters of fatty acids containing 6-22 carbon atoms, such as esters or partial esters of caproic acid, caprylic acid, dodecanoic acid, palmitic acid, stearic acid, linoleic acid, linolenic acid, cholesterol esters and oleic acid with aliphatic polyols or their cyclic anhydrides. Mixed esters, such as mixed or natural glycerides, can be used. The surfactant may account for 0.1%-20% of the weight of the composition, preferably 0.25%-5%. The balance of the composition is usually a propellant. A carrier may also be included as needed, such as lecithin for intranasal delivery.
细胞毒性T细胞(CTL)识别与MHC分子结合的肽形式的抗原,而非完整的外来抗原本身。MHC分子本身位于抗原呈递细胞的细胞表面。因此,只有当存在肽抗原、MHC分子和抗原呈递细胞(APC)的三聚体复合物时,CTL才可能被激活。相应地,如果不仅将该肽用于激活CTL,而且还额外添加了具有各自MHC分子的APC,则可能增强免疫应答。因此,在一些实施方案中,根据本公开的疫苗或免疫原性组合物替代地或额外地包含至少一种抗原呈递细胞,优选APC群体。Cytotoxic T cells (CTLs) recognize antigens in the form of peptides bound to MHC molecules, rather than the complete foreign antigens themselves. The MHC molecules themselves are located on the cell surface of antigen presenting cells. Therefore, CTLs can only be activated when there is a trimeric complex of peptide antigens, MHC molecules and antigen presenting cells (APCs). Accordingly, if not only the peptide is used to activate CTLs, but also APCs with respective MHC molecules are additionally added, the immune response may be enhanced. Therefore, in some embodiments, the vaccine or immunogenic composition according to the present disclosure alternatively or additionally comprises at least one antigen presenting cell, preferably an APC population.
因此,疫苗或免疫原性组合物可以细胞的形式递送,例如抗原呈递细胞,例如树突状细胞疫苗。抗原呈递细胞(例如树突状细胞)可被本文公开的新抗原肽脉冲或装载,可包含编码本文公开的新抗原肽的表达构建体,或可经遗传修饰(通过DNA或RNA转移)以表达一种、两种或多种本文公开的新抗原肽,例如至少2、3、4、5、6、7、8、9或10种新抗原肽。Thus, the vaccine or immunogenic composition can be delivered in the form of cells, such as antigen presenting cells, such as dendritic cell vaccines. Antigen presenting cells (e.g., dendritic cells) can be pulsed or loaded with the neoantigenic peptides disclosed herein, can contain expression constructs encoding the neoantigenic peptides disclosed herein, or can be genetically modified (by DNA or RNA transfer) to express one, two or more neoantigenic peptides disclosed herein, such as at least 2, 3, 4, 5, 6, 7, 8, 9 or 10 neoantigenic peptides.
合适的疫苗或免疫原性组合物也可以是与本文所述新抗原肽相关的DNA或RNA的形式。例如,编码一种或多种新抗原肽或由其衍生的蛋白的DNA或RNA可用作疫苗,例如通过向受试者直接注射。例如,编码至少2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24或25种新抗原肽或由其衍生的蛋白的DNA或RNA。Suitable vaccines or immunogenic compositions can also be in the form of DNA or RNA associated with the neoantigenic peptides described herein. For example, DNA or RNA encoding one or more neoantigenic peptides or proteins derived therefrom can be used as a vaccine, for example, by direct injection into a subject. For example, DNA or RNA encoding at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25 neoantigenic peptides or proteins derived therefrom.
许多方法被方便地用于向患者递送核酸。例如,核酸可以作为“裸DNA”直接递送。例如,在Wolff et al.,Science 247:1465-1468(1990)以及美国专利号5,580,859和5,589,466中描述了该方法。也可以使用例如美国专利号5,204,253中所述的弹道递送来施用核酸。可以施用仅由DNA组成的颗粒。或者,DNA可以附着在颗粒上,例如金颗粒。Many methods are conveniently used to deliver nucleic acids to patients. For example, nucleic acids can be delivered directly as "naked DNA". For example, this method is described in Wolff et al., Science 247: 1465-1468 (1990) and U.S. Patent Nos. 5,580,859 and 5,589,466. Nucleic acids can also be administered using ballistic delivery, such as that described in U.S. Patent No. 5,204,253. Particles consisting only of DNA can be administered. Alternatively, the DNA can be attached to particles, such as gold particles.
核酸也可以与阳离子化合物(例如阳离子脂质)复合递送。例如,在WO9618372、WO9324640、Mannino&Gould-Fogerite,BioTechniques 6(7):682-691(1988)、美国专利号5279833、WO9106309和Felgner et al.,Proc.Natl.Acad.Sci.USA 84:7413-7414(1987)中描述了脂质介导的基因递送方法。Nucleic acid can also be delivered in combination with a cationic compound (e.g., a cationic lipid). For example, lipid-mediated gene delivery methods are described in WO9618372, WO9324640, Mannino & Gould-Fogerite, BioTechniques 6 (7): 682-691 (1988), U.S. Patent No. 5279833, WO9106309, and Felgner et al., Proc. Natl. Acad. Sci. USA 84: 7413-7414 (1987).
递送系统可选性地包括细胞渗透肽、纳米颗粒包封物、病毒样颗粒、脂质体或其任何组合。细胞渗透肽包括TAT肽、单纯疱疹病毒VP22、穿膜肽(transportan)、Antp。脂质体可用作递送系统。也可以使用利斯特氏菌疫苗或电穿孔。Delivery system optionally includes cell permeation peptide, nanoparticle encapsulation, virus-like particle, liposome or any combination thereof. Cell permeation peptide includes TAT peptide, herpes simplex virus VP22, transmembrane peptide (transportan), Antp. Liposome can be used as delivery system. Listeria vaccine or electroporation can also be used.
一种或多种新抗原肽也可通过包含编码一种或多种新抗原肽的DNA或RNA序列的细菌或病毒载体递送。DNA或RNA可作为载体本身或在减毒细菌病毒或减毒活病毒(如牛痘或鸡痘)中递送。该方法涉及使用牛痘病毒作为载体表达编码本发明肽的核苷酸序列。在被引入急性或慢性感染的宿主或非感染的宿主后,重组牛痘病毒表达免疫原性肽,从而引发宿主CTL应答。可用于免疫方案的牛痘载体和方法在例如美国专利号4,722,848中描述。另一种载体是BCG(Bacille Calmette Guerin,BCG)。Stover et al.(Nature 351:456-460(1991)中描述了BCG载体。根据本文的说明书,本领域技术人员将清楚可用于本发明肽的治疗性施用或免疫的多种其他载体,例如鼠伤寒沙门氏菌等。One or more new antigenic peptides can also be delivered by a bacterial or viral vector comprising a DNA or RNA sequence encoding one or more new antigenic peptides. DNA or RNA can be delivered as a vector itself or in an attenuated bacterial virus or attenuated live virus (such as cowpox or fowlpox). This method involves using cowpox virus as a vector to express a nucleotide sequence encoding the peptide of the present invention. After being introduced into an acutely or chronically infected host or a non-infected host, the recombinant cowpox virus expresses an immunogenic peptide, thereby eliciting a host CTL response. Cowpox vectors and methods that can be used for immunization protocols are described, for example, in U.S. Pat. No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin, BCG). Stover et al. (Nature 351: 456-460 (1991) describes a BCG vector. Based on the description herein, a person skilled in the art will be aware of a variety of other vectors that can be used for therapeutic administration or immunization of the peptides of the present invention, such as Salmonella typhimurium, etc.
施用编码本文所述肽的核酸的适当方法包括使用编码多个表位的微型基因构建体。为了产生编码所选CTL表位(微型基因)的DNA序列以在人细胞中表达,表位的氨基酸序列被反向翻译。人密码子使用表用于指导每个氨基酸的密码子选择。这些编码表位的DNA序列直接邻接,形成连续的多肽序列。为了优化表达和/或免疫原性,可以在微型基因设计中纳入额外的元件。可反向翻译并包含在微型基因序列中的氨基酸序列的实例包括:辅助性T淋巴细胞、表位、前导(信号)序列和内质网滞留信号。此外,通过在CTL表位附近包含合成的(例如聚丙氨酸)或天然存在的侧翼序列,可以改善CTL表位在MHC的呈递。Suitable methods for administering nucleic acids encoding peptides described herein include the use of minigene constructs encoding multiple epitopes. In order to generate a DNA sequence encoding a selected CTL epitope (minigene) for expression in human cells, the amino acid sequence of the epitope is reverse translated. The human codon usage table is used to guide the codon selection for each amino acid. These DNA sequences encoding epitopes are directly adjacent to form a continuous polypeptide sequence. In order to optimize expression and/or immunogenicity, additional elements can be incorporated into the minigene design. Examples of amino acid sequences that can be reverse translated and included in the minigene sequence include: helper T lymphocytes, epitopes, leader (signal) sequences, and endoplasmic reticulum retention signals. In addition, by including synthetic (e.g., polyalanine) or naturally occurring flanking sequences near the CTL epitope, the presentation of the CTL epitope on the MHC can be improved.
微型基因序列通过组装编码微型基因正负链的寡核苷酸转化为DNA。使用众所周知的技术在适当的条件下合成、磷酸化、纯化和退火重叠的寡核苷酸(30-100个碱基长)。寡核苷酸的末端使用T4DNA连接酶连接。然后,编码CTL表位多肽的该合成微型基因可被克隆到所需的表达载体中。The minigene sequence is converted to DNA by assembling oligonucleotides encoding the positive and negative strands of the minigene. Overlapping oligonucleotides (30-100 bases long) are synthesized, phosphorylated, purified, and annealed under appropriate conditions using well-known techniques. The ends of the oligonucleotides are ligated using T4 DNA ligase. This synthetic minigene encoding the CTL epitope polypeptide can then be cloned into the desired expression vector.
载体中包含本领域技术人员熟知的标准调控序列,以确保在靶细胞中的表达。因此,编码一种或多种新抗原肽的DNA或RNA通常可操作地连接至以下一项或多项:The vector contains standard regulatory sequences well known to those skilled in the art to ensure expression in target cells. Therefore, the DNA or RNA encoding one or more new antigenic peptides is usually operably linked to one or more of the following:
-可用于驱动核酸分子表达的启动子。AAV ITR可以作为启动子,并且有利于消除对额外启动子元件的需求。对于普遍存在的表达,可使用以下启动子:CMV(特别是人巨细胞病毒立即早期启动子(hCMV-IE))、CAG、CBh、PGK、SV40、RSV、铁蛋白重链或轻链等。对于脑表达,可以使用以下启动子:Synapsinl用于所有神经元,CaMKIIalpha用于兴奋性神经元,GAD67或GAD65或VGAT用于γ-氨基丁酸能神经元等。用于驱动RNA合成的启动子可包括:PolIII启动子,如U6或HI。Pol II启动子和内含子盒的使用可用于表达向导RNA(gRNA)。通常,该启动子包括用于微型基因插入的下游克隆位点。关于合适的启动子序列的例子,特别参见美国专利号5,580,859和5,589,466。-Promoters that can be used to drive expression of nucleic acid molecules. AAV ITR can be used as a promoter and is conducive to eliminating the need for additional promoter elements. For ubiquitous expression, the following promoters can be used: CMV (particularly human cytomegalovirus immediate early promoter (hCMV-IE)), CAG, CBh, PGK, SV40, RSV, ferritin heavy chain or light chain, etc. For brain expression, the following promoters can be used: Synapsinl is used for all neurons, CaMKIIalpha is used for excitatory neurons, GAD67 or GAD65 or VGAT is used for γ-aminobutyric acid neurons, etc. Promoters for driving RNA synthesis may include: PolIII promoters, such as U6 or HI. The use of Pol II promoters and intron boxes can be used to express guide RNA (gRNA). Typically, the promoter includes a downstream cloning site for minigene insertion. For examples of suitable promoter sequences, see in particular U.S. Patent Nos. 5,580,859 and 5,589,466.
-转录反式激活因子或其他增强子元件也可以提高转录活性,例如人T细胞白血病病毒1型(HTLV-1)的5’长末端重复序列(LTR)的调节性R区(当其与CMV启动子结合时已被证明可诱导更高的细胞免疫应答)。- Transcriptional transactivators or other enhancer elements may also increase transcriptional activity, such as the regulatory R region of the 5′ long terminal repeat (LTR) of human T-cell leukemia virus type 1 (HTLV-1), which has been shown to induce higher cellular immune responses when bound to the CMV promoter.
-翻译优化序列,例如位于mRNA内AUG起始密码子(ACCAUGG)旁侧的Kozak序列,以及密码子优化。- Translation optimization sequences, such as the Kozak sequence flanking the AUG start codon (ACCAUGG) within the mRNA, and codon optimization.
可能需要额外的载体修饰以优化微型基因表达和免疫原性。在某些情况下,高效基因表达需要内含子,并且一个或多个合成的或天然存在的内含子可以被整合到微型基因的转录区域。还可以考虑加入mRNA稳定化序列以增加微型基因的表达。最近提出免疫刺激序列(ISS或CpG)在DNA疫苗的免疫原性中发挥作用。如果发现这些序列增强免疫原性,则可以将其包含在载体中,在微型基因编码序列之外。Additional vector modifications may be required to optimize minigene expression and immunogenicity. In some cases, introns are required for efficient gene expression, and one or more synthetic or naturally occurring introns can be integrated into the transcribed region of the minigene. The addition of mRNA stabilizing sequences to increase minigene expression can also be considered. Immunostimulatory sequences (ISS or CpG) have recently been proposed to play a role in the immunogenicity of DNA vaccines. If these sequences are found to enhance immunogenicity, they can be included in the vector, outside the minigene coding sequence.
在一些实施方案中,可以使用双顺反子表达载体,以允许产生微型基因编码的表位和增强或降低免疫原性的第二蛋白。In some embodiments, a bicistronic expression vector can be used to allow production of both the minigene-encoded epitope and a second protein that enhances or reduces immunogenicity.
本文所述的DNA疫苗或免疫原性组合物可通过共递送促进细胞介导的免疫应答的细胞因子(例如IL-2、IL-12、IL-18、GM-CSF和IFNγ)而得到增强。CXC趋化因子(如IL-8)和CC趋化因子(如巨噬细胞炎性蛋白(MIP)-1α、MIP-3α、MIP-3β和RANTES)可增加免疫应答的效力。还可以通过共递送质粒编码的细胞因子诱导分子(例如LeIF)、共刺激分子和粘附分子(例如B7-1(CD80)和/或B7-2(CD86))来增强DNA疫苗的免疫原性。辅助(HTL)表位可与细胞内靶向信号连接,并与CTL表位分开表达。这将允许HTL表位定向至不同于CTL表位的细胞室。如果需要,这可以促进HTL表位更有效地进入MHC II类途径,从而改善CTL诱导。与CTL诱导相反,通过免疫抑制分子(例如,TGF-β)的共表达降低免疫应答在某些疾病中可能是有益的。DNA vaccines or immunogenic compositions described herein can be enhanced by co-delivery of cytokines (e.g., IL-2, IL-12, IL-18, GM-CSF, and IFNγ) that promote cell-mediated immune responses. CXC chemokines (e.g., IL-8) and CC chemokines (e.g., macrophage inflammatory protein (MIP) -1α, MIP-3α, MIP-3β, and RANTES) can increase the effectiveness of immune responses. The immunogenicity of DNA vaccines can also be enhanced by co-delivery of plasmid-encoded cytokine induction molecules (e.g., LeIF), costimulatory molecules, and adhesion molecules (e.g., B7-1 (CD80) and/or B7-2 (CD86)). Auxiliary (HTL) epitopes can be connected to intracellular targeting signals and expressed separately from CTL epitopes. This will allow HTL epitopes to be directed to cell compartments different from CTL epitopes. If desired, this can promote HTL epitopes to enter MHC class II pathways more effectively, thereby improving CTL induction. In contrast to CTL induction, reduction of immune responses by co-expression of immunosuppressive molecules (eg, TGF-β) may be beneficial in certain diseases.
一旦选择了表达载体,微型基因就被克隆到启动子下游的多接头区域。将该质粒转化至合适的大肠杆菌菌株,并使用标准技术制备DNA。微型基因的方向和DNA序列以及载体中包含的所有其他元件均通过限制性图谱和DNA序列分析进行确认。含有正确质粒的细菌细胞可作为主细胞库和工作细胞库储存。Once the expression vector is selected, the minigene is cloned into the polylinker region downstream of the promoter. The plasmid is transformed into an appropriate strain of E. coli and DNA is prepared using standard techniques. The orientation and DNA sequence of the minigene and all other elements contained in the vector are confirmed by restriction mapping and DNA sequence analysis. Bacterial cells containing the correct plasmid can be stored as master cell banks and working cell banks.
可以使用多种制剂制备用于注射的纯化的质粒DNA。其中最简单的是将冻干的DNA在无菌磷酸盐缓冲液(PBS)中重建。已经描述了多种方法,并且可能会出现新技术。如上所述,核酸可以方便地与阳离子脂质一起配制。另外、糖脂、融合脂质体、肽和化合物统称为保护性、相互作用、非缩合(PINC),也可与纯化的质粒DNA复合,以影响稳定性、肌内分散或向特定器官或细胞类型的转运等变量。Purified plasmid DNA for injection can be prepared using a variety of formulations. The simplest of these is to reconstitute the lyophilized DNA in sterile phosphate buffered saline (PBS). A variety of methods have been described, and new technologies may emerge. As described above, nucleic acids can be conveniently formulated with cationic lipids. In addition, glycolipids, fusogenic liposomes, peptides and compounds, collectively referred to as protective, interactive, non-condensing (PINC), can also be complexed with purified plasmid DNA to affect variables such as stability, intramuscular dispersion or transport to specific organs or cell types.
包含肽的疫苗或免疫原性组合物可与包含编码肽的多核苷酸的疫苗或免疫原性组合物组合施用。例如,肽疫苗和DNA疫苗的施用可以在初始免疫-加强方案中交替进行。例如,预期用肽免疫原性组合物引发初始免疫并用DNA免疫原性组合物加强,如用DNA免疫原性组合物引发初始免疫和用肽免疫原性组合物加强一样。Vaccines or immunogenic compositions comprising peptides can be administered in combination with vaccines or immunogenic compositions comprising polynucleotides encoding the peptides. For example, administration of peptide vaccines and DNA vaccines can be alternating in a prime-boost regimen. For example, it is contemplated that priming with a peptide immunogenic composition and boosting with a DNA immunogenic composition is as likely as priming with a DNA immunogenic composition and boosting with a peptide immunogenic composition.
本公开还包括生产疫苗组合物的方法,包括以下步骤:The present disclosure also includes a method of producing a vaccine composition comprising the steps of:
a)任选地,根据前述方法鉴定至少一种新抗原肽;a) optionally, identifying at least one neoantigenic peptide according to the aforementioned method;
b)产生所述至少一种新抗原肽、至少一种编码新抗原肽的多肽或至少一种包含本文所述的多肽的载体;和b) producing the at least one neo-antigenic peptide, at least one polypeptide encoding the neo-antigenic peptide, or at least one vector comprising a polypeptide as described herein; and
c)任选添加生理上可接受的缓冲液、赋形剂和/或佐剂,并使用所述至少一种新抗原肽、多肽或载体生产疫苗。c) optionally adding a physiologically acceptable buffer, excipient and/or adjuvant and producing a vaccine using the at least one neoantigenic peptide, polypeptide or vector.
本公开的另一方面是生产DC疫苗的方法,其中所述DC存在至少一种本公开的新抗原肽。Another aspect of the present disclosure is a method of producing a DC vaccine, wherein the DCs are presented with at least one neoantigenic peptide of the present disclosure.
抗体TCR、CAR及其衍生物Antibody TCR, CAR and their derivatives
本公开还涉及一种特异性结合如本文公开的嵌合多肽(或蛋白)(最优选地SEQ IDNO:1-8202的嵌合蛋白)或通常与MHC或HLA分子结合的新抗原肽的抗体或其抗原结合片段。The present disclosure also relates to an antibody or antigen-binding fragment thereof that specifically binds to a chimeric polypeptide (or protein) as disclosed herein (most preferably a chimeric protein of SEQ ID NO: 1-8202) or a neoantigenic peptide that is normally bound to an MHC or HLA molecule.
在一些实施方案中,所述抗体或其抗原结合片段包含或由如前定义的抗原结合结构域(其结合嵌合蛋白)组成。In some embodiments, the antibody or antigen-binding fragment thereof comprises or consists of an antigen-binding domain (which binds to a chimeric protein) as defined above.
通常,所述抗体或其抗原结合片段结合新抗原肽,其通常以约2×10-7M或更小的解离常数(Kd)与如前定义的MHC或HLA分子或(跨膜)嵌合蛋白结合。在某些实施方案中,Kd为约2×10-7M或更小,约1×10-7M或更小,约9×10-8M或更小,约9×10-9M或更小,约5×10-9M或更小,约4×10-9M或更小,约3×10-9M或更小,约2×10-9M或更小,或约1×10-9M或更小。在某些非限制性实施方案中,Kd为约1×10-9M至约3×10-7M。在某些非限制性实施方案中,Kd为约1.5×10-9M至约3×10-7M。在某些非限制性实施方案中,Kd为约1.5×10-9M至约2.7×10-7M。Typically, the antibody or antigen-binding fragment thereof binds to a neoantigenic peptide, which typically binds to an MHC or HLA molecule or (transmembrane) chimeric protein as defined above with a dissociation constant ( Kd ) of about 2× 10-7 M or less. In certain embodiments, the Kd is about 2× 10-7 M or less, about 1× 10-7 M or less, about 9× 10-8 M or less, about 9× 10-9 M or less, about 5× 10-9 M or less, about 4× 10-9 M or less, about 3× 10-9 M or less, about 2× 10-9 M or less, or about 1× 10-9 M or less. In certain non-limiting embodiments, the Kd is about 1× 10-9 M to about 3× 10-7 M. In certain non-limiting embodiments, the Kd is about 1.5× 10-9 M to about 3× 10-7 M. In certain non-limiting embodiments, Kd is from about 1.5 x 10 "9 M to about 2.7 x 10 "7 M.
为了促进淋巴细胞T(LT)对肿瘤细胞的渗透和识别,另一种策略在于使用能够同时识别一个以上抗原靶标,尤其是同时识别两个抗原靶标的抗体。双特异性抗体有多种形式。BiTE(双特异性T细胞衔接子)是首先被开发的。这些融合蛋白由两种抗体的两个scFv(重链可变域VH和轻链可变域VL)组成,两种抗体通过结合肽连接:一种识别LT标记物(CD3+),另一种识别肿瘤抗原。目的是促进与肿瘤接触的LT的募集和激活,从而导致肿瘤细胞裂解(参见综述Patrick A.Baeuerle and Carsten Reinhardt;Bispecific T-CellEngaging Antibodies for Cancer Therapy、Cancer Res 2009;69:(12).June 15,2009和Galaine et al.,Innovations&Thérapeutiques en Oncologie,vol.3-n°3-7,2017)。In order to promote the penetration and recognition of tumor cells by lymphocytes T (LT), another strategy is to use antibodies that can recognize more than one antigen target at the same time, especially two antigen targets at the same time. Bispecific antibodies come in many forms. BiTE (bispecific T cell engager) was the first to be developed. These fusion proteins consist of two scFvs (heavy chain variable domain VH and light chain variable domain VL) of two antibodies, which are connected by a binding peptide: one recognizes the LT marker (CD3+) and the other recognizes the tumor antigen. The purpose is to promote the recruitment and activation of LTs in contact with the tumor, thereby leading to tumor cell lysis (see review Patrick A. Baeuerle and Carsten Reinhardt; Bispecific T-Cell Engaging Antibodies for Cancer Therapy, Cancer Res 2009; 69: (12). June 15, 2009 and Galaine et al., Innovations & Therapeutiques en Oncologie, vol. 3-n°3-7, 2017).
因此,在特定实施方案中,所述抗体是靶向如本文定义的嵌合蛋白的双特异性T细胞衔接子,并且特别包含如前定义的抗原结合结构域。Thus, in a particular embodiment, the antibody is a bispecific T cell engager targeting a chimeric protein as defined herein, and in particular comprises an antigen binding domain as defined previously.
术语“抗体”在本文中以最广泛的含义使用,包括多克隆和单克隆抗体,包括完整的抗体和功能性(抗原结合)抗体片段,包括片段抗原结合(Fab)片段、F(ab’)2片段、Fab’片段、Fv片段、重组IgG片段、能够特异性结合抗原的可变重链(VH)区域、单链抗体片段,包括单链可变片段(scFv)和单域抗体(例如,VHH抗体、sdAb、sdFv、纳米抗体)片段。该术语包括经基因工程化和/或其他变体修饰形式的免疫球蛋白,例如细胞内抗体、肽抗体、嵌合抗体、全人抗体、人源化抗体和异缀合物抗体、多特异性(例如双特异性)抗体、抗体、双抗体、三抗体和四抗体、串联双scFv、串联三scFv。除非另有说明,术语“抗体”应理解为包括功能性抗体及其片段。该术语还包括完整的或全长的抗体,包括任何类别或亚类的抗体,包括IgG及其亚类、IgG1、IgG2、IgG3、IgG4、IgM、IgE、IgA和IgD。在一些实施方案中,抗体包括轻链可变区和重链可变区,例如以scFv的形式。The term "antibody" is used in the broadest sense herein, including polyclonal and monoclonal antibodies, including complete antibodies and functional (antigen-binding) antibody fragments, including fragment antigen-binding (Fab) fragments, F(ab')2 fragments, Fab' fragments, Fv fragments, recombinant IgG fragments, variable heavy chain (VH) regions capable of specific binding to antigens, single-chain antibody fragments, including single-chain variable fragments (scFv) and single-domain antibodies (e.g., VHH antibodies, sdAbs, sdFvs, nanobodies) fragments. The term includes immunoglobulins in genetically engineered and/or other variant-modified forms, such as intrabodies, peptide antibodies, chimeric antibodies, fully human antibodies, humanized antibodies and heteroconjugate antibodies, multispecific (e.g., bispecific) antibodies, antibodies, double antibodies, three antibodies and four antibodies, tandem double scFvs, tandem three scFvs. Unless otherwise indicated, the term "antibody" should be understood to include functional antibodies and fragments thereof. The term also includes intact or full-length antibodies, including antibodies of any class or subclass, including IgG and its subclasses, IgG1, IgG2, IgG3, IgG4, IgM, IgE, IgA and IgD. In some embodiments, the antibody includes a light chain variable region and a heavy chain variable region, for example in the form of an scFv.
抗体包括在天然氨基酸序列中具有一个或多个氨基酸取代、插入或缺失的变体多肽种型,前提是抗体保留或基本保留其特异性结合功能。氨基酸的保守取代是众所周知的,并且如上所述。Antibodies include variant polypeptide forms having one or more amino acid substitutions, insertions or deletions in the native amino acid sequence, provided that the antibody retains or substantially retains its specific binding function. Conservative substitutions of amino acids are well known and described above.
本公开进一步包括产生抗体或其抗原结合片段的方法,其包括选择结合本文定义的肿瘤新抗原肽的抗体的步骤,所述抗体通常与如本文定义的MHC或HLA分子或嵌合蛋白结合,其解离常数(Kd)为约2×10-7M或更小。在某些实施例中,Kd为约2×10-7M或更小、约1×10-7M或更小,约9×10-8M或更小,约1×10-8M或更小,约9×10-9M或更小,约5×10-9M或更小,约4×10-9M或更小,约3×10-9M或更小,约2×10-9M或更小,或约1×10-9M或更小,或约1×10-10M或更小,或约1×10-12M或更小。The present disclosure further includes a method of producing an antibody or antigen-binding fragment thereof, comprising the step of selecting an antibody that binds to a tumor neoantigenic peptide as defined herein, said antibody typically binding to an MHC or HLA molecule or chimeric protein as defined herein with a dissociation constant ( Kd ) of about 2× 10-7 M or less. In certain embodiments, the Kd is about 2× 10-7 M or less, about 1× 10-7 M or less, about 9× 10-8 M or less, about 1× 10-8 M or less, about 9× 10-9 M or less, about 5× 10-9 M or less, about 4× 10-9 M or less, about 3× 10-9 M or less, about 2× 10-9 M or less, or about 1× 10-9 M or less, or about 1× 10-10 M or less, or about 1× 10-12 M or less.
在某些实施方案中,抗体来源于鼠、人或骆驼科动物(例如美洲驼)。In certain embodiments, the antibodies are of murine, human, or camelid (eg, llama) origin.
在一些实施方案中,抗体选自人抗体序列库。在一些实施方案中,通过用如前定义的SEQ ID NO:1-8202中任何一个嵌合蛋白或其部分(特别是胞外部分)免疫动物,随后进行选择步骤,来产生抗体。In some embodiments, the antibody is selected from a library of human antibody sequences. In some embodiments, the antibody is produced by immunizing an animal with a chimeric protein of any one of SEQ ID NOs: 1-8202 as defined above, or a portion thereof (particularly an extracellular portion), followed by a selection step.
包括嵌合抗体、人源化抗体或人抗体的抗体可如上所述进一步亲和力成熟和选择。人源化抗体包含啮齿动物序列衍生的CDR区域;通常地,啮齿类动物CDR被植入人框架,并且一些人框架残基可被反向突变为原始的啮齿类动物框架残基以保持亲和力,和/或一个或几个CDR残基可被突变而提高亲和力。全人抗体不含鼠序列,通常通过人抗体库的噬菌体展示技术或其天然免疫球蛋白基因座已被人免疫球蛋白基因座片段取代的转基因小鼠的免疫而产生。Antibodies including chimeric antibodies, humanized antibodies or human antibodies may be further affinity matured and selected as described above. Humanized antibodies contain CDR regions derived from rodent sequences; generally, rodent CDRs are implanted into human frameworks, and some human framework residues may be reverse mutated to original rodent framework residues to maintain affinity, and/or one or more CDR residues may be mutated to increase affinity. Fully human antibodies do not contain mouse sequences and are usually produced by phage display technology of human antibody libraries or immunization of transgenic mice whose natural immunoglobulin loci have been replaced by fragments of human immunoglobulin loci.
通过所述方法产生的抗体以及表达该抗体或其片段的免疫细胞也包含在本公开中。Antibodies produced by the method and immune cells expressing the antibodies or fragments thereof are also included in the present disclosure.
本公开还包括药物组合物,其包含一种或多种单独或与至少一种其他试剂(例如稳定化合物)组合的本文所公开的抗体,该药物组合物可在任何无菌、生物相容的药物载体中施用,并任选地与无菌的药学上可接受的缓冲液、稀释剂和/或赋形剂一起配制。药学上可接受的载体通常为增强或稳定组合物,和/或可用于促进组合物的制备。药学上可接受的载体包括在生理上相容并且在一些实施方案中是药学上惰性的溶剂、分散介质、涂层、抗细菌剂和抗真菌剂、等渗剂和吸收延迟剂等。The present disclosure also includes pharmaceutical compositions comprising one or more antibodies disclosed herein alone or in combination with at least one other agent (e.g., a stabilizing compound), which can be administered in any sterile, biocompatible pharmaceutical carrier, and optionally formulated with sterile pharmaceutically acceptable buffers, diluents, and/or excipients. Pharmaceutically acceptable carriers are typically enhanced or stabilized compositions, and/or can be used to promote the preparation of compositions. Pharmaceutically acceptable carriers include solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic agents, and absorption delay agents that are physiologically compatible and, in some embodiments, pharmaceutically inert.
包含本文公开的抗体的药物组合物的施用可以通过口服或肠胃外来完成。胃肠外的方法包括局部、动脉内(直接至肿瘤)、肌内、脊柱、皮下、髓内、鞘内、心室内、静脉内、腹膜内或鼻内施用。Administration of the pharmaceutical compositions comprising the antibodies disclosed herein can be accomplished orally or parenterally. Parenteral methods include topical, intra-arterial (directly to the tumor), intramuscular, spinal, subcutaneous, intramedullary, intrathecal, intraventricular, intravenous, intraperitoneal, or intranasal administration.
因此,除活性成分以外,这些药物组合物可包含合适的药学上可接受的载体,该载体包含有助于将活性化合物加工成可药用制剂的赋形剂和辅料。有关制剂和施用的技术的更多细节可在雷明顿最新版的《药物科学》(Ed.Maack Publishing Co,Easton,Pa.)中找到。Thus, in addition to the active ingredient, these pharmaceutical compositions may contain a suitable pharmaceutically acceptable carrier comprising excipients and adjuvants which facilitate processing of the active compound into pharmaceutically acceptable preparations. Further details of techniques for formulation and administration may be found in the latest edition of Remington's Pharmaceutical Sciences (Ed. Maack Publishing Co, Easton, Pa.).
根据施用途径,活性化合物(即抗体、双特异性和多特异性分子)可被包衣在材料中,以保护化合物免受酸和其他可能使化合物失去活性的自然条件的影响。Depending on the route of administration, the active compounds (ie, antibodies, bispecific and multispecific molecules) can be coated in a material to protect the compound from acids and other natural conditions that might render the compound inactive.
该组合物通常是无菌的,优选为流体。适当的流动性可以通过例如使用包衣(例如卵磷脂)、在分散体的情况下保持所需的颗粒尺寸以及使用表面活性剂来保持。在许多情况下,优选在组合物中包含等渗剂,例如糖、多元醇(例如甘露醇或山梨醇)和氯化钠。可注射组合物的长期吸收可通过在组合物中包括延迟吸收的试剂来实现,例如单硬脂酸铝或明胶。Said composition is normally sterile, preferably fluid.Suitable fluidity can be maintained by, for example, using a coating (e.g., lecithin), maintaining the required particle size in the case of a dispersion, and using a surfactant. In many cases, it is preferred to include an isotonic agent in the composition, such as sugar, a polyol (e.g., mannitol or sorbitol) and sodium chloride. The long-term absorption of the injectable composition can be achieved by including an agent that delays absorption in the composition, such as aluminum monostearate or gelatin.
用于口服施用的药物组合物可使用本领域熟知的药学上可接受的载体以适合于口服施用的剂量进行配制。这种载体使得药物组合物能够被制成片剂、丸剂、糖衣丸、胶囊、液体、凝胶、糖浆剂、浆液、混悬剂等,供患者食用。Pharmaceutical compositions for oral administration can be formulated with pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers enable the pharmaceutical compositions to be formulated into tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, etc. for consumption by patients.
口服用药物制剂可通过将活性化合物与固体赋形剂组合,任选地研磨所得混合物,并在添加合适的辅料(如需要)后加工颗粒混合物以获得片剂或糖衣丸芯而获得。合适的赋形剂为碳水化合物或蛋白质填充剂,例如糖,包括乳糖、蔗糖、甘露醇或山梨醇;来自玉米、小麦、大米、土豆或其他植物的淀粉;纤维素,例如甲基、纤维素、羟丙甲基纤维素或羧甲基纤维素钠;和树胶,包括阿拉伯树胶和黄蓍胶;和蛋白质,例如明胶和胶原。如果需要,可以添加崩解剂或增溶剂,例如交联聚乙烯吡咯烷酮、琼脂、海藻酸或其盐,例如藻酸钠。Pharmaceutical preparations for oral administration can be obtained by combining the active compound with a solid excipient, optionally grinding the resulting mixture, and processing the granular mixture after adding suitable excipients (if necessary) to obtain tablets or dragee cores. Suitable excipients are carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol or sorbitol; starch from corn, wheat, rice, potato or other plants; cellulose, such as methyl, cellulose, hydroxypropylmethylcellulose or sodium carboxymethylcellulose; and gums, including gum arabic and tragacanth; and proteins, such as gelatin and collagen. If necessary, disintegrants or solubilizers, such as cross-linked polyvinyl pyrrolidone, agar, alginic acid or its salts, such as sodium alginate, can be added.
糖衣丸芯提供有合适的包衣,例如浓缩糖溶液,其也可包含阿拉伯胶、滑石、聚乙烯吡咯烷酮、卡波姆凝胶、聚乙二醇和/或二氧化钛、漆溶液和合适的有机溶剂或溶剂混合物。染料或颜料可添加到片剂或糖衣包衣中,用于产品鉴定或表征活性化合物的数量,即剂量。Dragee cores are provided with suitable coatings, for example concentrated sugar solutions, which may also contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol and/or titanium dioxide, lacquer solutions and suitable organic solvents or solvent mixtures. Dyes or pigments may be added to the tablets or dragee coatings for product identification or to characterize the amount of active compound, i.e., the dosage.
可口服使用的药物制剂包括明胶制成的推入-配合型胶囊,以及明胶制成的软密封胶囊和包衣,如甘油或山梨醇。推入-配合型胶囊可包含与填充剂或粘合剂(例如乳糖或淀粉)、润滑剂(例如滑石或硬脂酸镁)混合的活性成分,以及可选的稳定剂。在软胶囊中,活性化合物可以溶解或悬浮在合适的液体中,例如脂肪油、液体石蜡或液体聚乙二醇(有或无稳定剂)。Pharmaceutical preparations that can be used orally include push-fit capsules made of gelatin, as well as soft, sealed capsules made of gelatin and coatings, such as glycerol or sorbitol. The push-fit capsules may contain the active ingredients mixed with fillers or binders, such as lactose or starches, lubricants, such as talc or magnesium stearate, and optionally stabilizers. In soft capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols, with or without stabilizers.
用于肠胃外施用的药物制剂包括活性化合物的水溶液。对于注射而言,本发明的药物组合物可在水溶液中配制,优选在生理学上相容的缓冲液中,例如Hank’s溶液、林格氏溶液或生理缓冲盐水。水性注射混悬剂可含有增加混悬剂粘度的物质,例如羧甲基纤维素钠、山梨醇或葡聚糖。此外,活性化合物的混悬剂可以制备成适当的油状注射混悬剂。合适的亲脂性溶剂或载体包括脂肪油(如芝麻油)或合成脂肪酸酯(如油酸乙酯或甘油三酯)或脂质体。任选地,该混悬剂还可包含合适的稳定剂或可增加化合物的溶解度以制备高度浓缩的溶液的试剂。Pharmaceutical preparations for parenteral administration include aqueous solutions of the active compound. For injection, the pharmaceutical composition of the present invention can be prepared in an aqueous solution, preferably in a physiologically compatible buffer, such as Hank's solution, Ringer's solution or physiologically buffered saline. Aqueous injection suspensions may contain substances that increase the viscosity of the suspension, such as sodium carboxymethylcellulose, sorbitol or dextran. In addition, the suspension of the active compound can be prepared into a suitable oily injection suspension. Suitable lipophilic solvents or carriers include fatty oils (such as sesame oil) or synthetic fatty acid esters (such as ethyl oleate or triglycerides) or liposomes. Optionally, the suspension may also include a suitable stabilizer or an agent that can increase the solubility of the compound to prepare a highly concentrated solution.
对于局部或经鼻施用,制剂中使用了适合于待渗透的特定屏障的渗透剂。这种渗透剂在本领域中是众所周知的For topical or nasal administration, penetrants appropriate to the particular barrier to be permeated are used in the formulation. Such penetrants are well known in the art.
本公开的药物组合物可根据本领域众所周知和常规实施的方法制备。例如,Remington:The Science and Practice of Pharmacy,Mack Publishing Co.,20th ed.,2000;and Sustained and Controlled Release Drug Delivery Systems,J R.Robinson,ed.,Marcel Dekker,Inc.,New York,1978。药物组合物优选在GMP条件下制备。The pharmaceutical compositions of the present disclosure can be prepared according to methods well known and routinely practiced in the art. For example, Remington: The Science and Practice of Pharmacy, Mack Publishing Co., 20th ed., 2000; and Sustained and Controlled Release Drug Delivery Systems, J R. Robinson, ed., Marcel Dekker, Inc., New York, 1978. The pharmaceutical compositions are preferably prepared under GMP conditions.
本公开内容还包括靶向本文定义的与MHC或HLA分子结合的新抗原肽的重组T细胞受体(TCR)。The present disclosure also encompasses recombinant T cell receptors (TCRs) targeting neoantigenic peptides as defined herein bound to MHC or HLA molecules.
本公开进一步包括产生TCR或其抗原结合片段的方法,该方法包括选择与如本文定义的肿瘤新抗原肽结合的TCR的步骤,该肿瘤新抗原肽任选地与MHC或HLA分子结合,解离常数(Kd)为约2×10-7M或更小。在某些实施方案中,Kd为约2×10-7M或更小,约1×10-7M或更小,约9×10-8M或更小,约1×10-8M或更小,约9×10-9M或更小,约5×10-9M或更小,约4×10- 9M或更小,约3×10-9M或更小,约2×10-9M或更小,或约1×10-9M或更小,或约1×10-10M或更小,或约1×10-12M或更小。The present disclosure further includes a method of producing a TCR or an antigen-binding fragment thereof, the method comprising the step of selecting a TCR that binds to a tumor neoantigenic peptide as defined herein, optionally bound to an MHC or HLA molecule, with a dissociation constant ( Kd ) of about 2× 10-7 M or less. In certain embodiments, the Kd is about 2× 10-7 M or less, about 1× 10-7 M or less, about 9× 10-8 M or less , about 1× 10-8 M or less, about 9× 10-9 M or less, about 5× 10-9 M or less, about 4× 10-9 M or less, about 3× 10-9 M or less, about 2× 10-9 M or less, or about 1× 10-9 M or less, or about 1× 10-10 M or less, or about 1× 10-12 M or less.
编码TCR的核酸可从多种来源获得,例如通过聚合酶链式反应(PCR)扩增天然存在的TCR DNA序列,随后表达抗体可变区,随后进行上述的选择步骤。在一些实施方案中,TCR获自从患者分离的T细胞或培养的T细胞杂交瘤。在一些实施方案中,靶抗原的TCR克隆已在用人免疫系统基因(例如HLA系统或HLA)工程化的转基因小鼠中产生。参见,例如,肿瘤抗原(参见,例如,Parkhurst et al.(2009)Clin Cancer Res.15:169-180 and Cohen et al.(2005)J Immunol.175:5799-5808。在一些实施方案中,噬菌体展示用于分离针对靶抗原的TCR(参见,例如,Varela-Rohena et al.(2008)Nat Med.14:1390-1395 and Li(2005)NatBiotechnol.23:349-354)。Nucleic acids encoding TCRs can be obtained from a variety of sources, such as by amplifying naturally occurring TCR DNA sequences by polymerase chain reaction (PCR), followed by expression of antibody variable regions, followed by the above-described selection steps. In some embodiments, TCRs are obtained from T cells isolated from patients or cultured T cell hybridomas. In some embodiments, TCR clones for target antigens have been generated in transgenic mice engineered with human immune system genes (e.g., HLA system or HLA). See, for example, tumor antigens (see, for example, Parkhurst et al. (2009) Clin Cancer Res. 15: 169-180 and Cohen et al. (2005) J Immunol. 175: 5799-5808. In some embodiments, phage display is used to isolate TCRs for target antigens (see, for example, Varela-Rohena et al. (2008) Nat Med. 14: 1390-1395 and Li (2005) Nat Biotechnol. 23: 349-354).
“T细胞受体”或“TCR”指包含可变α链和β链(分别称为TCRa和TCRb)或可变γ链和δ链(分别称为TCRg和TCRd)并且能够与结合MHC受体的抗原肽特异性结合的分子。在一些实施例中,TCR为αβ形式。通常,以αβ和γδ形式存在的TCR在结构上大致相似,但表达它们的T细胞可能具有不同的解剖位置或功能。TCR可在细胞表面或以可溶形式发现。通常,TCR位于T细胞(或T淋巴细胞)表面,其通常负责识别通过其细胞外结合域与主要组织相容性复合物(MHC)分子结合的抗原。在一些实施方案中,TCR还可包含恒定域、跨膜域和/或短胞质尾(参见,例如Janeway et ah,Immunobiology:The Immune System in Health and Disease,3rd Ed.,Current Biology Publications,p.4:33,1997)。例如,在某些方面,TCR的每条链可在C端具有N端免疫球蛋白可变域、免疫球蛋白恒定域、跨膜区和短胞质尾。在一些实施方案中,TCR与参与介导信号转导的CD3复合物的恒定蛋白结合。除非另有说明,术语“TCR”应理解为包括其功能性TCR片段。该术语还包括完整的或全长的TCR,包括αβ形式或γδ形式的TCR。"T cell receptor" or "TCR" refers to a molecule that comprises a variable α chain and a β chain (referred to as TCRRa and TCRb, respectively) or a variable γ chain and a δ chain (referred to as TCRg and TCRd, respectively) and is capable of specifically binding to an antigenic peptide bound to an MHC receptor. In some embodiments, the TCR is in the αβ form. Typically, TCRs in the αβ and γδ forms are roughly similar in structure, but the T cells expressing them may have different anatomical locations or functions. TCRs can be found on the cell surface or in a soluble form. Typically, TCRs are located on the surface of T cells (or T lymphocytes), which are generally responsible for recognizing antigens that are bound to major histocompatibility complex (MHC) molecules through their extracellular binding domains. In some embodiments, the TCR may also include a constant domain, a transmembrane domain, and/or a short cytoplasmic tail (see, e.g., Janeway et ah, Immunobiology: The Immune System in Health and Disease, 3rd Ed., Current Biology Publications, p. 4: 33, 1997). For example, in some aspects, each chain of a TCR may have an N-terminal immunoglobulin variable domain, an immunoglobulin constant domain, a transmembrane region, and a short cytoplasmic tail at the C-terminus. In some embodiments, the TCR binds to a constant protein of a CD3 complex involved in mediating signal transduction. Unless otherwise indicated, the term "TCR" is understood to include its functional TCR fragments. The term also includes complete or full-length TCRs, including TCRs in αβ or γδ forms.
因此,为本文的目的,提及的TCR包括任何TCR或功能性片段,例如TCR的抗原结合部分,其与结合在MHC分子中的特异性抗原肽,即MHC肽复合物结合。TCR的“抗原结合部分”或“抗原结合片段”可互换使用,是指包含TCR的一部分结构域但与全TCR结合的抗原(例如,MHC肽复合物)结合的分子。在某些情况下,抗原结合部分包含TCR的可变域,例如TCR的可变α链和可变β链,其足以形成结合特异性MHC肽复合物的结合部位,例如通常每条链包含三个互补决定区。Therefore, for the purposes of this article, reference to a TCR includes any TCR or functional fragment, such as an antigen binding portion of a TCR, which binds to a specific antigen peptide bound to an MHC molecule, i.e., an MHC peptide complex. "Antigen binding portion" or "antigen binding fragment" of a TCR is used interchangeably and refers to a molecule that contains a portion of a domain of a TCR but binds to an antigen (e.g., an MHC peptide complex) bound to the full TCR. In some cases, the antigen binding portion comprises a variable domain of a TCR, such as a variable α chain and a variable β chain of a TCR, which is sufficient to form a binding site that binds to a specific MHC peptide complex, such as typically each chain comprising three complementary determining regions.
在一些实施方案中,TCR链的可变域结合以形成类似于免疫球蛋白的环或互补决定区(CDR),其通过形成TCR分子的结合部位赋予抗原识别并决定肽的特异性。与免疫球蛋白一样,CDR通常由框架区(FR)分开(参见Jores et al.,Pwc.Nat′lAcad.Sci.U.S.A.87:9138,1990;Chothia et al.,EMBO J.7:3745,1988,也参见Lefranc et al.,Dev.Comp.Immunol.27:55,2003)。在一些实施方案中,CDR3是负责识别经处理的抗原的主要CDR,尽管也已表明α链的CDR1与抗原肽的N端部分相互作用,而β链的CDR1与肽的C端部分相互作用。CDR2被认为能够识别MHC分子。在一些实施方案中,β链的可变区可包含进一步的高变率(HV4)区。In some embodiments, the variable domains of the TCR chains combine to form a loop or complementary determining region (CDR) similar to that of immunoglobulins, which confers antigen recognition and determines the specificity of the peptide by forming the binding site of the TCR molecule. As with immunoglobulins, CDRs are usually separated by framework regions (FRs) (see Jores et al., Pwc. Nat'l Acad. Sci. U.S.A. 87:9138, 1990; Chothia et al., EMBO J. 7:3745, 1988, see also Lefranc et al., Dev. Comp. Immunol. 27:55, 2003). In some embodiments, CDR3 is the primary CDR responsible for recognizing the processed antigen, although it has also been shown that CDR1 of the α chain interacts with the N-terminal portion of the antigenic peptide, while CDR1 of the β chain interacts with the C-terminal portion of the peptide. CDR2 is believed to be able to recognize MHC molecules. In some embodiments, the variable region of the β chain may contain a further high variability (HV4) region.
在一些实施例中,TCR链包含恒定域。例如,与免疫球蛋白一样,TCR链的胞外部分(例如,α链、β链)可以包含两个免疫球蛋白结构域,一个可变域(例如Va或Vp;通常为根据Kabat编号的1-116氨基酸,Kabat et al.,″Sequences of Proteins of ImmunologicalInterest,US Dept.Health and Human Services,Public Health Service NationalInstitutes of Health,1991,5th ed.)和一个邻近细胞膜的恒定域(例如,a链恒定域或Ca,通常是基于Kabat的117-259氨基酸,β链恒定域或Cp,通常是基于Kabat的117-295氨基酸)。例如,在某些情况下,由两条链形成的TCR的胞外部分包含两个膜近侧的恒定域和两个膜远侧的包含CDR的可变域。TCR域的恒定域包含短连接序列,其中半胱氨酸残基形成二硫键,在两条链之间形成连接。在一些实施方案中,TCR可在α链和β链中的每一个链上具有额外的半胱氨酸残基,使得TCR在恒定域中包含两个二硫键。In some embodiments, the TCR chain comprises a constant domain. For example, as with immunoglobulins, the extracellular portion of the TCR chain (e.g., α chain, β chain) can comprise two immunoglobulin domains, a variable domain (e.g., Va or Vp; typically 1-116 amino acids according to Kabat numbering, Kabat et al., "Sequences of Proteins of Immunological Interest, US Dept. Health and Human Services, Public Health Service National Institutes of Health, 1991, 5th ed.) and a constant domain adjacent to the cell membrane (e.g., an alpha chain constant domain or Ca, typically 117-259 amino acids based on Kabat, a beta chain constant domain or Cp, typically 117-295 amino acids based on Kabat). For example, in some cases, the extracellular portion of the TCR formed by the two chains comprises two constant domains proximal to the membrane and two variable domains comprising CDRs distal to the membrane. The constant domain of the TCR domain comprises a short linker sequence in which cysteine residues form a disulfide bond to form a connection between the two chains. In some embodiments, the TCR may have additional cysteine residues on each of the alpha and beta chains such that the TCR comprises two disulfide bonds in the constant domain.
在一些实施方案中,TCR链可以包含跨膜结构域。在一些实施方案中,跨膜结构域带正电。在某些情况下,TCR链包含胞质尾。在某些情况下,该结构允许TCR与其他分子(如CD3)结合。例如,包含具有跨膜区的恒定域的TCR可以将蛋白质锚定在细胞膜上,并与CD3信号装置或复合物的恒定亚单位结合。In some embodiments, the TCR chain may include a transmembrane domain. In some embodiments, the transmembrane domain is positively charged. In some cases, the TCR chain includes a cytoplasmic tail. In some cases, the structure allows TCR to be combined with other molecules (such as CD3). For example, a TCR comprising a constant domain with a transmembrane region can anchor the protein to the cell membrane and bind to the constant subunit of a CD3 signaling device or a complex.
通常,CD3是一种多蛋白复合物,在哺乳动物和ζ链中可具有三条不同的链(γ、δ和ε)。例如,在哺乳动物中,该复合物可包含一条CD3y链、一条CD35链、两条CD3链和一条CD3ζ链的同源二聚体。CD3y、CD35和CD3链是免疫球蛋白超家族的高度相关的细胞表面蛋白,包含单个免疫球蛋白结构域。CD3y、CD35和CD3链的跨膜区域带负电荷,这是允许这些链与带正电的T细胞受体链结合的特征。CD3y、CD35和CD3链的胞内尾各包含一个保守基序,称为免疫受体酪氨酸基激活基序或ITAM,而每个CD3ζ链有三个保守基序。一般而言,ITAM参与TCR复合体的信号传输能力。这些辅助分子具有带负电荷的跨膜区域,并在将信号从TCR传播至细胞中发挥作用。CD3-和ζ-链与TCR一起形成T细胞受体复合物。Typically, CD3 is a multiprotein complex that can have three different chains (γ, δ, and ε) in mammals and a ζ chain. For example, in mammals, the complex can contain a homodimer of a CD3y chain, a CD35 chain, two CD3 chains, and a CD3ζ chain. The CD3y, CD35, and CD3 chains are highly related cell surface proteins of the immunoglobulin superfamily, containing a single immunoglobulin domain. The transmembrane regions of the CD3y, CD35, and CD3 chains are negatively charged, a feature that allows these chains to bind to positively charged T cell receptor chains. The intracellular tails of the CD3y, CD35, and CD3 chains each contain a conserved motif, called an immunoreceptor tyrosine-based activation motif or ITAM, and each CD3ζ chain has three conserved motifs. In general, ITAMs are involved in the signaling ability of the TCR complex. These accessory molecules have negatively charged transmembrane regions and play a role in propagating signals from the TCR to the cell. The CD3- and ζ- chains form the T cell receptor complex together with the TCR.
在一些实施方案中,TCR可以是两条链α和β(或可选的γ和δ)的异二聚体,或者可以是单链TCR构建体。在一些实施方案中,TCR是包含通过例如二硫键或二硫键连接两个单独的链(α和β链或γ和δ链)的异二聚体。In some embodiments, TCR can be a heterodimer of two chains α and β (or optional γ and δ), or can be a single-chain TCR construct. In some embodiments, TCR is a heterodimer comprising two separate chains (α and β chains or γ and δ chains) connected by, for example, a disulfide bond or a disulfide bond.
虽然T细胞受体(TCR)是跨膜蛋白,且不以可溶形式自然存在,但抗体可被分泌和膜结合。重要的是,TCR相对于抗体的优势在于,当在MHC分子的情况下呈递时,其原则上能够识别所有降解的细胞内和细胞外蛋白产生的肽。因此,TCR具有重要的治疗潜力。Although T cell receptors (TCRs) are transmembrane proteins and do not exist naturally in soluble form, antibodies can be secreted and membrane-bound. Importantly, the advantage of TCRs over antibodies is that, when presented in the context of MHC molecules, they can in principle recognize peptides generated by all degraded intracellular and extracellular proteins. Therefore, TCRs have important therapeutic potential.
本公开还涉及包含针对如本文所公开的肿瘤新抗原肽的抗原识别部分的可溶T细胞受体(sTCR)(特别参见Walseng E,S,Fallang L-E,Yang W,Vefferstad A,Areffard A,et al.(2015)Soluble T-Cell Receptors Produced in Human Cells forTargeted Delivery.PLoS ONE 10(4):e0119559)。在一个特定的实施方案中,可溶的TCR可与针对T细胞抗原的抗体片段融合,任选地,其中靶向的抗原为CD3或CD16(参见例如Boudousquie,Caroline et al.“Polyfunctional response by ImmTAC(IMCgp100)redirected CD8+and CD4+T cells.”Immunology vol.152,3(2017):425-438.doi:10.1111/imm.12779)。The present disclosure also relates to soluble T cell receptors (sTCRs) comprising an antigen recognition portion directed against a tumor neoantigenic peptide as disclosed herein (see in particular Walseng E, S, Fallang LE, Yang W, Vefferstad A, Areffard A, et al. (2015) Soluble T-Cell Receptors Produced in Human Cells for Targeted Delivery. PLoS ONE 10 (4): e0119559). In a specific embodiment, the soluble TCR can be fused with an antibody fragment against a T cell antigen, optionally wherein the targeted antigen is CD3 or CD16 (see, e.g., Boudousquie, Caroline et al. "Polyfunctional response by ImmTAC (IMCgp100) redirected CD8+ and CD4+ T cells." Immunology vol. 152, 3 (2017): 425-438. doi: 10.1111 / imm.12779).
在某些实施方案中,本公开包括重组HLA非依赖性(或非HLA限制性)T细胞受体(称为“HI-TCR”),其以HLA非依赖性方式结合如本文定义的(跨膜)嵌合蛋白(特别是如前定义的SEQ ID NO:1-8202中任何一个的新抗原肽)。在国际申请号WO2019/157454中描述了预期的且非常适合本发明的“HI-TCR”。因此,根据本公开的HI-TCR通常包含抗原结合链,该链包含:(a)以HLA非依赖性方式与抗原结合的抗原结合结构域(如前所定义),例如免疫球蛋白可变区的抗原结合片段;和(b)能够与CD3ζ多肽结合(并因此激活)的恒定域。因为TCR通常以HLA依赖性方式结合抗原,所以以HLA非依赖性方式结合的抗原结合结构域是异源的。优选地,抗原结合结构域或其片段包含:(i)包含或由抗体的重链可变区(VH)和/或(ii)抗体的轻链可变区(VL)组成的抗原结合结构域。TCR的恒定域是例如天然的或经修饰的TRAC多肽,或天然的或经修饰的TRBC多肽。TCR的恒定域是例如天然TCR恒域(α或β)或其片段。与通常自体包含胞内信号传导结构域的嵌合抗原受体不同,HI-TCR不直接产生激活信号;相反,抗原结合链与CD3ζ多肽结合并因此激活该多肽。当抗原在细胞表面的密度低于约10,000个分子/细胞时,包含重组TCR的免疫细胞提供了优异的活性。In certain embodiments, the present disclosure includes a recombinant HLA-independent (or non-HLA-restricted) T cell receptor (referred to as "HI-TCR") that binds to a (transmembrane) chimeric protein as defined herein (particularly a new antigenic peptide as defined above in any one of SEQ ID NOs: 1-8202). The intended "HI-TCR" that is very suitable for the present invention is described in International Application No. WO2019/157454. Therefore, the HI-TCR according to the present disclosure generally comprises an antigen binding chain comprising: (a) an antigen binding domain (as defined above) that binds to an antigen in an HLA-independent manner, such as an antigen binding fragment of an immunoglobulin variable region; and (b) a constant domain that is capable of binding to (and thus activating) a CD3ζ polypeptide. Because TCRs generally bind to antigens in an HLA-dependent manner, the antigen binding domain that binds in an HLA-independent manner is heterologous. Preferably, the antigen binding domain or fragment thereof comprises: (i) an antigen binding domain comprising or consisting of a heavy chain variable region (VH) of an antibody and/or (ii) a light chain variable region (VL) of an antibody. The constant domain of the TCR is, for example, a natural or modified TRAC polypeptide, or a natural or modified TRBC polypeptide. The constant domain of the TCR is, for example, a natural TCR constant domain (α or β) or a fragment thereof. Unlike chimeric antigen receptors that typically contain intracellular signaling domains, HI-TCRs do not directly generate activation signals; instead, the antigen binding chain binds to the CD3ζ polypeptide and thereby activates the polypeptide. Immune cells comprising recombinant TCRs provide excellent activity when the density of antigen on the cell surface is less than about 10,000 molecules/cell.
CD3ζ多肽是例如天然的CD3ζ多肽或经修饰的CD3ζ多肽。CD3ζ多肽任选地与共刺激分子或其片段的胞内结构域融合。或者,抗原结合结构域任选地包含共刺激区域,例如胞内结构域,其能够在抗原结合链与抗原结合时刺激免疫响应性细胞。共刺激分子的例子包括CD28、4-1BB、OX40、ICOS、DAP-10、其片段或其组合。The CD3 ζ polypeptide is, for example, a natural CD3 ζ polypeptide or a modified CD3 ζ polypeptide. The CD3 ζ polypeptide is optionally fused to the intracellular domain of a costimulatory molecule or a fragment thereof. Alternatively, the antigen binding domain optionally comprises a costimulatory region, such as an intracellular domain, which can stimulate immune responsive cells when the antigen binding chain binds to the antigen. Examples of costimulatory molecules include CD28, 4-1BB, OX40, ICOS, DAP-10, fragments thereof, or combinations thereof.
在一些实施方案中,重组HI-TCR通过整合在免疫响应性细胞的内源基因座上的转基因表达,例如,CD3δ基因组、CD3ε基因座、CD247基因座、B2M基因座、TRAC基因座、TRBC基因座、TRDC基因座和/或TRGC基因座。在大多数实施方案中,重组HI-TCR的表达由内源性TRAC或TRBC基因座驱动。在一些实施方案中,编码重组HI-TCR的一部分的转基因以破坏或消除包含天然TCRα链和/或天然TCRβ链的TCR的内源表达的方式整合入内源性TRAC和/或TRBC基因座。该破坏防止或消除了重组TCR与免疫响应性细胞中天然TCRα链和/或天然TCRβ链之间的错配。内源基因座还可包含经修饰的转录终止子区域,例如,TK转录终止子、GCSF转录终止子、TCRA转录终止子、HBB转录终止子、牛生长激素转录终止子、SV40转录终止子和P2A元件。In some embodiments, the recombinant HI-TCR is expressed by a transgene integrated into an endogenous locus of an immune responsive cell, for example, a CD3 δ genome, a CD3 ε locus, a CD247 locus, a B2M locus, a TRAC locus, a TRBC locus, a TRDC locus, and/or a TRGC locus. In most embodiments, the expression of the recombinant HI-TCR is driven by an endogenous TRAC or TRBC locus. In some embodiments, a transgene encoding a portion of the recombinant HI-TCR is integrated into an endogenous TRAC and/or TRBC locus in a manner that destroys or eliminates the endogenous expression of a TCR comprising a natural TCR α chain and/or a natural TCR β chain. The destruction prevents or eliminates mismatches between the recombinant TCR and the natural TCR α chain and/or the natural TCR β chain in the immune responsive cell. The endogenous locus may also include a modified transcription terminator region, for example, a TK transcription terminator, a GCSF transcription terminator, a TCRA transcription terminator, an HBB transcription terminator, a bovine growth hormone transcription terminator, an SV40 transcription terminator, and a P2A element.
在本公开的一些实施方案中,重组TCR和HI-TCR通常包含能够与第二胞外抗原结合结构域二聚的胞外抗原结合结构域。通常,第二胞外抗原结合结构域结合肿瘤抗原,优选其中肿瘤抗原选自pHER95、CD19、MUC16、MUC1、CAIX、CEA、CD8、CD7、CD10、CD20、CD22、CD30、CD70、CLL1、CD33、CD34、CD38、CD41、CD44、CD49f、CD56、CD74、CD133、CD138、EGP-2、EGP-40、EpCAM、Erb-B2、Erb-B3、Erb-B4、FBP、胎儿乙酰胆碱受体、叶酸受体-a、GD2、GD3、HER-2、hTERT、IL-13R-a2、κ-轻链、KDR、LeY、L1细胞粘附分子、MAGE-A1、间皮素、MAGEA3、p53、MART1,GP100、蛋白酶3(PR1)、酪氨酸酶、生存素、hTERT、EphA2、NKG2D配体、NY-ESO-1、癌胚抗原(h5T4)、PSCA、PSMA、ROR1、TAG-72、TAG-R2、WT-1、BCMA、CD123、CD44V6、NKCS1、EGF1R、EGFR-VIII、CD99、CD70、ADGRE2、CCR1、LILRB2、LILRB4、PRAME和ERBB。In some embodiments of the present disclosure, the recombinant TCR and HI-TCR generally comprise an extracellular antigen binding domain that can dimerize with a second extracellular antigen binding domain. Generally, the second extracellular antigen binding domain binds to a tumor antigen, preferably wherein the tumor antigen is selected from pHER95, CD19, MUC16, MUC1, CAIX, CEA, CD8, CD7, CD10, CD20, CD22, CD30, CD70, CLL1, CD33, CD34, CD38, CD41, CD44, CD49f, CD56, CD74, CD133, CD138, EGP-2, EGP-40, EpCAM, Erb-B2, Erb-B3, Erb-B4, FBP, fetal acetylcholine receptor, folate receptor-a, GD2, GD3, HER-2, hTERT, IL-13R -a2, kappa-light chain, KDR, LeY, L1 cell adhesion molecule, MAGE-A1, mesothelin, MAGEA3, p53, MART1, GP100, proteinase 3 (PR1), tyrosinase, survivin, hTERT, EphA2, NKG2D ligand, NY-ESO-1, carcinoembryonic antigen (h5T4), PSCA, PSMA, ROR1, TAG-72, TAG-R2, WT-1, BCMA, CD123, CD44V6, NKCS1, EGF1R, EGFR-VIII, CD99, CD70, ADGRE2, CCR1, LILRB2, LILRB4, PRAME and ERBB.
本公开还涵盖针对本文所公开的嵌合多肽(蛋白)和特别是本文定义的SEQ IDNO:1-8202中任何一个的跨膜嵌合蛋白的嵌合抗原受体(CAR)。在优选实施方案中,CAR包含如前定义的抗原结合结构域。CAR是包含抗原结合结构域的融合蛋白,通常来源于与TCR复合物信号传导结构域相连的抗体。通过选择合适的抗原结合结构域,CAR可用于将免疫细胞(如T细胞或NK T细胞)定向至如先前定义的肿瘤新抗原肽。The present disclosure also encompasses chimeric antigen receptors (CARs) for chimeric polypeptides (proteins) disclosed herein and, in particular, transmembrane chimeric proteins of any one of SEQ ID NOs: 1-8202 defined herein. In a preferred embodiment, CAR comprises an antigen binding domain as defined above. CAR is a fusion protein comprising an antigen binding domain, typically derived from an antibody linked to a TCR complex signaling domain. By selecting a suitable antigen binding domain, CAR can be used to direct immune cells (such as T cells or NK T cells) to tumor neoantigen peptides as previously defined.
CAR的抗原结合结构域通常基于来自抗体的scFv(单链可变片段)。除了N端的胞外抗体结合结构域,CAR通常可包含作为间隔子的铰链结构域,以将抗原结合结构域从其表达的免疫效应细胞的质膜上延伸出去;跨膜(TM)结构域;胞内信号传导结构域(例如来自TCR复合物的CD3分子ζ链的信号传导结构域或同等物);以及可选的一个或多个共刺激结构域,其可协助表达CAR的细胞的信号传导或功能。单独(第二代)或组合(第三代)地添加来自共刺激分子(包括CD28、OX-40(CD134)和4-1BB(CD137))的信号传导结构域以增强CAR修饰的T细胞的存活和提高其增殖。潜在的共刺激结构域还包括ICOS-1、CD27、GITR和DAP10。The antigen binding domain of CAR is usually based on scFv (single-chain variable fragment) from antibodies. In addition to the extracellular antibody binding domain at the N-terminus, CAR may generally include a hinge domain as a spacer to extend the antigen binding domain from the plasma membrane of the immune effector cell it expresses; transmembrane (TM) domain; intracellular signaling domain (e.g., signaling domain or equivalent from the CD3 molecule ζ chain of the TCR complex); and optional one or more costimulatory domains, which can assist in the signaling or function of cells expressing CAR. Signaling domains from costimulatory molecules (including CD28, OX-40 (CD134) and 4-1BB (CD137)) are added alone (second generation) or in combination (third generation) to enhance the survival of CAR-modified T cells and increase their proliferation. Potential costimulatory domains also include ICOS-1, CD27, GITR and DAP10.
因此,CAR可包括Therefore, CAR may include
(1)在其胞外部分,一种或多种抗原结合分子,例如一种或多种抗原结合片段、结构域或抗体的一部分,或一种或多种抗体可变域,和/或抗体分子,以及通常一种或多种如前定义的抗原结合结构域。(1) in its extracellular portion, one or more antigen binding molecules, such as one or more antigen binding fragments, domains or parts of antibodies, or one or more antibody variable domains, and/or antibody molecules, and generally one or more antigen binding domains as defined above.
(2)在其跨膜部分,来源于人T细胞受体-α或-β链、CD3ζ链、CD28、CD3-ε、CD45、CD4、CD5、CD8、CD9、CD16、CD22、CD33、CD37、CD64、CD80、CD86、CD134、CD137、ICOS、CD154或GITR的跨膜结构域。在一些实施方案中,跨膜结构域源自CD28、CD8或CD3-ζ。(2) In its transmembrane portion, the transmembrane domain is derived from human T cell receptor-α or -β chain, CD3ζ chain, CD28, CD3-ε, CD45, CD4, CD5, CD8, CD9, CD16, CD22, CD33, CD37, CD64, CD80, CD86, CD134, CD137, ICOS, CD154 or GITR. In some embodiments, the transmembrane domain is derived from CD28, CD8 or CD3-ζ.
(3)一个或多个共刺激结构域,例如源自人CD28、4-1BB(CD137)、ICOS-1、CD27、OX40(CD137)、DAP10和GITR(AITR)的共刺激结构域。在一些实施方案中,CAR包含CD28和4-1BB的共刺激结构域。(3) one or more costimulatory domains, for example, costimulatory domains derived from human CD28, 4-1BB (CD137), ICOS-1, CD27, OX40 (CD137), DAP10 and GITR (AITR). In some embodiments, CAR comprises a costimulatory domain of CD28 and 4-1BB.
(4)在其胞内信号传导结构域中,包含一个或多个ITAM的胞内信号传导结构域,例如,胞内信号传导结构域为CD3-ζ,或其缺少一个或两个ITAM的变体(例如,ITAM3和ITAM2),或胞内信号传导结构域衍生自FcεRIγ。(4) An intracellular signaling domain comprising one or more ITAMs, for example, the intracellular signaling domain is CD3-ζ, or a variant thereof lacking one or two ITAMs (e.g., ITAM3 and ITAM2), or the intracellular signaling domain is derived from FcεRIγ.
CAR可被设计为单独识别肿瘤新抗原肽或与HLA或MHC分子结合识别肿瘤新抗原肽。CAR can be designed to recognize tumor neoantigen peptides alone or in combination with HLA or MHC molecules.
用于结合抗原的部分包括三大类,来自抗体的单链抗体片段(scFvs)、来自文库的Fab’s或与其同源受体结合的天然配体(用于第一代CAR)。上述每一类的成功实例在Sadelain M,Brentjens R,Riviere I.The basic principles of chimeric antigenreceptor(CAR)design.Cancer discovery.2013;3(4):388-398中均有特别报道(特别参见表1)并包含在本申请中。The parts used to bind antigens include three major categories, single-chain antibody fragments (scFvs) from antibodies, Fab's from libraries, or natural ligands that bind to their cognate receptors (for first-generation CARs). Successful examples of each of the above categories are specifically reported in Sadelain M, Brentjens R, Riviere I. The basic principles of chimeric antigen receptor (CAR) design. Cancer discovery. 2013; 3 (4): 388-398 (see Table 1 in particular) and are included in this application.
抗体包括嵌合的、人源化的或人的抗体,并且可以如上所述进一步亲和力成熟和选择。通常使用来源于啮齿动物免疫球蛋白(例如,小鼠、大鼠)的嵌合或人源化scFv,因为它们容易来源于完全表征的单克隆抗体。人源化抗体包含啮齿动物序列衍生的CDR区域;通常,啮齿类动物CDR被植入人框架,并且一些人框架残基可被反向突变为原始的啮齿类动物框架残基以保持亲和力,和/或一个或几个CDR残基可被突变而提高亲和力。全人抗体不含鼠序列,通常通过人抗体库的噬菌体展示技术或其天然免疫球蛋白基因座已被人免疫球蛋白基因座片段取代的转基因小鼠的免疫而产生。可以产生在天然氨基酸序列中具有一个或多个氨基酸取代、插入或缺失的抗体变体,其中抗体保留或基本上保留其特异性结合功能。氨基酸的保守取代是众所周知的,并且如上所述。还可以产生对抗原具有改善的亲和力的其他变体。Antibodies include chimeric, humanized or human antibodies, and can be further affinity matured and selected as described above. Chimeric or humanized scFvs derived from rodent immunoglobulins (e.g., mice, rats) are generally used because they are easily derived from fully characterized monoclonal antibodies. Humanized antibodies include rodent sequence-derived CDR regions; generally, rodent CDRs are implanted into human frameworks, and some human framework residues may be reversely mutated to original rodent framework residues to maintain affinity, and/or one or more CDR residues may be mutated to improve affinity. Fully human antibodies do not contain mouse sequences and are generally produced by immunization of transgenic mice whose phage display technology of human antibody libraries or their natural immunoglobulin loci have been replaced by human immunoglobulin locus fragments. Antibody variants having one or more amino acid substitutions, insertions or deletions in the native amino acid sequence can be produced, wherein the antibody retains or substantially retains its specific binding function. Conservative substitutions of amino acids are well known and as described above. Other variants with improved affinity for antigens can also be produced.
通常,CAR包括先前定义的来自抗体分子的抗原结合结构域,例如从单克隆抗体(mAb)的可变重链(VH)和可变轻链(VL)衍生的单链抗体片段(scFv)。Typically, CARs include a previously defined antigen-binding domain from an antibody molecule, such as a single-chain antibody fragment (scFv) derived from the variable heavy (VH) and variable light (VL) chains of a monoclonal antibody (mAb).
在某些方面,CAR的抗原结合结构域与一个或多个跨膜和胞内信号传导结构域相连。在一些实施方案中,CAR包括与CAR的胞外区域融合的跨膜结构域。在一个实施方案中,使用与CAR中的一个结构域天然结合的跨膜结构域。在某些情况下,通过氨基酸取代选择或修饰跨膜结构域,以避免该结构域与相同或不同表面膜蛋白的跨膜结构域结合,从而使与受体复合物其他成员的相互作用最小化。In some aspects, the antigen binding domain of CAR is connected to one or more transmembrane and intracellular signaling domains. In some embodiments, CAR includes a transmembrane domain fused to the extracellular region of CAR. In one embodiment, a transmembrane domain naturally bound to a domain in CAR is used. In some cases, the transmembrane domain is selected or modified by amino acid substitution to avoid binding of the domain to the transmembrane domain of the same or different surface membrane proteins, thereby minimizing the interaction with other members of the receptor complex.
在一些实施方案中,跨膜结构域来源于天然或合成来源。如果来源是天然的,则该结构域可来源于任何膜结合或跨膜蛋白。跨膜区包括源自(即至少包含)T细胞受体、CD28、CD3ε、CD45、CD4、CD5、CD8、CD9、CD16、CD22、CD33、CD37、CD64、CD80、CD86、CD 134、CD137、CD154、ICOS或GITR的α、β或ζ链的跨膜区。跨膜结构域也可以是合成的。在一些实施方案中,跨膜结构域源自CD28、CD8或CD3-ζ。In some embodiments, the transmembrane domain is derived from a natural or synthetic source. If the source is natural, the domain can be derived from any membrane-bound or transmembrane protein. The transmembrane region includes a transmembrane region derived from (i.e., at least comprising) α, β or ζ chains of a T cell receptor, CD28, CD3ε, CD45, CD4, CD5, CD8, CD9, CD16, CD22, CD33, CD37, CD64, CD80, CD86, CD 134, CD137, CD154, ICOS or GITR. The transmembrane domain can also be synthetic. In some embodiments, the transmembrane domain is derived from CD28, CD8 or CD3-ζ.
在一些实施方案中,存在短的寡或多肽接头,例如长度为2至10个氨基酸的接头,并在CAR的跨膜结构域和胞质信号传导结构域之间形成连接。In some embodiments, a short oligo- or polypeptide linker is present, e.g., a linker of 2 to 10 amino acids in length, and forms a connection between the transmembrane domain and the cytoplasmic signaling domain of the CAR.
CAR通常包括至少一种胞内信号传导成分。第一代CAR通常具有来自CD3ζ链的胞内区域,其为来自内源性TCR的信号的主要传递者。第二代CAR通常进一步包含从各种共刺激蛋白受体(例如,CD28,41BB(CD28),ICOS)到CAR胞质尾的胞内信号传导结构域,以向T细胞提供额外信号。共刺激结构域包括源自人CD28、4-1BB(CD137)、ICOS-1、CD27、OX 40(CD137)、DAP10和GITR(AITR)的结构域。考虑了两个共刺激结构域的组合,例如CD28和4-1BB,或CD28和OX40。第三代CAR组合了多种信号传导结构域,如CD3z-CD28-4-1BB或CD3z-CD28-OX40,以增强效力。CAR generally includes at least one intracellular signaling component.The first generation CAR generally has an intracellular region from CD3ζ chains, which is the main transmitter of the signal from endogenous TCR.The second generation CAR generally further includes an intracellular signaling domain from various costimulatory protein receptors (e.g., CD28, 41BB (CD28), ICOS) to the cytoplasmic tail of CAR, to provide additional signals to T cells.Co-stimulatory domains include domains derived from human CD28, 4-1BB (CD137), ICOS-1, CD27, OX 40 (CD137), DAP10 and GITR (AITR).Considering the combination of two costimulatory domains, such as CD28 and 4-1BB, or CD28 and OX40.The third generation CAR combines a variety of signaling domains, such as CD3z-CD28-4-1BB or CD3z-CD28-OX40, to enhance effectiveness.
胞内信号传导结构域可以来自TCR复合物的细胞内成分,例如介导T细胞激活和细胞毒性的TCR CD3+链,例如CD3ζ链。其他胞内信号传导结构域包括FcεRIγ。胞内信号传导结构域可包含经修饰的CD3ζ多肽,该多肽缺少其三个免疫受体酪氨酸基激活基序(ITAM)中的一个或两个,其中ITAM为ITAM1、ITAM2和ITAM3(从N端至C端编号)。CD3-ζ的胞内信号传导区为SEQ ID NO:4的第22-164位残基。ITAM1位于第61-89位氨基酸残基周围,ITAM2位于第100-128位氨基酸残基周围,并且ITAM3位于第131-159位残基周围。因此,经修饰的CD3ζ多肽可使ITAM1、ITAM2或ITAM3中的任何一种失活。或者,经修饰的CD3ζ多肽可使任何两种ITAM失活,例如ITAM2和ITAM3,或ITAM1和ITAM2。优选地,ITAM3失活,例如缺失。更优选地,ITAM2和ITAM3失活,例如缺失,保留ITAM1。例如,一个经修饰的CD3ζ多肽仅保留ITAM1,并且其余的CD3ζ域缺失(第90-164位残基)。作为另一个实例,ITAM1被ITAM3的氨基酸序列取代,并且剩余的CD3ζ域缺失(第90-164位残基)。例如,参见Bridgeman et al.,Clin.Exp.Immunol.175(2):258-67(2014)、Zhao et al.,J.Immunol.183(9):5563-74(2009)、Maus et al.,WO 2018/132506;Sadelain et al.、WO/2019/133969、Feucht etal.,Nat Med.25(1):82-88(2019)。The intracellular signaling domain can be derived from the intracellular component of the TCR complex, such as the TCR CD3+ chain that mediates T cell activation and cytotoxicity, such as the CD3ζ chain. Other intracellular signaling domains include FcεRIγ. The intracellular signaling domain may comprise a modified CD3ζ polypeptide that lacks one or two of its three immunoreceptor tyrosine-based activation motifs (ITAMs), wherein ITAMs are ITAM1, ITAM2, and ITAM3 (numbered from the N-terminus to the C-terminus). The intracellular signaling region of CD3-ζ is residues 22-164 of SEQ ID NO: 4. ITAM1 is located around amino acid residues 61-89, ITAM2 is located around amino acid residues 100-128, and ITAM3 is located around residues 131-159. Thus, the modified CD3ζ polypeptide may inactivate any of ITAM1, ITAM2, or ITAM3. Alternatively, the modified CD3 zeta polypeptide may inactivate any two ITAMs, such as ITAM2 and ITAM3, or ITAM1 and ITAM2. Preferably, ITAM3 is inactivated, such as deleted. More preferably, ITAM2 and ITAM3 are inactivated, such as deleted, and ITAM1 is retained. For example, a modified CD3 zeta polypeptide retains only ITAM1, and the remaining CD3 zeta domain is deleted (residues 90-164). As another example, ITAM1 is replaced by the amino acid sequence of ITAM3, and the remaining CD3 zeta domain is deleted (residues 90-164). For example, see Bridgeman et al., Clin. Exp. Immunol. 175(2): 258-67 (2014), Zhao et al., J. Immunol. 183(9): 5563-74 (2009), Maus et al., WO 2018/132506; Sadelain et al., WO/2019/133969, Feucht et al., Nat Med. 25(1): 82-88 (2019).
因此,在某些方面,抗原结合结构域连接到一个或多个细胞信号传导模块。在一些实施方案中,细胞信号传导模块包括CD3跨膜结构域、CD3胞内信号传导结构域和/或其他CD跨膜结构域。CAR还可进一步包括一种或多种额外分子的一部分,例如Fc受体γ、CD8、CD4、CD25或CD16。Therefore, in some aspects, the antigen binding domain is connected to one or more cell signaling modules. In some embodiments, the cell signaling module includes a CD3 transmembrane domain, a CD3 intracellular signaling domain, and/or other CD transmembrane domains. CAR may also further include a portion of one or more additional molecules, such as Fc receptor gamma, CD8, CD4, CD25, or CD16.
在一些实施方案中,在CAR连接后,CAR的胞质结构域或胞内信号传导结构域激活相应非工程化免疫细胞(通常为T细胞)的至少一种正常效应子功能或应答。例如,CAR可诱导T细胞的功能,例如细胞溶解活性或T辅助活性、细胞因子或其他因子的分泌。In some embodiments, after CAR connection, the cytoplasmic domain or intracellular signaling domain of CAR activates at least one normal effector function or response of the corresponding non-engineered immune cell (usually T cell).For example, CAR can induce the function of T cells, such as cytolytic activity or T helper activity, secretion of cytokines or other factors.
在一些实施方案中,胞内信号传导结构域包括T细胞受体(TCR)的胞质序列,并且在一些方面还包括在自然环境中与该受体协同作用以在抗原特异性受体结合后启动信号转导的共受体的胞质序列,和/或该分子的变体,和/或具有相同功能性能力的任何合成序列。In some embodiments, the intracellular signaling domain comprises a cytoplasmic sequence of a T cell receptor (TCR), and in some aspects also comprises a cytoplasmic sequence of a co-receptor that acts synergistically with the receptor in its natural environment to initiate signal transduction following antigen-specific receptor binding, and/or a variant of that molecule, and/or any synthetic sequence having the same functional capability.
T细胞激活在某些方面被描述为由两类胞质信号传导序列介导:通过TCR启动抗原依赖性初级激活(初级胞质信号传导序列),以及以抗原非依赖性方式提供二级或共刺激信号(二级胞质信号传导序列)。在某些方面,CAR包括一个或两个这样的信号传导组分。T cell activation is described in some aspects as being mediated by two types of cytoplasmic signaling sequences: antigen-dependent primary activation (primary cytoplasmic signaling sequence) is initiated by TCR, and secondary or costimulatory signals (secondary cytoplasmic signaling sequence) are provided in an antigen-independent manner. In some aspects, CAR includes one or two such signaling components.
在某些方面,CAR包括以刺激方式或抑制方式调节TCR复合物初级激活的初级胞质信号传导序列。以刺激方式起作用的初级胞质信号传导序列可能包含信号传导基序,该信号传导基序被称为免疫受体酪氨酸基激活基序或ITAM。包含初级胞质信号传导序列的ITAM的实例包括源自TCRζ、FcRγ、FcRβ、CD3γ、CD3δ、CD3ε、CDS、CD22、CD79a、CD79b和CD66d的序列。在一些实施方案中,CAR中的胞质信号传导分子包含胞质信号传导结构域、其部分或来源于CD3ζ的序列。In some aspects, CAR includes a primary cytoplasmic signaling sequence that regulates the primary activation of the TCR complex in a stimulatory or inhibitory manner. The primary cytoplasmic signaling sequence that acts in a stimulatory manner may include a signaling motif, which is referred to as an immunoreceptor tyrosine-based activation motif or ITAM. Examples of ITAMs comprising primary cytoplasmic signaling sequences include sequences derived from TCR ζ, FcR γ, FcR β, CD3 γ, CD3 δ, CD3 ε, CDS, CD22, CD79a, CD79b, and CD66d. In some embodiments, the cytoplasmic signaling molecule in CAR includes a cytoplasmic signaling domain, a portion thereof, or a sequence derived from CD3 ζ.
CAR还可包括共刺激受体的信号传导结构域和/或跨膜部分,例如CD28、4-1BB、OX40、DAP10和ICOS。在某些方面,同一CAR包括激活和共刺激成分;或者,激活域由一种CAR提供,而共刺激成分由识别另一种抗原的另一种CAR提供。CAR can also include the signaling domain and/or transmembrane portion of a co-stimulatory receptor, such as CD28, 4-1BB, OX40, DAP10, and ICOS. In some aspects, the same CAR includes activation and co-stimulatory components; alternatively, the activation domain is provided by one CAR, and the co-stimulatory component is provided by another CAR that recognizes another antigen.
因此,在一些实施方案中,CAR可包括:Thus, in some embodiments, a CAR may include:
(1)在其胞外部分,一种或多种抗原结合分子,例如一种或多种抗原结合片段、结构域或抗体的一部分,或一种或多种抗体可变域(重链和/或轻链),和/或抗体分子。(1) In its extracellular portion, one or more antigen-binding molecules, such as one or more antigen-binding fragments, domains or portions of antibodies, or one or more antibody variable domains (heavy and/or light chains), and/or antibody molecules.
(2)在其跨膜部分,来源于人T细胞受体-α或-β链、CD3ζ链、CD28、CD3-ε、CD45、CD4、CD5、CD8、CD9、CD16、CD22、CD33、CD37、CD64、CD80、CD86、CD134、CD137、ICOS、CD154或GITR的跨膜结构域。在一些实施方案中,跨膜结构域源自CD28、CD8或CD3-ζ。(2) In its transmembrane portion, the transmembrane domain is derived from human T cell receptor-α or -β chain, CD3ζ chain, CD28, CD3-ε, CD45, CD4, CD5, CD8, CD9, CD16, CD22, CD33, CD37, CD64, CD80, CD86, CD134, CD137, ICOS, CD154 or GITR. In some embodiments, the transmembrane domain is derived from CD28, CD8 or CD3-ζ.
(3)一个或多个共刺激结构域,例如源自人CD28、4-1BB(CD137)、ICOS-1、CD27、OX40(CD137)、DAP10和GITR(AITR)的共刺激结构域。在一些实施方案中,CAR包含CD28和4-1BB的共刺激结构域。(3) one or more costimulatory domains, for example, costimulatory domains derived from human CD28, 4-1BB (CD137), ICOS-1, CD27, OX40 (CD137), DAP10 and GITR (AITR). In some embodiments, CAR comprises a costimulatory domain of CD28 and 4-1BB.
(4)在其胞内信号传导结构域中,包含一个或多个ITAM的一个或多个胞内信号传导结构域,例如:来自CD3-ζ的胞内信号传导结构域或其一部分,或其缺少一个或两个ITAM的变体(例如:ITAM3和/或ITAM2,另见上文详述和参考文献)、FcRγ、FcRβ、CD3γ、CD3δ、CD3ε、CDS、CD22、CD79a、CD79b和/或CD66d,特别是选自CD3-ζ的胞内结构域,或其缺少一个或两个ITAM的变体(例如:ITAM3和ITAM2),或FcεRIγ的胞内信号传导及其变体。(4) one or more intracellular signaling domains comprising one or more ITAMs in its intracellular signaling domain, for example: an intracellular signaling domain or a portion thereof from CD3-ζ, or a variant thereof lacking one or two ITAMs (for example: ITAM3 and/or ITAM2, see also the above detailed description and references), FcRγ, FcRβ, CD3γ, CD3δ, CD3ε, CDS, CD22, CD79a, CD79b and/or CD66d, in particular, an intracellular domain selected from CD3-ζ, or a variant thereof lacking one or two ITAMs (for example: ITAM3 and ITAM2), or an intracellular signaling domain of FcεRIγ and variants thereof.
CAR或其他抗原特异性受体也可以是抑制性CAR(例如iCAR),并且包括阻碍或抑制反应(例如免疫应答)的细胞内成分。这种胞内信号传导成分的实例是在免疫检查点分子上发现的,包括PD-1、CTLA4、LAG3、BTLA、OX2R、TIM-3、TIGIT、LAIR-1、PGE2受体、EP2/4腺苷受体(包括A2AR)。在一些方面,工程化细胞包括抑制性CAR,该抑制性CAR包括这种抑制性分子的信号传导结构域或衍生自这种抑制性分子的信号传导结构域,从而其用于阻碍细胞的反应。例如,当激活受体(例如CAR)识别的抗原也在正常细胞表面表达或也可能在正常细胞表面表达时,这种CAR用于降低脱靶效应的可能性。CAR or other antigen-specific receptors may also be inhibitory CAR (e.g., iCAR), and include intracellular components that hinder or inhibit reactions (e.g., immune responses). Examples of such intracellular signaling components are found on immune checkpoint molecules, including PD-1, CTLA4, LAG3, BTLA, OX2R, TIM-3, TIGIT, LAIR-1, PGE2 receptors, EP2/4 adenosine receptors (including A2AR). In some aspects, engineered cells include inhibitory CAR, which includes the signaling domain of such inhibitory molecules or the signaling domain derived from such inhibitory molecules, so that it is used to hinder the reaction of cells. For example, when the antigen recognized by the activation receptor (e.g., CAR) is also expressed on the surface of normal cells or may also be expressed on the surface of normal cells, this CAR is used to reduce the possibility of off-target effects.
示例性的抗原受体(包括CAR和重组TCR)以及将该受体工程化和导入细胞的方法,包括例如在国际专利申请公开号WO200014257、WO2013126726、WO2012/129514、WO2014031687、WO2013/166321、WO2013/071154、WO2013/123061、WO2019157454、美国专利申请公开号US2002131960、US2013287748、US20130149337、美国专利号6,451,995、7,446,190、8,252,592、8,339,645、8,398,282、7,446,179、6,410,319、7,070,995、7,265,209、7,354,762、7,446,191、8,324,353和8,479,118,以及欧洲专利申请号EP2537416中描述的那些,和/或Sadelain et al.,Cancer Discov.2013 April;3(4):388-398;Davila et al.(2013)PLoS ONE 8(4):e61338、Turtle et al.,Curr.Opin.Immunol.,2012 October;24(5):633-39、Wu et al.,Cancer,2012 March 18(2):160-75中描述的那些。在一些方面,包括CAR的基因工程化抗原受体在美国专利号7,446,190和国际专利申请公开号WO/2014055668A1中描述的那些。Exemplary antigen receptors (including CARs and recombinant TCRs) and methods for engineering and introducing the receptors into cells, including, for example, those disclosed in International Patent Application Publication Nos. WO200014257, WO2013126726, WO2012/129514, WO2014031687, WO2013/166321, WO2013/071154, WO2013/123061, WO2019157454, U.S. Patent Application Publication Nos. US2002131960, US201328 7748, US20130149337, U.S. Patent Nos. 6,451,995, 7,446,190, 8,252,592, 8,339,645, 8,398,282, 7,446,179, 6,410,319, 7,070,995, 7,265,209, 7,354,762, 7,446,191, 8,324,353, and 8,479,118, and those described in European Patent Application No. EP2537416, and/or Sadelain et al., Cancer Discov. 2013 April; 3(4): 388-398; Davila et al. (2013) PLoS ONE 8(4): e61338, Turtle et al., Curr. Opin. Immunol., 2012 October; 24(5): 633-39, Wu et al., Cancer, 2012 March 18(2): 160-75. In some aspects, the genetically engineered antigen receptors comprising CAR are those described in U.S. Pat. No. 7,446,190 and International Patent Application Publication No. WO/2014055668A1.
本公开还包括编码前述的抗体、抗原结合片段或其衍生物、TCR和CAR的多核苷酸,以及包含所述多核苷酸的载体。The present disclosure also includes polynucleotides encoding the aforementioned antibodies, antigen-binding fragments or derivatives thereof, TCR and CAR, and vectors comprising the polynucleotides.
免疫细胞Immune Cells
本公开进一步包括免疫细胞,特别是靶向一种或多种如前所述的肿瘤新抗原肽的分离的免疫细胞。在更具体的实施方案中,本公开包括免疫细胞,特别是表达如前所定义的重组CAR或TCR的分离的免疫细胞。The present disclosure further includes immune cells, in particular isolated immune cells targeting one or more tumor neoantigen peptides as described above. In a more specific embodiment, the present disclosure includes immune cells, in particular isolated immune cells expressing recombinant CAR or TCR as defined above.
如本文所用,术语“免疫细胞”包括来源于造血系统并在免疫应答中发挥作用的细胞。免疫细胞包括淋巴细胞(如B细胞和T细胞)、自然杀伤细胞、髓细胞(如单核细胞、巨噬细胞、嗜酸细胞、肥大细胞、嗜碱细胞和粒细胞)。As used herein, the term "immune cell" includes cells derived from the hematopoietic system and that play a role in immune response. Immune cells include lymphocytes (such as B cells and T cells), natural killer cells, myeloid cells (such as monocytes, macrophages, eosinophils, mast cells, basophils and granulocytes).
如本文所用,术语“T细胞”包括携带T细胞受体(TCR)的细胞,特别是针对本文所公开的肿瘤新抗原肽的TCR。根据本公开的T细胞可选自下组:炎性T淋巴细胞、细胞毒性T淋巴细胞、调节性T淋巴细胞、粘膜相关恒定T细胞(MAIT)、YδT细胞、肿瘤浸润性淋巴细胞(TIL)或辅助性T淋巴细胞(包括1型和2型辅助性T细胞和Th17辅助性细胞)。在另一个实施方案中,所述细胞可来源于下组:CD4+T-淋巴细胞和CD8+T-淋巴细胞。所述免疫细胞可以来自健康的供体或癌症患者。在一些实施方案中,免疫细胞是同种异体或自体细胞。在一些实施方案中,免疫细胞选自T细胞、天然杀伤T细胞、CD4+/CD8+T细胞、TIL/肿瘤衍生的CD8T细胞、中央记忆CD8+T细胞、Treg、MAIT、YδT细胞、人胚胎干细胞和可分化出淋巴细胞的多能干细胞。As used herein, the term "T cell" includes cells carrying a T cell receptor (TCR), in particular a TCR for a tumor neoantigen peptide disclosed herein. T cells according to the present disclosure may be selected from the following groups: inflammatory T lymphocytes, cytotoxic T lymphocytes, regulatory T lymphocytes, mucosal-associated constant T cells (MAIT), YδT cells, tumor-infiltrating lymphocytes (TIL) or helper T lymphocytes (including type 1 and type 2 helper T cells and Th17 helper cells). In another embodiment, the cell may be derived from the following group: CD4+T-lymphocytes and CD8+T-lymphocytes. The immune cells may be from a healthy donor or a cancer patient. In some embodiments, the immune cells are allogeneic or autologous cells. In some embodiments, the immune cells are selected from T cells, natural killer T cells, CD4+/CD8+ T cells, TIL/tumor-derived CD8 T cells, central memory CD8+ T cells, Treg, MAIT, Yδ T cells, human embryonic stem cells, and pluripotent stem cells that can differentiate into lymphocytes.
免疫细胞可以从血液中提取或衍生自干细胞。干细胞可以是成人干细胞、胚胎干细胞,特别是非人类干细胞、脐带血干细胞、祖细胞、骨髓干细胞、诱导多能干细胞、全能干细胞或造血干细胞。代表性的人类细胞为CD34+细胞。Immune cells can be extracted from blood or derived from stem cells. Stem cells can be adult stem cells, embryonic stem cells, particularly non-human stem cells, umbilical cord blood stem cells, progenitor cells, bone marrow stem cells, induced pluripotent stem cells, totipotent stem cells or hematopoietic stem cells. Representative human cells are CD34+ cells.
T细胞可从多种非限制性来源获得,包括外周血单个核细胞、骨髓、淋巴结组织、脐带血、胸腺组织、感染部位组织、腹水、胸水、脾组织和肿瘤。在某些实施方案中,T细胞可以使用任何数量的本领域技术人员已知的技术(例如FICOLLTM分离)从收集自受试者的单位血液中获得。在一个实施方案中,通过单采获得受试者循环血液中的细胞。在某些实施方案中,T细胞从PBMC中分离。PBMC可从通过全血密度梯度离心获得的血沉棕黄层细胞中分离,例如通过LYMPHOPREPTM梯度、PERCOLLTM梯度或FICOLLTM梯度进行离心。T细胞可通过消耗单核细胞(例如通过使用CD14)从PBMC中分离。在一些实施方案中,可以在密度梯度离心之前裂解红细胞。T cells can be obtained from a variety of non-limiting sources, including peripheral blood mononuclear cells, bone marrow, lymph node tissue, umbilical cord blood, thymus tissue, infection site tissue, ascites, pleural effusion, spleen tissue and tumor. In certain embodiments, T cells can be obtained from a unit of blood collected from a subject using any number of techniques known to those skilled in the art (e.g., FICOLL TM separation). In one embodiment, cells in the subject's circulating blood are obtained by single sampling. In certain embodiments, T cells are separated from PBMC. PBMC can be separated from buffy coat cells obtained by whole blood density gradient centrifugation, for example, by LYMPHOPREP TM gradient, PERCOLL TM gradient or FICOLL TM gradient centrifugation. T cells can be obtained by consuming mononuclear cells (e.g., by using CD14 ) are isolated from PBMCs. In some embodiments, red blood cells can be lysed prior to density gradient centrifugation.
在另一个实施方案中,所述细胞可来自健康供体、被诊断为癌症的受试者。细胞可以是自体的或同种异体的。In another embodiment, the cells may be from a healthy donor, a subject diagnosed with cancer. The cells may be autologous or allogeneic.
在同种异体免疫细胞治疗中,从健康供体而非患者处收集免疫细胞。通常,这些通过HLA匹配,以降低移植物对抗宿主病的可能性。或者,不需要进行HLA匹配的通用“现成”产品包括被设计成减少移植物对抗宿主疾病的改变,例如破坏或去除TCRαβ受体。参见综述Graham et al.,Cells.2018 Oct;7(10):155。由于单个基因编码α链(TRAC),而非编码β链的两个基因,因此TRAC基因座通常是去除或破坏TCRαβ受体表达的典型靶标。或者,也可以表达TCRαβ信号的抑制剂,例如截短形式的CD3ζ可以用作TCR抑制分子。还采用破坏或去除HLA I类分子。例如,Torikai et al.,Blood.2013;122:1341-1349使用ZFN敲除HLA-A基因座,而Ren et al.,Clin.Cancer Res.2017;23:2255-2266敲除了HLA I类表达所需的β-2微球蛋白(B2M)。Ren等人同时敲除了TCRαβ、B2M和免疫检查点PD1。一般而言,免疫细胞被激活和扩增以用于过继细胞治疗。本文公开的免疫细胞可在体内或离体扩增。通常可使用本领域已知的方法激活和扩增免疫细胞,特别是T细胞。通常,T细胞通过接触表面上附着有刺激CD3/TCR复合物相关信号的试剂和刺激T细胞表面上共刺激分子的配体的抗原而扩增。In allogeneic immune cell therapy, immune cells are collected from healthy donors rather than patients. Typically, these are HLA matched to reduce the likelihood of graft-versus-host disease. Alternatively, generic "off-the-shelf" products that do not require HLA matching include changes designed to reduce graft-versus-host disease, such as disruption or removal of TCR α β receptors. See review Graham et al., Cells. 2018 Oct; 7(10): 155. Since a single gene encodes the α chain (TRAC), rather than two genes encoding the β chain, the TRAC locus is typically a typical target for removal or disruption of TCR α β receptor expression. Alternatively, inhibitors of TCR α β signaling can also be expressed, such as a truncated form of CD3ζ that can be used as a TCR inhibitory molecule. Destruction or removal of HLA class I molecules is also employed. For example, Torikai et al., Blood. 2013; 122: 1341-1349 used ZFN to knock out the HLA-A locus, while Ren et al., Clin. Cancer Res. 2017; 23: 2255-2266 knocked out beta-2 microglobulin (B2M) required for HLA class I expression. Ren et al. simultaneously knocked out TCRαβ, B2M, and immune checkpoint PD1. In general, immune cells are activated and expanded for adoptive cell therapy. The immune cells disclosed herein can be expanded in vivo or in vitro. Immune cells, particularly T cells, can generally be activated and expanded using methods known in the art. Typically, T cells are expanded by contacting an antigen attached to a surface having a reagent that stimulates CD3/TCR complex-related signals and a ligand that stimulates a co-stimulatory molecule on the surface of the T cell.
在本公开的一个实施方案中,免疫细胞可被修饰为定向先前定义的肿瘤新抗原肽。在特定实施方案中,所述免疫细胞可在其细胞表面表达定向所述新抗原肽的重组抗原受体。“重组”是指细胞未在其天然状态下编码的抗原受体,即它是异源的、非内源性的。因此,重组抗原受体的表达可被视为向免疫细胞引入新的抗原特异性,导致细胞识别并结合前述的肽。抗原受体可以从任何有用的来源中分离。在一些实施方案中,细胞包含通过基因工程化引入的编码一种或多种抗原受体的一种或多种核酸,其中所述抗原包含至少一种根据本公开的肿瘤新抗原肽。In one embodiment of the present disclosure, immune cells may be modified to target previously defined tumor neoantigen peptides. In specific embodiments, the immune cells may express recombinant antigen receptors on their cell surfaces that target the neoantigen peptides. "Recombinant" refers to an antigen receptor that is not encoded by the cell in its native state, i.e., it is heterologous and non-endogenous. Therefore, the expression of recombinant antigen receptors can be regarded as introducing new antigen specificity into immune cells, causing the cells to recognize and bind to the aforementioned peptides. Antigen receptors can be isolated from any useful source. In some embodiments, the cells contain one or more nucleic acids encoding one or more antigen receptors introduced by genetic engineering, wherein the antigen contains at least one tumor neoantigen peptide according to the present disclosure.
根据本公开的抗原受体为基因工程化T细胞受体(TCR)及其组分,以及功能性非TCR抗原受体,例如前述的嵌合抗原受体(CAR)。Antigen receptors according to the present disclosure are genetically engineered T cell receptors (TCR) and components thereof, as well as functional non-TCR antigen receptors, such as the aforementioned chimeric antigen receptors (CAR).
免疫细胞可被遗传修饰以表达重组抗原受体的方法在本领域中是众所周知的。可将编码抗原受体的核酸分子以例如载体或任何其他合适的核酸构建体的形式引入细胞。载体及其所需组分在本领域是众所周知的。编码抗原受体的核酸分子可以使用本领域已知的任何方法产生,例如使用PCR进行分子克隆。可以使用常用方法(例如定点诱变)对抗原受体序列进行修饰。Methods by which immune cells can be genetically modified to express recombinant antigen receptors are well known in the art. Nucleic acid molecules encoding antigen receptors can be introduced into cells in the form of, for example, vectors or any other suitable nucleic acid constructs. Vectors and components required therefor are well known in the art. Nucleic acid molecules encoding antigen receptors can be produced using any method known in the art, such as molecular cloning using PCR. Antigen receptor sequences can be modified using common methods (e.g., site-directed mutagenesis).
在本公开的一些实施方案中,免疫细胞是以下细胞:其中(a)SUV39h1基因失活,(b)抗原特异性受体是包含如前定义的异源(或重组)抗原结合结构域和天然TCR恒定域或其片段的经修饰的TCR,并且该抗原特异性受体能够激活CD3ζ多肽。例如,免疫细胞可进一步包含至少一种嵌合共刺激受体(CCR)和/或至少一种例如如前所定义的嵌合抗原受体。In some embodiments of the present disclosure, the immune cell is a cell in which (a) the SUV39h1 gene is inactivated, (b) the antigen-specific receptor is a modified TCR comprising a heterologous (or recombinant) antigen-binding domain and a natural TCR constant domain or a fragment thereof as defined above, and the antigen-specific receptor is capable of activating a CD3ζ polypeptide. For example, the immune cell may further comprise at least one chimeric costimulatory receptor (CCR) and/or at least one chimeric antigen receptor, for example, as defined above.
在相关方面,免疫细胞,尤其是同种异体的免疫细胞,可被设计用于减少移植物对抗宿主的疾病,从而使细胞包含失活的(例如被破坏或缺失的)TCRαβ受体。在这种情况下,编码HI-TCR的抗原结合结构域(通常如前所定义)的核酸被方便地插入免疫细胞的内源性TRAC基因座和/或TRBC基因座。HI-TCR核酸序列的插入或其他较小的突变可破坏或消除包含天然TCRα链和/或天然TCRβ链的TCR的内源性表达。插入或突变可使内源性TCR表达降低至少约50%、60%、70%、75%、80%、85%、90%、95%、98%或99%。由于单个基因编码α链(TRAC),而非编码β链的两个基因,TRAC基因座通常是降低TCRαβ受体表达的靶点。因此,编码抗原特异性受体的核酸(例如CAR或TCR)可整合入TRAC基因座的某个位置,优选第一外显子(SEQ ID NO:3)的5’区域,该位置显著降低了功能性TCRα链的表达。参见,例如,Jantz etal.,WO 2017/062451、Sadelain et al.,WO 2017/180989、Torikai et al,.Blood,119(2):5697-705(2012);Eyquem et al.,Nature.2017 Mar 2;543(7643):113-117。内源性TCR的表达可减少至少约50%、60%、70%、75%、80%、85%、90%、95%、98%或99%。在这些实施方案中,编码抗原特异性受体的核酸的表达任选地受内源性TCR-α或内源性TCR-β启动子的控制。In a related aspect, immune cells, particularly allogeneic immune cells, can be designed to reduce graft-versus-host disease so that the cells contain inactivated (e.g., destroyed or deleted) TCR α β receptors. In this case, nucleic acids encoding the antigen binding domains of HI-TCR (generally as defined above) are conveniently inserted into the endogenous TRAC loci and/or TRBC loci of immune cells. Insertion or other minor mutations of HI-TCR nucleic acid sequences can destroy or eliminate endogenous expression of TCRs containing natural TCR α chains and/or natural TCR β chains. Insertion or mutation can reduce endogenous TCR expression by at least about 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99%. Since a single gene encodes the α chain (TRAC), rather than two genes encoding the β chain, the TRAC locus is usually a target for reducing TCR α β receptor expression. Therefore, nucleic acid encoding an antigen-specific receptor (e.g., CAR or TCR) can be integrated into a position of the TRAC locus, preferably the 5' region of the first exon (SEQ ID NO: 3), which significantly reduces the expression of functional TCR alpha chain. See, for example, Jantz et al., WO 2017/062451, Sadelain et al., WO 2017/180989, Torikai et al., Blood, 119(2): 5697-705 (2012); Eyquem et al., Nature. 2017 Mar 2; 543(7643): 113-117. The expression of endogenous TCR can be reduced by at least about 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99%. In these embodiments, expression of the nucleic acid encoding the antigen-specific receptor is optionally under the control of the endogenous TCR-α or endogenous TCR-β promoter.
任选地,免疫细胞还包含具有单独活性ITAM结构域的经修饰的CD3,并且任选地,CD3可进一步包含一个或多个或两个或多个共刺激结构域。在一些实施方案中,CD3包含两个共刺激结构域,任选地为CD28和4-1BB。具有单独活性ITAM域的经修饰的CD3可包括例如其中ITAM2和ITAM3已失活或ITAM1和ITAM2已失活的经修饰的CD3ζ胞内信号传导结构域。在一些实施方案中,经修饰的CD3ζ多肽仅保留ITAM1,并且其余的CD3ζ域缺失(第90-164位残基)。作为另一个实例,ITAM1被ITAM3的氨基酸序列取代,并且其余的CD3ζ域缺失(第90-164位残基)。Optionally, the immune cell also includes a modified CD3 with a single active ITAM domain, and optionally, CD3 may further include one or more or two or more costimulatory domains. In some embodiments, CD3 includes two costimulatory domains, optionally CD28 and 4-1BB. The modified CD3 with a single active ITAM domain may include, for example, a modified CD3 ζ intracellular signaling domain in which ITAM2 and ITAM3 are inactivated or ITAM1 and ITAM2 are inactivated. In some embodiments, the modified CD3 ζ polypeptide retains only ITAM1, and the remaining CD3 ζ domains are missing (90-164 residues). As another example, ITAM1 is replaced by the amino acid sequence of ITAM3, and the remaining CD3 ζ domains are missing (90-164 residues).
本文公开的经修饰的免疫细胞可包含前述方面的两个或多个、或三个或多个、或四个或多个的组合。The modified immune cells disclosed herein may comprise a combination of two or more, or three or more, or four or more of the aforementioned aspects.
例如,经修饰的免疫细胞是如下免疫细胞:其中(a)抗原特异性受体是包含异源(或重组)抗原结合结构域(通常如前所定义)和天然TCR恒定域或其片段的经修饰的TCR,并且所述抗原特异性受体能够激活CD3ζ多肽,和/或所述抗原特异性受体是CAR,并且任选地(b)SUV39H1基因失活,并且任选地(c)免疫细胞包含具有单独活性ITAM结构域的经修饰的CD3,例如其中ITAM2和ITAM3已失活,并且任选地(d)TCR受内源性TRAC和/或TRBC启动子控制,并且任选地(e)天然TCR-α链和/或天然TCR-β链的表达被破坏或消除。在进一步的实施方案中,细胞可包含至少一种嵌合共刺激受体(CCR)。For example, the modified immune cell is an immune cell in which (a) the antigen-specific receptor is a modified TCR comprising a heterologous (or recombinant) antigen binding domain (generally as defined above) and a natural TCR constant domain or a fragment thereof, and the antigen-specific receptor is capable of activating a CD3 ζ polypeptide, and/or the antigen-specific receptor is a CAR, and optionally (b) the SUV39H1 gene is inactivated, and optionally (c) the immune cell comprises a modified CD3 with a separate active ITAM domain, for example, wherein ITAM2 and ITAM3 are inactivated, and optionally (d) the TCR is controlled by an endogenous TRAC and/or TRBC promoter, and optionally (e) the expression of a natural TCR-α chain and/or a natural TCR-β chain is destroyed or eliminated. In further embodiments, the cell may comprise at least one chimeric costimulatory receptor (CCR).
本公开还涉及提供免疫细胞,特别是靶向本文所公开的肿瘤新抗原肽的T细胞群体,特别是免疫细胞和特别是表达TCR的T细胞群体,特别是如前定义的HLA非依赖性TCR(HITCR)或CAR的方法。The present disclosure also relates to methods for providing immune cells, in particular T cell populations targeting the tumor neoantigen peptides disclosed herein, in particular immune cells and in particular T cell populations expressing TCRs, in particular HLA-independent TCRs (HITCRs) or CARs as defined above.
T细胞群体可包括CD8+T细胞、CD4+T细胞或CD8+和CD4+T细胞。The T cell population may include CD8+ T cells, CD4+ T cells, or CD8+ and CD4+ T cells.
根据本公开所产生的免疫细胞群体可富含对本公开的肿瘤新抗原肽或嵌合蛋白,特别是SEQ ID NO:1-8202中的任何一个跨膜嵌合蛋白具有特异性(即靶向性)的免疫细胞。也就是说,根据本公开产生的免疫细胞群体将具有增加的靶向一种或多种肿瘤新抗原肽(即富含靶向新抗原肽的克隆型)或一种或多种嵌合蛋白的免疫细胞数。例如,与从受试者分离的样品中的免疫细胞相比,本公开的免疫细胞群体将具有增加的靶向肿瘤新抗原肽或嵌合蛋白的免疫细胞数。也就是说,免疫细胞群体的组成将不同于“天然”免疫细胞群体(即未经历本文所讨论的鉴定和扩增步骤的群体),因为靶向肿瘤新抗原肽或嵌合蛋白的免疫细胞的百分比或比例将增加。The immune cell population generated according to the present disclosure can be enriched in immune cells that are specific (i.e., targeted) to the tumor neoantigen peptides or chimeric proteins of the present disclosure, in particular any one of the transmembrane chimeric proteins of SEQ ID NO: 1-8202. That is, the immune cell population generated according to the present disclosure will have an increased number of immune cells targeting one or more tumor neoantigen peptides (i.e., enriched in clonal types targeting neoantigen peptides) or one or more chimeric proteins. For example, compared to the immune cells in a sample isolated from a subject, the immune cell population of the present disclosure will have an increased number of immune cells targeting tumor neoantigen peptides or chimeric proteins. That is, the composition of the immune cell population will be different from that of a "natural" immune cell population (i.e., a population that has not undergone the identification and amplification steps discussed herein) because the percentage or proportion of immune cells targeting tumor neoantigen peptides or chimeric proteins will increase.
根据本公开的免疫细胞群体可具有至少约0.2、0.3、0.4、0.5、0.6、0.7、0.8、0.9、1、2、3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、25、30、35、40、45、50、55、60、65、70、75、80、85、90、95或100%的靶向本文所公开的肿瘤新抗原肽或嵌合蛋白的T细胞。例如,免疫细胞群体可具有约0.2%-5%、5%-10%、10-20%、20-30%、30-40%、40-50%、50-70%或70-100%的靶向本公开的肿瘤新抗原肽或嵌合蛋白的免疫细胞。According to the present disclosure, the immune cell population may have at least about 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95 or 100% of T cells targeting the tumor neoantigen peptides or chimeric proteins disclosed herein. For example, the immune cell population may have about 0.2%-5%, 5%-10%, 10-20%, 20-30%, 30-40%, 40-50%, 50-70% or 70-100% of immune cells targeting the tumor neoantigen peptides or chimeric proteins disclosed herein.
例如当那些细胞暴露于肿瘤新抗原肽或嵌合蛋白时,扩增的肿瘤新抗原肽/或嵌合蛋白反应性免疫细胞群体可能比未扩增的免疫细胞群体具有更高的活性。提及“活性”可代表免疫细胞群体对肿瘤新抗原肽(例如对应于用于扩增的肽的肽)或肿瘤新抗原肽的混合物或本文定义的嵌合蛋白(或其片段,通常是其胞外片段)或嵌合蛋白的混合物(或其片段,通常是其胞外片段)的再刺激的反应。用于测定反应的合适方法在本领域中是已知的。例如,可以测量细胞因子的产生(例如,可以测量IL2或IFNy的产生)。提及“更高的活性”包括例如1-5、5-10、10-20、20-50、50-100、100-500、500-1000倍的活性提高。在一个方面,活性可以高1000倍以上。For example, when those cells are exposed to tumor neoantigen peptides or chimeric proteins, the amplified tumor neoantigen peptide/or chimeric protein reactive immune cell population may have higher activity than the unamplified immune cell population. Reference to "activity" may represent the response of an immune cell population to restimulation of a tumor neoantigen peptide (e.g., a peptide corresponding to a peptide used for amplification) or a mixture of tumor neoantigen peptides or a chimeric protein as defined herein (or a fragment thereof, typically an extracellular fragment thereof) or a mixture of chimeric proteins (or a fragment thereof, typically an extracellular fragment thereof). Suitable methods for determining the response are known in the art. For example, the production of cytokines can be measured (e.g., the production of IL2 or IFNy can be measured). Reference to "higher activity" includes, for example, 1-5, 5-10, 10-20, 20-50, 50-100, 100-500, 500-1000 times the activity increase. In one aspect, the activity can be more than 1000 times higher.
在一个优选实施方案中,本公开提供了多个免疫细胞或免疫细胞群体(即一个以上),其中所述多个免疫细胞包含识别克隆性肿瘤新抗原肽的免疫细胞(特别是T细胞)和识别不同克隆性肿瘤新抗原肽的T细胞。因此,本公开提供了多个识别不同的克隆性肿瘤新抗原肽的免疫细胞,特别是T细胞。多个或群体中不同免疫细胞,特别是T细胞,可选择地具有识别相同肿瘤新抗原肽或嵌合蛋白的不同表位的不同TCR。In a preferred embodiment, the present disclosure provides a plurality of immune cells or immune cell populations (i.e., more than one), wherein the plurality of immune cells comprises immune cells (particularly T cells) that recognize clonal tumor neoantigen peptides and T cells that recognize different clonal tumor neoantigen peptides. Therefore, the present disclosure provides a plurality of immune cells, particularly T cells, that recognize different clonal tumor neoantigen peptides. Different immune cells, particularly T cells, in a plurality or population may optionally have different TCRs that recognize different epitopes of the same tumor neoantigen peptide or chimeric protein.
在一个优选实施方案中,被多个T细胞识别的克隆性肿瘤新抗原肽或嵌合蛋白或一种或多种嵌合蛋白的表位的数量为2至1000个。例如,识别的克隆性新抗原的数量可以是2、3、4、5、6、7、8、9、10、20、50、100、150、200、250、300、350、400、450、500、550、600、650、700、750、800、850、900、950或1000,优选2至100个。可能存在多个免疫细胞,特别是T细胞,其具有不同的TCR,但识别相同的克隆新抗原。In a preferred embodiment, the number of clonal tumor neoantigen peptides or chimeric proteins or epitopes of one or more chimeric proteins recognized by multiple T cells is 2 to 1000. For example, the number of recognized clonal neoantigens can be 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950 or 1000, preferably 2 to 100. There may be multiple immune cells, particularly T cells, that have different TCRs but recognize the same clonal neoantigens.
免疫细胞(特别是T细胞)群体可以全部或主要由CD8+T细胞组成,或者全部或主要由CD8+T细胞和CD4+T细胞的混合物组成,或者全部或主要由CD4+T细胞组成。The immune cell (particularly T cell) population may consist entirely or mainly of CD8+ T cells, or entirely or mainly of a mixture of CD8+ T cells and CD4+ T cells, or entirely or mainly of CD4+ T cells.
在特定实施方案中,T细胞群体由从癌症患者或健康供体分离的T细胞产生。例如,T细胞群体可由从携带肿瘤的患者分离的样品中的T细胞产生。样品可以是肿瘤样品、外周血样品或来自受试者其他组织的样品。In certain embodiments, the T cell colony is generated by T cells isolated from a cancer patient or a healthy donor. For example, the T cell colony can be generated by T cells in a sample isolated from a patient carrying a tumor. The sample can be a tumor sample, a peripheral blood sample, or a sample from other tissues of the subject.
在一个特定实施方案中,免疫细胞群体是从其中鉴定了肿瘤新抗原肽的肿瘤样品中产生的。换句话说,免疫细胞(特别是T细胞)群体从衍生自癌症患者的肿瘤的生物标本中分离。这种T细胞在本文中被称为“肿瘤浸润性淋巴细胞(TIL)”。In a specific embodiment, the immune cell population is generated from a tumor sample in which a tumor neoantigen peptide is identified. In other words, the immune cell (particularly T cell) population is isolated from a biological specimen derived from a tumor of a cancer patient. Such T cells are referred to herein as "tumor infiltrating lymphocytes (TIL)".
可以使用本领域熟知的方法分离T细胞。例如,T细胞可以从基于CD3、CD4或CD8表达的样品产生的单细胞悬液中纯化。T细胞可通过Ficoll-paque梯度从样品中富集。T cells can be isolated using methods well known in the art. For example, T cells can be purified from a single cell suspension generated from a sample based on CD3, CD4 or CD8 expression. T cells can be enriched from a sample using a Ficoll-paque gradient.
癌症治疗方法Cancer treatment methods
在任何实施方案中,本文所述的癌症治疗产品可用于抑制癌细胞增殖的方法。本文所述的癌症治疗产品也可用于治疗癌症、癌症患者或处于癌症风险的患者的癌症预防性治疗。In any embodiment, the cancer therapeutic products described herein can be used in methods of inhibiting cancer cell proliferation. The cancer therapeutic products described herein can also be used for the preventive treatment of cancer in patients with cancer or at risk of cancer.
可使用本文所述治疗方法进行治疗的癌症包括前述定义的任何实体或非实体肿瘤。根据本公开,特别感兴趣的是乳腺癌、黑色素瘤和肺癌。Cancers that may be treated using the methods of treatment described herein include any solid or non-solid tumor as defined above. Of particular interest in light of the present disclosure are breast cancer, melanoma, and lung cancer.
癌症还包括用其他化学治疗药物治疗难治性癌症。本文中使用的术语“难治性”指在用另一种化学治疗剂治疗后不显示或仅显示弱抗增殖反应(例如,不显示或仅显示弱抑制肿瘤生长)的癌症(和/或其转移)。这些癌症无法用其他化疗药物令人满意地治疗。难治性癌症不仅包括(i)在患者治疗期间一种或多种化疗药物已经失败的癌症,还包括(ii)可通过其他方式(例如活检和在存在化疗药物的情况下培养)显示为难治性癌症。Cancer also includes cancers that are refractory to treatment with other chemotherapeutic drugs. The term "refractory" as used herein refers to cancers (and/or their metastases) that do not show or only show a weak antiproliferative response (e.g., do not show or only show a weak inhibition of tumor growth) after treatment with another chemotherapeutic agent. These cancers cannot be satisfactorily treated with other chemotherapeutic drugs. Refractory cancers include not only (i) cancers in which one or more chemotherapeutic drugs have failed during patient treatment, but also (ii) cancers that can be shown to be refractory by other means (e.g., biopsy and culture in the presence of chemotherapeutic drugs).
本文所述的治疗也适用于之前未接受治疗的有此需要的患者的治疗。The treatments described herein are also applicable to the treatment of previously untreated patients in need thereof.
根据本公开的受试者通常是已被诊断患有癌症或处于发展癌症的风险中的有此需要的患者。受试者通常是人、狗、猫、马或需要肿瘤特异性免疫应答的任何动物。The subject according to the present disclosure is typically a patient in need thereof who has been diagnosed with cancer or is at risk of developing cancer. The subject is typically a human, dog, cat, horse, or any animal in need of a tumor-specific immune response.
本公开还涉及先前定义的用于受试者的癌症疫苗治疗或用于治疗受试者癌症的新抗原肽、APC群体、疫苗或免疫原性组合物、编码新抗原肽的多核苷酸或载体,其中所述肽结合所述受试者的至少一种MHC分子。The present disclosure also relates to the previously defined neoantigenic peptides, APC populations, vaccines or immunogenic compositions, polynucleotides encoding the neoantigenic peptides or vectors for use in cancer vaccine therapy or for treating cancer in a subject, wherein the peptide binds to at least one MHC molecule of the subject.
本公开还提供了一种治疗受试者癌症的方法,其包括向所述受试者施用治疗有效量的本文所述疫苗或免疫原性组合物以治疗受试者。该方法可额外包括鉴别患有癌症的受试者的步骤。The present disclosure also provides a method of treating cancer in a subject, comprising administering to the subject a therapeutically effective amount of the vaccine or immunogenic composition described herein to treat the subject. The method may additionally include the step of identifying a subject with cancer.
本公开还涉及一种治疗癌症的方法,其包括通过本文所述的方法产生抗体或其抗原结合片段,并以治疗有效量向患有癌症的受试者施用所述抗体或其抗原结合片段或表达所述抗体或其抗原结合片段的免疫细胞,以治疗所述受试者。The present disclosure also relates to a method for treating cancer, comprising producing an antibody or an antigen-binding fragment thereof by the method described herein, and administering the antibody or the antigen-binding fragment thereof or an immune cell expressing the antibody or the antigen-binding fragment thereof to a subject suffering from cancer in a therapeutically effective amount to treat the subject.
本公开还涉及针对如本文所述的跨膜嵌合蛋白,或通常与MHC或HLA分子结合的肿瘤新抗原肽的抗体(包括其变体和衍生物)、T细胞受体(TCR)(包括其变体和衍生物)、非HLA限制性TCR(HI TCR)或CAR(包括其变体和衍生物),用于受试者的癌症治疗。The present disclosure also relates to antibodies (including variants and derivatives thereof), T cell receptors (TCRs) (including variants and derivatives thereof), non-HLA restricted TCRs (HI TCRs) or CARs (including variants and derivatives thereof) directed against transmembrane chimeric proteins as described herein, or tumor neoantigen peptides that are normally bound to MHC or HLA molecules, for use in treating cancer in a subject.
在一些实施方案中,所述抗体、TCR(特别是非HLA限制性TCR)或CAR结合如本文定义的跨膜嵌合蛋白和特别是SEQ ID NO:1-8202中任何一个跨膜嵌合蛋白。通常,所述抗体TCR(特别是非HLA限制性TCR)或CAR包含如前定义的抗原结合结构域(其结合SEQ ID NO:1-8202中任何一个跨膜嵌合蛋白)。In some embodiments, the antibody, TCR (particularly non-HLA restricted TCR) or CAR binds to a transmembrane chimeric protein as defined herein and in particular any one of SEQ ID NOs: 1-8202. Typically, the antibody TCR (particularly non-HLA restricted TCR) or CAR comprises an antigen binding domain as defined above (which binds to any one of SEQ ID NOs: 1-8202).
本公开还涉及针对本文所述的肿瘤新抗原肽(通常与MHC或HLA分子结合)或针对本文所述的嵌合蛋白的抗体(包括其变体和衍生物)、T细胞受体(TCR)(包括其变体和衍生物)或CAR(包括其变体和衍生物),或靶向如前所定义的新抗原肽或嵌合蛋白的免疫细胞,用于受试者的过继细胞或CAR-T细胞治疗,其中肿瘤新抗原肽结合所述受试者的至少一种MHC分子。在一些实施方案中,所述抗体、TCR(特别是非HLA限制性TCR)或CAR结合如本文定义的跨膜嵌合蛋白和特别是SEQ ID NO:1-8202中任何一个跨膜嵌合蛋白。通常,所述抗体TCR(特别是非HLA限制性TCR)或CAR包含如前定义的抗原结合结构域(其结合SEQ ID NO:1-8202中任何一个跨膜嵌合蛋白)。因此,通常在一些实施方案中,免疫细胞靶向如本文定义的跨膜嵌合蛋白。通常,本领域技术人员能够选择结合并识别先前定义的肿瘤新抗原肽的合适的抗原受体,以使免疫细胞重新定向用于癌细胞治疗。在特定实施方案中,用于本公开方法的免疫细胞是重定向的T细胞,例如重定向的CD8+和/或CD4+T细胞。The present disclosure also relates to antibodies (including variants and derivatives thereof), T cell receptors (TCRs) (including variants and derivatives thereof) or CARs (including variants and derivatives thereof) directed against the tumor neoantigen peptides described herein (usually bound to MHC or HLA molecules) or against the chimeric proteins described herein, or immune cells targeting the neoantigen peptides or chimeric proteins as defined above, for adoptive cell or CAR-T cell therapy of a subject, wherein the tumor neoantigen peptides bind to at least one MHC molecule of the subject. In some embodiments, the antibody, TCR (particularly non-HLA restricted TCR) or CAR binds to a transmembrane chimeric protein as defined herein and in particular any one of SEQ ID NOs: 1-8202. Typically, the antibody TCR (particularly non-HLA restricted TCR) or CAR comprises an antigen binding domain as defined above (which binds to any one of SEQ ID NOs: 1-8202 transmembrane chimeric proteins). Therefore, typically in some embodiments, immune cells target transmembrane chimeric proteins as defined herein. Generally, one skilled in the art will be able to select suitable antigen receptors that bind and recognize previously defined tumor neoantigenic peptides to redirect immune cells for cancer cell therapy. In certain embodiments, the immune cells used in the disclosed methods are redirected T cells, such as redirected CD8+ and/or CD4+ T cells.
在一些实施方案中,将如上所述的癌症治疗、疫苗接种治疗和/或过继细胞癌症治疗与额外的癌症疗法组合施用。具体而言,根据本公开的T细胞组合物可与检查点阻断治疗、共刺激抗体、化疗和/或放疗、靶向治疗或单克隆抗体治疗组合施用。In some embodiments, the cancer treatment, vaccination therapy and/or adoptive cell cancer therapy described above is administered in combination with an additional cancer therapy. Specifically, the T cell composition according to the present disclosure can be administered in combination with checkpoint blockade therapy, co-stimulatory antibodies, chemotherapy and/or radiotherapy, targeted therapy or monoclonal antibody therapy.
检查点抑制物包括但不限于PD-1抑制物、PD-L1抑制物、Lag-3抑制物、Tim-3抑制物、TIGIT抑制物、BTLA抑制物、T细胞活化的V结构域Ig抑制因子(VISTA)抑制物和CTLA-4抑制物,例如IDO抑制物。共刺激抗体通过免疫调节受体(包括但不限于ICOS、CD137、CD27OX-40和GITR)递送阳性信号。在优选实施方案中,检查点抑制剂是CTLA-4抑制剂。Checkpoint inhibitors include, but are not limited to, PD-1 inhibitors, PD-L1 inhibitors, Lag-3 inhibitors, Tim-3 inhibitors, TIGIT inhibitors, BTLA inhibitors, V domain Ig inhibitors of T cell activation (VISTA) inhibitors and CTLA-4 inhibitors, such as IDO inhibitors. Co-stimulatory antibodies deliver positive signals through immunomodulatory receptors (including but not limited to ICOS, CD137, CD27OX-40 and GITR). In a preferred embodiment, the checkpoint inhibitor is a CTLA-4 inhibitor.
本文所用的化学治疗实体指对细胞具有破坏性的实体,即降低细胞活力的实体。化学治疗实体可以是细胞毒性药物。考虑的化学治疗剂包括但不限于烷化剂、蒽环类、埃博霉素、亚硝基脲、亚乙基亚胺/甲基三聚氰胺、烷基磺酸盐、烷化剂、抗代谢物、嘧啶类似物、表鬼臼毒素、酶,如L-门冬酰胺酶;生物反应调节剂,如IFNa、IL-2、G-CSF和GM-CSF;铂配位复合物,例如顺氯氨铂、奥沙利铂和卡铂;蒽二酮;取代的尿素,例如羟基脲;甲基联氨衍生物,包括N-甲基联氨(MIH)和丙卡巴嗪;肾上腺皮质抑制剂,例如米托坦(o,ρ’-DDD)和氨基谷氨酰胺;激素和拮抗剂,包括肾上腺皮质类固醇拮抗剂,例如泼尼松及其等同物、地塞米松和氨基谷氨酰胺;孕酮,例如羟孕酮、醋酸甲羟孕酮和醋酸甲地孕酮;雌激素,例如己烯雌酚和乙炔雌二醇等同物;抗雌激素,如他莫昔芬;雄激素,包括丙酸睾酮和氟甲氧基丁酮/等同物;抗雄激素,如氟他胺、促性腺激素释放激素类似物和亮丙瑞林;和非甾类抗雄激素类药物如氟他胺。The chemotherapeutic entity used herein refers to an entity that is destructive to cells, i.e., an entity that reduces cell viability. The chemotherapeutic entity can be a cytotoxic drug. The chemotherapeutic agents considered include, but are not limited to, alkylating agents, anthracyclines, epothilones, nitrosoureas, ethyleneimine/methylmelamine, alkyl sulfonates, alkylating agents, antimetabolites, pyrimidine analogs, epipodophyllotoxins, enzymes such as L-asparaginase; biological response modifiers such as IFNa, IL-2, G-CSF and GM-CSF; platinum coordination complexes such as cisplatin, oxaliplatin and carboplatin; anthracenediones; substituted ureas such as hydroxyurea; methylhydrazine derivatives, including N-methylhydrazine (MIH) and procarbazine; adrenocortical inhibitors , such as mitotane (o,ρ'-DDD) and aminoglutamine; hormones and antagonists, including adrenocortical steroid antagonists, such as prednisone and its equivalents, dexamethasone and aminoglutamine; progestins, such as hydroxyprogesterone, medroxyprogesterone acetate and megestrol acetate; estrogens, such as diethylstilbestrol and ethinyl estradiol equivalents; antiestrogens, such as tamoxifen; androgens, including testosterone propionate and flumethoxetine/equivalents; antiandrogens, such as flutamide, gonadotropin-releasing hormone analogs and leuprolide; and non-steroidal antiandrogens such as flutamide.
“组合”可指在根据本发明的T细胞组合物施用之前、同时或之后施用额外治疗。"In combination" may mean that the additional treatment is administered before, simultaneously with, or after the administration of the T cell composition according to the invention.
另外或作为与检查点阻断相结合的替代方案,本公开的T细胞组合物也可使用基因编辑技术(包括但不限于TALEN和Crispr/ca)进行基因修饰,以使其对免疫检查点具有抗性。这种方法在本领域中是已知的,例如参见US20140120622。基因编辑技术可用于阻止T细胞表达的免疫检查点的表达,包括但不限于PD-1、Lag-3、Tim-3、TIGIT、BTLA CTLA-4及其组合。本文讨论的T细胞可以通过任何这些方法进行修饰。In addition or as an alternative to checkpoint blockade, the T cell compositions disclosed herein may also be genetically modified using gene editing techniques (including but not limited to TALEN and Crispr/ca) to make them resistant to immune checkpoints. This method is known in the art, for example, see US20140120622. Gene editing techniques can be used to prevent the expression of immune checkpoints expressed by T cells, including but not limited to PD-1, Lag-3, Tim-3, TIGIT, BTLA CTLA-4 and combinations thereof. The T cells discussed herein can be modified by any of these methods.
根据本公开的T细胞也可经遗传修饰以表达增加归巢至肿瘤的分子和/或向肿瘤微环境递送炎性介质,包括但不限于细胞因子、可溶性免疫调节受体和/或配体。T cells according to the present disclosure may also be genetically modified to express molecules that increase homing to tumors and/or delivery of inflammatory mediators to the tumor microenvironment, including but not limited to cytokines, soluble immunomodulatory receptors and/or ligands.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1:通过根据本发明的计算机模拟方法,在小鼠肿瘤B16F10-OVA细胞系(A)和MCA101-OVA细胞系(B)中鉴定,并在两个细胞系(C)中均鉴定对MHC等位基因的预测亲和力小于500nM的肿瘤新抗原肽(或TE衍生的表位)。Figure 1: Tumor neoantigen peptides (or TE-derived epitopes) with predicted affinity to MHC alleles less than 500 nM were identified in the mouse tumor B16F10-OVA cell line (A) and the MCA101-OVA cell line (B) by the computer simulation method according to the present invention, and in both cell lines (C).
图2:(A)小鼠肿瘤B16F10-OVA和MCA101-OVA细胞系的cDNA中扩增编码新抗原肽N25的融合转录物序列的RT-PCR凝胶。(B)小鼠肿瘤B16F10、B16F10-OVA和MCA101-OVA细胞系的cDNA中扩增编码新抗原肽N26的融合转录物序列的RT-PCR凝胶。Figure 2: (A) RT-PCR gel of the fusion transcript sequence encoding the new antigen peptide N25 amplified from the cDNA of the mouse tumor B16F10-OVA and MCA101-OVA cell lines. (B) RT-PCR gel of the fusion transcript sequence encoding the new antigen peptide N26 amplified from the cDNA of the mouse tumor B16F10, B16F10-OVA and MCA101-OVA cell lines.
图3:(A)用ELISPOT检测用DMSO(阴性对照)、OVA(卵清蛋白)(阳性对照)、N25肽或N26肽免疫的动物的腹股沟淋巴结中的肽反应性IFNg分泌细胞。(B)用DMSO(阴性对照)、SIINFEKL(阳性对照)、N25或N26肽免疫动物的10^5个细胞的IFNg斑点。Figure 3: (A) ELISPOT detection of peptide-reactive IFNg secreting cells in inguinal lymph nodes of animals immunized with DMSO (negative control), OVA (ovalbumin) (positive control), N25 peptide or N26 peptide. (B) IFNg spots of 10^5 cells of animals immunized with DMSO (negative control), SIINFEKL (positive control), N25 or N26 peptide.
图4:(A)在将肿瘤细胞B16F10-OVA注射到所述经免疫的小鼠后的几天,预先用DMSO、OVA或N25L肽免疫的小鼠的肿瘤体积(mm3)的发展。(B)在将肿瘤细胞B16F10-OVA注射到所述经免疫的小鼠中后的几天后,预先用DMSO、OVA或N26L肽免疫的小鼠的肿瘤体积(mm3)的发展。Figure 4: (A) Development of tumor volume (mm 3 ) of mice previously immunized with DMSO, OVA or N25L peptide a few days after injection of tumor cells B16F10-OVA into said immunized mice. (B) Development of tumor volume (mm 3 ) of mice previously immunized with DMSO, OVA or N26L peptide a few days after injection of tumor cells B16F10-OVA into said immunized mice.
图5:通过用于鉴定融合转录序列编码的肿瘤新抗原肽的方法,分析了784个管腔、100个HER2+、197个TNBC、112个正常乳腺组织、516个原发性肺腺癌(原发性肿瘤)和59个正常肺组织(实体组织正常)的TCGA数据集。(A)不同亚型的乳腺癌(HER2+、TNBC、正常乳腺组织和管腔)中融合转录物序列(TE-外显子融合)的数量。(B)不同亚型的肺癌(原发性肺腺癌、正常肺组织)中融合转录物序列(TE-外显子融合)的数量。Figure 5: TCGA datasets of 784 luminal, 100 HER2+, 197 TNBC, 112 normal breast tissues, 516 primary lung adenocarcinomas (primary tumors), and 59 normal lung tissues (solid tissue normal) were analyzed by the method for identifying tumor neoantigenic peptides encoded by fusion transcript sequences. (A) The number of fusion transcript sequences (TE-exon fusions) in different subtypes of breast cancer (HER2+, TNBC, normal breast tissue, and luminal). (B) The number of fusion transcript sequences (TE-exon fusions) in different subtypes of lung cancer (primary lung adenocarcinoma, normal lung tissue).
图6:通过计算机模拟测试从每个样品的TE基因融合产物中预测的8-9个氨基酸长的肽与同一样品中表达的预测的HLA等位基因的结合。显示了针对每个样品中至少一个HLA-A、-B或-C等位基因的预测亲和力低于500nM的肽。(A)不同亚型的乳腺癌样品(HER2+、TNBC、正常乳腺组织和管腔)。(B)不同亚型的肺癌样品(非小细胞肺癌、正常肺组织)。Figure 6: 8-9 amino acid long peptides predicted from TE gene fusion products of each sample were tested by computer simulation for binding to predicted HLA alleles expressed in the same sample. Peptides with predicted affinity less than 500 nM for at least one HLA-A, -B or -C allele in each sample are shown. (A) Breast cancer samples of different subtypes (HER2+, TNBC, normal breast tissue and luminal). (B) Lung cancer samples of different subtypes (non-small cell lung cancer, normal lung tissue).
图7:每个患者的肿瘤特异性肽在乳腺肿瘤亚型中的分布。(A)显示了每种亚型的乳腺癌患者的肿瘤特异性HLA结合肽的数量。(B)管腔亚型样品中共享的预测肿瘤新抗原肽的数量(n=784)(横坐标)。(C)HER2+亚型样品中共享的预测肿瘤新抗原肽的数量(n=100)(横坐标)。(D)TNBC亚型样品中共享的预测肿瘤新抗原肽的数量(n=197)(横坐标)。Figure 7: Distribution of tumor-specific peptides per patient in breast tumor subtypes. (A) The number of tumor-specific HLA binding peptides for breast cancer patients of each subtype is shown. (B) The number of shared predicted tumor neoantigen peptides in luminal subtype samples (n=784) (abscissa). (C) The number of shared predicted tumor neoantigen peptides in HER2+ subtype samples (n=100) (abscissa). (D) The number of shared predicted tumor neoantigen peptides in TNBC subtype samples (n=197) (abscissa).
图8:(A)每个原发性肺腺癌(LUAD)样品(肺癌)中肿瘤特异性HLA结合肽的数量。(B)每个患者的肿瘤特异性肽在肺腺癌中的分布。原发性肿瘤亚型样品中共享的预测肿瘤新抗原肽的数量(n=516)(横坐标)。Figure 8: (A) Number of tumor-specific HLA binding peptides in each primary lung adenocarcinoma (LUAD) sample (lung cancer). (B) Distribution of tumor-specific peptides in lung adenocarcinoma per patient. Number of predicted tumor neoantigen peptides shared among primary tumor subtype samples (n=516) (abscissa).
图9:当供体为外显子(A)且供体为TE(B)时融合核苷酸序列的重建。Figure 9: Reconstruction of fusion nucleotide sequences when the donor is an exon (A) and the donor is a TE (B).
图10:嵌合转录物衍生肽与HLA-A2的结合。使用四聚体形成测定通过流式细胞术验证最高频率的嵌合融合中预测的肽与HLA-A2等位基因的结合。结果显示为相对于阳性对照的结合百分比。虚线表示被视为确认与该等位基因结合的阈值。Figure 10: Binding of chimeric transcript-derived peptides to HLA-A2. Binding of the predicted peptides in the highest frequency chimeric fusions to the HLA-A2 allele was verified by flow cytometry using a tetramer formation assay. Results are shown as percent binding relative to the positive control. The dashed line indicates the threshold that is considered confirmed binding to this allele.
图11:ER衍生肽与HLA-A2分子的结合。用于合成的嵌合转录物衍生肽的肽-HLA-A*02:01复合体形成。显示复合体形成相对于阳性对照的百分比(CMV pp65495-503)。Melan-A的突变(MelA Mut)和非突变(MelA)序列分别用作强结合肽和弱结合肽对照。“阴性”表示染色背景。虚线表示将肽视为与HLA-A*0201良好结合所需的最低复合体形成值(阳性对照的50%)。Figure 11: Binding of ER-derived peptides to HLA-A2 molecules. Peptide-HLA-A*02:01 complex formation for synthetic chimeric transcript-derived peptides. The percentage of complex formation relative to the positive control (CMV pp65495-503) is shown. Mutated (MelA Mut) and non-mutated (MelA) sequences of Melan-A were used as strong binding peptide and weak binding peptide controls, respectively. "Negative" indicates staining background. The dotted line indicates the minimum complex formation value (50% of the positive control) required for the peptide to be considered as binding well to HLA-A*0201.
图12:融合转录物衍生肽和反应性CD8+T细胞产生的免疫原性。(A)在使用6个不同健康供体的体外免疫原性测定中,pJET(融合转录物衍生肽)特异性四聚体阳性CD8+T细胞从6个不同健康供体扩增的频率。(B)用不同浓度的特异性肽刺激后CTL克隆的细胞因子分泌。右侧列出了生成的CTL克隆及其肽特异性。(C)针对CTL-克隆9与装载2种不同肽浓度的靶细胞以及抗MHC-1抗体或同种型对照(左图)或与不同比例的未装载靶细胞(右图)共培养的杀伤测定。(D)当CTL克隆9、80和64与未装载肽的靶细胞和抗MHCI-I抗体或同种型对照共培养时,对其进行杀伤测定。效应子:靶比例显示在每个单独的小图中。H1650用作该图各小图的靶细胞。Figure 12: Immunogenicity of fusion transcript-derived peptides and reactive CD8+T cell production. (A) Frequency of pJET (fusion transcript-derived peptide)-specific tetramer-positive CD8+T cells expanded from 6 different healthy donors in an in vitro immunogenicity assay using 6 different healthy donors. (B) Cytokine secretion of CTL clones after stimulation with different concentrations of specific peptides. The generated CTL clones and their peptide specificity are listed on the right. (C) Killing assay for CTL-clone 9 co-cultured with target cells loaded with 2 different peptide concentrations and anti-MHC-1 antibodies or isotype controls (left figure) or with different ratios of unloaded target cells (right figure). (D) Killing assays were performed when CTL clones 9, 80 and 64 were co-cultured with target cells unloaded with peptides and anti-MHCI-I antibodies or isotype controls. Effector: target ratios are shown in each individual panel. H1650 was used as the target cell for each panel of this figure.
图13:识别融合衍生肽的TCR的表达。将由CTL克隆9衍生的TCR序列转导的Jurkat报告细胞单独与靶细胞共培养,或装载2种不同浓度的肽。小图中显示了通过流式细胞术评估的3个报告基因的Jurkat阳性细胞百分比,使用H1650细胞系作为靶细胞(上图)或H1395细胞系作为靶细胞(下图)。阴性对照:未转导Jurkat细胞。无肽:转导Jurkat细胞与未装载肽的靶细胞共培养。阳性对照:用PMA/离子霉素刺激的转导Jurkat细胞。Figure 13: Expression of TCRs that recognize fusion-derived peptides. Jurkat reporter cells transduced with TCR sequences derived from CTL clone 9 were co-cultured with target cells alone or loaded with 2 different concentrations of peptides. The inset shows the percentage of Jurkat-positive cells for the 3 reporter genes assessed by flow cytometry, using the H1650 cell line as target cells (upper panel) or the H1395 cell line as target cells (lower panel). Negative control: Jurkat cells were not transduced. No peptide: Transduced Jurkat cells were co-cultured with target cells that were not loaded with peptides. Positive control: Transduced Jurkat cells stimulated with PMA/ionomycin.
图14:A.在与装载相关/特异性或无关/不相关肽(Melan-A)的靶细胞共培养后,激活用CTL克隆衍生的TCR转导的Jurkat细胞,其识别嵌合转录物衍生肽。B.在存在或不存在抗MHC-I阻断抗体(W6/32)或同种型对照的情况下,在与装载相关肽或不相关肽(Melan-A)的靶细胞共培养后,激活用CTL克隆衍生的TCR转导的Jurkat细胞,其识别嵌合转录物衍生肽。使用PMA/紫罗霉素作为激活的阳性对照,并且使用未装载肽的靶细胞作为激活的阴性对照。以H1395LUAD细胞系为靶细胞。每来自CTL-克隆的每个TCR来源均显示在顶部,括号内为肽特异性,显示了每一个这些TCR识别的嵌合转录物衍生肽的氨基酸序列。该肽序列是在每种情况下用于装载靶细胞的特异性/相关肽。Melan-A和MelA Mut均指不相关肽(ELAGIGILTV)。Figure 14: A. After co-culturing with target cells loaded with relevant/specific or irrelevant/unrelated peptides (Melan-A), Jurkat cells transduced with TCRs derived from CTL clones were activated, which recognized chimeric transcript-derived peptides. B. In the presence or absence of anti-MHC-I blocking antibodies (W6/32) or isotype controls, after co-culturing with target cells loaded with relevant or unrelated peptides (Melan-A), Jurkat cells transduced with TCRs derived from CTL clones were activated, which recognized chimeric transcript-derived peptides. PMA/viomycin was used as a positive control for activation, and target cells not loaded with peptides were used as negative controls for activation. H1395LUAD cell lines were used as target cells. Each TCR source from CTL-clones is shown at the top, with peptide specificity in brackets, showing the amino acid sequence of the chimeric transcript-derived peptides recognized by each of these TCRs. The peptide sequence is the specific/relevant peptide used to load the target cells in each case. Melan-A and MelA Mut both refer to unrelated peptides (ELAGIGILTV).
图15:识别融合转录物衍生肽的肿瘤浸润性淋巴细胞。在融合转录物衍生肽的混合物+IL2(A)的存在下或仅用IL-2(B)扩增的针对肿瘤浸润性淋巴细胞(TIL)中发现的指定的融合转录物衍生肽的四聚体阳性CD8T细胞的百分比。Figure 15: Tumor infiltrating lymphocytes recognizing fusion transcript-derived peptides. Percentage of tetramer-positive CD8 T cells for the indicated fusion transcript-derived peptides found in tumor infiltrating lymphocytes (TILs) expanded in the presence of a mixture of fusion transcript-derived peptides + IL2 (A) or with IL-2 alone (B).
图16:识别LUAD患者衍生样品中融合转录物衍生肽的CD8+T细胞表型。识别LUAD患者2(A,上图)和患者3(B,上图)的肿瘤、近肿瘤、淋巴结和血液样品中存在的融合转录物衍生肽的四聚体阳性CD8T细胞的百分比。在图(A)和(B)的下图中,显示了四聚体阳性亲代细胞群体中初始(CCR 7+CD45+)、中央记忆(CM、CCR7+CD45-)、效应记忆(EM、CCR7-CD45-)和末端效应(TE、CCR7-CD45+)细胞的百分比。Figure 16: Recognize the CD8+T cell phenotype of fusion transcript-derived peptides in LUAD patient-derived samples. Recognize the percentage of tetramer-positive CD8T cells of fusion transcript-derived peptides present in tumors, near tumors, lymph nodes and blood samples of LUAD patients 2 (A, upper figure) and 3 (B, upper figure). In the figure below (A) and (B), the percentage of initial (CCR 7+CD45+), central memory (CM, CCR7+CD45-), effector memory (EM, CCR7-CD45-) and terminal effector (TE, CCR7-CD45+) cells in tetramer-positive parental cell populations is shown.
图17:A.总结了CD8+T细胞识别离体发现的、不含T细胞扩增的嵌合转录物衍生肽的频率的热度图。仅显示在至少一个组织中发现的肽特异性(总评估患者=4)。B.在对患者2和患者5进行离体染色后,在A中总结的四聚体阳性细胞中CCR7和CD45RA的百分比。(1号病人无可用数据)。C.总结了在5名受分析患者的肿瘤、近肿瘤或肿瘤引流LN样品中的CD8+T细胞上体外扩增后识别嵌合转录物衍生肽的特定四聚体阳性细胞的热度图。仅显示了在至少一个组织中发现的肽特异性。黑色方框突出显示了在同一组织和患者的离体发现的肽特异性。Figure 17: A. Heat map summarizing the frequency of CD8+ T cells recognizing chimeric transcript-derived peptides found ex vivo without T cell expansion. Only peptide specificities found in at least one tissue are shown (total patients evaluated = 4). B. Percentages of CCR7 and CD45RA in tetramer-positive cells summarized in A after ex vivo staining for patients 2 and 5. (No data available for patient 1). C. Heat map summarizing specific tetramer-positive cells recognizing chimeric transcript-derived peptides after in vitro expansion on CD8+ T cells in tumor, near-tumor, or tumor-draining LN samples from the 5 analyzed patients. Only peptide specificities found in at least one tissue are shown. Black boxes highlight peptide specificities found ex vivo in the same tissue and patient.
图18:肺部肿瘤样品的免疫肽组学分析。在公开的MHC-I免疫肽体数据集中检索融合转录物衍生肽序列。每列代表不同的样品。每行代表不同的肽序列(在右侧明确说明)。填色方框表示在哪个样品中发现了每个融合转录物衍生肽。描述每个样品数据集的出版物均在顶部注释。Figure 18: Immunopeptidomic analysis of lung tumor samples. Fusion transcript-derived peptide sequences were searched in the public MHC-I immunopeptidome dataset. Each column represents a different sample. Each row represents a different peptide sequence (explicitly stated on the right). The filled boxes indicate in which sample each fusion transcript-derived peptide was found. The publication describing each sample dataset is annotated at the top.
图19:显示未转染阴性对照(左)和ABHD1-JET转染条件(右)的FACS直方图。红色直方图对应于抗Myc染色,并且灰色线显示非抗体(缓冲液)条件。Figure 19: FACS histograms showing untransfected negative control (left) and ABHD1-JET transfection conditions (right). The red histogram corresponds to anti-Myc staining and the grey line shows non-antibody (buffer) conditions.
具体实施方式Detailed ways
1.实施例1:融合转录物序列编码的肿瘤新抗原肽的鉴定1. Example 1: Identification of tumor neoantigen peptides encoded by fusion transcript sequences
1.1在小鼠中进行概念证明1.1 Proof of concept in mice
为了检测由融合转录物序列产生的单个和共享的肿瘤新抗原肽,开发了生物信息管道。该管道被设计成鉴定由TE序列的一部分和外显子序列的一部分组成的肿瘤特异性mRNA序列。该管道意味着确定MHC等位基因。对于每个人样品,可以使用seq2hla(v2.2)工具(bitbucket.org/sebastian_BOEgel/seq2hla)确定I类和II类MHC等位基因。对于小鼠模型,鼠H-2等位基因通常是众所周知的。生物信息学方法包括将来自RNA测序的转录物与参考基因组进行比对。本文描述的概念证明分析中,mm10用于小鼠,并且hg19用于人。可以使用不同版本的组装基因组,例如hg19、hg38、mm9或mm10。该比对通过STAR(v2.5.3a)(github.com/alexdobin/STAR)进行,设置如下:In order to detect single and shared tumor neoantigenic peptides generated by fusion transcript sequences, a bioinformatics pipeline was developed. The pipeline is designed to identify tumor-specific mRNA sequences consisting of a portion of the TE sequence and a portion of the exon sequence. The pipeline is meant to determine MHC alleles. For each human sample, the seq2hla (v2.2) tool (bitbucket.org/sebastian_BOEgel/seq2hla) can be used to determine class I and class II MHC alleles. For mouse models, the mouse H-2 allele is generally well known. Bioinformatics methods include aligning transcripts from RNA sequencing with a reference genome. In the proof-of-concept analysis described in this article, mm10 was used for mice and hg19 was used for humans. Different versions of assembled genomes can be used, such as hg19, hg38, mm9, or mm10. The alignment was performed using STAR (v2.5.3a) (github.com/alexdobin/STAR) and the settings are as follows:
-为允许多次击中比对,将设置基因座最大数量和允许比对读长的参数outFilterMultimapNmax设置为1000,并且- To allow for multiple hit alignments, the parameter outFilterMultimapNmax, which sets the maximum number of loci and allowed alignment read length, was set to 1000, and
-为了检测异常连接(融合),将设置融合段最小长度的参数chimSegmentMin设置为10,将设置融合连接最小悬垂量的参数chimJunctionoverhangMin设置为10。- To detect abnormal connections (fusions), the parameter chimSegmentMin, which sets the minimum length of a fused segment, was set to 10, and the parameter chimJunctionoverhangMin, which sets the minimum overhang of a fused junction, was set to 10.
正常(来自SJ.out.tab输出文件)和异常(来自Chimeric.out.joint输出文件)连接使用Ensembl和repeatmasker数据库进行注释。正常连接定义了与对比所用参数相匹配的所有连接(最大内含子长度≤1000000bp(由-alignItronMax设置)、相同的染色体和良好定向),并且异常连接是与至少一个前述标准不匹配的连接。这意味着TE/外显子连接可以是两种连接类型,但外显子/外显子连接必须在正常文件中(SJ.out.tab)。在计算机模拟中提取包含TE序列和外显子序列之间连接的转录物序列。从转录物序列中与连接重叠的区域,或离框(阅读框架非经典)时连接的下游,软件在所有阅读框中预测所有可能的8或9-mers的肽。然后,通过netMHCpan(v3.4)(cbs.dtu.dk/services/NetMHCpan/)确定所有这些可能的肽对先前为匹配样品定义的MHC等位基因的结合亲和力。目前有十几种不同的预测算法用于预测肽的结合亲和力,其中NetMHC是新抗原预测管道中使用最广泛和最有效的算法。Normal (from SJ.out.tab output file) and abnormal (from Chimeric.out.joint output file) connections were annotated using Ensembl and repeatmasker databases. Normal connections define all connections that match the parameters used for the alignment (maximum intron length ≤ 1000000 bp (set by -alignItronMax), same chromosome and good orientation), and abnormal connections are connections that do not match at least one of the aforementioned criteria. This means that TE/exon connections can be both connection types, but exon/exon connections must be in the normal file (SJ.out.tab). Transcript sequences containing connections between TE sequences and exon sequences were extracted in silico. From regions in the transcript sequence that overlap with the connection, or downstream of the connection when out-of-frame (reading frame non-canonical), the software predicts peptides of all possible 8 or 9-mers in all reading frames. Then, the binding affinity of all these possible peptides to the MHC alleles previously defined for the matched samples was determined by netMHCpan (v3.4) (cbs.dtu.dk/services/NetMHCpan/). There are currently more than a dozen different prediction algorithms used to predict peptide binding affinity, among which NetMHC is the most widely used and effective algorithm in neoantigen prediction pipelines.
低于500nM或百分位数轶低于2%的肽被视为潜在的新抗原。每个剪接位点(供体或受体)均被独特地标注为TE或外显子。5’端部分为合格的“供体”,3’端部分为合格的“受体”。Peptides with a p-value below 500 nM or a percentile below 2% were considered potential neoantigens. Each splice site (donor or acceptor) was uniquely annotated as TE or exon. The 5' end was classified as a qualified "donor" and the 3' end as a qualified "acceptor".
癌症和正常组织之间共享的预测的HLA结合肽排除在进一步分析之外。Predicted HLA-binding peptides shared between cancer and normal tissues were excluded from further analysis.
该方法已应用于从7种充分表征的鼠肿瘤细胞系(B16F10、B16F10-OVA、MCA101、MCA101-OVA、MC38、MC38-GFP、MC38-GFP-OVA)获得的RNAseq数据。具有延伸OVA的细胞系对应于相同的模型,但进一步表达卵清蛋白。在本研究中,该细胞系被认为是类似的模型,也就是说,例如,在来自B16F10-OVA的细胞系上进行的测定被认为是在来自B16F10的细胞系上进行的测定的重复。The method has been applied to RNAseq data obtained from seven well-characterized murine tumor cell lines (B16F10, B16F10-OVA, MCA101, MCA101-OVA, MC38, MC38-GFP, MC38-GFP-OVA). The cell line with extended OVA corresponds to the same model, but further expresses ovalbumin. In this study, the cell lines were considered as similar models, that is, for example, an assay performed on a cell line from B16F10-OVA was considered a duplicate of an assay performed on a cell line from B16F10.
利用这些参数获得了候选肽的列表(图1A、1B和1C),其中一些特异于特定的细胞系(图1A和1B),并且另一些在两种肿瘤细胞系之间共享(图1C)。Using these parameters, a list of candidate peptides was obtained ( FIGS. 1A , 1B, and 1C ), some of which were specific to a particular cell line ( FIGS. 1A and 1B ) and others that were shared between the two tumor cell lines ( FIG. 1C ).
为了验证,我们选择在B16F10-OVA或MCA101-OVA中表达的一系列肽,其中预测的亲和力低于500nM。选择肽并尝试优化读长的数量与预测的针对MHC-1的亲和力之间的比例。For validation, we selected a series of peptides expressed in B16F10-OVA or MCA101-OVA for which the predicted affinity was below 500 nM. Peptides were selected and an attempt was made to optimize the ratio between the number of reads and the predicted affinity against MHC-1.
选择四种预测的肿瘤新抗原肽,并通过鉴定TE和外显子序列对其进行表征(表2)。Four predicted tumor neoantigenic peptides were selected and characterized by identifying TE and exon sequences (Table 2).
表2:通过该方法选择的4种预测的肿瘤新抗原肽的表征Table 2: Characterization of 4 predicted tumor neoantigenic peptides selected by this method
1.2通过RT-PCR验证融合转录物序列1.2 Verification of fusion transcript sequence by RT-PCR
首先,使用一对引物(其中一个引物在TE序列中,另一个引物在外显子序列中)通过常规RT-PCR进行验证。First, verification was performed by conventional RT-PCR using a pair of primers, one of which was in the TE sequence and the other in the exon sequence.
为了提取和逆转录RNA,将3-5.106个细胞裂解在500μL Trizol中,并在离心前将100μL酚-氯仿添加到裂解物中。收集水相,与100%EtOH以1∶1的比例混合,并转移至RNAeasy minikit柱。然后按照制造商的说明(包括DNAse柱上处理)收集RNA。在RNA洗脱后,根据制造商的说明,通过使用Turbo DNAse(Fisher scientific)进一步去除DNA污染物。使用nanodrop测量RNA浓度,并使用1μg RNA进行逆转录。根据制造商的说明,使用oligodT(15)作为引物,使用Superscript III(Life technologies)进行第一链合成。引物从Eurogentec订购。使用Taq聚合酶进行PCR反应。在鉴定各反应的最佳条件后,从琼脂糖凝胶中提取PCR产物,并使用GATC lightrun进行测序。用APE软件检查序列比对。To extract and reverse transcribe RNA, 3-5.10 6 cells were lysed in 500 μL Trizol and 100 μL phenol-chloroform was added to the lysate before centrifugation. The aqueous phase was collected, mixed with 100% EtOH in a 1:1 ratio, and transferred to an RNAeasy minikit column. RNA was then collected according to the manufacturer's instructions (including DNAse column treatment). After RNA elution, DNA contaminants were further removed by using Turbo DNAse (Fisher scientific) according to the manufacturer's instructions. RNA concentration was measured using nanodrop, and 1 μg RNA was used for reverse transcription. First-strand synthesis was performed using Superscript III (Life technologies) using oligodT (15) as a primer according to the manufacturer's instructions. Primers were ordered from Eurogentec. PCR reactions were performed using Taq polymerase. After identifying the optimal conditions for each reaction, PCR products were extracted from agarose gels and sequenced using GATC lightrun. Sequence alignment was checked using APE software.
使用该方法,在表2中鉴定的细胞系中分别检测到与N25、N26、N90和N94的预测大小相匹配的条带(参见图2A的N25)。有趣的是,尽管如前所述在管道中,在计算机模拟中,通过RNAseq仅在MCA和MC38细胞中检测到N26,但使用RT-PCR,我们在B16F10-OVA细胞中检测到对应于N26的条带(图2B),表明该序列在三个独立的肿瘤细胞系(MCA、MC38和B16F10)之间共享。通过重新分析RNAseq数据,我们发现N26连接存在于B16F10-OVA细胞中,但低于算法的检测阈值。此外,RT-PCR产物的测序显示与算法预测的序列完全匹配。Using this method, bands matching the predicted sizes of N25, N26, N90, and N94 were detected in the cell lines identified in Table 2, respectively (see N25 in Figure 2A). Interestingly, although N26 was only detected in MCA and MC38 cells by RNAseq in silico as described in the pipeline, using RT-PCR, we detected a band corresponding to N26 in B16F10-OVA cells (Figure 2B), indicating that this sequence is shared between three independent tumor cell lines (MCA, MC38, and B16F10). By reanalyzing the RNAseq data, we found that the N26 junction was present in B16F10-OVA cells, but below the detection threshold of the algorithm. In addition, sequencing of the RT-PCR product showed an exact match to the sequence predicted by the algorithm.
1.3小鼠体内免疫1.3 Immunization of mice
为了在体内验证这些候选物,合成了对应于与MHC I类序列结合的新抗原肽的短(9-mers)肽。对于体内试验,合成了长(27-mers)肽,其包括预测的9-mers的MHC结合短肽的侧翼区域,因为该长度更适合于体内免疫。将B16F10OVA和MCA101-OVA在RPMI、谷氨酰胺、10%FCS、1%盘尼西林-链霉素中保存,并使用TrypLE来传代。将细胞在培养物中最多保存一个月,并且每次体内实验均解冻新的小瓶。将100μg长肽(N25L或N26L)、SIINFEKL肽(短OVA肽)、OVA(Sigma)或DMSO(每个含50μg polyI:C)通过皮下注入侧腹的方式免疫C57BL6J受体小鼠。初始免疫后7天重复免疫。3天后(初始免疫后10天),处死动物,用ELISPOT检测腹股沟淋巴结中肽特异性分泌IFNg的CD8T细胞数(图3A)。10μg.mL-1的短肽(N25、N26或SIINFEKL)或DMSO用于再刺激T细胞。或者,在二次免疫后7天,在动物皮下注射在PBS中的2.5.105个B16F10-OVA或5.105个MCA-OVA细胞。我们发现N25和更少程度的N26能够诱导免疫应答(图3B)。To validate these candidates in vivo, short (9-mers) peptides corresponding to neoantigenic peptides that bind to MHC class I sequences were synthesized. For in vivo experiments, long (27-mers) peptides were synthesized that included flanking regions of the predicted 9-mer MHC-binding short peptides, as this length is more suitable for in vivo immunization. B16F10OVA and MCA101-OVA were maintained in RPMI, glutamine, 10% FCS, 1% penicillin-streptomycin and passaged using TrypLE. Cells were maintained in culture for up to one month, and new vials were thawed for each in vivo experiment. C57BL6J recipient mice were immunized with 100 μg of long peptide (N25L or N26L), SIINFEKL peptide (short OVA peptide), OVA (Sigma), or DMSO (each containing 50 μg polyI:C) by subcutaneous injection into the flank. Immunizations were repeated 7 days after the initial immunization. After 3 days (10 days after the initial immunization), the animals were sacrificed and the number of peptide-specific IFNg-secreting CD8 T cells in the inguinal lymph nodes was detected by ELISPOT (Figure 3A). 10μg.mL -1 of short peptides (N25, N26 or SIINFEKL) or DMSO was used to re-stimulate T cells. Alternatively, 7 days after the secondary immunization, the animals were subcutaneously injected with 2.5.10 5 B16F10-OVA or 5.10 5 MCA-OVA cells in PBS. We found that N25 and, to a lesser extent, N26 were able to induce an immune response (Figure 3B).
1.4小鼠肿瘤的体内治疗1.4 In vivo treatment of mouse tumors
为了测试这些肽是否对肿瘤细胞具有保护作用,我们在第0天和第7天用100mg肽N25L或N26L,或OVA(对照肽)和PBS中的50μg polyI:C免疫C57BL6小鼠,并在第14天向用OVA、N25L和N26L免疫的小鼠注射2.5.105个B16F10-OVA细胞。将B16F10 OVA和MCA101-OVA在RPMI、谷氨酰胺、10%FCS、1%盘尼西林-链霉素中保存,并使用TrypLE来传代。将细胞在培养物中最多保存一个月,并且每次体内实验均解冻新的小瓶。将100μg长肽(N25L或N26L)、OVA(Sigma)或DMSO(每个含50μg polyI:C)通过皮下注入侧腹的方式进行免疫。初始免疫后7天重复免疫。To test whether these peptides have a protective effect on tumor cells, we immunized C57BL6 mice with 100 mg of peptide N25L or N26L, or OVA (control peptide) and 50 μg polyI:C in PBS on days 0 and 7, and injected 2.5.10 5 B16F10-OVA cells into mice immunized with OVA, N25L and N26L on day 14. B16F10 OVA and MCA101-OVA were maintained in RPMI, glutamine, 10% FCS, 1% penicillin-streptomycin and passaged using TrypLE. Cells were maintained in culture for up to one month and a new vial was thawed for each in vivo experiment. Immunizations were performed by subcutaneous injection into the flank with 100 μg of long peptide (N25L or N26L), OVA (Sigma) or DMSO (each containing 50 μg polyI:C). Immunizations were repeated 7 days after the initial immunization.
10μg.mL-1的短肽(N25、N26或SIINFEKL)或DMSO用于再刺激T细胞。或者,在二次免疫后7天,在动物皮下注射2.5.105个B16F10-OVA或5.105个MCA-OVA细胞。使用手动卡尺每周两次测量肿瘤大小,并在整个实验时间框架内监测动物健康状况(图4A和图4B)。当肿瘤体积达到1mm3时,处死动物。引人注目的是,我们观察到N25L以比OVA更有效的方式显著延迟了B16OVA肿瘤的形成。此外,我们在N26L免疫后获得了类似的结果。10 μg.mL -1 of short peptides (N25, N26 or SIINFEKL) or DMSO was used to re-stimulate T cells. Alternatively, 7 days after the secondary immunization, the animals were subcutaneously injected with 2.5.10 5 B16F10-OVA or 5.10 5 MCA-OVA cells. Tumor size was measured twice a week using a manual caliper, and animal health was monitored throughout the experimental time frame (Figures 4A and 4B). Animals were sacrificed when the tumor volume reached 1 mm 3. Strikingly, we observed that N25L significantly delayed the formation of B16OVA tumors in a more effective manner than OVA. In addition, we obtained similar results after N26L immunization.
2.实施例2:鉴定源自融合转录物(由TE元件和外显子序列组成)的人肺腺癌新抗原肽2. Example 2: Identification of human lung adenocarcinoma neoantigenic peptides derived from fusion transcripts (consisting of TE elements and exon sequences)
2.1材料和方法2.1 Materials and methods
RNA提取。将肿瘤和邻近组织的样品切割成1mm3的碎片,并重悬于补充有1%的β-巯基乙醇的700μl的RTL裂解缓冲液(Quiagen)中,并使用Perecellys 24组织均质器(Bertin Technoogies)均质化。按照制造商的说明,使用RNeasy Micro Kit(Qiagen)进行总RNA分离。使用相同的流程从5.106个肿瘤细胞系中提取肿瘤细胞系的总RNA。RNA extraction. Samples of tumors and adjacent tissues were cut into 1 mm 3 fragments and resuspended in 700 μl of RTL lysis buffer (Quiagen) supplemented with 1% β-mercaptoethanol and homogenized using a Perecellys 24 tissue homogenizer (Bertin Technoogies). Total RNA was isolated using the RNeasy Micro Kit (Qiagen) according to the manufacturer's instructions. Total RNA of tumor cell lines was extracted from 5.10 6 tumor cell lines using the same process.
PCR和测序。使用APE软件设计引物。对于每个样品,按照供货商的指示,使用SuperScript III逆转录酶(ThermoFisher)将1μg的RNA逆转录成cDNA。使用GoTaq G2热启动聚合酶(Promega)进行PCR反应。所有引物的使用浓度均为0.5μM。反应在VeritiTM 96孔热循环仪(ThermoFisher)中进行。将PCR产物装载入LabChip GX(Caliper LifeSciences),并由LabChip GX软件(v4.2)进行分析。PCR and sequencing. Primers were designed using APE software. For each sample, 1 μg of RNA was reverse transcribed into cDNA using SuperScript III reverse transcriptase (ThermoFisher) according to the supplier's instructions. PCR reactions were performed using GoTaq G2 hot start polymerase (Promega). All primers were used at a concentration of 0.5 μM. Reactions were performed in a VeritiTM 96-well thermal cycler (ThermoFisher). PCR products were loaded into LabChip GX (Caliper LifeSciences) and analyzed by LabChip GX software (v4.2).
对于具有预期大小的扩增产物的样品,重复进行PCR反应。然后,将PCR产物在2%琼脂糖凝胶无SYBR染料(1/10000)(Invitrogen)中运行。按照制造商的说明,使用QIAquick凝胶提取试剂盒(Qiagen)切割特异性条带并纯化DNA产品。最后,通过EuroFinsScientific对这些产品进行测序。使用Serial Cloner软件将所得序列与预期序列进行比较。For samples with amplified products of expected size, PCR reactions were repeated. Then, PCR products were run in 2% agarose gel without SYBR dye (1/10000) (Invitrogen). According to the manufacturer's instructions, QIAquick gel extraction kit (Qiagen) was used to cut specific bands and purify DNA products. Finally, these products were sequenced by EuroFinsScientific. Serial Cloner software was used to compare the resulting sequence with the expected sequence.
四聚体形成。HLA-A2单体购自并按照制造商的说明,用合成的ER衍生肽评估四聚体的形成。简而言之,将合成的HLA-A2单体与合成肽在18℃下孵育48小时。通过将单体与生物素化的琼脂糖进一步孵育完成四聚化。最后,使用PE缀合的抗β2-微球蛋白抗体通过流式细胞术测量四聚体的形成。作为阳性对照,我们使用了制造商提供的来源于CMV的肽。Tetramer formation. HLA-A2 monomers were purchased from The formation of tetramers was assessed with synthetic ER-derived peptides following the manufacturer’s instructions. Briefly, synthetic HLA-A2 monomers were incubated with synthetic peptides at 18 °C for 48 h. Tetramerization was completed by further incubation of the monomers with biotinylated agarose. Finally, tetramer formation was measured by flow cytometry using PE-conjugated anti-β2-microglobulin antibodies. As a positive control, we used a peptide derived from CMV provided by the manufacturer.
在评估特异性CD8+T细胞存在的实验中,四聚化步骤通过将单体与荧光链霉亲和素的不同组合(PE、APC、PE-Cy5、PE-CF594、BV421、BV711和FITC)孵育来进行。In experiments evaluating the presence of specific CD8+ T cells, the tetramerization step was performed by incubating the monomers with different combinations of fluorescent streptavidin (PE, APC, PE-Cy5, PE-CF594, BV421, BV711, and FITC).
引发天然CTL。通过Ficoll梯度分离从HLA-A2+健康献血者中获得PBMC。将CD14+、CD4+和CD8+细胞通过使用磁珠(Miltenyi Biotec)通过阳性选择进行纯化。当CD4+和CD8+T细胞冷冻保存直至实验日时,在IL-4(50ng/mL)和GM-CSF(10ng/mL)存在下培养106个细胞/mL的CD14+级分5天以获得moDC。在这段时间后,将moDC用LPS成熟,并与最终浓度为1μg/mL的合成ER衍生肽孵育2小时。最后,在补充有IL-2(10U/ml)和IL-7(100ng/ml)的培养基中,将装载肽的moDC与自体的CD4+和CD8+T细胞共同培养。ER衍生肽对特异性CD8+CTL群体的刺激通过流式细胞术的MHC-I四聚体染色进行评估,每种肽使用两种颜色的四聚体组合。Initiate natural CTL. PBMCs were obtained from HLA-A2+ healthy blood donors by Ficoll gradient separation. CD14+, CD4+ and CD8+ cells were purified by positive selection using magnetic beads (Miltenyi Biotec). When CD4+ and CD8+ T cells were cryopreserved until the day of the experiment, 10 6 cells/mL of CD14+ fractions were cultured for 5 days in the presence of IL-4 (50ng/mL) and GM-CSF (10ng/mL) to obtain moDC. After this period of time, moDC were matured with LPS and incubated with synthetic ER-derived peptides at a final concentration of 1μg/mL for 2 hours. Finally, peptide-loaded moDC were co-cultured with autologous CD4+ and CD8+ T cells in culture medium supplemented with IL-2 (10U/ml) and IL-7 (100ng/ml). Stimulation of specific CD8 + CTL populations by ER-derived peptides was assessed by MHC-I tetramer staining by flow cytometry using a two-color tetramer combination for each peptide.
四聚体染色。将细胞重悬于PBS中,在4℃下用活/死Aqua-405nm(ThermoFisher)染色20分钟,并洗涤一次。此后,将细胞重悬于含有SA偶联的四聚体混合物的PBS-1%BSA中,并在室温下于黑暗中孵育20分钟。无需进一步洗涤,将表面抗体添加到PBS-1%BSA中,并在4℃下于黑暗中孵育细胞20分钟。表面抗体是抗CD3-BV650+抗CD8-PECy7的组合,如有需要再与抗CCR7-AF700+抗CD45RA-BUV395组合。最后,将细胞洗涤两次并重悬于FACS缓冲液中进行流式细胞分析。Tetramer staining. The cells were resuspended in PBS, stained with live/dead Aqua-405nm (ThermoFisher) for 20 minutes at 4°C, and washed once. Thereafter, the cells were resuspended in PBS-1% BSA containing a mixture of SA-coupled tetramers and incubated in the dark for 20 minutes at room temperature. Without further washing, the surface antibodies were added to PBS-1% BSA and the cells were incubated in the dark at 4°C for 20 minutes. The surface antibodies were a combination of anti-CD3-BV650 + anti-CD8-PECy7, combined with anti-CCR7-AF700 + anti-CD45RA-BUV395 if necessary. Finally, the cells were washed twice and resuspended in FACS buffer for flow cytometry analysis.
CTL-克隆产生。将四聚体阳性细胞在U形底96孔板中进行单细胞FACS(ARIA-sorter,BD)分选。将分选的细胞收集在100μl含有150,000个饲养细胞的RPMI 10%人血清AB(Sigma-Aldrich)中。最后,添加100μl含有IL-2(3000IU/ml)和抗CD3(100μg/ml,来自Miltenyi的OKT3克隆)的AIM-培养基,并且将细胞培养最多15-20天。当在孔中观察到明显的细胞生长时,我们用新的饲养细胞进行第二轮扩增,最多持续15天。在此期间,细胞在必要时用相同的但仅含有500IU/ml的IL-2的培养基(AIM-RPMI 50/50+5%人血清)饲养和裂解。最后,通过FACs-四聚体染色检查扩增克隆的特异性,并且仅使用四聚体阳性克隆超过85%的克隆进行进一步分析。CTL-clone generation. Tetramer-positive cells are sorted by single-cell FACS (ARIA-sorter, BD) in a U-shaped bottom 96-well plate. The sorted cells are collected in 100 μl of RPMI 10% human serum AB (Sigma-Aldrich) containing 150,000 feeder cells. Finally, 100 μl of AIM-medium containing IL-2 (3000 IU/ml) and anti-CD3 (100 μg/ml, OKT3 clone from Miltenyi) is added, and the cells are cultured for up to 15-20 days. When obvious cell growth is observed in the wells, we use new feeder cells for a second round of expansion, which lasts up to 15 days. During this period, cells are fed and lysed with the same medium (AIM-RPMI 50/50+5% human serum) containing only 500 IU/ml of IL-2 when necessary. Finally, the specificity of the amplified clones was checked by FACs-tetramer staining, and only clones with more than 85% tetramer-positive clones were used for further analysis.
杀伤测定。为进行杀伤测定,使用了xCELLigence RTCA S16实时细胞分析仪。将H1650细胞系以0,5x106个细胞/ml的速度在预涂布的16孔板中进行铺板。一天后,将细胞与不同浓度的相应的合成肽孵育1小时,或不进行孵育。此后,用培养基洗涤细胞两次,并用相同浓度的抗MHC-I抗体(W6/32克隆,50μg/孔)或同种型对照再孵育30分钟,或不进行孵育。在不进行额外洗涤的情况下,以相应比例添加CTL-克隆。完整测定在最终体积为200的无血清培养基中于37℃下并连接至xCELLigence系统完成。实时测量阻抗变化(细胞指数)40小时。每种条件均进行重复。Killing assay. For the killing assay, the xCELLigence RTCA S16 real-time cell analyzer was used. The H1650 cell line was plated at 0,5x10 6 cells/ml in pre-coated 16-well plates. One day later, the cells were incubated for 1 hour with different concentrations of the corresponding synthetic peptides or without incubation. Thereafter, the cells were washed twice with medium and incubated for another 30 minutes with the same concentration of anti-MHC-I antibodies (W6/32 clone, 50 μg/well) or isotype controls or without incubation. Without additional washing, CTL-clones were added in the corresponding ratios. The complete assay was done in a final volume of 200 μl of serum-free medium at 37°C and connected to the xCELLigence system. The impedance changes (cell index) were measured in real time for 40 hours. Each condition was repeated.
细胞因子分泌和Jurkat细胞激活。将50.000个H1650细胞在96孔平板中在补充有5%胎牛血清的培养基中进行铺板。第二天,将细胞与不同终浓度的合成肽孵育1-2小时。之后,将细胞洗涤两次,以1∶1的比例添加CTL-克隆,并与装载肽的靶细胞共同培养18小时。按照制造商的说明,收集培养物上清液并通过细胞因子珠阵列(CBA、BD Biosciences)分析细胞因子浓度。Cytokine secretion and Jurkat cell activation. 50.000 H1650 cells were plated in 96-well plates in a medium supplemented with 5% fetal bovine serum. The next day, cells were incubated with synthetic peptides at different final concentrations for 1-2 hours. Afterwards, cells were washed twice, CTL-clones were added at a ratio of 1:1, and co-cultured with target cells loaded with peptides for 18 hours. Culture supernatants were collected and cytokine concentrations were analyzed by cytokine bead array (CBA, BD Biosciences) according to the manufacturer's instructions.
使用转导的Jurkat细胞而非CTL克隆和两种不同类型的靶细胞(H1650和H1395细胞系)进行了相同的实验。在本测定中,在与装载肽的靶细胞共培养后,通过流式细胞术分析报告标记物的表达来评估Jurkat细胞。使用PMA/离子霉素作为阳性对照来激活Jurkat细胞。The same experiment was performed using transduced Jurkat cells instead of CTL clones and two different types of target cells (H1650 and H1395 cell lines). In this assay, Jurkat cells were evaluated by flow cytometry analysis of the expression of reporter markers after co-culture with peptide-loaded target cells. PMA/ionomycin was used as a positive control to activate Jurkat cells.
组织和血液样品。将肺肿瘤、近肿瘤和淋巴结样品切割成小块,并在终体积为2ml的培养基(CO2独立培养基+5)中使用I型胶原酶(2mg/ml)、透明质酸酶(2mg/ml)和DNasa(25μg/ml)的混合物于37℃下进行消化40分钟。消化后,通过细胞过滤器收集单细胞悬液并进行洗涤。肿瘤和近肿瘤悬液通过ficoll梯度在淋巴细胞级分上富集。之后,将细胞染色用于如前所述的FACs进行四聚体分析。Tissue and blood samples. Lung tumors, near tumors and lymph node samples were cut into small pieces and digested for 40 minutes at 37°C using a mixture of type I collagenase (2 mg/ml), hyaluronidase (2 mg/ml) and DNasa (25 μg/ml) in a final volume of 2 ml of culture medium ( CO2 independent culture medium +5). After digestion, single cell suspensions were collected and washed by cell strainers. Tumor and near tumor suspensions were enriched in lymphocyte fractions by ficoll gradients. Afterwards, cells were stained for tetramer analysis using FACs as described above.
将血液样品接种在ficoll梯度上,并分离出PBMC。之后,使用EasyStep人CD8+T细胞富集试剂盒(STEMCELL Technologies)对PBMC进行CD8+T细胞富集。最后,将富集的细胞染色用于如上所述的四聚体分析。Blood samples were plated on a ficoll gradient and PBMCs were isolated. PBMCs were then enriched for CD8+ T cells using the EasyStep Human CD8+ T Cell Enrichment Kit (STEMCELL Technologies). Finally, the enriched cells were stained for tetramer analysis as described above.
肿瘤浸润性淋巴细胞(TIL)培养物。将肿瘤组织切割成小块(1-3mm3大小,最多6-12块)。将每个肿瘤片段从24孔平板转移至单孔中,并在终体积为2ml的RPMI 10%人血清+IL-26000IU/ml中培养。根据需要在15-20天期间对细胞进行饲养/裂解,并冷冻保存或分析四聚体染色。Tumor infiltrating lymphocyte (TIL) culture. Tumor tissue was cut into small pieces (1-3 mm 3 size, 6-12 pieces at most). Each tumor fragment was transferred from a 24-well plate to a single well and cultured in RPMI 10% human serum + IL-26000 IU/ml in a final volume of 2 ml. Cells were fed/lysed during 15-20 days as needed and frozen for storage or analyzed for tetramer staining.
TCR克隆。从CTL-克隆中提取总RNA,并使用SuperScript III(ThermoFisher)将其逆转录成cDNA。如Li et al.2019所述,TCRα和β通过PCR进行扩增。DNA产物在2%琼脂糖凝胶中运行,并提取凝胶条带(Qiagen)后进行测序。将TCR V区(α和β)与鼠TCR恒定链连接,克隆到PEW-PEF1A-inatEGFP载体中,并在转化细菌中扩增。TCR cloning. Total RNA was extracted from CTL-clones and reverse transcribed into cDNA using SuperScript III (ThermoFisher). TCRα and β were amplified by PCR as described by Li et al. 2019. DNA products were run in a 2% agarose gel and sequenced after extraction of gel bands (Qiagen). TCR V regions (α and β) were ligated to the mouse TCR constant chain, cloned into the PEW-PEF1A-inatEGFP vector, and amplified in transformed bacteria.
Jurkat转导。将慢病毒颗粒由用表达TCR质粒与包膜(pvSVG)和包装(psPAX2)质粒一起转染的HEK-293FT细胞系产生。64小时后,收集上清液,并使用100kDa离心过滤器(Sigma-Aldrich)浓缩慢病毒颗粒。将慢病毒悬液通过旋转接种转移至表达报告基因的TCR阴性Jurkat细胞(NFAT-GPF、NF-KB-CFP和AP-1-mCherry)。5天后,使用抗鼠TCR-β抗体(克隆H57-597)通过FACS评估转导效率。Rosskopf S.et al.2018中描述了该Jurkat细胞。Jurkat transduction. Lentiviral particles were produced by a HEK-293FT cell line transfected with a TCR-expressing plasmid together with an envelope (pvSVG) and packaging (psPAX2) plasmid. After 64 hours, the supernatant was collected and the lentiviral particles were concentrated using a 100 kDa centrifugal filter (Sigma-Aldrich). The lentiviral suspension was transferred to TCR-negative Jurkat cells expressing reporter genes (NFAT-GPF, NF-KB-CFP, and AP-1-mCherry) by spin inoculation. After 5 days, the transduction efficiency was assessed by FACS using an anti-mouse TCR-β antibody (clone H57-597). The Jurkat cells are described in Rosskopf S.et al.2018.
质谱数据分析。使用ProteomeDiscoverer 1.4(ThermoFisher)分析源自MHC-洗脱的肽的公共免疫肽组学原始数据,参数如下:无酶、肽长度8-15aa、前体质量耐受20ppm和片段质量耐受性0.02Da。甲硫氨酸可作为可变修饰,并且应用了1%的错误发现率(FDR)。针对来自Uniprot/SwissProt(2020年3月6日更新)的人蛋白质组,结合来自肺TCGA项目的所有融合转录物衍生的蛋白列表,检索MS/MS谱。最后,与Uniprot数据库或肺正常样品中存在的翻译的融合转录物匹配的肽被舍弃。Mass spectrometry data analysis. Public immunopeptidomics raw data derived from MHC-eluted peptides were analyzed using ProteomeDiscoverer 1.4 (ThermoFisher) with the following parameters: no enzyme, peptide length 8-15aa, precursor mass tolerance 20ppm and fragment mass tolerance 0.02Da. Methionine can be used as a variable modification, and a 1% false discovery rate (FDR) was applied. MS/MS spectra were retrieved for the human proteome from Uniprot/SwissProt (updated on March 6, 2020), combined with a list of proteins derived from all fusion transcripts from the lung TCGA project. Finally, peptides matching translated fusion transcripts present in the Uniprot database or normal lung samples were discarded.
2.2结果:鉴定人受试者中编码肿瘤新抗原肽的融合转录物序列2.2 Results: Identification of fusion transcript sequences encoding tumor neoantigenic peptides in human subjects
2.2.1新抗原的表征2.2.1 Characterization of neoantigens
首先,在来自TCGA公共数据库的正常样品中表征TE-外显子融合转录物。在来自19个不同组织(胆管、膀胱、脑、乳房、子宫颈、结肠、头颈、肾、肝、胰腺、PCPG、前列腺、直肠、肉瘤、皮肤、胸腺、甲状腺、子宫)的679个正常样品中,共鉴定了8876例独特融合。发现每种组织类型的特异性融合,其中只有极少量的泛组织融合转录物。这些结果表明,一种专门的组织特异性调节机制与这些融合转录物相关。First, TE-exon fusion transcripts were characterized in normal samples from the TCGA public database. In 679 normal samples from 19 different tissues (bile duct, bladder, brain, breast, cervix, colon, head and neck, kidney, liver, pancreas, PCPG, prostate, rectum, sarcoma, skin, thymus, thyroid, uterus), a total of 8876 unique fusions were identified. Specific fusions of each tissue type were found, with only a very small amount of pan-tissue fusion transcripts. These results show that a special tissue-specific regulatory mechanism is associated with these fusion transcripts.
然后将来自TCGA的514份LUAD样品中鉴定的融合数与TCGA的59份正常相关肺样品进行比较。平均而言,在NSCLC样品中鉴定到235例融合,而在健康肺样品中鉴定到200例融合(Wilcoxon p值=9x10-10)。在NSCLC肿瘤中共鉴定了8269例独特融合。The number of fusions identified in 514 LUAD samples from TCGA was then compared to 59 normal-related lung samples from TCGA. On average, 235 fusions were identified in NSCLC samples, while 200 fusions were identified in healthy lung samples (Wilcoxon p-value = 9x10-10 ). A total of 8269 unique fusions were identified in NSCLC tumors.
获得的第一类融合称为TSF(肿瘤特异性融合),其在至少1%的肿瘤样品中发现,而在任何正常样品中均未发现。210例融合因此被定义为TSF。The first type of fusion obtained was called TSF (tumor-specific fusion), which was found in at least 1% of tumor samples and not in any normal samples. 210 fusions were therefore defined as TSF.
肿瘤中的一些高频融合转录物和正常细胞中的低频融合转录物也可能是新抗原的良好候选物。因此,第二类称为TAF(肿瘤相关融合)的融合被特别定义为在低于4%的正常组织,特别是低于2%的正常组织和超过10%的肿瘤中存在的融合,并且与正常组织样品相比,在肿瘤中过度表达。Some high-frequency fusion transcripts in tumors and low-frequency fusion transcripts in normal cells may also be good candidates for neoantigens. Therefore, the second category of fusions called TAFs (tumor-associated fusions) are specifically defined as fusions present in less than 4% of normal tissues, especially less than 2% of normal tissues and more than 10% of tumors, and are overexpressed in tumors compared with normal tissue samples.
融合序列:Fusion sequence:
-为了重建融合核苷酸序列,从Ensembl HG19人组装数据库中提取了位于染色体“供体_染色体_X”上从“供体_起始_X”至链“供体_链_X”上的“供体_断点_X”的供体的序列以及位于染色体“受体_染色体_X”上从“受体_断点_X”至链“受体_链_X”上的“受体_结束_X”的受体的序列。需要注意的是,Ensembl HG19人数据库的使用并非限制性的,其他适应性的数据库也可使用,例如NCBI参考序列数据库(RefSeq)。- To reconstruct the fusion nucleotide sequence, the sequence of the donor located on chromosome "donor_chromosome_X" from "donor_start_X" to "donor_breakpoint_X" on strand "donor_strand_X" and the sequence of the acceptor located on chromosome "acceptor_chromosome_X" from "acceptor_breakpoint_X" to "acceptor_end_X" on strand "acceptor_strand_X" were extracted from the Ensembl HG19 human assembly database. It should be noted that the use of the Ensembl HG19 human database is not restrictive, and other suitable databases may also be used, such as the NCBI Reference Sequence Database (RefSeq).
-如果融合出现在负链上,应注意取序列的反向补体。- If the fusion occurs on the minus strand, care should be taken to take the reverse complement of the sequence.
-“融合序列”由供体序列和随后的受体序列组成。- A "fusion sequence" consists of a donor sequence followed by an acceptor sequence.
融合转录物的核苷酸序列:Nucleotide sequence of the fusion transcript:
在已知的涉及外显子的经典转录物的基础上,对所有“融合转录物”进行了重建。All “fusion transcripts” were reconstructed based on the known classical transcripts involving exons.
当供体为外显子时(见表9A)When the donor is an exon (see Table 9A)
它从从供体外显子的经典转录物开始,并用融合序列替换完整的经典外显子序列。在这种情况下,融合转录物在受体的TE序列后停止。 It starts with a canonical transcript from the donor exon and replaces the complete canonical exon sequence with the fusion sequence. In this case, the fusion transcript stops after the recipient's TE sequence.
当供体为TE时(表9B)When the donor is TE (Table 9B)
该序列在转录物中受体外显子的经典位置开始,并忽略上游的所有外显子。受体外显子的经典序列用融合序列替换,转录物被重建直至结束。 This sequence starts at the canonical position of the receptor exon in the transcript and all exons upstream are ignored. The canonical sequence of the receptor exon is replaced with the fusion sequence and the transcript is reconstructed until the end.
融合转录物的每个k大小的核苷酸序列(即24-75个核苷酸)(第一个k-mer的翻译从融合转录物的第一个核苷酸开始,第二个k-mer的翻译从融合转录物的第二个核苷酸开始,等)然后被翻译成肽序列。Each k-sized nucleotide sequence (i.e., 24-75 nucleotides) of the fusion transcript (translation of the first k-mer starts from the first nucleotide of the fusion transcript, translation of the second k-mer starts from the second nucleotide of the fusion transcript, etc.) is then translated into a peptide sequence.
获得的肽随后用NetMHCpan进一步分析,用于MHC结合预测。因此,预测了序列中存在的每一个k-mer与至少一个已知人等位基因结合的亲和力(进一步说明参见实施例1)。The peptides obtained were then further analyzed using NetMHCpan for MHC binding prediction. Thus, the affinity of each k-mer present in the sequence for binding to at least one known human allele was predicted (see Example 1 for further explanation).
然后针对照参考蛋白质组进一步筛选肽,通常对于人受试者,针对Uniprot中存在的所有序列(代表人外显子组中编码的所有序列)。如果肽具有相同的氨基酸序列或仅在第一位或最后一位的氨基酸不同,则认为其与Uniprot中的肽等同。所有这些等同的序列随后被从候选列表中舍弃。因此,源自这些230例融合转录物的117个肽序列被预测为将结合至HLA-A2:01(见下表3)。The peptides are then further screened against a reference proteome, typically for human subjects, against all sequences present in Uniprot (representing all sequences encoded in the human exome). If a peptide has the same amino acid sequence or differs only in the first or last amino acid, it is considered equivalent to a peptide in Uniprot. All these equivalent sequences are then discarded from the candidate list. Therefore, 117 peptide sequences derived from these 230 fusion transcripts are predicted to bind to HLA-A2:01 (see Table 3 below).
表3:肽LUADTable 3: Peptide LUAD
2.2.2 HLA-A2相关肽的验证2.2.2 Validation of HLA-A2-associated peptides
鉴于HLA-A2等位基因在近50%的高加索人群中表达,并且存在不同的技术工具,验证主要集中在与HLA-A2相关的肽上。Given that the HLA-A2 allele is expressed in nearly 50% of the Caucasian population and that different technical tools exist, validation has focused primarily on peptides associated with HLA-A2.
在以下段落中,TE-外显子衍生转录物可与“融合转录物”互换使用,术语“TE衍生肽”可与“融合转录物衍生肽”互换使用。In the following paragraphs, TE-exon-derived transcripts may be used interchangeably with “fusion transcripts” and the term “TE-derived peptides” may be used interchangeably with “fusion transcript-derived peptides”.
TE-外显子衍生转录物在肺腺癌样品中的表达Expression of TE-exon-derived transcripts in lung adenocarcinoma samples
为了实验性验证预测的TE-外显子转录物,首先通过PCR验证了在LUAD肿瘤样品和肿瘤细胞系中的表达。因此,设计了用于每种嵌合融合的特异性引物,以使其中一条引物与融合的TE部分结合,另一条引物与融合的外显子部分结合。通过对PCR产物进行测序进一步证实了结果。To experimentally validate the predicted TE-exon transcripts, expression in LUAD tumor samples and tumor cell lines was first verified by PCR. Therefore, specific primers for each chimeric fusion were designed so that one primer binds to the TE portion of the fusion and the other primer binds to the exon portion of the fusion. The results were further confirmed by sequencing the PCR products.
具体而言,设计特异性引物,使得正向引物在重建的融合序列的“供体”序列中结合,反向引物在重建的融合序列的“受体”序列中结合。对来源于肺肿瘤样品和人肿瘤细胞系的RNA运行PCR。将扩增产物接种在琼脂糖凝胶上,并且对在预期大小上发现的条带进行切割和测序。最后,将测序的PCR产物与重建的融合序列进行比较。Specifically, specific primers are designed so that the forward primer binds in the "donor" sequence of the reconstructed fusion sequence and the reverse primer binds in the "acceptor" sequence of the reconstructed fusion sequence. PCR is run on RNA derived from lung tumor samples and human tumor cell lines. The amplified product is inoculated on an agarose gel, and the bands found in the expected size are cut and sequenced. Finally, the PCR product of sequencing is compared with the reconstructed fusion sequence.
使用该方法,可以确认LUAD肿瘤样品和肿瘤细胞系中均存在预测的融合转录物。下表4总结了8个频率最高的具有预测的肽的嵌合融合的结果,其与高亲和力结合HLA-A2等位基因相关。Using this approach, it was possible to confirm the presence of the predicted fusion transcripts in both LUAD tumor samples and tumor cell lines. Table 4 below summarizes the results for the 8 most frequent chimeric fusions with predicted peptides that are associated with high affinity binding to the HLA-A2 allele.
表4:最高频率的融合转录物验证。通过PCR验证了在15个LUAD肿瘤样品和6个LUAD肿瘤细胞系中最高频率的融合肽。下表中的“是”或“否”状态表示预期大小的PCR产物的存在或不存在。当通过测序进一步验证PCR产物时,表示为“是”。Table 4: Validation of the most frequent fusion transcripts. The most frequent fusion peptides in 15 LUAD tumor samples and 6 LUAD tumor cell lines were validated by PCR. The "yes" or "no" status in the table below indicates the presence or absence of a PCR product of the expected size. When the PCR product was further validated by sequencing, it was indicated as "yes".
ER衍生肽与HLA-A2分子的结合Binding of ER-derived peptides to HLA-A2 molecules
一旦确认了嵌合转录物的表达,就合成衍生肽,并确认了它们与HLA-A2的结合。由于单体稳定化和四聚体形成仅在高亲和力结合肽存在的情况下才有可能,因此通过流式细胞术估算了在合成肽存在的情况下HLA-A2四聚体的形成。所有预测的肽均能够稳定四聚体的形成,相对于阳性对照,显示出高于50%的荧光百分比。作为阳性对照,使用了已知的来源于巨细胞病毒(CMV)的HLA-A2高亲和力结合肽。该结果证实了预测的与HLA-A2等位基因的高亲和力结合。图10显示了从最高频率融合肽中获得的10个肽的结果。Once the expression of the chimeric transcripts was confirmed, derivative peptides were synthesized and their binding to HLA-A2 was confirmed. Since monomer stabilization and tetramer formation are only possible in the presence of high affinity binding peptides, the formation of HLA-A2 tetramers in the presence of synthetic peptides was estimated by flow cytometry. All predicted peptides were able to stabilize the formation of tetramers, showing a fluorescence percentage higher than 50% relative to the positive control. As a positive control, a known HLA-A2 high affinity binding peptide derived from cytomegalovirus (CMV) was used. The results confirmed the predicted high affinity binding to the HLA-A2 allele. Figure 10 shows the results of the 10 peptides obtained from the highest frequency fusion peptides.
图11显示了一组新的肽,也来源于高频的嵌合转录物,使用相同的肽-MHC-I复合物形成测定证实了其与HLA-A2的结合。作为复合物形成的阳性对照,我们使用了CMV pp65495-503(NLVPMVATV)和Melan-A的突变序列(MelA mut,ELAGIGILTV),两者均为HLA-A2的已知良好结合物。Melan-A(MelA)的非突变序列被用作低结合肽的对照。阴性是不含任何肽的重组HLA-A2分子。Figure 11 shows a set of new peptides, also derived from high frequency chimeric transcripts, whose binding to HLA-A2 was confirmed using the same peptide-MHC-I complex formation assay. As positive controls for complex formation, we used CMV pp65495-503 (NLVPMVATV) and a mutant sequence of Melan-A (MelA mut, ELAGIGILTV), both of which are known good binders of HLA-A2. The non-mutated sequence of Melan-A (MelA) was used as a control for low binding peptides. Negatives are recombinant HLA-A2 molecules without any peptide.
ER衍生肽的免疫原性Immunogenicity of ER-derived peptides
与HLA-A2等位基因结合验证后的下一步是测试预测肽的免疫原性。因此,进行引发测定以测试所鉴定的肽扩增特异性细胞毒性T细胞的能力。将来自HLA-A2+健康供体的PBMC用于产生单核细胞衍生的树突状细胞(moDC)。在将moDC装载至合成肽混合物后,与CD4+和CD8+T细胞进行自体共培养。最后,通过流式细胞术使用双色四聚体染色分析特异性CD8+T细胞的扩增。作为特异性扩增的对照,在无肽的情况下进行共培养。通过在一个供体中使用该方法,可以鉴定和扩增识别6个最高频率的嵌合融合衍生肽(RLLHLESFL、LLGETKVYV、AILPKANTV、RLADHLSFC、FLIVAEILI、YLWTTFFPL)的特异性CD8+T细胞。通过与总CD8+T细胞中的对照测试相比,四聚体阳性细胞百分比提高了至少一个数量级,使该结果得到证明。The next step after validation in conjunction with the HLA-A2 allele is to test the immunogenicity of the predicted peptide. Therefore, a priming assay was performed to test the ability of the identified peptide to amplify specific cytotoxic T cells. PBMCs from HLA-A2+ healthy donors were used to produce monocyte-derived dendritic cells (moDC). After moDC was loaded into a synthetic peptide mixture, autologous co-culture was performed with CD4+ and CD8+ T cells. Finally, the amplification of specific CD8+ T cells was analyzed by flow cytometry using two-color tetramer staining. As a control for specific amplification, co-culture was performed in the absence of peptide. By using this method in one donor, specific CD8+ T cells that recognize the 6 highest frequencies of chimeric fusion-derived peptides (RLLHLESFL, LLGETKVYV, AILPKANTV, RLADHLSFC, FLIVAIEILI, YLWTTFFPL) can be identified and amplified. The result was demonstrated by increasing the percentage of tetramer-positive cells by at least one order of magnitude compared to the control test in total CD8+ T cells.
为了评估另外5个供体的反应,进行了相同的实验。图12A总结了对总共6个供体进行分析后获得的结果,其中我们发现29个最高频率的融合转录物衍生肽中的23个(YLWTTFFPL、FLGTRVTRV、RLADHLSFC、LLGETKVYV、MLVTWELAL、MLMKTVWQA、SLMQSGSPV、AILPKANTV、AMDGKELSL、LLDRFGYHV、GLLNISHTA、ILTASITSI、ILSGYGPCV、RQAPGFHHA、GLPSHVELA、ILHSLVTGV、LLHLESFLV、VLLTNTIWL、LLTSWHLYL、RLLHLESFL、YLPYFLKSL、VLMWTMAHL、YLQGLPLPL)存在特异性CD8+T扩增。作为扩增的阳性结果,使用了突变的Melan-A肽(ELAGIGILTV)。这些实验表明,这些肽能够诱导免疫应答,并证实ER衍生肽的免疫原性。The same experiment was performed to evaluate the responses of an additional 5 donors. Figure 12A summarizes the results obtained after analyzing a total of 6 donors, where we found specific CD8+ T expansion for 23 of the 29 most frequent fusion transcript-derived peptides (YLWTTFFPL, FLGTRVTRV, RLADHLSFC, LLGETKVYV, MLVTWELAL, MLMKTVWQA, SLMQSGSPV, AILPKANTV, AMDGKELSL, LLDRFGYHV, GLLNISHTA, ILTASITSI, ILSGYGPCV, RQAPGFHHA, GLPSHVELA, ILHSLVTGV, LLHLESFLV, VLLTNTIWL, LLTSWHLYL, RLLHLESFL, YLPYFLKSL, VLMWTMAHL, YLQGLPLPL). As a positive result for amplification, a mutated Melan-A peptide (ELAGIGILTV) was used. These experiments demonstrated that the peptides were able to induce an immune response and confirmed the immunogenicity of the ER-derived peptides.
识别ER衍生肽的细胞毒性T淋巴细胞克隆的生产Production of cytotoxic T lymphocyte clones recognizing ER-derived peptides
为产生细胞毒性T淋巴细胞(CTL)克隆,免疫原性测定(图12A)中扩增的CD8+四聚体阳性T细胞经FACS-分选。产生了10个识别5个不同的ER衍生肽:YLWTTFFPL、LLGETKVYV、MLVTWELAL、MLMKTVWQA、RLADHLSF的克隆。这些肽在表3中分别列为肽9、86、53、80和64。将参考这些数字来表明每个生成的CTL克隆的特异性。例如,CTL-克隆9识别ER衍生肽9。在第二组实验中,产生了识别肽17(LLDRFGYHV)的新CTL-克隆17。To generate cytotoxic T lymphocyte (CTL) clones, CD8+ tetramer-positive T cells expanded in the immunogenicity assay (FIG. 12A) were FACS-sorted. Ten clones were generated that recognized five different ER-derived peptides: YLWTTFFPL, LLGETKVYV, MLVTWELAL, MLMKTVWQA, RLADHLSF. These peptides are listed as peptides 9, 86, 53, 80, and 64, respectively, in Table 3. Reference will be made to these numbers to indicate the specificity of each generated CTL clone. For example, CTL-clone 9 recognized ER-derived peptide 9. In a second set of experiments, a new CTL-clone 17 was generated that recognized peptide 17 (LLDRFGYHV).
为了评估所产生的CTL-克隆的细胞毒性能力,使用H1650细胞系作为靶细胞进行了两种不同的功能性分析。这是一种表达HLA-A2等位基因的LUAD衍生的肿瘤细胞系。To evaluate the cytotoxic capacity of the generated CTL-clones, two different functional assays were performed using the H1650 cell line as target cells. This is a LUAD-derived tumor cell line expressing the HLA-A2 allele.
首先,测量CTL-克隆暴露于ER衍生肽后分泌细胞因子的能力。将CTL-克隆与装载特异性ER衍生肽的靶细胞共培养18小时后,在培养上清液中测量INF-γ、TNF和颗粒酶B(Gr-B)的分泌。所有CTL-克隆在暴露特异性ER衍生肽后均被激活,以剂量依赖的方式分泌细胞因子(图12B)。First, the ability of CTL clones to secrete cytokines after exposure to ER-derived peptides was measured. After co-culture of CTL clones with target cells loaded with specific ER-derived peptides for 18 hours, the secretion of INF-γ, TNF and granzyme B (Gr-B) was measured in the culture supernatant. All CTL clones were activated after exposure to specific ER-derived peptides and secreted cytokines in a dose-dependent manner (Figure 12B).
在第二组实验中,评估了CTL-克隆的杀伤能力。将CTL-克隆在不同条件下与装载或未装载ER衍生肽的靶细胞共培养。利用xCELLigence系统测量了靶细胞单层中的实时阻抗变化。在这些测定中,细胞指数的降低与反映细胞活力的单层中细胞数的减少相关。In a second set of experiments, the killing capacity of CTL-clones was evaluated. CTL-clones were co-cultured with target cells loaded or not with ER-derived peptides under different conditions. Real-time impedance changes in the target cell monolayer were measured using the xCELLigence system. In these assays, a decrease in the cell index correlated with a decrease in the number of cells in the monolayer reflecting cell viability.
当CTL-克隆9与装载ER衍生肽9的靶细胞以1∶1的比例共培养时,与对照细胞(单独的靶细胞)相比,观察到细胞指数随时间降低。当存在阻断性抗MHC-I抗体(+抗MHC-I)的情况下进行共培养时,细胞指数的该降低受到抑制。使用相同浓度的同种型对照(+同种型)进行共培养时,并未抑制细胞指数的降低。此外,当靶细胞装载较高浓度的肽(1pM与1uM比较)时,降低量会增加(图12C,左图)。这些结果表明,细胞毒性T细胞能够识别由本文所述的融合转录物编码的肽,并杀死表达这种肽的靶肿瘤细胞。When CTL-clone 9 was co-cultured with target cells loaded with ER-derived peptide 9 at a ratio of 1:1, a decrease in the cell index was observed over time compared to control cells (target cells alone). This decrease in the cell index was inhibited when co-culture was performed in the presence of a blocking anti-MHC-I antibody (+ anti-MHC-I). The decrease in the cell index was not inhibited when co-culture was performed using an isotype control (+ isotype) at the same concentration. In addition, the decrease increased when the target cells were loaded with a higher concentration of peptide (1 pM compared to 1 uM) (Figure 12C, left). These results indicate that cytotoxic T cells are able to recognize the peptide encoded by the fusion transcript described herein and kill target tumor cells expressing such peptides.
随后证实ER衍生肽由靶细胞天然表达和呈递,因此所述靶细胞可通过将其与CTL-克隆共培养而被杀死,而无需添加外部肽。为此目的,进行了CTL-克隆9与不同比例的H1650靶细胞的共培养。图12C的右图显示,与对照细胞(仅靶细胞)相比,CTL-9能够以4∶1的效应物-靶标比例杀死靶细胞。此外,在更高的比例(8∶1)下,杀伤效果增加。在较低比例(2∶1)下未发现靶细胞被杀死。It was subsequently confirmed that the ER-derived peptides are naturally expressed and presented by target cells, and therefore the target cells can be killed by co-culturing them with CTL-clones without the addition of external peptides. For this purpose, co-culture of CTL-clone 9 with different ratios of H1650 target cells was performed. The right figure of Figure 12C shows that CTL-9 is able to kill target cells at an effector-target ratio of 4:1 compared to control cells (target cells only). In addition, at a higher ratio (8:1), the killing effect increases. No target cells were found to be killed at a lower ratio (2:1).
最后,对CTL-克隆9、CTL-克隆64和CTL-克隆80进行了类似的实验,显示了在抗MCH-I抗体存在的情况下进行共培养时也可以抑制对靶细胞的特异性杀伤(图12D)。Finally, similar experiments were performed on CTL-clone 9, CTL-clone 64, and CTL-clone 80, showing that specific killing of target cells could also be inhibited when co-cultured in the presence of anti-MCH-I antibodies ( FIG. 12D ).
综上所述,这些结果证实,识别由本文所述的融合转录物编码的多种不同肽的细胞毒性T细胞能够识别并杀死表达所述特异性融合转录物衍生肽的肿瘤细胞,并且该效果是由于在MHC-I分子的情况下对肽的特异性识别。此外,CTL-克隆能够在不添加外部肽的情况下杀死靶细胞的事实为融合转录物衍生肽在LUAD肿瘤细胞系中天然表达和呈递提供了明确证据。Taken together, these results confirm that cytotoxic T cells that recognize a variety of different peptides encoded by the fusion transcripts described herein are able to recognize and kill tumor cells expressing the specific fusion transcript-derived peptides, and that this effect is due to specific recognition of the peptides in the context of MHC-I molecules. In addition, the fact that CTL clones are able to kill target cells without the addition of external peptides provides clear evidence that the fusion transcript-derived peptides are naturally expressed and presented in LUAD tumor cell lines.
识别融合衍生肽的工程化T细胞的产生Generation of engineered T cells that recognize fusion-derived peptides
用编码CTL-9TCR序列的慢病毒载体转导的Jurkat细胞与两种不同的靶细胞H1650和H1395共培养。两者均为LUAD衍生的表达HLA-A2等位基因的细胞系。通过流式细胞术中报告基因(NFAT-GPF、NF-KB-CFP和AP-1-mCherry)的荧光的提高来评估TCR介导的Jurkat细胞激活。初步结果显示,与阴性对照(未转导的Jurkat细胞)相比,Jurkat细胞在与两种靶细胞共培养时被激活。此外,当与装载特异性肽的靶细胞进行共培养时,该激活以剂量依赖性方式增加。使用PMA/离子霉素作为阳性对照(图13)。Jurkat cells transduced with the lentiviral vector encoding CTL-9TCR sequences were co-cultured with two different target cells H1650 and H1395. Both were cell lines expressing HLA-A2 alleles derived from LUAD. Jurkat cell activation mediated by TCR was assessed by the increase of the fluorescence of reporter genes (NFAT-GPF, NF-KB-CFP and AP-1-mCherry) in flow cytometry. Preliminary results show that Jurkat cells are activated when co-cultured with two target cells compared with negative control (untransduced Jurkat cells). In addition, when co-culturing with target cells loaded with specific peptides, the activation increases in a dose-dependent manner. PMA/ ionomycin is used as a positive control (Figure 13).
这些结果在另一组实验中重复,并且用编码来自CTL-86、CTL-53和CTL-17的TCR序列的慢病毒载体转导的Jurkat细胞获得了类似的结果。转导的Jurkat细胞通过与装载有相应ER衍生肽(特异性/相关肽)但不装载有对照Melan-A肽(无关/不相关肽,ELAGIGILTV)的靶肿瘤细胞系共培养而被激活(图14A)。如预期的,激活被抗HLA-I抗体的阻断所抑制(图14B)。因此,由产生的CTL克隆表达的TCR对相应的呈递ER衍生肽的HLA-A2具有特异性。These results were repeated in another set of experiments, and similar results were obtained with Jurkat cells transduced with lentiviral vectors encoding TCR sequences from CTL-86, CTL-53, and CTL-17. The transduced Jurkat cells were activated by co-culturing with a target tumor cell line loaded with the corresponding ER-derived peptide (specific/relevant peptide) but not loaded with a control Melan-A peptide (irrelevant/irrelevant peptide, ELAGIGILTV) ( FIG. 14A ). As expected, activation was inhibited by blocking with anti-HLA-I antibodies ( FIG. 14B ). Thus, the TCR expressed by the generated CTL clones is specific for the corresponding HLA-A2 presenting the ER-derived peptide.
这些结果与图12C和D所示的结果一致,表明LUAD衍生的肿瘤细胞表达TE衍生肽。此外,这些结果还强调了CTL-克隆TCR序列在工程化T细胞发育中的技术相关性。These results are consistent with those shown in Figure 12C and D, indicating that LUAD-derived tumor cells express TE-derived peptides. In addition, these results also highlight the technical relevance of CTL-cloned TCR sequences in the development of engineered T cells.
LUAD患者中存在识别融合衍生肽的CD8+细胞CD8+ cells that recognize fusion-derived peptides exist in LUAD patients
随后旨在鉴定LUAD肿瘤样品中识别融合衍生肽的CTL细胞的存在。We then aimed to identify the presence of CTL cells recognizing the fusion-derived peptides in LUAD tumor samples.
在第一组实验中,通过四聚体染色分析用TE衍生肽和Il-2的混合物或仅用Il-2扩增的肿瘤浸润性淋巴细胞(TIL)。如图15A和B所示,在LUAD患者的TIL中发现了识别融合衍生肽的CD8+T细胞。In the first set of experiments, tumor infiltrating lymphocytes (TILs) amplified with a mixture of TE-derived peptides and Il-2 or with Il-2 alone were analyzed by tetramer staining. As shown in Figures 15A and B, CD8+ T cells that recognized the fusion-derived peptides were found in the TILs of LUAD patients.
随后表明可以检测到四聚体阳性细胞,并且进一步评估了其在来源于新鲜肿瘤样品的未扩增CD8+T细胞中的表型。使用该策略,对来源于LUAD患者样品的肿瘤、近肿瘤、侵入的淋巴结和血液中存在的CD8+T细胞进行分析。基于表面标记CCR 7和CD45RA的表达,确定了初始(CCR7+CD45+)、中央记忆(CM、CCR7+CD45RA-)效应记忆(EM、CCR7-CD45-)和末端效应子(TE、CCR7-CD45+)T细胞的细胞表型。有趣的是,在肿瘤组织中发现的四聚体阳性细胞优先共享记忆表型,而初始细胞(CCR7+CD45+)主要发现于源自淋巴结的细胞中(图16A和B)。患者2和3在图14和图15中相同。It was subsequently shown that tetramer positive cells can be detected, and the phenotype in the unamplified CD8+T cells derived from fresh tumor samples was further evaluated. Using this strategy, CD8+T cells present in the tumor, near tumor, invasive lymph nodes and blood derived from LUAD patient samples were analyzed. Based on the expression of surface markers CCR 7 and CD45RA, the cell phenotypes of initial (CCR7+CD45+), central memory (CM, CCR7+CD45RA-) effector memory (EM, CCR7-CD45-) and terminal effector (TE, CCR7-CD45+) T cells were determined. Interestingly, the tetramer positive cells found in tumor tissue preferentially share memory phenotypes, and initial cells (CCR7+CD45+) are mainly found in cells derived from lymph nodes (Figure 16 A and B). Patients 2 and 3 are identical in Figures 14 and 15.
所有测试样品均来自HLA-A2+患者。All tested samples were from HLA-A2+ patients.
肿瘤组织中具有记忆表型的四聚体阳性细胞的存在以及TIL中四聚体阳性细胞的存在,证明这些患者对TE衍生肽产生了免疫应答。此外,淋巴结中存在初始四聚体阳性细胞表明可以产生针对这些,特别是TE衍生肽,的免疫应答。The presence of tetramer-positive cells with a memory phenotype in tumor tissue and the presence of tetramer-positive cells in TILs demonstrated that these patients had developed an immune response to TE-derived peptides. In addition, the presence of naive tetramer-positive cells in lymph nodes indicated that an immune response could be generated against these, especially TE-derived peptides.
在第二个队列中,来自LUAD癌症HLA-A2+患者的5个原发性的、未经治疗的LUAD肿瘤、近肿瘤、肿瘤引流淋巴结和血液样品进行了分析。每个样品的一半通过分离CD8+T细胞直接离体分析,而不进行体外扩增,并且另一半与混合有IL-2的嵌合转录物衍生肽(患者1和2)或与单独的IL-2(患者3、4、5)一起体外培养20天,目的是在样品中扩增识别嵌合转录物衍生肽的特异性T细胞群体。T细胞采用双色四聚体染色法进行鉴定。还将抗CCR7和CD45RA的抗体添加到未扩增的细胞中,以区分初始细胞和记忆细胞。考虑用5个或更多双四聚体标记的细胞进行扩增。In the second cohort, five primary, untreated LUAD tumors, adjacent tumors, tumor-draining lymph nodes, and blood samples from HLA-A2+ patients with LUAD cancer were analyzed. Half of each sample was analyzed directly ex vivo by isolating CD8+ T cells without ex vivo expansion, and the other half was cultured in vitro for 20 days with chimeric transcript-derived peptides mixed with IL-2 (patients 1 and 2) or with IL-2 alone (patients 3, 4, 5) in order to expand the specific T cell population that recognizes the chimeric transcript-derived peptides in the sample. T cells were identified by two-color tetramer staining. Antibodies against CCR7 and CD45RA were also added to the non-expanded cells to distinguish between naïve and memory cells. Cells labeled with five or more dual tetramers were considered for expansion.
图17A显示了在直接离体分析的4名患者中发现的7个“嵌合转录物衍生肽特异性”的四聚体阳性T细胞群体的总结(其中一名患者样品因技术原因无法分析)。CCR7/CD45RA标记表明,在肿瘤样品中检测到的所有四聚体阳性T细胞均具有明确的效应子/记忆表型,而在血液和淋巴结中,“嵌合转录物衍生肽特异性”的四聚体阳性T细胞具有不同比例的低分化的、CCR7+初始和/或中央记忆表型。Figure 17A shows a summary of the 7 tetramer-positive T cell populations found in 4 patients analyzed directly ex vivo (one patient sample could not be analyzed for technical reasons). CCR7/CD45RA labeling showed that all tetramer-positive T cells detected in tumor samples had a clear effector/memory phenotype, while in blood and lymph nodes, tetramer-positive T cells specific for "chimeric transcript-derived peptides" had different proportions of poorly differentiated, CCR7+ naive and/or central memory phenotypes.
因此,这些结果表明,人原发性NSCLC肿瘤包含嵌合转录物衍生肽特异性的记忆T细胞(图17B)。Thus, these results indicate that human primary NSCLC tumors contain memory T cells specific for the chimeric transcript-derived peptide ( FIG. 17B ).
扩增的四聚体+CD8+T细胞的总结见图17C。就大多数肽特异性而言,T细胞仅在2名患者中从肿瘤和匹配的入侵的淋巴结(LN)中扩增,并且在某些情况下从匹配的近肿瘤样品(Jt)中扩增(图17C)。7个特异性四聚体阳性群体中的5个在第20天也在同一患者中发现,并且在离体发现的组织没有T细胞扩增(图17A和图17C中的加粗方框)。A summary of the expanded tetramer+CD8+T cells is shown in Figure 17C. For most peptide specificities, T cells were expanded from tumors and matched invaded lymph nodes (LN) in only 2 patients, and in some cases from matched near-tumor samples (Jt) (Figure 17C). Five of the 7 specific tetramer-positive populations were also found in the same patient at day 20, and no T cells were expanded in tissues found ex vivo (bold boxes in Figures 17A and 17C).
因此,这些结果提供了证据,即在体外扩增前后,LUAD患者的肿瘤、肿瘤引流淋巴结、有时近肿瘤组织和血液中存在嵌合转录物衍生肽特异性T细胞,这与LUAD患者中存在嵌合转录物衍生肽特异性免疫应答相一致。Therefore, these results provide evidence that chimeric transcript-derived peptide-specific T cells exist in tumors, tumor-draining lymph nodes, and sometimes juxtatumor tissues and blood of LUAD patients before and after ex vivo expansion, which is consistent with the presence of chimeric transcript-derived peptide-specific immune responses in LUAD patients.
通过质谱在LUAD活检中鉴定肽。Identification of peptides in LUAD biopsies by mass spectrometry.
ER衍生肽需要肿瘤细胞表面呈递MHC I类分子,以便被细胞毒性T细胞识别。为了确认预测的肽在MHC I类分子上表达,使用了源自3个LUAD活检的来自MHC I免疫肽组的公开数据(Laumont CM et al.,“Noncoding regions are the main source of targe表tumor-specific antigens”Sci Transl Med.2018 10(470))。使用OpenMS软件分析来自3个LUAD肿瘤(PXD009752、PXD009754和PXD009755)的MHC-I免疫纯化的上传至PRIDE数据库的原始数据。需要注意的是,蛋白质组学中的数据依赖性采集仅允许鉴定那些目标数据库中包含的序列(通常是全人类蛋白质组);根据本申请的肽先前未被鉴定,因为其源自非编码序列。重新分析了纳入了目标数据库中本文预测的肽序列的MS/MS鉴定。在3个样品活检中发现了5个肽(肽ID:3304、269、757、1810、3953)。为了进行这一分析,考虑了TCGA中至少在5个样品中存在的与任何MHC I等位基因结合的源自嵌合融合的所有预测的肽。该结果证实了嵌合融合衍生肽在LUAD肿瘤中的MHC I类分子上的表达。ER-derived peptides require MHC class I molecules to be presented on the surface of tumor cells in order to be recognized by cytotoxic T cells. To confirm that the predicted peptides are expressed on MHC class I molecules, public data from MHC I immunopeptidomes derived from three LUAD biopsies were used (Laumont CM et al., "Noncoding regions are the main source of target tumor-specific antigens" Sci Transl Med. 2018 10 (470)). The raw data uploaded to the PRIDE database from MHC-I immunopurification of three LUAD tumors (PXD009752, PXD009754 and PXD009755) were analyzed using OpenMS software. It should be noted that data-dependent acquisition in proteomics only allows the identification of sequences contained in the target database (usually the entire human proteome); the peptides according to the present application have not been previously identified because they are derived from non-coding sequences. The MS/MS identification of the peptide sequences predicted in this paper that were included in the target database was reanalyzed. Five peptides were found in three sample biopsies (peptide IDs: 3304, 269, 757, 1810, 3953). For this analysis, all predicted peptides derived from chimeric fusions that bind to any MHC I allele present in at least five samples in TCGA were considered. The results confirm the expression of chimeric fusion-derived peptides on MHC class I molecules in LUAD tumors.
之后,我们将分析扩展至新的肺免疫肽组学数据集(Bulik-Sullivan etal.Nat.Biotec 2018,Chong et al.Nat.Comm.2020and Javitt et al.Front Immunol2019)。值得注意的是,除包含LUAD肿瘤细胞系的Javitt et al.Front Immunol 2019外,所有数据集均使用新鲜的肺肿瘤样品生成。在第二次分析中,使用了ProteomeDiscoverer1.4软件来鉴定ER衍生肽。考虑了4个数据集,在总共19个免疫肽组样品中至少有一个样品中存在23种独特的ER衍生肽。在图18中,在每个MHC样品(列)中鉴定的ER衍生肽(行)用灰色方框表示。右边显示了找到的肽序列。有趣的是,在1个以上的MHC样品中观察到了其中一些肽序列,这表明它们在样品之间是共享的。这些结果证实了融合转录物衍生肽是由肿瘤细胞表面的HLA-I分子加工和呈递的。We then extended the analysis to new lung immunopeptidomics datasets (Bulik-Sullivan et al. Nat. Biotec 2018, Chong et al. Nat. Comm. 2020 and Javitt et al. Front Immunol 2019). It is worth noting that all datasets were generated using fresh lung tumor samples, except for Javitt et al. Front Immunol 2019, which contained LUAD tumor cell lines. In the second analysis, ProteomeDiscoverer 1.4 software was used to identify ER-derived peptides. Four datasets were considered, and 23 unique ER-derived peptides were present in at least one of the 19 immunopeptidomic samples in total. In Figure 18, the ER-derived peptides (rows) identified in each MHC sample (column) are indicated by gray boxes. The peptide sequences found are shown on the right. Interestingly, some of these peptide sequences were observed in more than one MHC sample, indicating that they are shared between samples. These results confirm that the fusion transcript-derived peptides are processed and presented by HLA-I molecules on the surface of tumor cells.
肽RLADHLSFC源自融合转录物,其中融合的基因部分是肿瘤抑制基因(融合ID:chr22:29117506:->chr 22:29115473:-/相关基因:CHEK2),以及肽GLPSHVELA源自融合转录物,其中基因部分为致癌基因(融合ID:chr 6:117763597:->chr 6:117739669:-/相关基因:ROS1)。有趣的是,发现这两种肽均具有免疫原性(图12A),尤其是对肽RLADHLSFC而言,图12D中所示结果表明,这两种肽均可由H1650细胞系表达。此外,我们发现识别肽GLPSHVELA的TIL(图13A),这表明该融合转录物衍生肽可在LUAD肿瘤样品中表达。The peptide RLADHLSFC is derived from a fusion transcript in which the fused gene portion is a tumor suppressor gene (fusion ID: chr22:29117506:->chr 22:29115473:-/related gene: CHEK2), and the peptide GLPSHVELA is derived from a fusion transcript in which the gene portion is an oncogene (fusion ID: chr 6:117763597:->chr 6:117739669:-/related gene: ROS1). Interestingly, both peptides were found to be immunogenic (Figure 12A), especially for the peptide RLADHLSFC, and the results shown in Figure 12D indicate that both peptides can be expressed by the H1650 cell line. In addition, we found TILs that recognized the peptide GLPSHVELA (Figure 13A), indicating that this fusion transcript-derived peptide can be expressed in LUAD tumor samples.
3.实施例3:从各种癌症样品中鉴定源自由TE元件和外显子序列组成的融合转录物的新抗原肽。3. Example 3: Identification of new antigenic peptides derived from fusion transcripts composed of TE elements and exon sequences from various cancer samples.
表5:来自32个不同的癌症类型(来自TCGA)的9184个样品Table 5: 9184 samples from 32 different cancer types (from TCGA)
根据如前所述的方法分析了来自前述癌症样品的RNA数据集。The RNA datasets from the aforementioned cancer samples were analyzed according to the methods described above.
鉴定了16580个融合转录物。16,580 fusion transcripts were identified.
4跨膜嵌合蛋白4 transmembrane chimeric protein
4.1跨膜融合蛋白的鉴定4.1 Identification of transmembrane fusion proteins
本公开提供了从生物信息管道预测的融合转录物中获得的跨膜嵌合蛋白候选物的第一选择,所述生物信息管道是使用在TCGA(癌症基因组图谱)和CCLE(博德研究所癌细胞系百科)公开可用的RNA-Seq数据来鉴定全基因组非经典剪接区域而开发的(在实施例部分中描述)。The present disclosure provides a first selection of transmembrane chimeric protein candidates obtained from fusion transcripts predicted by a bioinformatic pipeline developed using publicly available RNA-Seq data from TCGA (The Cancer Genome Atlas) and CCLE (Broad Institute Cancer Cell Line Encyclopedia) to identify genome-wide non-canonical splicing regions (described in the Examples section).
这种跨膜嵌合蛋白候选物的鉴定和选择如下所详述。The identification and selection of such transmembrane chimeric protein candidates are described in detail below.
首先在来自TCGA和CCLE数据库的转录组mRNA数据中鉴定所有源自TE和外显子序列之间的剪接事件的转录物。We first identified all transcripts originating from splicing events between TE and exonic sequences in transcriptome mRNA data from TCGA and CCLE databases.
该步骤已在本申请的上述部分(详细描述和先前实施例)中详细说明。This step has been described in detail in the above sections of this application (Detailed Description and previous Examples).
该步骤已在本申请的上述部分(详细描述和先前实施例)中详细说明。如前所述,融合转录物来自已知对产生功能多样性至关重要的替代性剪接机制。通过现有外显子和内含子序列的重排,它允许单个基因表达多种mRNA和编码多种蛋白。观察到的剪接改变的类型包括外显子跳读、内含子保留和使用替代性剪接供体或受体位点。在这些融合转录物中,TE可以作为供体(位于5’位)或受体(位于3’位),相应地,外显子可以是受体或供体。因此,TE-外显子剪接导致“非编码”基因组的一部分并入编码基因组,从而将非编码基因组序列暴露于翻译机制。也称为JET(连接外显子TE)的这些融合(或嵌合)转录物包括ORF(开放阅读框),即它们是阅读框中有能力翻译成多肽或蛋白的部分。当TE为受体时,融合转录物的ORF是经典的(即与经典转录物相同),而当TE为供体时,ORF可以是经典的,或可以是移码1或2个核苷酸。This step has been described in detail in the above-mentioned part (detailed description and previous embodiments) of the application. As previously mentioned, fusion transcripts come from alternative splicing mechanisms known to be essential for producing functional diversity. Through the rearrangement of existing exon and intron sequences, it allows a single gene to express multiple mRNAs and encode multiple proteins. The types of splicing changes observed include exon skipping, intron retention and the use of alternative splicing donor or acceptor sites. In these fusion transcripts, TE can be used as a donor (positioned at 5') or an acceptor (positioned at 3'), and accordingly, an exon can be an acceptor or a donor. Therefore, TE-exon splicing causes a part of a "non-coding" genome to be incorporated into the coding genome, thereby exposing the non-coding genome sequence to the translation mechanism. These fusion (or chimeric) transcripts, also referred to as JET (joining exon TE), include ORF (open reading frame), i.e., they are parts in the reading frame that have the ability to be translated into polypeptides or proteins. When the TE is the acceptor, the ORF of the fusion transcript is canonical (ie, identical to the canonical transcript), whereas when the TE is the donor, the ORF may be canonical, or may be frameshifted by 1 or 2 nucleotides.
融合转录物不仅包括融合的TE和外显子序列(对应于JET),还可进一步包括对应于各种转录物同种型的外显子,如果外显子是供体,则在融合断点的上游(在外显子和TE之间),如果TE是供体,则在融合断点的下游。The fusion transcript includes not only the fused TE and exon sequences (corresponding to JET), but may further include exons corresponding to various transcript isoforms, upstream of the fusion breakpoint (between the exon and TE) if the exon is the donor, and downstream of the fusion breakpoint if the TE is the donor.
本文公开的跨膜新抗原肽的鉴定进一步包括选择融合转录物的步骤,所述融合转录物具有在蛋白质组数据库(例如UniProt)中注释为属于编码膜蛋白的转录物的翻译的外显子序列。The identification of transmembrane neoantigenic peptides disclosed herein further comprises the step of selecting fusion transcripts having translated exon sequences annotated in a proteomic database (e.g., UniProt) as belonging to transcripts encoding membrane proteins.
所选融合转录物的序列随后被翻译成融合肽(也称为翻译的连接)序列。The sequence of the selected fusion transcript is then translated into a fusion peptide (also called a translated junction) sequence.
每个融合转录物的完整序列按照以下规则翻译:The complete sequence of each fusion transcript was translated according to the following rules:
-其中外显子为供体的融合转录物在转录物的经典ORF后从转录物的起始翻译至外显子和TE之间的断点后的第一个终止密码子。- A fusion transcript in which the exon is the donor is translated from the start of the transcript after the classical ORF of the transcript to the first stop codon after the breakpoint between the exon and the TE.
-其中TE为供体的融合转录物在3个ORF(1-3)后从TE的起始或从TE和外显子之间的断点前的最后一个终止密码子后翻译至所述断点后的第一个终止密码子。- A fusion transcript in which TE is the donor is translated after 3 ORFs (1-3) from the start of TE or from the last stop codon before the breakpoint between TE and exon to the first stop codon after said breakpoint.
-仅保留了含有至少3个源自TE序列的氨基酸的翻译的肽序列。- Only translated peptide sequences containing at least 3 amino acids derived from TE sequences were retained.
通常,与UniProt中任何参考的或注释的蛋白序列相匹配的来自翻译的连接的肽序列被舍弃,因此,重点关注未注释的嵌合肽(如表9至12举例说明)。In general, peptide sequences from translated junctions that matched any reference or annotated protein sequence in UniProt were discarded, thus focusing on unannotated chimeric peptides (as exemplified in Tables 9 to 12).
通过异位表达验证点击Validation of hits by ectopic expression
通过应用上述规则,选择了完整的跨膜嵌合蛋白的短列表。预期这些嵌合蛋白通过TE-受体融合或元融合产生。相应的基因和相关的嵌合ID为:By applying the above rules, a short list of complete transmembrane chimeric proteins was selected. These chimeric proteins are expected to be produced by TE-receptor fusion or meta-fusion. The corresponding genes and associated chimeric IDs are:
表6Table 6
表7Table 7
从上述表6和表7中,合成了27个TE-受体融合和17个添加了c-Myc序列的元融合转录物,并将它们克隆入pCDNA 3质粒(可商购)。这些质粒用于在HEK293细胞系中异位表达包括c-Myc标签的预测的嵌合蛋白。在细胞转染和蛋白表达后,使用抗Myc Alexa Fluor 6472233S克隆9B11抗体检测和定量来自胞外区域的c-Myc。From Tables 6 and 7 above, 27 TE-receptor fusions and 17 meta-fusion transcripts with added c-Myc sequences were synthesized and cloned into pCDNA 3 plasmids (commercially available). These plasmids were used to ectopically express the predicted chimeric proteins including the c-Myc tag in the HEK293 cell line. After cell transfection and protein expression, c-Myc from the extracellular region was detected and quantified using anti-Myc Alexa Fluor 6472233S clone 9B11 antibody.
以下19个JET衍生的转录物通过这种方法验证为阳性,从而证明了相应的嵌合蛋白在上述实验设置中被稳定翻译并插入膜中。The following 19 JET-derived transcripts were validated as positive by this approach, thus demonstrating that the corresponding chimeric proteins were stably translated and inserted into the membrane in the above experimental setup.
表8Table 8
图19显示了流式细胞术结果的实例(转染以下构建体:ABHD1ENST00000316470chr Figure 19 shows an example of flow cytometry results (transfected with the following constructs: ABHD1ENST00000316470chr 2:27346930:+>chr 2:27347727:+)。2:27346930:+>chr 2:27347727:+).
4.2总蛋白质组学4.2 Total proteomics
基于质谱的蛋白质组学已成为问询特定细胞制剂的实际蛋白质含量的有力工具。为了确认JET确实被翻译成蛋白质,对从细胞系和新鲜肿瘤中产生的质谱输出文件(称为原始文件)进行了分析,以鉴定JET衍生肽的不同群体。本研究分组为两种不同的分析,每一种分析提供了不同且互补类型的信息,表明JET衍生的蛋白可以在肿瘤样品或肿瘤细胞系中被可靠地检测到。Mass spectrometry-based proteomics has emerged as a powerful tool to interrogate the actual protein content of a particular cell preparation. To confirm that JET is indeed translated into protein, mass spectrometry output files (referred to as raw files) generated from cell lines and fresh tumors were analyzed to identify distinct populations of JET-derived peptides. This study was grouped into two different analyses, each providing different and complementary types of information, demonstrating that JET-derived proteins can be reliably detected in tumor samples or tumor cell lines.
首先,证明了在CCLE数据集中发现从JET衍生的蛋白具有高度的重复性。因此,使用了来自CCLE队列中超过7个不同的细胞系中经预测的所有那些JET mRNA序列的计算机模拟翻译的连接,并针对来自Nusinow et al.2020的质谱原始文件(其在于来自CCLE的375个细胞系的蛋白质组学分析)问询。这些细胞系被分组在TMT 6plex中,产生总计29MS/MS的输出文件。基于MS的蛋白质组学分析鉴定出57个JET衍生的蛋白,其包含至少1个与剪接连接重叠的肽,且其中涉及剪接事件的基因根据Uniprot被注释为定位在质膜(表9)。First, it was demonstrated that the discovery of JET-derived proteins in the CCLE dataset was highly reproducible. Therefore, the connection of the computer-simulated translation of all those JET mRNA sequences predicted in more than 7 different cell lines in the CCLE cohort was used and queried against the mass spectrometry raw files from Nusinow et al. 2020 (which is a proteomic analysis of 375 cell lines from CCLE). These cell lines were grouped in TMT 6plex, generating a total of 29MS/MS output files. MS-based proteomic analysis identified 57 JET-derived proteins that contained at least 1 peptide overlapping with the splicing connection, and genes involved in splicing events were annotated as being located at the plasma membrane according to Uniprot (Table 9).
表9:Table 9:
下表中列的编号是指以下项目。The numbers in the columns in the table below refer to the following items.
尽管同位素标记法(例如TMT)存在固有的限制,即导致通道间出现显著的标签交叉对话,但经证实,在超过1个TMT组中鉴定到JET,因此在超过1个肿瘤样品中也鉴定到JET。Despite the inherent limitations of isotopic labeling methods such as TMT, which result in significant label cross-talk between channels, it was confirmed that JET was identified in more than one TMT group and therefore in more than one tumor sample.
表面蛋白质富集:Surface protein enrichment:
据描述,由于使用了裂解方案,跨膜蛋白在整个蛋白质组学实验中的代表性不足。因此,第二种方法已被用于通过在H1650肺细胞系中进行生物素标记来富集胞外暴露的蛋白质。利用该实验获得的质谱原始文件进行了两种不同的分析。首先,为了从一般角度理解融合衍生蛋白(嵌合蛋白)是否在质膜上表达,在该实验的质谱文件中分析了在来自CCLE队列的7个以上细胞系(肿瘤特异性和非肿瘤特异性)中表达的所有连接。提供了10个嵌合肽的鉴定,其中6个涉及发现了TE和外显子的氨基酸的连接。从连接序列来看,其中4个与膜室中注释的蛋白相关。通过TMHMM算法(http://www.cbs.dtu.dk/)对跨膜螺旋进行预测,并对翻译序列的拓扑结构进行研究。基于序列的预测拓扑结构,仅保留预测TE将暴露于胞外隔室的候选序列。共鉴定出10个融合衍生肽,其中6个与剪接位点重叠(表10)。It has been described that transmembrane proteins are underrepresented in the whole proteomics experiment due to the lysis protocol used. Therefore, a second approach has been used to enrich for extracellularly exposed proteins by biotin labeling in the H1650 lung cell line. Two different analyses were performed using the mass spectrometry raw files obtained from this experiment. First, in order to understand from a general point of view whether fusion-derived proteins (chimeric proteins) are expressed on the plasma membrane, all junctions expressed in more than 7 cell lines (tumor-specific and non-tumor-specific) from the CCLE cohort were analyzed in the mass spectrometry files of this experiment. The identification of 10 chimeric peptides was provided, 6 of which involved the junctions of amino acids where TE and exon were found. From the junction sequence, 4 of them were related to proteins annotated in the membrane compartment. The prediction of transmembrane helices was performed by the TMHMM algorithm (http://www.cbs.dtu.dk/) and the topology of the translated sequences was studied. Based on the predicted topology of the sequences, only candidate sequences were retained for which the TE was predicted to be exposed to the extracellular compartment. A total of 10 fusion-derived peptides were identified, 6 of which overlapped with the splice sites (Table 10).
表10:Table 10:
此外,发现了涉及经典基因产物RHOT2的肽,其携带预期暴露于胞外隔室的TE。根据这些分析,进一步表明也可以使用这种方法鉴定肿瘤特异性融合衍生蛋白。因此,来自肺TCGA和CCLE队列(如前所述)的肺肿瘤特异性JET被用于到MS文件问询它们。在16个被鉴定的序列中,其中8个涉及连接,且被鉴定的氨基酸来源于TE和经典基因产物。根据所涉及的经典蛋白的注释的亚细胞位置,16个嵌合肽中的5个可能位于质膜中,而其余的(11个)可能属于其他膜室或污染物胞质溶胶(表11)。In addition, peptides involving the classical gene product RHOT2 were found, which carried TEs expected to be exposed to the extracellular compartment. Based on these analyses, it was further suggested that tumor-specific fusion-derived proteins could also be identified using this approach. Therefore, lung tumor-specific JETs from the lung TCGA and CCLE cohorts (as described above) were used to interrogate them to MS files. Of the 16 identified sequences, 8 involved junctions, and the identified amino acids were derived from TEs and classical gene products. Based on the annotated subcellular locations of the classical proteins involved, 5 of the 16 chimeric peptides are likely to be located in the plasma membrane, while the rest (11) may belong to other membrane compartments or contaminant cytosols (Table 11).
表11:Table 11:
根据翻译的连接的预测序列,使用TMHMM算法计算跨膜螺旋结构域的存在。这产生了两个感兴趣的候选基因,涉及ATP11A和PARM1基因产物。Based on the predicted sequences of the translated junctions, the presence of transmembrane helical domains was calculated using the TMHMM algorithm. This yielded two candidate genes of interest, involving the ATP11A and PARM1 gene products.
这些初步结果揭示了表面组中嵌合肽的存在,以及我们的方法学来研究暴露于细胞表面的翻译的表观遗传产物的能力。同样,我们的管道的证实的适用性为进一步分析不同的细胞模型铺平了道路,扩大了我们的研究范围,并允许鉴定更大的JET群体(外显子-可转座元件的连接)。These initial results reveal the presence of chimeric peptides in the surface group and the power of our methodology to study epigenetic products of translation exposed to the cell surface. Likewise, the demonstrated applicability of our pipeline paves the way for further analysis in different cell models, expanding the scope of our studies and allowing the identification of a larger population of JETs (exon-transposable element junctions).
用数据集标识符PXD016582分析通过PRIDE合作存储库存入ProteomeXchangeConsortium的原始数据集。这些分析对应于来自保留了异质性并概括了结直肠癌的特征的单个肿瘤细胞的患者来源的类器官克隆。从同一名患者中分离出四个肿瘤克隆,并将其与从同一名患者的无肿瘤结肠粘膜组织活组织切片中产生的正常结肠类器官系一起保存在类器官培养物中。肿瘤克隆T1、T3、T4和T5在形态上不同于同一患者的正常类器官。更多详情请参见Demmers,L.C.,Kretzschmar,K.,Van Hoeck,A.et al.Single-cell derivedtumor organoids display diversity in HLA class I peptide presentation(参见NatCommun 11,5338(2020).https://doi.org/10.1038/s41467-020-19142-9)。The original datasets deposited into the ProteomeXchange Consortium through the PRIDE collaborative repository were analyzed with the dataset identifier PXD016582. These analyses correspond to patient-derived organoid clones from single tumor cells that retain heterogeneity and recapitulate the characteristics of colorectal cancer. Four tumor clones were isolated from the same patient and maintained in organoid culture together with a normal colon organoid line generated from a tumor-free colon mucosal tissue biopsy from the same patient. Tumor clones T1, T3, T4, and T5 are morphologically different from normal organoids from the same patient. For more details, see Demmers, L.C., Kretzschmar, K., Van Hoeck, A. et al. Single-cell derived tumor organoids display diversity in HLA class I peptide presentation (see Nat Commun 11, 5338 (2020). https://doi.org/10.1038/s41467-020-19142-9).
按照与前述相同的管道,从JET转录物中获得的下列肽已被鉴定:Following the same pipeline as described above, the following peptides were identified from the JET transcript:
表12:Table 12:
第二组结果来自使用TMT 10-plex试剂和SPS-MS3采集器在癌细胞系百科(CCLE)中收集的375个细胞系的蛋白质组谱。有关样品来源和特性的更多信息,请参见以下出版物:Nusinow DP,Szpyt J,Ghandi M,Rose CM,McDonald ER,Kalocsay M,Jané-ValbuenaJ,Gelfand E,Schweppe DK,Jedrychowski M,Golji J,Porter DA,Rejtar T,Wang YK,Kryukov GV,Stegmeier F,Erickson BK,Garraway LA,Sellers WR,GygiSP.Quantitative Proteomics of the Cancer Cell Line Encyclopedia.Cell.2020Jan23;180(2):387-402.e16。原始文件均已上传至UCSD的MassIVE存储库。The second set of results comes from the proteomic profiles of 375 cell lines collected in the Cancer Cell Line Encyclopedia (CCLE) using TMT 10-plex reagents and SPS-MS3 acquisition. For more information on sample sources and characteristics, please see the following publications: Nusinow DP, Szpyt J, Ghandi M, Rose CM, McDonald ER, Kalocsay M, Jané-Valbuena J, Gelfand E, Schweppe DK, Jedrychowski M, Golji J, Porter DA, Rejtar T, Wang YK, Kryukov GV, Stegmeier F, Erickson BK, Garraway LA, Sellers WR, Gygi SP. Quantitative Proteomics of the Cancer Cell Line Encyclopedia. Cell. 2020 Jan 23; 180(2): 387-402.e16. The original files have been uploaded to the MassIVE repository at UCSD.
表13:Table 13:
4.3方法4.3 Methods
总蛋白质组学:Total Proteomics:
为了在蛋白质组水平上发现融合或JET(连接外显子-TE),使用Stewart等人于2019年在Cell中发表的肺原发性肿瘤的公开数据(从PRIDE数据库下载的原始文件,登录代码为PXD010357)。To discover fusions or JETs (Junctioning Exons - TEs) at the proteomic level, publicly available data of lung primary tumors published in Cell by Stewart et al. in 2019 were used (raw files downloaded from the PRIDE database with accession code PXD010357).
简而言之,从细胞系或肿瘤中获得总蛋白提取物,并且随后用胰蛋白酶消化。使用TMT用同位素标签对所得肽进行化学标记,在同一实验中用内部参考标准分析不同的样品。通过HPLC离线分馏肽,并在质谱仪上分别运行每个实验的不同级分(来自Thermo-Fisher的Orbitrap Fusion或Orbitrap Fusion Lumos)。有关实验流程和分析的更多详情,请参见相应出版物。In brief, total protein extracts were obtained from cell lines or tumors and subsequently digested with trypsin. The resulting peptides were chemically labeled with isotopic tags using TMT, and different samples were analyzed with internal reference standards in the same experiment. Peptides were fractionated offline by HPLC, and different fractions of each experiment were run separately on a mass spectrometer (Orbitrap Fusion or Orbitrap Fusion Lumos from Thermo-Fisher). For more details on the experimental procedures and analysis, please refer to the corresponding publications.
使用Proteome Discoverer 2.4(Thermo-Fisher)和Sequest-HT作为搜索引擎对来自质谱运行的原始输出文件进行问询。定制数据库用于查询质谱峰,其两者均包括Swissprot和TrEMBL经典序列,以及从不同数据集(肺TCGA和CCLE)预测的肺肿瘤特异性JET序列的计算机模拟翻译。蛋白质切割被指定为最多允许2次缺失切割的胰蛋白酶。肽FDR被设置为1%,而蛋白质FDR被允许为100%,以使我们的搜索集中于肽的研究。肽的质量耐受为4.5ppm,并且片段耐受为0.02Da。半胱氨酸的氨基甲酸乙酯化被设置为固定修饰。为了定量,TMT报告的信号通过使用MS2或MS3片段获得,并与用于肽鉴定的MS2扫描配对。根据Uniprot注释,仅保留包含与连接重叠的已鉴定的肽且涉及位于质膜中的基因的连接。Proteome Discoverer 2.4 (Thermo-Fisher) and Sequest-HT were used as search engines to query the raw output files from the mass spectrometry run. Custom databases were used to query mass spectrometry peaks, both of which included Swissprot and TrEMBL classical sequences, as well as computer-simulated translations of lung tumor-specific JET sequences predicted from different data sets (lung TCGA and CCLE). Protein cleavage was specified as trypsin with a maximum of 2 missed cleavages allowed. The peptide FDR was set to 1%, while the protein FDR was allowed to be 100% to focus our search on the study of peptides. The mass tolerance of the peptide was 4.5ppm, and the fragment tolerance was 0.02Da. The urethanization of cysteine was set to a fixed modification. For quantification, the signal reported by TMT was obtained by using MS2 or MS3 fragments and paired with MS2 scans for peptide identification. According to the Uniprot annotation, only connections containing identified peptides overlapping with the connection and involving genes located in the plasma membrane were retained.
表面富集蛋白质组学Surface enrichment proteomics
将肺腺癌细胞系H1650(ATCC)在补充胎牛血清和1%青霉素/链霉素的RPMI培养基中在37℃和5%CO2的环境中生长。使用Pierce细胞表面蛋白生物素化和分离试剂盒(Thermo Scientific,目录号A44390)将约85%融合率的1000万个细胞用于其表面蛋白质组的富集。Lung adenocarcinoma cell line H1650 (ATCC) was grown in RPMI medium supplemented with fetal bovine serum and 1% penicillin/streptomycin at 37°C and 5% CO 2. 10 million cells at approximately 85% confluence were used for enrichment of their surface proteome using the Pierce Cell Surface Protein Biotinylation and Isolation Kit (Thermo Scientific, catalog number A44390).
从贴壁细胞中去除培养基,用PBS洗涤,并使用试剂盒中提供的磺基-NHS-SS-生物素进行生物素化。在室温下培养10分钟后,去除标记溶液,并用冰冷的TBS洗涤细胞两次。将细胞随后在冰冷的TBS中破碎,在4℃下以500g离心3分钟沉淀,并使用试剂盒中提供的补充蛋白酶抑制剂的裂解缓冲液进行裂解。为了完全破碎细胞,将它们在有缓冲液的情况下在冰上培养30分钟。在4℃下以15000g离心5分钟来清除所得提取物。The culture medium was removed from the adherent cells, washed with PBS, and biotinylated using Sulfo-NHS-SS-Biotin provided in the kit. After incubation at room temperature for 10 minutes, the labeling solution was removed and the cells were washed twice with ice-cold TBS. The cells were then disrupted in ice-cold TBS, pelleted by centrifugation at 500g for 3 minutes at 4°C, and lysed using the lysis buffer supplemented with protease inhibitors provided in the kit. To completely disrupt the cells, they were incubated on ice for 30 minutes in the presence of buffer. The resulting extract was cleared by centrifugation at 15000g for 5 minutes at 4°C.
为了捕获标记的蛋白,将提取物与NeutrAvidin琼脂糖微珠在试剂盒中提供的柱子上,在端对端旋转器上,在室温下孵育30分钟。孵育后,通过离心柱子将未结合的材料丢弃。使用所提供的洗涤缓冲液清洗微珠,并立即离心以丢弃流穿液。用洗涤缓冲液共洗涤4次,随后用20mM Tris-HCl(pH 8)再洗涤3次。使用100μl洗脱缓冲液(10mM Tris-HCl(pH 8)10mM DTT)进行生物素化蛋白的洗脱,允许在室温下在端对端旋转器中与微珠孵育45分钟。将柱子以1000g离心2分钟来最终回收浓缩的蛋白。To capture the labeled protein, the extract was incubated with NeutrAvidin agarose beads on the column provided in the kit on an end-to-end rotator at room temperature for 30 minutes. After incubation, the unbound material was discarded by centrifugation of the column. The beads were washed with the provided wash buffer and immediately centrifuged to discard the flow-through. Washed 4 times with wash buffer, followed by 3 more washes with 20mM Tris-HCl (pH 8). Elution of the biotinylated protein was performed using 100 μl elution buffer (10mM Tris-HCl (pH 8) 10mM DTT), and allowed to incubate with the beads for 45 minutes at room temperature in an end-to-end rotator. The column was centrifuged at 1000 g for 2 minutes to finally recover the concentrated protein.
洗脱液用于后续分析。在黑暗中,在室温下使用5.5mM CAA将蛋白烷基化30分钟。在验证了pH值(7-9之间)后,然后在室温下,使用1∶100比例(蛋白∶酶)的胰蛋白酶对蛋白进行溶液内消化。通过用TFA酸化样品来停止胰蛋白酶化。The eluate was used for subsequent analysis. The protein was alkylated using 5.5 mM CAA for 30 minutes at room temperature in the dark. After verification of the pH value (between 7-9), the protein was then digested in solution using trypsin at a 1:100 ratio (protein: enzyme) at room temperature. Trypsinization was stopped by acidifying the sample with TFA.
使用C18材料的内部填充微柱对所得肽进行脱盐。用70%ACN0.1%TFA洗涤柱子,并用0.1%TFA平衡。将样品装载并进一步用0.1%TFA洗涤,然后用40%ACN 0.1%TFA洗脱肽。然后将清洁后的肽完全干燥,随后在Orbitrap Fusion(Thermo Scientific)中进行LC-MS/MS分析。The obtained peptides were desalted using an internally packed microcolumn of C18 material. The column was washed with 70% ACN 0.1% TFA and balanced with 0.1% TFA. The sample was loaded and further washed with 0.1% TFA, and then the peptides were eluted with 40% ACN 0.1% TFA. The cleaned peptides were then completely dried and subsequently analyzed by LC-MS/MS in an Orbitrap Fusion (Thermo Scientific).
下表14和15分别是指从融合转录物(其中外显子为供体)翻译的SEQ ID NO:1-1423的蛋白(或作为本文所用的肽的同义词)和从融合转录物(其中TE为供体)翻译的SEQID NO:1424-8202的嵌合蛋白质的详细鉴定。该(跨膜)新抗原肽的组是通过选择具有外显子序列的融合转录物获得的,所述序列在正常蛋白质组数据库(如本文的UNIPROT)中被注释为属于编码跨膜蛋白的转录物。断点列给出了外显子衍生aa序列和TE衍生aa序列之间的断点位置。每个表格的最后一列均指从同一JET(或融合)的剪接变体衍生的各种嵌合蛋白(通过其SEQ ID NO鉴定)。Tables 14 and 15 below refer to the detailed identification of proteins (or synonyms as peptides used herein) of SEQ ID NOs: 1-1423 translated from fusion transcripts (wherein exons are donors) and chimeric proteins of SEQ ID NOs: 1424-8202 translated from fusion transcripts (wherein TE is donor), respectively. This set of (transmembrane) new antigenic peptides is obtained by selecting fusion transcripts with exon sequences that are annotated as belonging to transcripts encoding transmembrane proteins in normal proteome databases (such as UNIPROT herein). The breakpoint column gives the breakpoint positions between the exon-derived aa sequence and the TE-derived aa sequence. The last column of each table refers to various chimeric proteins (identified by their SEQ ID NOs) derived from splice variants of the same JET (or fusion).
表16-18涉及元融合。元融合是两个融合的组合。Tables 16-18 refer to meta-fusions. A meta-fusion is a combination of two fusions.
例如,在下表中,元融合_id:chr 1:154709520:->chr 1:154705620:-|chr 1:15474451:->chr 1:154709564:--由作为元融合一部分的两个嵌合id构成。列的编号是指以下各项:For example, in the following table, metafusion_id:chr1:154709520:->chr1:154705620:-|chr1:15474451:->chr1:154709564:--is constructed from the two chimera ids that are part of the metafusion. The column numbers refer to the following:
表18提供了SEQ ID NO:9166-10163的元融合肽(或作为本文所用的肽的同义词)的元融合_id、转录物、ORF和名称。Table 18 provides the metafusion_id, transcript, ORF and name of the metafusion peptides of SEQ ID NOs: 9166-10163 (or synonyms as used herein for the peptides).
表19和表20分别涉及从以外显子为供体的融合转录物翻译的SEQ ID NO:10164-12830的翻译融合肽和从以TE为供体的融合转录物翻译的SEQ ID NO:12331-21452的翻译融合肽的详细鉴定。列的编号是指以下:Tables 19 and 20 relate to detailed identification of the translated fusion peptides of SEQ ID NOs: 10164-12830 translated from the fusion transcript with exon as donor and the translated fusion peptides of SEQ ID NOs: 12331-21452 translated from the fusion transcript with TE as donor, respectively. The column numbers refer to the following:
第11列(位置)给出了从相应的转录物翻译的肽的SEQ ID(在本列中称为tx),并且其与每个融合id相关联。通过将表19第11列的编号加上10163,将表20第11列的编号加上12330,可以获得SEQ ID。第12列给出了第11列中每一个翻译肽的断点位置。Column 11 (position) gives the SEQ ID of the peptide translated from the corresponding transcript (referred to as tx in this column) and is associated with each fusion id. The SEQ ID is obtained by adding 10163 to the number in column 11 of Table 19 and 12330 to the number in column 11 of Table 20. Column 12 gives the breakpoint position of each translated peptide in column 11.
表19bis和表20bis是指从表19和表20的外显子或TE供体融合物中获得的相应肽,并在这些表中提及。所述表提供了融合id、ORF、转录物名称和融合中涉及的TE名称。Table 19bis and Table 20bis refer to the corresponding peptides obtained from the exon or TE donor fusions of Table 19 and Table 20 and are mentioned in these tables. The tables provide the fusion id, ORF, transcript name and TE name involved in the fusion.
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表14Table 14
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表15Table 15
表16Table 16
表16Table 16
表16Table 16
表16Table 16
表16Table 16
表17Table 17
表17Table 17
表17Table 17
表17Table 17
表17Table 17
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表18Table 18
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19Table 19
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表19bisTable 19bis
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20Table 20
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
表20bisTable 20bis
序列描述:Sequence Description:
Claims (30)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP21305296.2 | 2021-03-11 | ||
EP22305236 | 2022-03-02 | ||
EP22305236.6 | 2022-03-02 | ||
PCT/EP2022/056318 WO2022189620A1 (en) | 2021-03-11 | 2022-03-11 | Transmembrane neoantigenic peptides |
Publications (1)
Publication Number | Publication Date |
---|---|
CN118265538A true CN118265538A (en) | 2024-06-28 |
Family
ID=80786816
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202280033953.0A Pending CN118265538A (en) | 2021-03-11 | 2022-03-11 | Transmembrane neoantigenic peptides |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN118265538A (en) |
-
2022
- 2022-03-11 CN CN202280033953.0A patent/CN118265538A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20250041412A1 (en) | Transmembrane neoantigenic peptides | |
KR20230172630A (en) | Tumor Neoantigenic Peptide | |
US12152077B1 (en) | Immunotherapy targeting tumor neoantigenic peptides | |
JP6918703B2 (en) | How to treat cancer | |
EP3294333B2 (en) | Claudin-18.2-specific immunoreceptors and t cell epitopes | |
KR20230172047A (en) | Tumor neoantigen peptides and uses thereof | |
AU2018345539A1 (en) | HPV-specific binding molecules | |
US20230212243A1 (en) | Neoantigenic Epitopes Associated with SF3B1 Mutations | |
CN118265538A (en) | Transmembrane neoantigenic peptides | |
CN117597143A (en) | tumor neoantigen peptide | |
CN117440823A (en) | Tumor neoantigen peptides and their uses | |
JP2025510795A (en) | Immunotherapy targeting neo-antigenic peptides derived from tumor metastatic factors in glioblastoma | |
EP4499134A1 (en) | Immunotherapy targeting tumor transposable element derived neoantigenic peptides in glioblastoma | |
JP2024527620A (en) | Novel tumor-specific antigens for cancer stem cells and their uses |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |