CN118786207A

CN118786207A - Modified cells and their uses

Info

Publication number: CN118786207A
Application number: CN202380016609.5A
Authority: CN
Inventors: 高晓飞; 聂小千; 袁扬; 黄彦杰; 刘璇; 葛云
Original assignee: West Lake Laboratory Zhejiang Provincial Laboratory Of Life Sciences And Biomedicine; West Lake Biomedical Technology Shanghai Co ltd
Current assignee: West Lake Laboratory Zhejiang Provincial Laboratory Of Life Sciences And Biomedicine; West Lake Biomedical Technology Shanghai Co ltd
Priority date: 2022-01-12
Filing date: 2023-01-06
Publication date: 2024-10-15
Also published as: WO2023134573A1

Abstract

本公开提供了一种细胞，所述细胞具有与其连接的物质，其中所述物质通过分选酶识别基序、通过包含N‑末端甘氨酸的接头连接至细胞的至少一种膜蛋白。还提供了获得修饰的细胞的方法，以及修饰的细胞用于递送物质例如药物和探针的用途。The present disclosure provides a cell having a substance connected thereto, wherein the substance is connected to at least one membrane protein of the cell through a sortase recognition motif and a linker comprising an N-terminal glycine. Also provided are methods for obtaining modified cells, and uses of modified cells for delivering substances such as drugs and probes.

Description

Modified cells and their uses

技术领域Technical Field

本公开一般涉及修饰的细胞，更具体地涉及膜蛋白修饰的细胞及其用于递送物质的用途，所述物质包括药物和探针等，特别是用于治疗和预防与HPV感染相关的疾病的用途。The present disclosure generally relates to modified cells, and more particularly to membrane protein modified cells and their use in delivering substances, including drugs and probes, etc., especially for treating and preventing diseases associated with HPV infection.

背景技术Background Art

人乳头瘤病毒感染(HPV感染)是由人乳头瘤病毒(HPV)(属于乳头状瘤病毒科(Papillomaviridae)的DNA病毒)引起的感染。许多HPV感染没有症状，并且90％的感染会在两年内自行痊愈。然而，在某些情况下，HPV感染会持续存在并导致疣或癌前病变。根据感染的部位不同，这些病变会增加宫颈癌、外阴癌、阴道癌、阴茎癌、肛门癌、口腔癌、扁桃体癌或喉癌的风险。Human papillomavirus infection (HPV infection) is an infection caused by human papillomavirus (HPV), a DNA virus belonging to the family Papillomaviridae. Many HPV infections have no symptoms, and 90% of infections clear up on their own within two years. However, in some cases, HPV infection persists and causes warts or precancerous lesions. Depending on the site of infection, these lesions increase the risk of cervical cancer, vulvar cancer, vaginal cancer, penile cancer, anal cancer, oral cancer, tonsil cancer, or laryngeal cancer.

绝大多数的宫颈癌都是由HPV引起的；其中HPV16和HPV18这两株占所有病例的70％。将近90％的HPV阳性的口咽癌与HPV16有关。HPV6和HPV11是生殖器疣和喉乳头状瘤病的常见病因。HPV16引发了50％的宫颈癌和癌前宫颈病变。宫颈癌是全球女性中第四常见的癌症。然而，对于复发性宫颈癌患者，治疗选择有限。根据世界卫生组织(WHO)发布的全球癌症流行病学统计报告，2020年全球约有60万例宫颈癌新发病例和34万例宫颈癌死亡病例，其中约85％的新发病例和90％的死亡病例发生在发展中国家。2020年，中国有11万例宫颈癌新发病例，约占全球宫颈癌新发病例的18.3％，有6万例死亡病例，约占全球宫颈癌死亡病例的17.6％。The vast majority of cervical cancers are caused by HPV; two strains, HPV16 and HPV18, account for 70% of all cases. Nearly 90% of HPV-positive oropharyngeal cancers are associated with HPV16. HPV6 and HPV11 are common causes of genital warts and laryngeal papillomatosis. HPV16 causes 50% of cervical cancers and precancerous cervical lesions. Cervical cancer is the fourth most common cancer in women worldwide. However, for patients with recurrent cervical cancer, treatment options are limited. According to the Global Cancer Epidemiology Statistics Report released by the World Health Organization (WHO), there were approximately 600,000 new cases of cervical cancer and 340,000 deaths from cervical cancer worldwide in 2020, of which approximately 85% of new cases and 90% of deaths occurred in developing countries. In 2020, there were 110,000 new cases of cervical cancer in China, accounting for approximately 18.3% of the world's new cases of cervical cancer, and 60,000 deaths, accounting for approximately 17.6% of the world's deaths from cervical cancer.

人类有效的抗肿瘤免疫与针对癌症新抗原的T细胞的存在有关，癌症新抗原是一类由肿瘤特异性突变产生的HLA结合肽。新出现的数据表明，对此类新抗原的识别已成为临床上免疫疗法的一个主要方面。大规模并行全外显子组测序已用于检测肿瘤内的所有突变以预测新抗原。新抗原疫苗接种可以扩大已有新抗原特异性T细胞群体并诱导新的肿瘤特异性T细胞。尽管新抗原已成为抗肿瘤免疫反应的潜在理想靶标，但在临床应用实现之前，仍有许多问题尚待解答。T细胞的激活需要MHC分子向特定T细胞受体呈递MHC限制性肽。特定抗原呈递系统的缺乏是新抗原疫苗接种的问题之一。Effective anti-tumor immunity in humans is associated with the presence of T cells targeting cancer neoantigens, a class of HLA-bound peptides generated by tumor-specific mutations. Emerging data suggest that the identification of such neoantigens has become a major aspect of immunotherapy in the clinic. Massively parallel whole-exome sequencing has been used to detect all mutations within tumors to predict neoantigens. Neoantigen vaccination can expand existing neoantigen-specific T cell populations and induce new tumor-specific T cells. Although neoantigens have become potential ideal targets for anti-tumor immune responses, many questions remain to be answered before clinical applications can be achieved. The activation of T cells requires MHC molecules to present MHC-restricted peptides to specific T cell receptors. The lack of specific antigen presentation systems is one of the problems with neoantigen vaccination.

近年来，用于延长在治疗多种人疾病中药物保留时间的药物递送系统的开发引起了广泛关注。然而，许多系统仍然面临各种挑战和限制，例如稳定性差、不想要的毒性和免疫反应[J.W.Yoo,D.J.Irvine,D.E.Discher,and S.Mitragotri,“Bio-inspired,bioengineered and biomimetic drug delivery carriers,”Nat.Rev.Drug Discov.,vol.10,no.7,pp.521-535,2011.]。红细胞(RBC)是人体内最常见的细胞类型，三十多年来作为理想的体内药物递送系统已被广泛研究，原因是其独特的生物学特性，包括：(i)循环范围广泛遍及全身；(ii)作为生物材料具有良好的生物相容性，体内存活时间长；(iii)大的表面积与体积比；(iv)没有细胞核、线粒体和其他细胞器。In recent years, the development of drug delivery systems for extending drug retention time in the treatment of various human diseases has attracted widespread attention. However, many systems still face various challenges and limitations, such as poor stability, unwanted toxicity and immune response [J.W.Yoo,D.J.Irvine,D.E.Discher,and S.Mitragotri, "Bio-inspired, bioengineered and biomimetic drug delivery carriers," Nat.Rev.Drug Discov.,vol.10,no.7,pp.521-535,2011.]. Red blood cells (RBCs) are the most common cell type in the human body and have been widely studied as ideal in vivo drug delivery systems for more than 30 years due to their unique biological properties, including: (i) wide circulation throughout the body; (ii) good biocompatibility as a biomaterial and long survival time in vivo; (iii) large surface area to volume ratio; (iv) no nucleus, mitochondria and other organelles.

通过直接封装、外源肽的非共价连接、或通过与RBC表面蛋白特异性抗体的融合来安装蛋白质，RBC已被开发为药物递送载体。已经证明这种修饰的RBC的体内应用存在局限性。例如，封装会破坏细胞膜，从而影响仅工程改造的细胞的体内存活率。此外，聚合物颗粒与RBC的非共价连接容易解离，有效载荷将很快在体内降解。RBCs have been developed as drug delivery vehicles by direct encapsulation, non-covalent attachment of exogenous peptides, or by fusion with RBC surface protein-specific antibodies to install proteins. It has been shown that the in vivo application of such modified RBCs has limitations. For example, encapsulation can destroy the cell membrane, thereby affecting the in vivo survival rate of only engineered cells. In addition, the non-covalent attachment of polymer particles to RBCs is easily dissociated, and the payload will soon degrade in vivo.

细菌分选酶是能够以共价和位点特异性方式修饰蛋白质的转肽酶[J.M.Antos,J.Ingram,T.Fang,N.Pishesha,M.C.Truttmann,and H.L.Ploegh,“Site-SpecificProtein Labeling via Sortase-Mediated transpeptidation,”2017.]。来自金黄色葡萄球菌(Staphylococcus aureus)的野生型分选酶A(wt SrtA)识别LPXTG基序，并在苏氨酸和甘氨酸之间进行切割，在酶和底物蛋白之间形成共价酰基酶中间体。该中间体通过N-末端通常具有三个连续甘氨酸残基(3×甘氨酸，G₃)的肽或蛋白质的亲核攻击而解离。先前的研究已经在RBC上基因过表达了KELL膜蛋白，该膜蛋白在其C-末端具有LPXTG基序，通过使用wt SrtA，可以将其连接到3×甘氨酸或G_(n≥3)修饰的蛋白质/肽的N-末端[J.Shi,L.Kundrat,N.Pishesha,A.Bilate,C.Theile,and T.Maruyama,“Engineered red bloodcells as carriers for systemic delivery of a wide array of functionalprobes,”pp.1-6,2014.]。这些携带药物的RBC在动物模型中显示出治疗疾病的功效。然而，这需要工程改造造血干细胞或祖细胞(HSPC)并将这些细胞分化为成熟RBC的步骤，这极大地限制了应用。Bacterial sortases are transpeptidases that can modify proteins in a covalent and site-specific manner [JM Antos, J. Ingram, T. Fang, N. Pishesha, M.C. Truttmann, and H.L. Ploegh, "Site-Specific Protein Labeling via Sortase-Mediated transpeptidation," 2017.]. Wild-type sortase A (wt SrtA) from Staphylococcus aureus recognizes the LPXTG motif and cleaves between threonine and glycine to form a covalent acylase intermediate between the enzyme and the substrate protein. This intermediate is dissociated by nucleophilic attack by peptides or proteins that typically have three consecutive glycine residues (3×glycine, G ₃ ) at the N-terminus. Previous studies have genetically overexpressed the KELL membrane protein on RBCs, which has an LPXTG motif at its C-terminus, which can be linked to the N-terminus of 3×glycine or G _(n≥3) modified proteins/peptides by using wt SrtA [J. Shi, L. Kundrat, N. Pishesha, A. Bilate, C. Theile, and T. Maruyama, "Engineered red blood cells as carriers for systemic delivery of a wide array of functional probes," pp. 1-6, 2014.]. These drug-carrying RBCs have shown efficacy in treating diseases in animal models. However, this requires the steps of engineering hematopoietic stem cells or progenitor cells (HSPCs) and differentiating these cells into mature RBCs, which greatly limits the application.

因此，在本领域仍然需要一种改进的细胞递送系统用于治疗与HPV感染相关的疾病。Therefore, there remains a need in the art for an improved cell delivery system for treating diseases associated with HPV infection.

发明内容Summary of the invention

在第一方面，本公开提供了一种细胞，所述细胞具有与其连接的物质，其中所述物质通过分选酶识别基序与所述细胞的至少一种膜蛋白连接，并且连接到至少一种膜蛋白的物质包含以下结构：A-M-L-Gly_mX_n-M-P，其中A代表物质，L代表分选酶介导的反应后分选酶识别基序的残余部分，Gly_m代表m个甘氨酸，其中m优选为1-5，X_n代表n个间隔氨基酸，其中n优选为0-10，M代表交联后双功能交联剂的残余部分，P代表细胞的至少一种膜蛋白。In a first aspect, the present disclosure provides a cell having a substance attached thereto, wherein the substance is attached to at least one membrane protein of the cell via a sortase recognition motif, and the substance attached to the at least one membrane protein comprises the following structure: AML-Gly _m X _n -MP, wherein A represents the substance, L represents the remainder of the sortase recognition motif after a sortase-mediated reaction, Gly _m represents m glycines, wherein m is preferably 1-5, X _n represents n spacer amino acids, wherein n is preferably 0-10, M represents the remainder of a bifunctional cross-linker after cross-linking, and P represents at least one membrane protein of the cell.

在一些实施方案中，双功能交联剂是胺-巯基型，优选马来酰亚胺羧酸(C_2-8)，例如，6-马来酰亚胺己酸和4-马来酰亚胺丁酸。In some embodiments, the bifunctional crosslinker is of the amine-thiol type, preferably maleimidocarboxylic acid (C _2-8 ), for example, 6-maleimidocaproic acid and 4-maleimidobutyric acid.

在一些实施方案中，双功能交联剂使所述侧链氨基与至少一种膜蛋白的至少一个暴露的巯基形成交联。In some embodiments, the bifunctional cross-linker forms a cross-link between the side chain amino group and at least one exposed sulfhydryl group of at least one membrane protein.

在一些实施方案中，分选酶识别基序包含、或基本上由、或由选自LPXTG、LPXAG、LPXSG、LPXLG、LPXVG、LGXTG、LAXTG、LSXTG、NPXTG、MPXTG、IPXTG、SPXTG、VPXTG、YPXRG、LPXTS和LPXTA的氨基酸序列组成，其中X是任何氨基酸。In some embodiments, the sortase recognition motif comprises, consists essentially of, or consists of an amino acid sequence selected from the group consisting of LPXTG, LPXAG, LPXSG, LPXLG, LPXVG, LGXTG, LAXTG, LSXTG, NPXTG, MPXTG, IPXTG, SPXTG, VPXTG, YPXRG, LPXTS, and LPXTA, wherein X is any amino acid.

在一些实施方案中，分选酶识别基序包含位于分选酶识别基序的N-末端至C-末端方向第5位的非天然氨基酸，其中所述非天然氨基酸是任选取代的羟基羧酸，其具有式CH₂OH-(CH₂)_n-COOH，其中n为0至3的整数，优选n＝0。In some embodiments, the sortase recognition motif comprises an unnatural amino acid at position 5 from the N-terminus to the C-terminus of the sortase recognition motif, wherein the unnatural amino acid is an optionally substituted hydroxycarboxylic acid having the formula _CH2OH- ( _CH2 ) _n -COOH, wherein n is an integer from 0 to 3, preferably n=0.

在一些实施方案中，分选酶识别基序包含、或基本上由、或由选自LPXT*Y、LPXA*Y、LPXS*Y、LPXL*Y、LPXV*Y、LGXT*Y、LAXT*Y、LSXT*Y、NPXT*Y、MPXT*Y、IPXT*Y、SPXT*Y、VPXT*Y和YPXR*Y的氨基酸序列组成，其中*代表任选取代的羟基羧酸；并且X和Y独立地代表任何氨基酸。In some embodiments, the sortase recognition motif comprises, consists essentially of, or consists of an amino acid sequence selected from the group consisting of LPXT*Y, LPXA*Y, LPXS*Y, LPXL*Y, LPXV*Y, LGXT*Y, LAXT*Y, LSXT*Y, NPXT*Y, MPXT*Y, IPXT*Y, SPXT*Y, VPXT*Y, and YPXR*Y, wherein * represents an optionally substituted hydroxycarboxylic acid; and X and Y independently represent any amino acid.

在一些实施方案中，分选酶识别基序包含、或基本上由、或由选自LPXT*G、LPXA*G、LPXS*G、LPXL*G、LPXV*G、LGXT*G、LAXT*G、LSXT*G、NPXT*G、MPXT*G、IPXT*G、SPXT*G、VPXT*G、YPXR*G、LPXT*S和LPXT*A的氨基酸序列组成，其中M优选为LPET*G，*为2-羟基乙酸。In some embodiments, the sortase recognition motif comprises, consists essentially of, or consists of an amino acid sequence selected from the group consisting of LPXT*G, LPXA*G, LPXS*G, LPXL*G, LPXV*G, LGXT*G, LAXT*G, LSXT*G, NPXT*G, MPXT*G, IPXT*G, SPXT*G, VPXT*G, YPXR*G, LPXT*S, and LPXT*A, wherein M is preferably LPET*G and * is 2-hydroxyacetic acid.

在一些实施方案中，L选自LPXT、LPXA、LPXS、LPXL、LPXV、LGXT、LAXT、LSXT、NPXT、MPXT、IPXT、SPXT、VPXT和YPXR，其中X是任何氨基酸。In some embodiments, L is selected from the group consisting of LPXT, LPXA, LPXS, LPXL, LPXV, LGXT, LAXT, LSXT, NPXT, MPXT, IPXT, SPXT, VPXT, and YPXR, wherein X is any amino acid.

在一些实施方案中，物质A包含暴露的巯基，优选暴露的半胱氨酸，更优选末端半胱氨酸，最优选C末端半胱氨酸。In some embodiments, substance A comprises an exposed sulfhydryl group, preferably an exposed cysteine, more preferably a terminal cysteine, most preferably a C-terminal cysteine.

在一些实施方案中，物质从N-末端至C-末端包含HPV抗原肽、任选的第一肽接头和MHC-I。在一些另外的实施方案中，物质从N-末端至C-末端包含HPV抗原肽、任选的第一肽接头、β2-微球蛋白(例如人β2-微球蛋白)、任选的第二肽接头和MHC-I重链(优选缺失跨膜区和胞质区的MHC-I重链)。In some embodiments, the substance comprises an HPV antigen peptide, an optional first peptide linker, and MHC-I from the N-terminus to the C-terminus. In some other embodiments, the substance comprises an HPV antigen peptide, an optional first peptide linker, β2-microglobulin (e.g., human β2-microglobulin), an optional second peptide linker, and an MHC-I heavy chain (preferably an MHC-I heavy chain lacking a transmembrane region and a cytoplasmic region) from the N-terminus to the C-terminus.

在一些实施方案中，HPV抗原肽与主要组织相容性复合物Ⅰ类(MHC-I)蛋白结合，如HLA-A*02:01、HLA-A*02:02、HLA-A*02:06、HLA-A*02:07和HLA-A*02:11。In some embodiments, the HPV antigen peptide binds to major histocompatibility complex class I (MHC-I) proteins, such as HLA-A*02:01, HLA-A*02:02, HLA-A*02:06, HLA-A*02:07, and HLA-A*02:11.

在一些实施方案中，HPV抗原肽是HPV16抗原肽，优选包含或由选自SEQ ID NO:9(YMLDLQPET)、SEQ ID NO:10(LLMGTLGIV)和SEQ ID NO:11(KCLKFYSKI)的氨基酸序列组成，或其功能等效的变体。In some embodiments, the HPV antigen peptide is an HPV16 antigen peptide, preferably comprising or consisting of an amino acid sequence selected from SEQ ID NO: 9 (YMLDLQPET), SEQ ID NO: 10 (LLMGTLGIV) and SEQ ID NO: 11 (KCLKFYSKI), or functionally equivalent variants thereof.

在一些实施方案中，分选酶是分选酶A(SrtA)，例如，金黄色葡萄球菌转肽酶A，例如，金黄色葡萄球菌转肽酶A变体(mgSrtA)。In some embodiments, the sortase is sortase A (SrtA), e.g., Staphylococcus aureus transpeptidase A, e.g., Staphylococcus aureus transpeptidase A variant (mgSrtA).

在一些实施方案中，细胞选自去核细胞，优选红细胞。In some embodiments, the cells are selected from enucleated cells, preferably erythrocytes.

在第二方面，本公开提供了一种修饰细胞的方法，包括：In a second aspect, the present disclosure provides a method of modifying a cell, comprising:

a)提供包含A-M-L'结构的分选酶底物，其中A代表物质，L'代表分选酶识别基序，M代表交联后双功能交联剂的残余部分；a) providing a sortase substrate comprising an A-M-L' structure, wherein A represents a substance, L' represents a sortase recognition motif, and M represents a residual portion of a bifunctional cross-linker after cross-linking;

b)(i)提供具有式Gly_mX_n-M'的肽，其中Gly_m代表m个甘氨酸，m优选为1-5；X_n代表n个间隔氨基酸，n优选为0-10；M'代表双功能交联剂；b) (i) providing a peptide having the formula Gly _m X _n -M', wherein Gly _m represents m glycine, m is preferably 1-5; X _n represents n spacer amino acids, n is preferably 0-10; M' represents a bifunctional cross-linking agent;

(ii)在使Gly_mX_n-M'肽与细胞的至少一种膜蛋白连接从而生成Gly_mX_n-M-P的条件下，用Gly_mX_n-M'肽处理细胞，其中M代表交联后双功能交联剂的残余部分，P代表所述细胞的至少一种膜蛋白；和(ii) treating the cell with a _GlymXn - _M ' peptide, wherein M represents the remaining part of the bifunctional cross-linking agent after cross-linking and P represents at least one membrane protein of the cell, under conditions such that the _GlymXn _- M' peptide is linked to at least one membrane protein of the cell to generate _GlymXn _- MP; and

c)在一种或多种适合于分选酶通过分选酶介导的反应将分选酶底物缀合至Gly_m的条件下，将处理的细胞与分选酶底物在分选酶存在下进行接触；c) contacting the treated cells with a sortase substrate in the presence of a sortase under one or more conditions suitable for the sortase to conjugate the sortase substrate to Gly _m via a sortase-mediated reaction;

a)和b)可以同时，或者a)在b)之前，或者a)在b)之后进行。a) and b) can be performed simultaneously, or a) can be performed before b), or a) can be performed after b).

在一些实施方案中，在处理步骤之前，所述方法进一步包括用还原剂对细胞进行预处理以形成暴露的巯基的步骤。In some embodiments, prior to the treating step, the method further comprises the step of pre-treating the cells with a reducing agent to form exposed sulfhydryl groups.

在一些实施方案中，X_n包含至少一个具有侧链氨基的氨基酸，例如赖氨酸，并且优选地，X_n的C-末端氨基酸是具有侧链氨基的氨基酸。In some embodiments, _Xn comprises at least one amino acid having a side chain amino group, such as lysine, and preferably, the C-terminal amino acid of _Xn is an amino acid having a side chain amino group.

在一些实施方案中，双功能交联剂交联所述侧链氨基和至少一种膜蛋白的至少一个暴露的巯基。In some embodiments, the bifunctional cross-linker cross-links the side chain amino group and at least one exposed thiol group of at least one membrane protein.

在一些实施方案中，分选酶介导的反应形成连接到细胞的至少一种膜蛋白的物质，所述物质包含以下结构：A-M-L-Gly_mX_n-M-P，其中A代表物质，L代表分选酶介导反应后分选酶识别基序的残留部分，Gly_m代表m个甘氨酸，m优选为1-5，X_n代表n个间隔氨基酸，n优选为0-10，M代表交联后双功能交联剂的残留部分，P代表细胞的至少一种膜蛋白。In some embodiments, the sortase-mediated reaction forms a substance attached to at least one membrane protein of a cell, the substance comprising the following structure: AML-Gly _m X _n -MP, wherein A represents the substance, L represents the remaining portion of the sortase recognition motif after the sortase-mediated reaction, Gly _m represents m glycines, m is preferably 1-5, X _n represents n spacer amino acids, n is preferably 0-10, M represents the remaining portion of the bifunctional cross-linker after cross-linking, and P represents at least one membrane protein of the cell.

在第三方面，本公开提供了通过第二方面的方法获得的细胞。In a third aspect, the present disclosure provides a cell obtained by the method of the second aspect.

在第四方面，本公开提供了包含第一和/或第三方面的细胞和任选地生理学上可接受的载体的组合物。In a fourth aspect, the present disclosure provides a composition comprising the cell of the first and/or third aspect and optionally a physiologically acceptable carrier.

在第五方面，本公开提供了用于诊断、治疗或预防与HPV感染相关的疾病的方法。In a fifth aspect, the present disclosure provides methods for diagnosing, treating or preventing diseases associated with HPV infection.

在一些实施方案中，与HPV感染相关的疾病选自宫颈癌、肛门生殖器癌症、头颈癌和口咽癌。In some embodiments, the disease associated with HPV infection is selected from cervical cancer, anogenital cancer, head and neck cancer, and oropharyngeal cancer.

在第六方面，本公开提供了将物质递送至有需要的受试者的方法，包括向受试者施用第一和/或第三方面的细胞或第四方面的组合物。In a sixth aspect, the present disclosure provides a method of delivering a substance to a subject in need thereof, comprising administering the cell of the first and/or third aspect or the composition of the fourth aspect to the subject.

在第七方面，本公开提供了一种增加受试者中物质的循环时间或血浆半衰期的方法，包括根据第二方面的方法将物质与细胞连接。In a seventh aspect, the present disclosure provides a method of increasing the circulation time or plasma half-life of a substance in a subject, comprising linking the substance to a cell according to the method of the second aspect.

在第八方面，本发明提供了第一和/或第三方面的细胞或第四方面的组合物在制备用于诊断、治疗或预防与HPV感染相关的疾病的药物中的用途。In an eighth aspect, the present invention provides use of the cell of the first and/or third aspect or the composition of the fourth aspect in the preparation of a medicament for diagnosing, treating or preventing a disease associated with HPV infection.

在一些实施方案中，药物是疫苗。In some embodiments, the medicament is a vaccine.

在第九方面，提供了第一和/或第三方面的细胞或第四方面的组合物用于在有需要的受试者中诊断、治疗或预防与HPV感染相关的疾病。In a ninth aspect, there is provided the cell of the first and/or third aspect or the composition of the fourth aspect for use in diagnosing, treating or preventing a disease associated with HPV infection in a subject in need thereof.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

在附图中，通过示例的方式示出了本公开的实施方案。应当明确地理解，说明书和附图仅用于说明和帮助理解本公开的目的，而不旨在作为对本发明限制的定义。In the accompanying drawings, embodiments of the present disclosure are shown by way of example. It should be clearly understood that the description and the accompanying drawings are only used to illustrate and help understand the present disclosure, and are not intended to be a definition of the limitations of the present invention.

图1说明了根据本公开在体外用肽标记细胞的示例性方法。FIG. 1 illustrates an exemplary method for labeling cells with peptides in vitro according to the present disclosure.

图2显示根据本公开的甘氨酸GGGSK-6-MAL接头的结构式。FIG. 2 shows the structural formula of the glycine GGGSK-6-MAL linker according to the present disclosure.

图3显示根据本公开的一个示例性实施方案，HPV16(YMLDLQPET)-hMHC1在体外有效地缀合到RBC上。FIG. 3 shows that HPV16(YMLDLQPET)-hMHC1 is efficiently conjugated to RBCs in vitro according to an exemplary embodiment of the present disclosure.

图4显示通过IFN-γELISpot法检测HPV16(YMLDLQPET)-hMHC1-RBC在患者PBMC中诱导的IFN-γ反应。FIG. 4 shows the IFN-γ response induced by HPV16(YMLDLQPET)-hMHC1-RBC in patient PBMCs detected by IFN-γ ELISpot method.

图5显示通过CD8+T细胞表面荧光素缀合的MHC四聚体(A)4-1BB和CD107a(B)的表达来检测由HPV16(YMLDLQPET)-hMHC1-RBC产生的免疫反应的细胞毒性潜力。FIG5 shows the cytotoxic potential of the immune response generated by HPV16(YMLDLQPET)-hMHC1-RBCs detected by the expression of fluorescein-conjugated MHC tetramers (A) 4-1BB and CD107a (B) on the surface of CD8+ T cells.

图6显示T细胞细胞毒性实验后Caski细胞的存活情况。FIG6 shows the survival of Caski cells after T cell cytotoxicity assay.

图7显示T细胞细胞毒性实验后HPV16-MC38细胞的存活情况(A：不同时间点；B：来自不同小鼠的脾脏细胞)。FIG. 7 shows the survival of HPV16-MC38 cells after T cell cytotoxicity assay (A: different time points; B: spleen cells from different mice).

图8显示接受HPV16(KCLKFYSKI)-mMHC1-RBC反复输注的动物体内肿瘤生长情况。FIG8 shows tumor growth in animals receiving repeated infusions of HPV16(KCLKFYSKI)-mMHC1-RBCs.

图9显示6-马来酰亚胺己酸-Leu-Pro-Glu-Thr-2-羟基乙酸-Gly的结构式。FIG. 9 shows the structural formula of 6-maleimidocaproic acid-Leu-Pro-Glu-Thr-2-hydroxyacetic acid-Gly.

具体实施方式DETAILED DESCRIPTION

为了促进对本公开原理的理解，现在将参考附图中示出的实施方案，并且将使用特定语言来描述这些实施方案。然而，应当理解的是，并不旨在限制本公开的范围。For the purposes of promoting an understanding of the principles of the present disclosure, reference will now be made to the embodiments illustrated in the drawings, and specific language will be used to describe these embodiments. It will be understood, however, that no limitation of the scope of the present disclosure is intended.

在本公开中，除非另有说明，本文中使用的科学和技术术语具有本领域技术人员通常理解的含义。尽管与本文描述的那些方法和材料类似或等同的任何方法和材料都可用于本发明的实践，但是本文描述了优选的方法和材料。因此，通过参考整个说明书来更全面地描述本文定义的术语。In the present disclosure, unless otherwise indicated, the scientific and technical terms used herein have the meanings commonly understood by those skilled in the art. Although any methods and materials similar or equivalent to those described herein can be used in the practice of the present invention, preferred methods and materials are described herein. Therefore, the terms defined herein are more fully described by reference to the entire specification.

如本文所用的单数术语一(a)、一种(an)和该(the)包括复数引用，除非上下文另外明确指出。As used herein, the singular terms a, an, and the include plural references unless the context clearly dictates otherwise.

除非另有说明，否则核酸以5'至3'的方向从左至右书写；氨基酸序列以氨基至羧基的方向从左至右书写。Unless otherwise indicated, nucleic acids are written left to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to carboxy orientation.

应当理解，本发明不限于所描述的具体方法、方案和试剂，因为这些可以根据本领域技术人员使用其具体情境而变化。It is to be understood that this invention is not limited to the particular methodology, protocols, and reagents described, as these may vary depending upon the specific context in which it is used by one skilled in the art.

如本文所用，在氨基酸序列的上下文中，术语“主要由......组成”是指所列举的氨基酸序列以及其N-或C-末端处的额外的1、2、3、4或5个氨基酸。As used herein, in the context of an amino acid sequence, the term "consisting essentially of" refers to the recited amino acid sequence plus an additional 1, 2, 3, 4 or 5 amino acids at the N- or C-terminus thereof.

除非上下文另有要求，否则术语“包括”、“包含”和“含有”或类似术语旨在表示非排他性的包含，使得所列举的元素或特征不是仅仅包括那些陈述或列出的元素，而可以包括未列出或陈述的其他元素或特征。Unless the context requires otherwise, the terms "includes," "comprising," and "including" or similar terms are intended to mean a non-exclusive inclusion such that the listed elements or features include not only those stated or listed elements but may include other elements or features not listed or stated.

如本文所用，术语“患者”、“个体”和“受试者”在本文公开的治疗或组合物的任何哺乳动物接受者的上下文中使用。因此，本文公开的方法和组合物可具有医学和/或兽医应用。在优选的形式中，哺乳动物是人。As used herein, the terms "patient," "individual," and "subject" are used in the context of any mammalian recipient of the treatments or compositions disclosed herein. Thus, the methods and compositions disclosed herein may have medical and/or veterinary applications. In a preferred form, the mammal is a human.

如本文所用，术语“序列同一性”旨在包括精确的核苷酸或氨基酸匹配的数目，考虑了使用标准算法的适当比对、考虑了序列在比较窗口上相同的程度。因此，通过以下计算“序列同一性百分比”：在比较窗口上比较两个最佳比对序列，确定在两个序列中出现相同核酸碱基(例如，A、T、C、G)的位置数目，以产生匹配位置的数目，用匹配位置的数目除以比较窗口中的总位置数(即，窗口大小)，再将结果乘以100，得出序列同一性的百分比。例如，“序列同一性”可以理解为表示由DNASIS计算机程序(Windows版本2.5；可从美国加利福尼亚州南旧金山的日立软件工程有限公司获得)计算的“匹配百分比”。As used herein, the term "sequence identity" is intended to include the number of accurate nucleotide or amino acid matches, considering the appropriate comparison using standard algorithms, considering the degree of sequence identical on the comparison window. Therefore, by the following calculation "sequence identity percentage ratio": compare two optimal alignment sequences on the comparison window, determine the number of positions where identical nucleic acid bases (for example, A, T, C, G) occur in the two sequences, to produce the number of matching positions, divide the total number of positions in the comparison window (that is, window size) by the number of matching positions, then multiply the result by 100, and obtain the percentage ratio of sequence identity. For example, "sequence identity" can be understood as representing the "matching percentage ratio" calculated by DNASIS computer program (Windows version 2.5; Available from Hitachi Software Engineering Co., Ltd. in South San Francisco, California, USA).

如本文所用，术语“缀合的”或“缀合”是指两个分子(例如，两个蛋白质或蛋白质和小分子或其他实体)彼此缔合，其通过直接或间接共价或非共价相互作用的方式连接。As used herein, the term "conjugated" or "conjugation" refers to the association of two molecules (eg, two proteins or a protein and a small molecule or other entity) with each other, which are linked by means of direct or indirect covalent or non-covalent interactions.

本发明人在此开发了一种新策略，以通过分选酶介导的反应用物质(例如肽和/或小分子)修饰细胞，例如血细胞，尤其是天然RBC。该技术可以通过直接修饰天然细胞(例如RBC)而不是受资源限制的HSPC来生产细胞产品，标记效率非常高。此外，经修饰的细胞很好地保留了其原始的生物学特性，并保持与天然状态一样稳定。在一些实施方案中，标记的红细胞具有与天然RBC相同的寿命并且在循环中持续信号长达28天或更长时间。The inventors have developed a new strategy to modify cells, such as blood cells, especially natural RBCs, with substances (e.g., peptides and/or small molecules) mediated by sortase reactions. This technology can produce cell products by directly modifying natural cells (e.g., RBCs) rather than HSPCs that are limited by resources, and the labeling efficiency is very high. In addition, the modified cells retain their original biological properties well and remain as stable as the natural state. In some embodiments, the labeled red blood cells have the same lifespan as natural RBCs and continue to signal in the circulation for up to 28 days or longer.

细胞cell

一方面，本公开提供了具有与其连接的物质的细胞，其中所述物质通过分选酶识别基序连接至细胞的至少一种膜蛋白。为了提高分选酶介导的反应的标记效率，将包含N-末端甘氨酸的接头通过双功能交联剂(例如，胺-巯基型双功能交联剂)与细胞的至少一种膜蛋白缀合，优选与至少一种膜蛋白的至少一个暴露的巯基缀合。In one aspect, the disclosure provides a cell having a substance attached thereto, wherein the substance is attached to at least one membrane protein of the cell via a sortase recognition motif. In order to improve the labeling efficiency of the sortase-mediated reaction, a linker comprising an N-terminal glycine is conjugated to at least one membrane protein of the cell via a bifunctional crosslinker (e.g., an amine-thiol type bifunctional crosslinker), preferably to at least one exposed thiol group of at least one membrane protein.

本公开内容涵盖各种活的动物细胞，例如哺乳动物细胞，例如各种血细胞，包括红细胞、T细胞、B细胞、单核细胞、NK细胞和巨核细胞。在一些实施方案中，所述动物细胞是哺乳动物细胞，例如人细胞。在一些实施方案中，所述细胞是免疫系统细胞，例如淋巴细胞(例如T细胞或NK细胞)或树突细胞。在一些实施方案中，所述细胞是细胞毒性细胞。在一些实施方案中，所述细胞是非永生化细胞。在一些实施方案中，所述细胞是原代细胞。在一些实施方案中，所述细胞是天然细胞。在一些实施方案中，所述细胞未被基因工程改造以表达包含分选酶识别序列、在其末端包含一个或多个甘氨酸或丙氨酸的序列、或两者的多肽。The present disclosure encompasses various living animal cells, such as mammalian cells, such as various blood cells, including erythrocytes, T cells, B cells, monocytes, NK cells and megakaryocytes. In some embodiments, the animal cell is a mammalian cell, such as a human cell. In some embodiments, the cell is an immune system cell, such as a lymphocyte (such as a T cell or a NK cell) or a dendritic cell. In some embodiments, the cell is a cytotoxic cell. In some embodiments, the cell is a non-immortalized cell. In some embodiments, the cell is a primary cell. In some embodiments, the cell is a natural cell. In some embodiments, the cell is not genetically engineered to express a polypeptide comprising a sortase recognition sequence, a sequence comprising one or more glycine or alanine at its end, or both.

在一些实施方案中，细胞是成熟红细胞(RBC)。在某些实施方案中，所述RBC是人RBC，例如人天然RBC。在一些实施方案中，所述RBC是未经基因工程改造以表达包含分选酶识别基序或亲核受体序列的蛋白质的红细胞。在一些实施方案中，所述RBC未经基因工程改造。In some embodiments, the cell is a mature red blood cell (RBC). In certain embodiments, the RBC is a human RBC, such as a human natural RBC. In some embodiments, the RBC is a red blood cell that has not been genetically engineered to express a protein that contains a sortase recognition motif or a nucleophilic receptor sequence. In some embodiments, the RBC is not genetically engineered.

如本文所用，术语“非工程改造”、“非基因修饰”和“非重组”是可互换的，并且是指未经基因工程改造、不存在基因修饰等。非工程改造膜蛋白涵盖内源蛋白。在某些实施方案中，非基因工程改造红细胞不含有非内源核酸，例如源自载体、来自不同物种、或包含人工序列的DNA或RNA，例如人为引入的DNA或RNA。在某些实施方案中，未有意将非工程改造细胞与能够在适合于细胞摄取核酸的条件下引起可遗传的遗传改变的核酸接触。As used herein, the terms "non-engineered", "non-genetically modified" and "non-recombinant" are interchangeable and refer to not being genetically engineered, the absence of genetic modification, etc. Non-engineered membrane proteins encompass endogenous proteins. In certain embodiments, non-genetically engineered erythrocytes do not contain non-endogenous nucleic acids, such as DNA or RNA derived from a vector, from a different species, or containing artificial sequences, such as DNA or RNA introduced artificially. In certain embodiments, non-engineered cells are not intentionally contacted with nucleic acids that can cause heritable genetic changes under conditions suitable for cellular uptake of nucleic acids.

在一些实施方案中，细胞未经基因工程改造以用于分选。如果细胞未经基因工程改造以表达包含在分选酶催化反应中分选酶识别基序或亲核受体序列的蛋白质，则认为细胞“未经基因工程改造以用于分选”。In some embodiments, the cell is not genetically engineered for sorting. A cell is considered "not genetically engineered for sorting" if it is not genetically engineered to express a protein comprising a sortase recognition motif or a nucleophilic receptor sequence in a sortase catalyzed reaction.

在一些方面，本公开提供了一种细胞，所述细胞具有与其连接的物质，其中所述物质通过分选酶识别基序与所述细胞的至少一种膜蛋白连接，并且所述与至少一种膜蛋白连接的物质包含以下结构：A-M-L-Gly_mX_n-M-P，其中A代表物质，L代表分选酶介导反应后分选酶识别基序的残留部分，Gly_m代表m个甘氨酸，m优选为1-5，X_n代表n个间隔氨基酸，n优选为0-10，M代表交联后双功能交联剂的残留部分，P代表细胞的至少一种膜蛋白。In some aspects, the present disclosure provides a cell having a substance attached thereto, wherein the substance is attached to at least one membrane protein of the cell via a sortase recognition motif, and the substance attached to at least one membrane protein comprises the following structure: AML-Gly _m X _n -MP, wherein A represents the substance, L represents the remaining portion of the sortase recognition motif after a sortase-mediated reaction, Gly _m represents m glycines, m is preferably 1-5, X _n represents n spacer amino acids, n is preferably 0-10, M represents the remaining portion of a bifunctional cross-linker after cross-linking, and P represents at least one membrane protein of the cell.

在一些实施方案中，如本文所述，本公开提供了具有与其连接的物质的细胞。在一些实施方案中，提供了包含多个此类细胞的组合物。在一些实施方案中，组合物中至少选定百分比的细胞被修饰，例如，具有与细胞的至少一种膜蛋白连接的物质。例如，在一些实施方案中，至少5％、10％、15％、20％、25％、30％、40％、50％、60％、70％、80％、90％、95％、96％、97％、98％、99％或更多的细胞具有与其连接的物质。在一些实施方案中，连接的物质可以是本文描述的一种或多种物质。In some embodiments, as described herein, the disclosure provides cells with substances connected thereto. In some embodiments, compositions comprising a plurality of such cells are provided. In some embodiments, at least a selected percentage of cells in the composition is modified, for example, with substances connected to at least one membrane protein of the cell. For example, in some embodiments, at least 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or more of the cells have substances connected thereto. In some embodiments, the substances connected may be one or more substances described herein.

在一些实施方案中，本公开提供了一种包含物质的细胞，所述物质通过使用分选酶识别基序和包含末端甘氨酸例如N-末端甘氨酸的接头(例如，具有Gly_mX_n的结构的接头，其中Gly_m代表m个甘氨酸，其中m优选为1-5，并且X_n代表n个间隔氨基酸，其中n优选为0-10)与细胞的膜蛋白连接。在一些实施方案中，所连接的物质可以是相同的，或者细胞可以用多种不同物质进行分选。In some embodiments, the present disclosure provides a cell comprising a substance, wherein the substance is connected to a membrane protein of the cell by using a sortase recognition motif and a linker comprising a terminal glycine, such as an N-terminal glycine (e.g., a linker having a structure of Gly _m X _n , wherein Gly _m represents m glycines, wherein m is preferably 1-5, and X _n represents n spacer amino acids, wherein n is preferably 0-10). In some embodiments, the connected substances can be the same, or the cell can be sorted with a plurality of different substances.

在一些实施方案中，物质经由分选酶识别基序连接至包含末端甘氨酸例如N-末端甘氨酸的接头(例如，具有Gly_mX_n的结构的接头，其中Gly_m代表m个甘氨酸，其中m优选为1-5，并且X_n代表n个间隔氨基酸，其中n优选为0-10)，所述接头与细胞的至少一种膜蛋白交联。在一些实施方案中，分选酶识别基序可以选自：LPXTG、LPXAG、LPXSG、LPXLG、LPXVG、LGXTG、LAXTG、LSXTG、NPXTG、MPXTG、IPXTG、SPXTG、VPXTG、YPXRG、LPXTS和LPXTA，其中X是任何氨基酸。在一些实施方案中，分选酶识别基序可以包含位于分选酶识别基序的N-末端至C-末端方向第5位的非天然氨基酸，其中所述非天然氨基酸是任选取代的羟基羧酸，其具有式CH₂OH-(CH₂)_n-COOH，其中n为0至3的整数，优选n＝0。在一些实施方案中，包含非天然氨基酸的分选酶识别基序可以选自：LPXT*Y、LPXA*Y、LPXS*Y、LPXL*Y、LPXV*Y、LGXT*Y、LAXT*Y、LSXT*Y、NPXT*Y、MPXT*Y、IPXT*Y、SPXT*Y、VPXT*Y和YPXR*Y，其中*表示任选取代的羟基羧酸，并且X和Y独立地代表任何氨基酸。在一些实施方案中，包含非天然氨基酸的分选酶识别基序可以选自：LPXT*G、LPXA*G、LPXS*G、LPXL*G、LPXV*G、LGXT*G、LAXT*G、LSXT*G、NPXT*G、MPXT*G、IPXT*G、SPXT*G、VPXT*G、YPXR*G、LPXT*S和LPXT*A，其中M优选为LPET*G，其中*优选为2-羟基乙酸。In some embodiments, the substance is linked to a linker comprising a terminal glycine, such as an N-terminal _glycine , via a sortase recognition motif (e.g., a linker having a structure of _GlymXn , wherein _Glym represents m glycines, wherein m is preferably 1-5, and _Xn represents n spacer amino acids, wherein n is preferably 0-10), which is cross-linked to at least one membrane protein of the cell. In some embodiments, the sortase recognition motif can be selected from the group consisting of: LPXTG, LPXAG, LPXSG, LPXLG, LPXVG, LGXTG, LAXTG, LSXTG, NPXTG, MPXTG, IPXTG, SPXTG, VPXTG, YPXRG, LPXTS, and LPXTA, wherein X is any amino acid. In some embodiments, the sortase recognition motif may comprise an unnatural amino acid at position 5 from the N-terminal to C-terminal direction of the sortase recognition motif, wherein the unnatural amino acid is an optionally substituted hydroxycarboxylic acid having the formula CH ₂ OH-(CH ₂ ) _n -COOH, wherein n is an integer from 0 to 3, preferably n = 0. In some embodiments, the sortase recognition motif comprising an unnatural amino acid may be selected from the group consisting of LPXT*Y, LPXA*Y, LPXS*Y, LPXL*Y, LPXV*Y, LGXT*Y, LAXT*Y, LSXT*Y, NPXT*Y, MPXT*Y, IPXT*Y, SPXT*Y, VPXT*Y, and YPXR*Y, wherein * represents an optionally substituted hydroxycarboxylic acid, and X and Y independently represent any amino acid. In some embodiments, the sortase recognition motif comprising an unnatural amino acid can be selected from the group consisting of: LPXT*G, LPXA*G, LPXS*G, LPXL*G, LPXV*G, LGXT*G, LAXT*G, LSXT*G, NPXT*G, MPXT*G, IPXT*G, SPXT*G, VPXT*G, YPXR*G, LPXT*S and LPXT*A, wherein M is preferably LPET*G and wherein * is preferably 2-hydroxyacetic acid.

可以理解，物质与膜蛋白连接后，从分选酶识别基序的第5位(从N-末端至C-末端的方向)的最后一个或两个残基被其上发生连接的氨基酸取代，如本文别处所述。例如，与膜蛋白连接的物质可以包含A-M-L-Gly_mX_n-M-P的结构，其中L代表分选酶介导的反应后分选酶识别基序的残余部分，并且可以选自：LPXT、LPXA、LPXS、LPXL、LPXV、LGXT、LAXT、LSXT、NPXT、MPXT、IPXT、SPXT、VPXT和YPXR。It is understood that after the substance is linked to the membrane protein, the last one or two residues from the 5th position (in the direction from the N-terminus to the C-terminus) of the sortase recognition motif are replaced by the amino acid to which the linkage occurs, as described elsewhere herein. For example, the substance linked to the membrane protein can comprise the structure of AML- _GlymXn _- MP, wherein L represents the remaining part of the sortase recognition motif after the sortase-mediated reaction, and can be selected from the group consisting of: LPXT, LPXA, LPXS, LPXL, LPXV, LGXT, LAXT, LSXT, NPXT, MPXT, IPXT, SPXT, VPXT and YPXR.

在一些实施方案中，通过使用分选酶将分选酶底物附着或缀合或连接至细胞的膜蛋白来修饰基因工程改造的细胞。例如，细胞(例如，RBC)已经被基因工程改造以表达多种产物中的任一种，例如多肽或非编码RNA，可以被基因工程改造以具有一个或多个基因的至少一部分的缺失，和/或可以被基因工程改造以在一种或多种内源基因的序列中具有一种或多种精确的改变。在某些实施方案中，根据本文描述的方法用本文描述的各种物质中的任一种对此类基因工程改造的细胞进行分选。In some embodiments, genetically engineered cells are modified by attaching or conjugating or connecting a sortase substrate to a membrane protein of the cell using a sortase. For example, cells (e.g., RBCs) have been genetically engineered to express any of a variety of products, such as polypeptides or non-coding RNAs, can be genetically engineered to have a deletion of at least a portion of one or more genes, and/or can be genetically engineered to have one or more precise changes in the sequence of one or more endogenous genes. In certain embodiments, such genetically engineered cells are sorted with any of the various substances described herein according to the methods described herein.

在一些实施方案中，本公开考虑使用自体细胞，例如红细胞，其从个体分离，在体外修饰后，这些分离的细胞被施用至所述个体。在一些实施方案中，本公开考虑使用与要施用此类细胞的个体具有相同血型的免疫相容性的红细胞(例如，至少在ABO血型系统方面，并且在一些实施方案中，在D血型系统方面)，或者使用可以是相容血型的红细胞。In some embodiments, the present disclosure contemplates the use of autologous cells, such as red blood cells, which are isolated from an individual and, after in vitro modification, these isolated cells are administered to the individual. In some embodiments, the present disclosure contemplates the use of red blood cells that have the same immunological compatibility of the blood type as the individual to whom such cells are to be administered (e.g., at least in terms of the ABO blood group system, and in some embodiments, in terms of the D blood group system), or the use of red blood cells that can be of a compatible blood type.

包含末端甘氨酸的接头Linker containing terminal glycine

在一些实施方案中，包含末端甘氨酸例如N-末端甘氨酸的接头(例如，具有Gly_mX_n的结构的接头，其中Gly_m代表m个甘氨酸，m优选为1-5，并且X_n代表n个间隔氨基酸，其中n优选为0-10)经由双功能交联剂连接至细胞的至少一种膜蛋白。In some embodiments, a linker comprising a terminal glycine, such as an N-terminal _glycine (e.g., a linker having a structure of _GlymXn , wherein _Glym represents m glycines, m is preferably 1-5, and _Xn represents n spacer amino acids, wherein n is preferably 0-10) is attached to at least one membrane protein of a cell via a bifunctional cross-linker.

如本文所用，Gly_m是指m个甘氨酸，其中m优选为1-5，例如一、二、三、四或五个甘氨酸。如本文所用，X_n代表n个任选的间隔氨基酸，其可以是任何氨基酸。在一些实施方案中，n可以是0-10或更大，例如0-5、1-4或0、1、2、3、4、5、6、7、8、9或10。在一些实施方案中，间隔氨基酸可以是任何天然或非天然氨基酸，例如Gly、Ala、Ser、Lys、Asn、Thr、Glu或Gln。在一些实施方案中，X_n可以是AAS或AASK。在一些实施方案中，X_n可以是GGS或GGSK。在一些实施方案中，X_n可以是LGS或LGSK。As used herein, Gly _m refers to m glycines, wherein m is preferably 1-5, such as one, two, three, four or five glycines. As used herein, X _n represents n optional spacer amino acids, which can be any amino acid. In some embodiments, n can be 0-10 or greater, such as 0-5, 1-4 or 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10. In some embodiments, the spacer amino acid can be any natural or non-natural amino acid, such as Gly, Ala, Ser, Lys, Asn, Thr, Glu or Gln. In some embodiments, X _n can be AAS or AASK. In some embodiments, X _n can be GGS or GGSK. In some embodiments, X _n can be LGS or LGSK.

如本文所用，术语“双功能交联剂”是指设计用于连接两个反应基团的试剂。如果双功能交联剂被设计为两个反应基团相同，则称为同双功能交联剂；如果它的两个反应基团不同，则它是异双功能交联剂。如果交联剂的一个或两个反应基团仅由于将交联剂暴露于适当波长的光而引起的光化学反应而变得如此，则交联剂是可光活化的、光反应性的、光敏性的或光活化的。本公开考虑使用能够将包含末端甘氨酸的接头与细胞的至少一种膜蛋白交联的各种双功能交联剂。As used herein, the term "bifunctional crosslinker" refers to a reagent designed to link two reactive groups. If a bifunctional crosslinker is designed so that the two reactive groups are the same, it is called a homobifunctional crosslinker; if its two reactive groups are different, it is a heterobifunctional crosslinker. A crosslinker is photoactivatable, photoreactive, photosensitive, or photoactivated if one or both reactive groups of the crosslinker become so only as a result of a photochemical reaction caused by exposing the crosslinker to light of an appropriate wavelength. The present disclosure contemplates the use of various bifunctional crosslinkers capable of crosslinking a linker comprising a terminal glycine to at least one membrane protein of a cell.

在一些实施方案中，双功能交联剂可包括但不限于：(1)零长度型(例如，EDC；EDC加磺基NHS；CMC；DCC；DIC；N,N'-羰基二咪唑；伍德沃德试剂K)；(2)胺-巯基型，例如NHS酯-马来酰亚胺异双功能交联剂；马来酰亚胺羧酸(C_2-8)(例如，6-马来酰亚胺己酸和4-马来酰亚胺丁酸)；EMCS；SPDP、LC-SPDP、磺基-LC-SPDP；SMPT和磺基-LC-SMPT；SMCC、LC-SMCC和磺基-SMCC；MBS和磺基-MBS；SIAB和磺基-SIAB；SMPB和磺基SMPB；GMBS和磺基GMBS；SIAX和SIAXX；SIAC和SIACX；NPIA；(3)同双功能NHS酯型(例如，DSP；DTSSP；DSS；DST和磺基-DST；BSOCOES和磺基-BSOCOES；EGS和磺基-EGS)；(4)同双功能亚胺酯型(例如，DMA；DMP；DMS；DTBP)；(5)羰基-巯基型(例如，KMUH；EMCH；MPBH；M2C2H；PDPH)；(6)巯基反应型(例如，DPDPB；BMH；HBVS)；(7)巯基-羟基型(例如，PMPI)；或类似的。In some embodiments, the bifunctional cross-linking agent may include, but is not limited to: (1) zero-length type (e.g., EDC; EDC plus sulfo-NHS; CMC; DCC; DIC; N,N'-carbonyldiimidazole;Woodward's reagent K); (2) amine-thiol type, such as NHS ester-maleimide heterobifunctional cross-linking agent; maleimidocarboxylic acid (C _2-8 ) (e.g., 6-maleimidocaproic acid and 4-maleimidobutyric acid); EMCS; SPDP, LC-SPDP, sulfo-LC-SPDP; SMPT and sulfo-LC-SMPT; SMCC, LC-SMCC and sulfo-SMCC; MBS and sulfo-MBS; SIAB and sulfo-SIAB; SMPB and sulfoSMPB; GMBS and sulfoGMBS; SIAX and SIAXX; SIAC and SIACX; NPIA; (3) homobifunctional NHS ester type (e.g. , DSP; DTSSP; DSS; DST and sulfo-DST; BSOCOES and sulfo-BSOCOES; EGS and sulfo-EGS); (4) homobifunctional imide ester type (e.g., DMA; DMP; DMS; DTBP); (5) carbonyl-thiol type (e.g., KMUH; EMCH; MPBH; M2C2H; PDPH); (6) thiol-reactive type (e.g., DPDPB; BMH; HBVS); (7) thiol-hydroxyl type (e.g., PMPI); or the like.

在一些实施方案中，双功能交联剂是胺-巯基型，优选马来酰亚胺羧酸(C_2-8),例如，6-马来酰亚胺己酸和4-马来酰亚胺丁酸。在一些另外的实施方案中，包含末端甘氨酸的接头包含至少一个具有侧链氨基的氨基酸，例如赖氨酸，并且优选地，X_n的C-末端氨基酸是具有侧链氨基的氨基酸，其使得能够在侧链氨基和至少一种膜蛋白的至少一个暴露的巯基之间发生交联反应。In some embodiments, the bifunctional cross-linker is an amine-thiol type, preferably a maleimide carboxylic acid (C _2-8 ), for example, 6-maleimidocaproic acid and 4-maleimidobutyric acid. In some other embodiments, the linker comprising a terminal glycine comprises at least one amino acid having a side chain amino group, such as lysine, and preferably, the C-terminal amino acid of _Xn is an amino acid having a side chain amino group, which enables a cross-linking reaction to occur between the side chain amino group and at least one exposed thiol group of at least one membrane protein.

分选酶Sortase

被称为“分选酶”的酶已从多种革兰氏阳性细菌中分离。分选酶、分选酶介导的转酰基反应及其在蛋白质工程改造中的用途是本领域普通技术人员众所周知的(参见，例如，PCT/US2010/000274(WO/2010/087994)和PCT/US2011/033303(WO/2011/133704))。根据革兰氏阳性细菌基因组中61种分选酶的序列比对和系统发育分析，分选酶被分为4类，分别命名为A、B、C和D(Dramsi S,Trieu-Cuot P,Bierne H,Sorting sortases:a nomenclatureproposal for the various sortases of Gram-positive bacteria.Res Microbiol.156(3):289-97,2005)。本领域技术人员可以容易地基于分选酶的序列和/或其他特征将分选酶归入正确的类别，例如上文Drami等人中描述的那些。本文所用的术语“分选酶A”是指A类分选酶，在任何特定细菌物种中通常命名为SrtA，例如来自金黄色葡萄球菌(S.aureus)或者化脓性链球菌(S.pyogenes)的SrtA。Enzymes called "sortases" have been isolated from a variety of Gram-positive bacteria. Sorting enzymes, sortase-mediated transacylation reactions, and their use in protein engineering are well known to those of ordinary skill in the art (see, e.g., PCT/US2010/000274 (WO/2010/087994) and PCT/US2011/033303 (WO/2011/133704)). According to sequence alignment and phylogenetic analysis of 61 sortases in the genome of Gram-positive bacteria, sortases are divided into four categories, named A, B, C, and D (Dramsi S, Trieu-Cuot P, Bierne H, Sorting sortases: a nomenclature proposal for the various sortases of Gram-positive bacteria. Res Microbiol. 156 (3): 289-97, 2005). Those skilled in the art can readily assign sortases to the correct class based on their sequence and/or other characteristics, such as those described in Drami et al., supra. As used herein, the term "sortase A" refers to class A sortases, generally designated SrtA in any particular bacterial species, such as SrtA from Staphylococcus aureus (S. aureus) or Streptococcus pyogenes (S. pyogenes).

术语“分选酶”也称为转酰胺酶，是指具有转酰胺酶活性的酶。分选酶识别包含分选酶识别基序(例如，氨基酸序列LPXTG)的底物。被分选酶识别的分子(即，包含分选酶识别基序)有时在本文中被称为“分选酶底物”。分选酶可以容忍切割位点附近部分的多种改变，因此只要底物含有适当暴露的分选酶识别基序并且有合适的亲核试剂，就可以实现不同实体的多功能缀合。术语“分选酶介导的转酰基反应”、“分选酶催化的转酰基反应”、“分选酶介导的反应”、“分选酶催化的反应”、“分选酶反应”、“分选酶介导的转肽反应”和类似术语在本文中可互换使用，指此类反应。在被转酰胺酶或分选酶识别的序列方面，术语“分选酶识别基序”、“分选酶识别序列”和“转酰胺酶识别序列”在本文中可互换使用。术语“亲核受体序列”是指能够在分选酶催化的反应中充当亲核体的氨基酸序列，例如，包含N-末端甘氨酸(例如，1、2、3、4或5个N-末端甘氨酸)的序列。The term "sortase", also known as transamidase, refers to an enzyme with transamidase activity. The sortase recognizes a substrate comprising a sortase recognition motif (e.g., the amino acid sequence LPXTG). Molecules recognized by the sortase (i.e., comprising the sortase recognition motif) are sometimes referred to herein as "sortase substrates". The sortase can tolerate a variety of changes in the portion near the cleavage site, so as long as the substrate contains a properly exposed sortase recognition motif and has a suitable nucleophilic reagent, multifunctional conjugation of different entities can be achieved. The terms "sortase-mediated transacylation reaction", "sortase-catalyzed transacylation reaction", "sortase-mediated reaction", "sortase-catalyzed reaction", "sortase reaction", "sortase-mediated transpeptidation reaction" and similar terms are used interchangeably herein to refer to such reactions. In terms of sequences recognized by transamidase or sortase, the terms "sortase recognition motif", "sortase recognition sequence" and "transamidase recognition sequence" are used interchangeably herein. The term "nucleophilic acceptor sequence" refers to an amino acid sequence capable of acting as a nucleophile in a reaction catalyzed by a sortase, for example, a sequence comprising an N-terminal glycine (eg, 1, 2, 3, 4 or 5 N-terminal glycines).

本公开涵盖涉及本领域已知的任何分选酶类别(例如，来自任何细菌物种或菌株的分选酶A、B、C或D)的实施方案。在一些实施方案中，使用分选酶A，例如来自金黄色葡萄球菌的SrtA。在一些实施方案中，可以使用两种或更多种分选酶。在一些实施方案中，分选酶可以利用不同的分选酶识别序列和/或不同的亲核受体序列。The present disclosure encompasses embodiments involving any sortase class known in the art (e.g., sortase A, B, C, or D from any bacterial species or strain). In some embodiments, a sortase A is used, such as SrtA from Staphylococcus aureus. In some embodiments, two or more sortases can be used. In some embodiments, the sortases can utilize different sortase recognition sequences and/or different nucleophilic receptor sequences.

在一些实施方案中，分选酶是分选酶A(SrtA)。SrtA识别基序LPXTG，其中常见的识别基序为，例如，LPKTG、LPATG、LPNTG。在一些实施方案中，使用LPETG。然而，也可以识别落入该共有序列之外的基序。例如，在一些实施方案中，基序在第4位包含A、S、L或V而不是T，例如，LPXAG、LPXSG、LPXLG或LPXVG，例如，LPNAG或LPESG、LPELG或LPEVG。在一些实施方案中，基序在第5位包含A而不是G，例如，LPXTA，例如，LPNTA。在一些实施方案中，基序在第2位包含G或A而不是P，例如，LGXTG或LAXTG，例如，LGATG或LAETG。在一些实施方案中，基序在第1位包含I或M而不是L，例如，MPXTG或IPXTG，例如，MPKTG、IPKTG、IPNTG或IPETG。Pishesha等人2018描述了分选酶A的多种识别基序。In some embodiments, the sortase is sortase A (SrtA). SrtA recognizes the motif LPXTG, where common recognition motifs are, for example, LPKTG, LPATG, LPNTG. In some embodiments, LPETG is used. However, motifs falling outside the consensus sequence can also be recognized. For example, in some embodiments, the motif comprises A, S, L, or V instead of T at position 4, for example, LPXAG, LPXSG, LPXLG, or LPXVG, for example, LPNAG or LPESG, LPELG, or LPEVG. In some embodiments, the motif comprises A instead of G at position 5, for example, LPXTA, for example, LPNTA. In some embodiments, the motif comprises G or A instead of P at position 2, for example, LGXTG or LAXTG, for example, LGATG or LAETG. In some embodiments, the motif comprises I or M instead of L at position 1, for example, MPXTG or IPXTG, for example, MPKTG, IPKTG, IPNTG, or IPETG. Pishesha et al. 2018 described multiple recognition motifs for sortase A.

在一些实施方案中，分选酶识别序列是LPXTG，其中X是标准或非标准氨基酸。在一些实施方案中，X选自D、E、A、N、Q、K或R。在一些实施方案中，识别序列选自LPXTG、LPXAG、LPXSG、LPXLG、LPXVG、LGXTG、LAXTG、LSXTG、NPXTG、MPXTG、IPXTG、SPXTG、VPXTG、YPXRG、LPXTS和LPXTA，其中X可以是任何氨基酸，例如在某些实施方案中选自D、E、A、N、Q、K或R的那些氨基酸。In some embodiments, the sortase recognition sequence is LPXTG, wherein X is a standard or non-standard amino acid. In some embodiments, X is selected from D, E, A, N, Q, K, or R. In some embodiments, the recognition sequence is selected from LPXTG, LPXAG, LPXSG, LPXLG, LPXVG, LGXTG, LAXTG, LSXTG, NPXTG, MPXTG, IPXTG, SPXTG, VPXTG, YPXRG, LPXTS, and LPXTA, wherein X can be any amino acid, such as those selected from D, E, A, N, Q, K, or R in certain embodiments.

在一些实施方案中，分选酶可识别包含非天然氨基酸的基序，优选位于分选酶识别基序的N-末端至C-末端方向的第5位。在一些实施方案中，非天然氨基酸是具有式CH₂OH-(CH₂)_n-COOH的取代或未取代的羟基羧酸，其中n为0至5的整数，例如，0、1、2、3、4和5，并且优选n＝0。在一些实施方案中，非天然氨基酸是取代的羟基羧酸，并且在一些另外的实施方案中，羟基羧酸被一个或多个选自卤素、C_1-6烷基、C_1-6卤代烷基、羟基、C_1-6烷氧基和C_1-6卤代烷氧基的取代基取代。术语“卤代”或“卤素”是指氟、氯、溴或碘，优选氟和氯。术语“烷基”本身或作为另一个取代基的一部分是指式C_nH_2n+1的烃基，其中n是大于或等于1的数。在一些实施方案中，可用于本公开的烷基包含1至6个碳原子，优选1至4个碳原子，更优选1至3个碳原子，还更优选1至2个碳原子。烷基可以是直链或支链的，并且可以如本文所示进一步被取代。C_x-y烷基是指包含x至y个碳原子的烷基。合适的烷基包括甲基、乙基、正丙基、异丙基、正丁基、异丁基、仲丁基、叔丁基、戊基及其异构体(例如正戊基、异戊基)和己基及其异构体(例如正己基、异己基)。优选的烷基包括甲基、乙基、正丙基、异丙基、正丁基、异丁基、仲丁基和叔丁基。术语“卤代烷基”单独或组合，是指具有如上定义的含义的烷基，其中一个或多个氢被如上定义的卤素取代。此类卤代烷基的非限制性实例包括氯甲基、1-溴乙基、氟甲基、二氟甲基、三氟甲基、1,1,1-三氟乙基等。In some embodiments, the sortase can recognize a motif comprising a non-natural amino acid, preferably located at position 5 in the N-terminal to C-terminal direction of the sortase recognition motif. In some embodiments, the non-natural amino acid is a substituted or unsubstituted hydroxycarboxylic acid having the formula CH ₂ OH-(CH ₂ ) _n -COOH, wherein n is an integer from 0 to 5, for example, 0, 1, 2, 3, 4 and 5, and preferably n=0. In some embodiments, the non-natural amino acid is a substituted hydroxycarboxylic acid, and in some other embodiments, the hydroxycarboxylic acid is substituted with one or more substituents selected from halogen, C _1-6 alkyl, C _1-6 haloalkyl, hydroxy, C _1-6 alkoxy and C _1-6 haloalkoxy. The term "halo" or "halogen" refers to fluorine, chlorine, bromine or iodine, preferably fluorine and chlorine. The term "alkyl" by itself or as part of another substituent refers to a hydrocarbon group of the formula C _n H _2n+1 , wherein n is a number greater than or equal to 1. In some embodiments, the alkyl group that can be used for the present disclosure comprises 1 to 6 carbon atoms, preferably 1 to 4 carbon atoms, more preferably 1 to 3 carbon atoms, and even more preferably 1 to 2 carbon atoms. The alkyl group can be straight or branched, and can be further substituted as shown herein. _Cxy alkyl refers to an alkyl group comprising x to y carbon atoms. Suitable alkyl groups include methyl, ethyl, n-propyl, isopropyl, n-butyl, isobutyl, sec-butyl, tert-butyl, pentyl and isomers thereof (e.g., n-pentyl, isopentyl) and hexyl and isomers thereof (e.g., n-hexyl, isohexyl). Preferred alkyl groups include methyl, ethyl, n-propyl, isopropyl, n-butyl, isobutyl, sec-butyl and tert-butyl. The term "haloalkyl" refers to an alkyl group having the meaning as defined above, alone or in combination, in which one or more hydrogens are replaced by halogens as defined above. Non-limiting examples of such haloalkyl groups include chloromethyl, 1-bromoethyl, fluoromethyl, difluoromethyl, trifluoromethyl, 1,1,1-trifluoroethyl, etc.

在一些实施方案中，包含非天然氨基酸的分选酶识别基序可以选自：LPXT*Y、LPXA*Y、LPXS*Y、LPXL*Y、LPXV*Y、LGXT*Y、LAXT*Y、LSXT*Y、NPXT*Y、MPXT*Y、IPXT*Y、SPXT*Y、VPXT*Y和YPXR*Y，其中*表示任选取代的羟基羧酸，并且X和Y独立地代表任何氨基酸。在一些实施方案中，包含非天然氨基酸的分选酶识别基序可以选自：LPXT*G、LPXA*G、LPXS*G、LPXL*G、LPXV*G、LGXT*G、LAXT*G、LSXT*G、NPXT*G、MPXT*G、IPXT*G、SPXT*G、VPXT*G、YPXR*G、LPXT*S和LPXT*A，其中M优选为LPET*G，其中*优选为2-羟基乙酸。In some embodiments, the sortase recognition motif comprising an unnatural amino acid may be selected from the group consisting of LPXT*Y, LPXA*Y, LPXS*Y, LPXL*Y, LPXV*Y, LGXT*Y, LAXT*Y, LSXT*Y, NPXT*Y, MPXT*Y, IPXT*Y, SPXT*Y, VPXT*Y, and YPXR*Y, wherein * represents an optionally substituted hydroxycarboxylic acid, and X and Y independently represent any amino acid. In some embodiments, the sortase recognition motif comprising an unnatural amino acid may be selected from the group consisting of LPXT*G, LPXA*G, LPXS*G, LPXL*G, LPXV*G, LGXT*G, LAXT*G, LSXT*G, NPXT*G, MPXT*G, IPXT*G, SPXT*G, VPXT*G, YPXR*G, LPXT*S, and LPXT*A, wherein M is preferably LPET*G, wherein * is preferably 2-hydroxyacetic acid.

在一些实施方案中，本公开考虑使用天然存在的分选酶的变体。在一些实施方案中，变体能够介导甘氨酸_(n)缀合，其中n优选为1或2。此类变体可以通过如定向进化、位点特异性修饰等方法生成。可以获得关于分选酶(例如分选酶A酶)的大量结构信息，包括单独的SrtA或与分选酶识别序列结合的SrtA的NMR或晶体结构(参见，例如，Zong Y等人，J.BiolChem.2004,279,31383-31389)。已确定了金黄色葡萄球菌SrtA的活性位点和底物结合口袋。本领域普通技术人员可以通过例如避免会破坏或实质上改变分选酶的活性位点或底物结合袋的缺失或取代来产生功能变体。在一些实施方案中，SrtA上的定向进化可以通过利用Chen等人Sci.Rep.2016,6(1),31899中描述的基于FRET(荧光共振能量转移)的选择测定法来进行。在一些实施方案中，金黄色葡萄球菌SrtA的功能变体可以是CN106191015A和CN109797194A中描述的那些。在一些实施方案中，金黄色葡萄球菌SrtA变体可以是截短的变体，其中例如，从N-末端去除(与野生型金黄色葡萄球菌SrtA相比)25-60个(例如，30、35、40、45、50、55、59或60个)氨基酸。In some embodiments, the disclosure contemplates the use of naturally occurring variants of sortase. In some embodiments, the variant is capable of mediating glycine _(n) conjugation, wherein n is preferably 1 or 2. Such variants can be generated by methods such as directed evolution, site-specific modification, etc. A large amount of structural information about sortase (e.g., sortase A enzyme) can be obtained, including NMR or crystal structures of SrtA alone or SrtA bound to a sortase recognition sequence (see, e.g., Zong Y et al., J. Biol Chem. 2004, 279, 31383-31389). The active site and substrate binding pocket of Staphylococcus aureus SrtA have been determined. A person of ordinary skill in the art can generate functional variants by, for example, avoiding deletions or substitutions that would destroy or substantially change the active site or substrate binding pocket of the sortase. In some embodiments, directed evolution on SrtA can be performed by utilizing the FRET (fluorescence resonance energy transfer) selection assay described in Chen et al. Sci. Rep. 2016, 6 (1), 31899. In some embodiments, the functional variants of S. aureus SrtA may be those described in CN106191015A and CN109797194A. In some embodiments, the S. aureus SrtA variant may be a truncated variant, wherein, for example, 25-60 (e.g., 30, 35, 40, 45, 50, 55, 59 or 60) amino acids are removed from the N-terminus (compared to wild-type S. aureus SrtA).

在一些实施方案中，可用于本公开的金黄色葡萄球菌SrtA的功能变体可以是在以下氨基酸位置包含一或多个突变的金黄色葡萄球菌SrtA变体：D124、Y187、E189和F200，例如D124G，Y187L、E189R和F200L，并且任选地还包含P94S/R、D160N、D165A、K190E和K196T中的一或多个突变。在某些实施方案中，金黄色葡萄球菌SrtA变体可以包含D124G；D124G和F200L；P94S/R、D124G、D160N、D165A、K190E和K196T；P94S/R、D160N、D165A、Y187L、E189R、K190E和K196T；P94S/R、D124G、D160N、D165A、Y187L、E189R、K190E和K196T；D124G、Y187L、E189R和F200L；或P94S/R、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L。在一些实施方案中，金黄色葡萄球菌SrtA变体从N-末端去除25至60个(例如，25、30、35、40、45、50、55、59或60个)氨基酸。在一些实施方案中，上述突变的氨基酸位置根据野生型金黄色葡萄球菌SrtA的编号(例如，如SEQ ID NO:1所示)进行编号。在一些实施方案中，野生型金黄色葡萄球菌SrtA的全长核苷酸序列如例如，SEQ ID NO:2所示。In some embodiments, a functional variant of S. aureus SrtA useful in the present disclosure may be a S. aureus SrtA variant comprising one or more mutations at the following amino acid positions: D124, Y187, E189, and F200, such as D124G, Y187L, E189R, and F200L, and optionally further comprising one or more mutations in P94S/R, D160N, D165A, K190E, and K196T. In certain embodiments, the S. aureus SrtA variant may comprise D124G; D124G and F200L; P94S/R, D124G, D160N, D165A, K190E, and K196T; P94S/R, D160N, D165A, Y187L, E189R, K190E, and K196T; P94S/R, D124G, D160N, D165A, Y187L, E189R, K190E, and K196T; D124G, Y187L, E189R, and F200L; or P94S/R, D124G, D160N, D165A, Y187L, E189R, K190E, K196T, and F200L. In some embodiments, the S. aureus SrtA variant removes 25 to 60 (e.g., 25, 30, 35, 40, 45, 50, 55, 59 or 60) amino acids from the N-terminus. In some embodiments, the amino acid positions of the mutations are numbered according to the numbering of wild-type S. aureus SrtA (e.g., as shown in SEQ ID NO: 1). In some embodiments, the full-length nucleotide sequence of wild-type S. aureus SrtA is, for example, as shown in SEQ ID NO: 2.

SEQ ID NO:1(全长，GenBank登录号：CAA3829591.1)SEQ ID NO: 1 (full length, GenBank accession number: CAA3829591.1)

1MKKWTNRLMT IAGVVLILVA AYLFSKPHID NYLHDKDKDE KIEQYDKNVK1MKKWTNRLMT IAGVVLILVA AYLFSKPHID NYLHDKDKDE KIEQYDKNVK

51EQASKDKKQQ AKPQIPKDKS KVAGYIEIPD ADIKEPVYPG PATPEQLNRG51EQASKDKKQQ AKPQIPKDKS KVAGYIEIPD ADIKEPVYPG PATPEQLNRG

101VSFAEENESL DDQNISIAGH TFIDRPNYQF TNLKAAKKGS MVYFKVGNET101VSFAEENESLDDQNISIAGHTFIDRPNYQFTNLKAAKKGSMVYFKVGNET

151RKYKMTSIRD VKPTDVGVLD EQKGKDKQLT LITCDDYNEK TGVWEKRKIF151RKYKMTSIRD VKPTDVGVLD EQKGKDKQLT LITCDDYNEK TGVWEKRKIF

201VATEVK201VATEVK

SEQ ID NO:2(全长，野生型)SEQ ID NO: 2 (full length, wild type)

ATGAAAAAATGGACAAATCGATTAATGACAATCGCTGGTGTGGTACTTATCCTAGATGAAAAAATGGACAAATCGATTAATGACAATCGCTGGTGTGGTACTTATCCTAG

TGGCAGCATATTTGTTTGCTAAACCACATATCGATAATTATCTTCACGATAAAGATGGCAGCATATTTGTTTGCTAAACCACATATCGATAATTATCTTCACGATAAAGA

TAAAGATGAAAAGATTGAACAATATGATAAAAATGTAAAAGAACAGGCGAGTATAAAGATGAAAAGATTGAACAATATGATAAAAATGTAAAAGAACAGGCGAGTA

AAGATAAAAAGCAGCAAGCTAAACCTCAAATTCCGAAAGATAAATCGAAAGTGGAAGATAAAAAGCAGCAAGCTAAACCTCAAATTCCGAAAGATAAATCGAAAGTGG

CAGGCTATATTGAAATTCCAGATGCTGATATTAAAGAACCAGTATATCCAGGACCCAGGCTATATTGAAATTCCAGATGCTGATATTAAAGAACCAGTATATCCAGGACC

AGCAACACCTGAACAATTAAATAGAGGTGTAAGCTTTGCAGAAGAAAATGAATCAGCAACACCTGAACAATTAAATAGAGGTGTAAGCTTTGCAGAAGAAAATGAATC

ACTAGATGATCAAAATATTTCAATTGCAGGACACACTTTCATTGACCGTCCGAACACTAGATGATCAAAATATTTCAATTGCAGGACACACTTTCATTGACCGTCCGAAC

TATCAATTTACAAATCTTAAAGCAGCCAAAAAAGGTAGTATGGTGTACTTTAAAGTATCAATTTACAAATCTTAAAGCAGCCAAAAAAGGTAGTATTGGTGTACTTTAAAG

TTGGTAATGAAACACGTAAGTATAAAATGACAAGTATAAGAGATGTTAAGCCTATTGGTAATGAAACACGTAAGTATAAAATGACAAGTATAAGAGATGTTAAGCCTA

CAGATGTAGGAGTTCTAGATGAACAAAAAGGTAAAGATAAACAATTAACATTAACAGATGTAGGAGTTCTAGATGAACAAAAAGGTAAAGATAAACAATTAACATTAA

TTACTTGTGATGATTACAATGAAAAGACAGGCGTTTGGGAAAAACGTAAAATCTTTTACTTGTGATGATTACAATGAAAAGACAGGCGTTTGGGAAAAACGTAAAATCTT

TGTAGCTACAGAAGTCAAATGTAGCTACAGAAGTCAAA

在一些实施方案中，与野生型金黄色葡萄球菌SrtA相比，金黄色葡萄球菌SrtA变体可以在对应于SEQ ID NO:1的第94、105、108、124、160、165、187、189、190、196和200位中的一个或多个位置包含一个或多个突变。在一些实施方案中，与野生型金黄色葡萄球菌SrtA相比，金黄色葡萄球菌SrtA变体可以包含对应于P94S/R、E105K、E108A、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L的一个或多个突变。在一些实施方案中，与野生型金黄色葡萄球菌SrtA相比，金黄色葡萄球菌SrtA变体可以包含对应于D124G、Y187L、E189R和F200L的一个或多个突变，并且任选地还包含对应于P94S/R、D160N、D165A、K190E和K196T的一个或多个突变，并且任选地还包含对应于E105K和E108A的一个或多个突变。在某些实施方案中，与野生型金黄色葡萄球菌SrtA相比，金黄色葡萄球菌SrtA变体可以包含对应于D124G；D124G和F200L；P94S/R、D124G、D160N、D165A、K190E和K196T；P94S/R、D160N、D165A、Y187L、E189R、K190E和K196T；P94S/R、D124G、D160N、D165A、Y187L、E189R、K190E和K196T；D124G、Y187L、E189R和F200L；或P94S/R、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L的突变。在一些实施方案中，金黄色葡萄球菌SrtA变体可以包含相对于SEQID NO:1的P94S/R、E105K、E108A、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L中的一个或多个突变。在一些实施方案中，金黄色葡萄球菌SrtA变体可以包含D124G、Y187L、E189R和F200L，并且任选地进一步包含P94S/R、D160N、D165A、K190E和K196T中的一个或多个突变，并且任选地进一步包含相对于SEQ ID NO:1的E105K和/或E108A。在某些实施方案中，金黄色葡萄球菌SrtA变体相对于SEQ ID NO:1，可以包含：D124G；D124G和F200L；P94S/R、D124G、D160N、D165A、K190E和K196T；P94S/R、D160N、D165A、Y187L、E189R、K190E和K196T；P94S/R、D124G、D160N、D165A、Y187L、E189R、K190E和K196T；D124G、Y187L、E189R和F200L；或P94S/R、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L。在一些实施方案中，突变E105K和/或E108A/Q允许分选酶介导的反应是Ca²⁺非依赖性的。在一些实施方案中，本文所述的金黄色葡萄球菌SrtA变体可以从N-末端去除25-60个(例如，25、30、35、40、45、50、55、56、57、58、59或60个)氨基酸。在一些实施方案中，上述突变的氨基酸位置根据野生型金黄色葡萄球菌SrtA(例如，如SEQ ID NO:1所示)全长的编号进行编号。In some embodiments, compared to wild-type S. aureus SrtA, the S. aureus SrtA variant can comprise one or more mutations at one or more of positions corresponding to positions 94, 105, 108, 124, 160, 165, 187, 189, 190, 196, and 200 of SEQ ID NO: 1. In some embodiments, compared to wild-type S. aureus SrtA, the S. aureus SrtA variant can comprise one or more mutations corresponding to P94S/R, E105K, E108A, D124G, D160N, D165A, Y187L, E189R, K190E, K196T, and F200L. In some embodiments, compared to wild-type S. aureus SrtA, the S. aureus SrtA variant may comprise one or more mutations corresponding to D124G, Y187L, E189R, and F200L, and optionally further comprises one or more mutations corresponding to P94S/R, D160N, D165A, K190E, and K196T, and optionally further comprises one or more mutations corresponding to E105K and E108A. In certain embodiments, compared to wild-type S. aureus SrtA, the S. aureus SrtA variant may comprise one or more mutations corresponding to D124G; D124G and F200L; P94S/R, D124G, D160N, D165A, K190E, and K196T; P94S/R, D160N, D165A, Y187L, E189R, K190E 87L, E189R, K190E, K196T, and F200L. In some embodiments, the SrtA variant of Staphylococcus aureus may comprise one or more mutations of P94S/R, E105K, E108A, D124G, D160N, D165A, Y187L, E189R, K190E, K196T, and F200L relative to SEQ ID NO: 1. In some embodiments, the S. aureus SrtA variant may comprise D124G, Y187L, E189R, and F200L, and optionally further comprises one or more mutations in P94S/R, D160N, D165A, K190E, and K196T, and optionally further comprises E105K and/or E108A relative to SEQ ID NO: 1. In certain embodiments, the S. aureus SrtA variant may comprise D124G, Y187L, E189R, and F200L, and optionally further comprises one or more mutations in P94S/R, D160N, D165A, K190E, and K196T, and optionally further comprises E105K and/or E108A relative to SEQ ID NO: 1. NO: 1, may contain: D124G; D124G and F200L; P94S/R, D124G, D160N, D165A, K190E and K196T; P94S/R, D160N, D165A, Y187L, E189R, K190E and K196T; P94S/R, D124G, D160N, D165A, Y187L, E189R, K190E and K196T; D124G, Y187L, E189R and F200L; or P94S/R, D124G, D160N, D165A, Y187L, E189R, K190E, K196T and F200L. In some embodiments, mutations E105K and/or E108A/Q allow the sortase-mediated reaction to be Ca2 ⁺ -independent. In some embodiments, the SrtA variants described herein can remove 25-60 (e.g., 25, 30, 35, 40, 45, 50, 55, 56, 57, 58, 59, or 60) amino acids from the N-terminus. In some embodiments, the amino acid positions of the mutations described above are numbered according to the numbering of the full length of wild-type SrtA (e.g., as shown in SEQ ID NO: 1).

在一些实施方案中，可用于本公开的金黄色葡萄球菌SrtA的功能变体可以是包含P94S/R、E105K、E108A/Q、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L中的一个或多个突变的金黄色葡萄球菌SrtA变体。在某些实施方案中，金黄色葡萄球菌SrtA变体可以包含P94S/R、E105K、E108Q、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L；或P94S/R、E105K、E108A、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L。在一些实施方案中，金黄色葡萄球菌SrtA变体可以包含相对于SEQ ID NO:1的P94S/R、E105K、E108A/Q、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L中的一个或多个突变。在某些实施方案中，金黄色葡萄球菌SrtA变体可以包含相对于SEQ ID NO:1的P94S/R、E105K、E108Q、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L；或相对于SEQ ID NO:1的P94S/R、E105K、E108A、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L。在一些实施方案中，金黄色葡萄球菌SrtA变体从N-末端去除25-60个(例如，25、30、35、40、45、50、55、56、57、58、59或60个)氨基酸。在一些实施方案中，上述突变的氨基酸位置根据野生型金黄色葡萄球菌SrtA(例如，如SEQ ID NO:1所示)的编号进行编号。In some embodiments, a functional variant of S. aureus SrtA useful in the present disclosure may be a S. aureus SrtA variant comprising one or more mutations of P94S/R, E105K, E108A/Q, D124G, D160N, D165A, Y187L, E189R, K190E, K196T, and F200L. In certain embodiments, the S. aureus SrtA variant may comprise P94S/R, E105K, E108Q, D124G, D160N, D165A, Y187L, E189R, K190E, K196T, and F200L; or P94S/R, E105K, E108A, D124G, D160N, D165A, Y187L, E189R, K190E, K196T, and F200L. In some embodiments, the S. aureus SrtA variant may comprise one or more mutations relative to P94S/R, E105K, E108A/Q, D124G, D160N, D165A, Y187L, E189R, K190E, K196T, and F200L of SEQ ID NO: 1. In certain embodiments, the S. aureus SrtA variant can comprise P94S/R, E105K, E108Q, D124G, D160N, D165A, Y187L, E189R, K190E, K196T, and F200L relative to SEQ ID NO: 1; or P94S/R, E105K, E108A, D124G, D160N, D165A, Y187L, E189R, K190E, K196T, and F200L relative to SEQ ID NO: 1. In some embodiments, the S. aureus SrtA variant has 25-60 (e.g., 25, 30, 35, 40, 45, 50, 55, 56, 57, 58, 59, or 60) amino acids removed from the N-terminus. In some embodiments, the amino acid positions of the mutations are numbered according to the numbering of wild-type S. aureus SrtA (eg, as shown in SEQ ID NO: 1).

在一些实施方案中，本公开涵盖金黄色葡萄球菌SrtA变体(mg SrtA)，其包含与如SEQ ID No:3中所示的氨基酸序列具有至少60％(例如，至少65％、70％、75％、80％、85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％、99.5％、99.9％或更高的)同一性的氨基酸序列，或基本上由其组成，或由其组成。在一些实施方案中，SEQ ID NO:3是截短的SrtA，并且对应于野生型SrtA的突变在下面以粗体和下划线显示。在一些实施方案中，SrtA变体包含与如SEQ ID No:3中所示的氨基酸序列具有至少60％(例如，至少65％、70％、75％、80％、85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％、99.5％、99.9％或更高的)同一性的氨基酸序列，或基本上由其组成，或由其组成，并且所述SrtA变体包含P94R/S、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L的突变，以及任选地包含E105K和/或E108A/Q(根据SEQ ID NO:1的编号进行编号)。In some embodiments, the present disclosure encompasses a S. aureus SrtA variant (mg SrtA) comprising, consisting essentially of, or consisting of an amino acid sequence having at least 60% (e.g., at least 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, 99.9% or more) identity to the amino acid sequence as shown in SEQ ID No: 3. In some embodiments, SEQ ID NO: 3 is a truncated SrtA, and mutations corresponding to wild-type SrtA are shown below in bold and underlined. In some embodiments, the SrtA variant comprises, consists essentially of, or consists of an amino acid sequence having at least 60% (e.g., at least 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, 99.9% or more) identity to the amino acid sequence as shown in SEQ ID No:3, and the SrtA variant comprises mutations of P94R/S, D124G, D160N, D165A, Y187L, E189R, K190E, K196T and F200L, and optionally comprises E105K and/or E108A/Q (numbering according to the numbering of SEQ ID NO:1).

SEQ ID NO:3(以粗体和下划线显示突变)SEQ ID NO: 3 (mutations shown in bold and underlined)

在一些实施方案中，本公开提供了编码金黄色葡萄球菌SrtA变体的核酸，在一些实施方案中，所述核酸在SEQ ID NO:4中显示。In some embodiments, the disclosure provides a nucleic acid encoding a S. aureus SrtA variant, which in some embodiments is shown in SEQ ID NO:4.

SEQ ID NO:4SEQ ID NO:4

AAACCACATATCGATAATTATCTTCACGATAAAGATAAAGATGAAAAGATTGAAAAACCACATATCGATAATTATCTTCACGATAAAGATAAAGATGAAAAGATTGAA

CAATATGATAAAAATGTAAAAGAACAGGCGAGTAAAGATAAAAAGCAGCAAGCCAATATGATAAAAATGTAAAAGAACAGGCGAGTAAAGATAAAAAGCAGCAAGC

TAAACCTCAAATTCCGAAAGATAAATCGAAAGTGGCAGGCTATATTGAAATTCCTAAACCTCAAATTCCGAAAGATAAATCGAAAGTGGCAGGCTATATTGAAATTCC

AGATGCTGATATTAAAGAACCAGTATATCCAGGACCAGCAACACGTGAACAATTAGATGCTGATATTAAAGAACCAGTATATCCAGGACCAGCAACACGTGAACAATT

AAATAGAGGTGTAAGCTTTGCAGAAGAAAATGAATCACTAGATGATCAAAATATAAATAGAGGTGTAAGCTTTGCAGAAGAAAATGAATCACTAGATGATCAAAATAT

TTCAATTGCAGGACACACTTTCATTGGCCGTCCGAACTATCAATTTACAAATCTTTTCAATTGCAGGACACACTTTTCATTGGCCGTCCGAACTATCAATTTACAAATCTT

AAAGCAGCCAAAAAAGGTAGTATGGTGTACTTTAAAGTTGGTAATGAAACACGTAAAGCAGCCAAAAAAGGTAGTATGGGTACTTTAAAGTTGGTAATGAAACACGT

AAGTATAAAATGACAAGTATAAGAAATGTTAAGCCTACAGCTGTAGGAGTTCTAAAGTATAAAATGACAAGTATAAGAAATGTTAAGCCTACAGCTGTAGGAGTTCTA

GATGAACAAAAAGGTAAAGATAAACAATTAACATTAATTACTTGTGATGATCTTGATGAACAAAAAGGTAAAGATAAACAATTAACATTAATTACTTGTGATGATCTT

AATCGGGAGACAGGCGTTTGGGAAACACGTAAAATCTTGGTAGCTACAGAAGTCAATCGGGAGACAGGCGTTTGGGAAACACGTAAAATCTTGGTAGCTACAGAAGTC

AAAAAA

在一些实施方案中，金黄色葡萄球菌SrtA变体可以是截短的变体，其中例如，从N-末端去除(与野生型金黄色葡萄球菌SrtA相比)25-60个(例如，30、35、40、45、50、55、59或60个)氨基酸。在一些实施方案中，截短的变体包含与如SEQ ID No:5或7中所示的氨基酸序列具有至少60％(例如，至少65％、70％、75％、80％、85％、90％、91％、92％、93％、94％、95％、96％、97％、98％、99％、99.5％、99.9％或更高，例如100％的)同一性的氨基酸序列，或基本上由其组成，或由其组成。编码SEQ ID NOs:5和7的核酸在以下的SEQ ID NOs:6和8中显示。In some embodiments, the S. aureus SrtA variant can be a truncated variant, wherein, for example, 25-60 (e.g., 30, 35, 40, 45, 50, 55, 59 or 60) amino acids are removed from the N-terminus (compared to wild-type S. aureus SrtA). In some embodiments, the truncated variant comprises, or consists essentially of, or consists of, an amino acid sequence having at least 60% (e.g., at least 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 99.5%, 99.9% or more, such as 100%) identity to the amino acid sequence as shown in SEQ ID No: 5 or 7. Nucleic acids encoding SEQ ID NOs: 5 and 7 are shown in SEQ ID NOs: 6 and 8 below.

SEQ ID NO:5(与wt SrtA相比的突变以粗体和下划线显示)SEQ ID NO: 5 (mutations compared to wt SrtA are shown in bold and underlined)

SEQ ID NO:6SEQ ID NO:6

CAAGCTAAACCTCAAATTCCGAAAGATAAATCGAAAGTGGCAGGCTATATTGAACAAGCTAAACCTCAAATTCCGAAAGATAAATCGAAAGTGGCAGGCTATATTGAA

ATTCCAGATGCTGATATTAAAGAACCAGTATATCCAGGACCAGCAACACGTGAAATTCCAGATGCTGATATTAAAGAACCAGTATATCCAGGACCAGCAACACGTGAA

CAATTAAATAGAGGTGTAAGCTTTGCAGAAGAAAATGAATCACTAGATGATCAACAATTAAATAGAGGTGTAAGCTTTGCAGAAGAAAATGAATCACTAGATGATCAA

AATATTTCAATTGCAGGACACACTTTCATTGGCCGTCCGAACTATCAATTTACAAAATATTTCAATTGCAGGACACACTTTCATTGGCCGTCCGAACTATCAATTTACAA

ATCTTAAAGCAGCCAAAAAAGGTAGTATGGTGTACTTTAAAGTTGGTAATGAAAATCTTAAAGCAGCCAAAAAAGGTAGTATGTGTGTACTTTAAAGTTGGTAATGAAA

CACGTAAGTATAAAATGACAAGTATAAGAAATGTTAAGCCTACAGCTGTAGGAGCACGTAAGTATAAAATGACAAGTATAAGAAATGTTAAGCCTACAGCTGTAGGAG

TTCTAGATGAACAAAAAGGTAAAGATAAACAATTAACATTAATTACTTGTGATGTTCTAGATGAACAAAAAGGTAAAGATAAACAATTAACATTAATTACTTGTGATG

ATCTTAATCGGGAGACAGGCGTTTGGGAAACACGTAAAATCTTGGTAGCTACAGATCTTAATCGGGAGACAGGCGTTTGGGAAACACGTAAAATCTTGGTAGCTACAG

AAGTCAAAAAGTCAAA

SEQ ID NO:7(与wt SrtA相比的突变以粗体和下划线显示)SEQ ID NO:7 (mutations compared to wt SrtA are shown in bold and underlined)

SEQ ID NO:8SEQ ID NO:8

ATGCAAGCTAAACCTCAAATTCCGAAAGATAAATCGAAAGTGGCAGGCTATATTATGCAAGCTAAACCTCAAATTCCGAAAGATAAATCGAAAGTGGCAGGCTATATT

GAAATTCCAGATGCTGATATTAAAGAACCAGTATATCCAGGACCAGCAACACGTGAAATTCCAGATGCTGATATTAAAGAACCAGTATATCCAGGACCAGCAACACGT

GAACAATTAAATAGAGGTGTAAGCTTTGCAGAAGAAAATGAATCACTAGATGATGAACAATTAAATAGAGGTGTAAGCTTTGCAGAAGAAAATGAATCACTAGATGAT

CAAAATATTTCAATTGCAGGACACACTTTCATTGGCCGTCCGAACTATCAATTTACAAAATATTTCAATTGCAGGACACACTTTCATTGGCCGTCCGAACTATCAATTTA

CAAATCTTAAAGCAGCCAAAAAAGGTAGTATGGTGTACTTTAAAGTTGGTAATGCAAATCTTAAAGCAGCCAAAAAAGGTAGTATGTGTGTACTTTAAAGTTGGTAATG

AAACACGTAAGTATAAAATGACAAGTATAAGAAATGTTAAGCCTACAGCTGTAGAAACACGTAAGTATAAAATGACAAGTATAAGAAATGTTAAGCCTACAGCTGTAG

GAGTTCTAGATGAACAAAAAGGTAAAGATAAACAATTAACATTAATTACTTGTGGAGTTCTAGATGAACAAAAAGGTAAAGATAAACAATTAACATTAATTACTTGTG

ATGATCTTAATCGGGAGACAGGCGTTTGGGAAACACGTAAAATCTTGGTAGCTAATGATCTTAATCGGGAGACAGGCGTTTGGGAAACACGTAAAATCTTGGTAGCTA

CAGAAGTCAAACAGAAGTCAAA

在一些实施方案中，分选酶A变体可包含以下任何一个或多个：第94位的S残基(S94)或第94位的R残基(R94)、第105位的K残基(K105)、第108位的A残基(A108)、第108位的Q残基(Q108)、第124位的G残基(G124)、第160位的N残基(N160)、第165位的A残基(A165)、第189位的R残基(R189)、第190位的E残基(E190)、第196位的T残基(T196)、以及第200位的L残基(L200)(根据野生型SrtA，例如，SEQ ID NO:1的编号进行编号)，任选地从野生型金黄色葡萄球菌SrtA的N-末端去除约25-60个(例如，25、30、35、40、45、50、55、56、57、58、59或60个)氨基酸。例如，在一些实施方案中，分选酶A变体相对于野生型金黄色葡萄球菌SrtA(例如，SEQ ID NO:1)包含两、三、四或五个前述突变。在一些实施方案中，相对于野生型金黄色葡萄球菌SrtA(例如，SEQ ID NO:1)，分选酶A变体包含第94位的S残基(S94)或第94位的R残基(R94)、以及第160位的N残基(N160)、第165位的A残基(A165)、以及第196位的T残基(T196)。例如，在一些实施方案中，相对于野生型金黄色葡萄球菌SrtA(例如，SEQ ID NO:1)，分选酶A变体包含P94S或P94R，以及D160N、D165A和K196T。在一些实施方案中，相对于野生型金黄色葡萄球菌SrtA(例如，SEQ ID NO:1)，分选酶A变体包含第94位的S残基(S94)或第94位的R残基(94)、以及第160位的N残基(N160)、第165位的A残基(A165)、第190位的E残基、以及第196位的T残基。例如，在一些实施方案中，相对于野生型金黄色葡萄球菌SrtA(例如，SEQ ID NO:1)，分选酶A变体包含P94S或P94R，以及D160N、D165A、K190E和K196T。在一些实施方案中，相对于野生型金黄色葡萄球菌SrtA(例如，SEQ ID NO:1)，分选酶A变体包含第94位的R残基(R94)、第160位的N残基(N160)、第165位的A残基(A165)、第190位的E残基、以及第196位的T残基。在一些实施方案中，相对于野生型金黄色葡萄球菌SrtA(例如，SEQ IDNO:1)，分选酶A变体包含P94R、D160N、D165A、K190E和K196T。在一些实施方案中，金黄色葡萄球菌SrtA变体可以从N-末端去除25-60个(例如，25、30、35、40、45、50、55、56、57、58、59或60个)氨基酸。In some embodiments, the sortase A variant may comprise any one or more of the following: an S residue at position 94 (S94) or an R residue at position 94 (R94), a K residue at position 105 (K105), an A residue at position 108 (A108), a Q residue at position 108 (Q108), a G residue at position 124 (G124), an N residue at position 160 (N160), an A residue at position 165 (A165), an R residue at position 189 (R189), an E residue at position 190 (E190), a T residue at position 196 (T196), and an L residue at position 200 (L200) (according to wild-type SrtA, e.g., SEQ ID NO:1), optionally removing about 25-60 (e.g., 25, 30, 35, 40, 45, 50, 55, 56, 57, 58, 59 or 60) amino acids from the N-terminus of wild-type S. aureus SrtA. For example, in some embodiments, the sortase A variant comprises two, three, four or five of the aforementioned mutations relative to wild-type S. aureus SrtA (e.g., SEQ ID NO:1). In some embodiments, the sortase A variant comprises an S residue at position 94 (S94) or an R residue at position 94 (R94), and an N residue at position 160 (N160), an A residue at position 165 (A165), and a T residue at position 196 (T196) relative to wild-type S. aureus SrtA (e.g., SEQ ID NO:1). For example, in some embodiments, the sortase A variant comprises P94S or P94R, and D160N, D165A, and K196T relative to wild-type S. aureus SrtA (e.g., SEQ ID NO: 1). In some embodiments, the sortase A variant comprises an S residue at position 94 (S94) or an R residue at position 94 (94), and an N residue at position 160 (N160), an A residue at position 165 (A165), an E residue at position 190, and a T residue at position 196 relative to wild-type S. aureus SrtA (e.g., SEQ ID NO: 1). For example, in some embodiments, the sortase A variant comprises P94S or P94R, and D160N, D165A, K190E, and K196T relative to wild-type S. aureus SrtA (e.g., SEQ ID NO: 1). In some embodiments, relative to wild-type S. aureus SrtA (e.g., SEQ ID NO: 1), the sortase A variant comprises an R residue at position 94 (R94), an N residue at position 160 (N160), an A residue at position 165 (A165), an E residue at position 190, and a T residue at position 196. In some embodiments, relative to wild-type S. aureus SrtA (e.g., SEQ ID NO: 1), the sortase A variant comprises P94R, D160N, D165A, K190E, and K196T. In some embodiments, the S. aureus SrtA variant can remove 25-60 (e.g., 25, 30, 35, 40, 45, 50, 55, 56, 57, 58, 59, or 60) amino acids from the N-terminus.

在一些实施方案中，可以使用比天然存在的分选酶A具有更高的转酰胺酶活性的分选酶A变体。在一些实施方案中，分选酶A变体的活性是野生型金黄色葡萄球菌分选酶A活性的至少约10、15、20、40、60、80、100、120、140、160、180或200倍。在一些实施方案中，此类分选酶变体用于本公开的组合物或方法中。在一些实施方案中，分选酶变体相对于野生型金黄色葡萄球菌SrtA包含任何一种或多种以下取代：P94S/R、E105K、E108A、E108Q、D124G、D160N、D165A、Y187L、E189R、K190E、K196T和F200L突变。在一些实施方案中，SrtA变体可以从N-末端去除25-60个(例如，30、35、40、45、50、55、59或60个)氨基酸。In some embodiments, a sortase A variant having a higher transamidase activity than naturally occurring sortase A can be used. In some embodiments, the activity of the sortase A variant is at least about 10, 15, 20, 40, 60, 80, 100, 120, 140, 160, 180, or 200 times that of wild-type S. aureus sortase A. In some embodiments, such sortase variants are used in the compositions or methods of the present disclosure. In some embodiments, the sortase variant comprises any one or more of the following substitutions relative to wild-type S. aureus SrtA: P94S/R, E105K, E108A, E108Q, D124G, D160N, D165A, Y187L, E189R, K190E, K196T, and F200L mutations. In some embodiments, the SrtA variant can have 25-60 (eg, 30, 35, 40, 45, 50, 55, 59, or 60) amino acids removed from the N-terminus.

在一些实施方案中，氨基酸突变位置通过比对亲本金黄色葡萄球菌SrtA(本文所述的金黄色葡萄球菌SrtA变体源自所述亲本)与SEQ ID No:1的多肽来确定，即，SEQ IDNO:1的多肽用于确定亲本金黄色葡萄球菌SrtA中相应的氨基酸序列。用于确定对应于本文所述的突变位置的氨基酸位置的方法是本领域众所周知的。另一多肽中相应氨基酸残基的鉴定可以通过使用Needleman-Wunsch算法(Needleman and Wunsch,1970,J.Mol.Biol.48:443-453)来确认，如EMBOSS包的Needle程序中所实施的(EMBOSS:The European MolecularBiology Open Software Suite,Rice et al.,2000,Trends Genet.16:276-277)，优选3.0.0或更高版本。基于上述众所周知的计算机程序，确定本文所述的感兴趣多肽的氨基酸位置是本领域技术人员的常规工作。In some embodiments, the amino acid mutation position is determined by comparing the parent Staphylococcus aureus SrtA (the Staphylococcus aureus SrtA variant described herein is derived from the parent) with the polypeptide of SEQ ID No: 1, that is, the polypeptide of SEQ ID NO: 1 is used to determine the corresponding amino acid sequence in the parent Staphylococcus aureus SrtA. Methods for determining the amino acid position corresponding to the mutation position described herein are well known in the art. The identification of the corresponding amino acid residue in another polypeptide can be confirmed by using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, J. Mol. Biol. 48: 443-453), as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, Trends Genet. 16: 276-277), preferably 3.0.0 or higher. Based on the above-mentioned well-known computer programs, it is a routine task for those skilled in the art to determine the amino acid position of the polypeptide of interest described herein.

在一些实施方案中，分选酶变体可以进一步包含1、2、3、4、5、6、7、8、9、10、11、12、13、14或15个保守氨基酸突变。保守氨基酸突变基本上不影响蛋白质活性是本领域众所周知的。In some embodiments, the sortase variant may further comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15 conservative amino acid mutations. It is well known in the art that conservative amino acid mutations do not substantially affect protein activity.

使用分选酶，例如，mg SrtA将蛋白质共价标记到细胞上，在科学研究和临床应用中具有广阔的前景。然而，这可能有一定的限制：不同类型的细胞具有不同类型的膜蛋白，并且含有N-末端甘氨酸(例如，mg SrtA的G1)的蛋白质数量也不同。本公开的策略使得使用各种分选酶来用物质修饰各种细胞成为可能。The use of sortases, such as mg SrtA, to covalently label proteins onto cells has broad prospects in scientific research and clinical applications. However, this may have certain limitations: different types of cells have different types of membrane proteins, and the number of proteins containing N-terminal glycine (e.g., G1 of mg SrtA) is also different. The strategy disclosed herein makes it possible to use various sortases to modify various cells with substances.

不可逆的接头Irreversible joints

由于SrtA介导的蛋白质-细胞缀合是可逆反应，为了提高细胞标记的效率，使逆反应的发生最小化将是有益的。提高产物收率的一种解决方案是提高反应底物的浓度，但在实际应用中，大分子蛋白质可能很难达到很高的浓度；即使可以达到高浓度，高成本也可能限制该技术的使用。另一种解决方案是不断地从反应系统中去除产物，使反应不会因平衡而停止，但由于反应是在细胞上进行的，产物分离可能很困难。本申请的发明人令人惊讶地发现，对于细胞标记，可通过引入不是逆反应底物的羟基乙酰基样副产物来防止逆反应，从而使标记反应不可逆。Since SrtA-mediated protein-cell conjugation is a reversible reaction, in order to improve the efficiency of cell labeling, it would be beneficial to minimize the occurrence of reverse reactions. One solution to increase product yield is to increase the concentration of the reaction substrate, but in practical applications, it may be difficult to reach very high concentrations for large molecular proteins; even if high concentrations can be achieved, high costs may limit the use of this technology. Another solution is to continuously remove products from the reaction system so that the reaction does not stop due to equilibrium, but since the reaction is carried out on cells, product separation may be difficult. The inventors of the present application surprisingly found that for cell labeling, the reverse reaction can be prevented by introducing hydroxyacetyl-like byproducts that are not reverse reaction substrates, thereby making the labeling reaction irreversible.

为了获得羟基乙酰基样副产物，本公开考虑使用包含非天然氨基酸(优选位于分选酶识别基序的N-末端至C-末端方向的第5位)的分选酶识别基序。在一些实施方案中，非天然氨基酸是具有式CH₂OH-(CH₂)_n-COOH的取代或未取代的羟基羧酸，其中n为0至5的整数，例如，0、1、2、3、4和5，并且优选n＝0。在一些实施方案中，包含非天然氨基酸的分选酶识别基序可以选自：LPXT*Y、LPXA*Y、LPXS*Y、LPXL*Y、LPXV*Y、LGXT*Y、LAXT*Y、LSXT*Y、NPXT*Y、MPXT*Y、IPXT*Y、SPXT*Y、VPXT*Y和YPXR*Y，其中*表示任选取代的羟基羧酸，并且X和Y独立地代表任何氨基酸。在一些实施方案中，包含非天然氨基酸的分选酶识别基序可以选自：LPXT*G、LPXA*G、LPXS*G、LPXL*G、LPXV*G、LGXT*G、LAXT*G、LSXT*G、NPXT*G、MPXT*G、IPXT*G、SPXT*G、VPXT*G、YPXR*G、LPXT*S和LPXT*A，其中M优选为LPET*G，其中*优选为2-羟基乙酸。在一些实施方案中，Leu-Pro-Glu-Thr-2-羟基乙酸-Gly(LPET-(2-羟基乙酸)-G)用作接头以确保副产物将使反应不可逆。In order to obtain hydroxyacetyl-like byproducts, the present disclosure contemplates the use of a sortase recognition motif comprising an unnatural amino acid, preferably at position 5 in the N-terminal to C-terminal direction of the sortase recognition motif. In some embodiments, the unnatural amino acid is a substituted or unsubstituted hydroxycarboxylic acid having the formula CH ₂ OH-(CH ₂ ) _n -COOH, wherein n is an integer from 0 to 5, e.g., 0, 1, 2, 3, 4, and 5, and preferably n=0. In some embodiments, the sortase recognition motif comprising an unnatural amino acid may be selected from the group consisting of LPXT*Y, LPXA*Y, LPXS*Y, LPXL*Y, LPXV*Y, LGXT*Y, LAXT*Y, LSXT*Y, NPXT*Y, MPXT*Y, IPXT*Y, SPXT*Y, VPXT*Y, and YPXR*Y, wherein * represents an optionally substituted hydroxycarboxylic acid, and X and Y independently represent any amino acid. In some embodiments, the sortase recognition motif comprising an unnatural amino acid can be selected from the group consisting of: LPXT*G, LPXA*G, LPXS*G, LPXL*G, LPXV*G, LGXT*G, LAXT*G, LSXT*G, NPXT*G, MPXT*G, IPXT*G, SPXT*G, VPXT*G, YPXR*G, LPXT*S, and LPXT*A, wherein M is preferably LPET*G, wherein * is preferably 2-hydroxyacetic acid. In some embodiments, Leu-Pro-Glu-Thr-2-hydroxyacetic acid-Gly (LPET-(2-hydroxyacetic acid)-G) is used as a linker to ensure that the byproduct will make the reaction irreversible.

为了将不可逆接头引入物质，在一些实施方案中，作为接头的包含非天然氨基酸的分选酶识别基序是化学合成的并且可以直接缀合至物质,例如蛋白质或多肽。To introduce an irreversible linker into a substance, in some embodiments, a sortase recognition motif comprising an unnatural amino acid as a linker is chemically synthesized and can be directly conjugated to a substance, such as a protein or polypeptide.

在一些实施方案中，包含非天然氨基酸的分选酶识别基序可以通过各种化学手段与物质缀合以产生所需的分选酶底物。这些方法可包括与双功能交联剂或双功能交联物质(例如，NHS酯-马来酰亚胺异双功能交联剂)化学缀合，以将伯胺基团与还原的硫醇基团连接。可以在分选酶识别基序和物质之间形成其它分子融合，例如通过间隔子。如本文所用，在一些实施方案中，术语“间隔子”是指在交联分选酶识别基序和物质后双功能交联剂的残余部分。In some embodiments, the sortase recognition motif comprising an unnatural amino acid can be conjugated to a substance by various chemical means to produce the desired sortase substrate. These methods may include chemical conjugation with a bifunctional crosslinker or a bifunctional crosslinking substance (e.g., NHS ester-maleimide heterobifunctional crosslinker) to connect the primary amine group to the reduced thiol group. Other molecular fusions can be formed between the sortase recognition motif and the substance, for example, through a spacer. As used herein, in some embodiments, the term "spacer" refers to the remaining portion of the bifunctional crosslinker after crosslinking the sortase recognition motif and the substance.

用于连接双功能交联剂或间隔子的各种化学缀合方式可用于本公开，包括但不限于：(1)零长度型(例如，EDC；EDC加磺基NHS；CMC；DCC；DIC；N,N'-羰基二咪唑；伍德沃德试剂K)；(2)胺-巯基型，例如NHS酯-马来酰亚胺异双功能交联剂；马来酰亚胺羧酸(C_2-8)(例如，6-马来酰亚胺己酸和4-马来酰亚胺丁酸)；EMCS；SPDP、LC-SPDP、磺基-LC-SPDP；SMPT和磺基-LC-SMPT；SMCC、LC-SMCC和磺基-SMCC；MBS和磺基-MBS；SIAB和磺基-SIAB；SMPB和磺基SMPB；GMBS和磺基GMBS；SIAX和SIAXX；SIAC和SIACX；NPIA；(3)同双功能NHS酯型(例如，DSP；DTSSP；DSS；DST和磺基-DST；BSOCOES和磺基-BSOCOES；EGS和磺基-EGS)；(4)同双功能亚胺酯型(例如，DMA；DMP；DMS；DTBP)；(5)羰基-巯基型(例如，KMUH；EMCH；MPBH；M2C2H；PDPH)；(6)巯基反应型(例如，DPDPB；BMH；HBVS)；(7)巯基-羟基型(例如，PMPI)；或类似的。Various chemical conjugation methods for connecting bifunctional crosslinkers or spacers can be used in the present disclosure, including but not limited to: (1) zero-length type (e.g., EDC; EDC plus sulfo-NHS; CMC; DCC; DIC; N,N'-carbonyldiimidazole;Woodward's reagent K); (2) amine-thiol type, such as NHS ester-maleimide heterobifunctional crosslinker; maleimidocarboxylic acid ( _C2-8 ) (e.g., 6-maleimidocaproic acid and 4-maleimidobutyric acid); EMCS; SPDP, LC-SPDP, sulfo-LC-SPDP; SMPT and sulfo-LC-SMPT; SMCC, LC-SMCC and sulfo-SMCC; MBS and sulfo-MBS; SIAB and sulfo-SIAB; SMPB and sulfoSMPB; GMBS and sulfoGMBS; SIAX and SIAXX; SIAC and SIACX; NPIA; (3) homobifunctional NHS ester type (e.g. , DSP; DTSSP; DSS; DST and sulfo-DST; BSOCOES and sulfo-BSOCOES; EGS and sulfo-EGS); (4) homobifunctional imide ester type (e.g., DMA; DMP; DMS; DTBP); (5) carbonyl-thiol type (e.g., KMUH; EMCH; MPBH; M2C2H; PDPH); (6) thiol-reactive type (e.g., DPDPB; BMH; HBVS); (7) thiol-hydroxyl type (e.g., PMPI); or the like.

在一些实施方案中，胺-巯基型或者NHS酯-马来酰亚胺异双功能交联剂是可以在本文中使用的优选的间隔子。在某些实施方案中，马来酰亚胺羧酸(C_2-8)异双功能交联剂，例如，6-马来酰亚胺己酸和4-马来酰亚胺丁酸是用于构建所需分选酶底物的特别有用的间隔子。马来酰亚胺羧酸(C_2-8)异双功能交联剂，例如，6-马来酰亚胺己酸和4-马来酰亚胺丁酸可以与暴露的巯基(例如，在暴露的半胱氨酸上)发生Michael加成反应，但该反应不会与未暴露的半胱氨酸发生。在一种实施方案中，将6-马来酰亚胺己酸引入本公开的不可逆接头中，得到6-马来酰亚胺己酸-Leu-Pro-Glu-Thr-2-羟基乙酸-Gly，如图9所示。在一些实施方案中，半胱氨酸残基是或者已经加入到物质的C-末端以提供暴露的半胱氨酸。In some embodiments, amine-sulfhydryl type or NHS ester-maleimide heterobifunctional crosslinker is a preferred spacer that can be used in this article. In certain embodiments, maleimide carboxylic acid (C _2-8 ) heterobifunctional crosslinker, for example, 6-maleimidocaproic acid and 4-maleimidobutyric acid are particularly useful spacers for constructing the desired sortase substrate. Maleimide carboxylic acid (C _2-8 ) heterobifunctional crosslinker, for example, 6-maleimidocaproic acid and 4-maleimidobutyric acid can react with exposed sulfhydryl (for example, on exposed cysteine) Michael addition reaction occurs, but the reaction will not occur with unexposed cysteine. In one embodiment, 6-maleimidocaproic acid is introduced into the irreversible linker of the present disclosure to obtain 6-maleimidocaproic acid-Leu-Pro-Glu-Thr-2-hydroxyacetic acid-Gly, as shown in Figure 9. In some embodiments, a cysteine residue is or has been added to the C-terminus of the substance to provide an exposed cysteine.

通过使用本文所述的间隔子，特别是马来酰亚胺羧酸(C_2-8)异双功能交联剂，例如6-马来酰亚胺己酸和4-马来酰亚胺丁酸，发明人成功设计了具有不同结构的接头，包括双叉、三叉和多叉。根据实际需要，例如为获得多模式治疗，这些不同的接头可用于标记细胞如RBC。在一些实施方案的多叉结构设计中，一个或多个间隔子可以连接至N-末端氨基酸的氨基和/或赖氨酸侧链的氨基，并且相同或不同的物质如蛋白质或多肽可以连接至一个或多个间隔子。这项技术可以进一步扩大细胞标记物质如蛋白质的种类，并提高RBC工程改造的效率。By using the spacers described herein, in particular maleimidocarboxylic acid (C _2-8 ) heterobifunctional crosslinkers, such as 6-maleimidocaproic acid and 4-maleimidobutyric acid, the inventors have successfully designed linkers with different structures, including bifurcated, trifurcated and multifurcated. According to actual needs, for example, to obtain multimodal treatment, these different linkers can be used to label cells such as RBCs. In the multifurcated structure design of some embodiments, one or more spacers can be connected to the amino group of the N-terminal amino acid and/or the amino group of the lysine side chain, and the same or different substances such as proteins or polypeptides can be connected to one or more spacers. This technology can further expand the types of cell marker substances such as proteins and improve the efficiency of RBC engineering.

分选酶底物Sortase substrates

可以容易地设计适合分选酶介导的缀合的底物。分选酶底物可包含分选酶识别基序和物质。例如，可以修饰物质如多肽，以在C-末端处或附近包含分选酶识别基序，从而允许其充当分选酶的底物。分选酶识别基序不需要位于底物的最C-末端，但通常应该足以被酶接近以参与分选酶反应。在一些实施方案中，如果在分选酶识别基序的最N-末端氨基酸(例如，L)和多肽的C-末端氨基酸之间存在不超过5、6、7、8、9或10个氨基酸，则认为分选酶识别基序“靠近”C-末端。包含分选酶识别基序的多肽可以通过向其掺入或连接多种部分(例如，肽、蛋白质、化合物、核酸、脂质、小分子和糖)中的任意来修饰。Substrates suitable for sortase-mediated conjugation can be easily designed. A sortase substrate can comprise a sortase recognition motif and a substance. For example, a substance such as a polypeptide can be modified to comprise a sortase recognition motif at or near the C-terminus, thereby allowing it to act as a substrate for a sortase. The sortase recognition motif need not be located at the very C-terminus of the substrate, but should generally be close enough to be accessible to the enzyme to participate in the sortase reaction. In some embodiments, a sortase recognition motif is considered to be "close to" the C-terminus if there are no more than 5, 6, 7, 8, 9, or 10 amino acids between the most N-terminal amino acid (e.g., L) of the sortase recognition motif and the C-terminal amino acid of the polypeptide. A polypeptide comprising a sortase recognition motif can be modified by incorporating or linking any of a variety of moieties (e.g., peptides, proteins, compounds, nucleic acids, lipids, small molecules, and sugars) thereto.

在一些实施方案中，本公开提供了包含结构A-M-L'的分选酶底物，其中A代表物质，M代表一个或多个任选的间隔子，并且L'代表包含本文所述的非天然氨基酸的分选酶识别基序。在一些实施方案中，一种或多种M选自以下类型的交联剂：(1)零长度型；(2)胺-巯基型；(3)同双功能NHS酯型；(4)同双功能亚胺酯型；(5)羰基-巯基型；(6)巯基反应型；和(7)巯基-羟基型；优选地，一种或多种M是马来酰亚胺羧酸(C_2-8)型异双功能交联剂，例如6-马来酰亚胺己酸和4-马来酰亚胺丁酸，并且物质包含暴露的巯基，优选暴露的半胱氨酸，更优选末端半胱氨酸，最优选C-末端半胱氨酸。在一些实施方案中，当存在两个或更多个间隔子时，与间隔子连接的物质可以相同或不同。In some embodiments, the present disclosure provides a sortase substrate comprising the structure AM-L', wherein A represents a substance, M represents one or more optional spacers, and L' represents a sortase recognition motif comprising a non-natural amino acid as described herein. In some embodiments, one or more M is selected from the following types of cross-linkers: (1) zero length type; (2) amine-sulfhydryl type; (3) homobifunctional NHS ester type; (4) homobifunctional imine ester type; (5) carbonyl-sulfhydryl type; (6) sulfhydryl reactive type; and (7) sulfhydryl-hydroxy type; preferably, one or more M is a maleimidocarboxylic acid (C _2-8 ) type heterobifunctional cross-linker, such as 6-maleimidocaproic acid and 4-maleimidobutyric acid, and the substance comprises an exposed sulfhydryl group, preferably an exposed cysteine, more preferably a terminal cysteine, and most preferably a C-terminal cysteine. In some embodiments, when two or more spacers are present, the substance attached to the spacer can be the same or different.

物质substance

在一些实施方案中，物质包含HPV抗原肽。在一些另外的实施方案中，HPV抗原肽与主要组织相容性复合物Ⅰ类(MHC-I)蛋白结合，如HLA-A*02:01、HLA-A*02:02、HLA-A*02:06、HLA-A*02:07和HLA-A*02:11。In some embodiments, the substance comprises an HPV antigen peptide. In some other embodiments, the HPV antigen peptide binds to a major histocompatibility complex class I (MHC-I) protein, such as HLA-A*02:01, HLA-A*02:02, HLA-A*02:06, HLA-A*02:07, and HLA-A*02:11.

本文所用的术语“抗原肽”是指能够与MHC分子(特别是I类MHC分子)的肽结合槽结合的肽，从而与MHC分子形成MHC复合物。在本技术领域众所周知，抗原肽在APC表面的MHC复合物中的展示通常不涉及，例如，整个蛋白质。相反，位于槽中的抗原肽通常是整个蛋白质的一小部分。在一些实施方案中，抗原肽来源于病原体蛋白，例如病毒蛋白。在一些实施方案中，抗原肽是肿瘤新抗原。在一些实施方案中，术语“抗原肽”包括来源于蛋白肿瘤特异性变化的肿瘤抗原肽。肿瘤抗原肽涵盖但不限于由以下原因产生的肿瘤抗原：例如，蛋白质序列中的取代、移码突变、框架内缺失、插入，以及多肽的肿瘤特异性过表达。术语“肿瘤特异性变化”是指在正常非癌细胞中不存在但在癌或肿瘤细胞中发现的变化。The term "antigenic peptide" as used herein refers to a peptide that can bind to the peptide binding groove of an MHC molecule (particularly a class I MHC molecule), thereby forming an MHC complex with the MHC molecule. It is well known in the art that the display of antigenic peptides in the MHC complex on the surface of an APC does not usually involve, for example, the entire protein. On the contrary, the antigenic peptide located in the groove is usually a small part of the entire protein. In some embodiments, the antigenic peptide is derived from a pathogen protein, such as a viral protein. In some embodiments, the antigenic peptide is a tumor neoantigen. In some embodiments, the term "antigenic peptide" includes tumor antigenic peptides derived from tumor-specific changes in proteins. Tumor antigenic peptides cover but are not limited to tumor antigens produced by the following reasons: for example, substitutions in protein sequences, frameshift mutations, in-frame deletions, insertions, and tumor-specific overexpression of polypeptides. The term "tumor-specific changes" refers to changes that are not present in normal non-cancerous cells but are found in cancer or tumor cells.

在一些实施方案中，所述物质是融合蛋白，其从N-末端到C-末端包含抗原肽、任选的第一肽接头和MHC-I。在一些另外的实施方案中，所述物质是融合蛋白，其从N-末端到C-末端包含抗原肽、任选的第一肽接头、β2-微球蛋白(即，MHC-I轻链)、任选的第二肽接头和MHC-I重链(优选缺失跨膜区和胞质区的MHC-I重链)。在一些实施方案中，第一肽接头和第二肽接头是富含甘氨酸和/或丝氨酸的独立接头。In some embodiments, the substance is a fusion protein, which comprises an antigenic peptide, an optional first peptide linker and MHC-I from N-terminus to C-terminus. In some other embodiments, the substance is a fusion protein, which comprises an antigenic peptide, an optional first peptide linker, β2-microglobulin (i.e., MHC-I light chain), an optional second peptide linker and MHC-I heavy chain (preferably MHC-I heavy chain lacking transmembrane region and cytoplasmic region) from N-terminus to C-terminus. In some embodiments, the first peptide linker and the second peptide linker are independent linkers rich in glycine and/or serine.

本文所述的融合蛋白可以被称为“人工MHC单链分子”，是指包含抗原肽、β2-微球蛋白和MHC I类重链的融合蛋白。在一些文献中，这种融合蛋白也被称为“单链三聚体”。通常，在抗原肽和β2-微球蛋白(即，MHC-I轻链)之间有肽接头，并且在β2-微球蛋白和MHC I类重链之间有肽接头。因此，人工MHC单链分子可以从N-末端到C-末端包含抗原肽、第一肽接头、β2-微球蛋白、第二肽接头和MHC I类重链。在一些结构中，人工MHC单链分子还可以在N端包含信号肽(例如，β2-微球蛋白信号肽)。在一些结构中，人工MHC单链分子中的MHC I类重链部分是缺失跨膜区和胞质区的截短版本。The fusion protein described herein can be referred to as "artificial MHC single-chain molecule", which refers to a fusion protein comprising an antigenic peptide, β2-microglobulin and MHC class I heavy chain. In some documents, this fusion protein is also referred to as a "single-chain trimer". Typically, there is a peptide linker between the antigenic peptide and the β2-microglobulin (i.e., MHC-I light chain), and there is a peptide linker between the β2-microglobulin and the MHC class I heavy chain. Therefore, the artificial MHC single-chain molecule can include an antigenic peptide, a first peptide linker, a β2-microglobulin, a second peptide linker and an MHC class I heavy chain from the N-terminus to the C-terminus. In some structures, the artificial MHC single-chain molecule can also include a signal peptide (e.g., a β2-microglobulin signal peptide) at the N-terminus. In some structures, the MHC class I heavy chain portion in the artificial MHC single-chain molecule is a truncated version of the missing transmembrane region and the cytoplasmic region.

在一些实施方案中，抗原肽是HPV抗原肽。在一些另外的实施方案中，术语“HPV抗原肽”是指来源于HPV(如HPV16或HPV18)蛋白肿瘤特异性变化的肿瘤抗原肽。高危型HPV的两种主要致癌蛋白是E6和E7。"E"的命名表明这两种蛋白是早期蛋白(在HPV生命周期中的早期表达)。HPV基因组包含六个早期(E1、E2、E4、E5、E6和E7)开放阅读框架(ORF)。宿主细胞被感染后，病毒早期启动子被激活，并且转录出包含所有六个早期ORF的多顺反子初级RNA。然后，这种多顺反子RNA通过活性RNA剪接生成多种mRNA的异构体。E2的病毒早期转录启动了病毒E2调控，因为高水平的E2抑制转录。HPV基因组通过破坏E2的ORF整合到宿主基因组中，阻止E2抑制E6和E7。因此，病毒基因组整合到宿主DNA基因组中会增加E6和E7的表达，促进细胞增殖和恶性转化的机会。E6和E7表达的程度与最终可能发展的宫颈病变类型相关。在一些实施方案中，HPV抗原肽是HPV16或HPV18的E6或E7抗原肽。在一些实施方案中，HPV抗原肽是HPV16-E6抗原肽、HPV16-E7抗原肽、HPV18-E6抗原肽或HPV18-E7抗原肽，或其组合。在一些实施方案中，HPV抗原肽包含或由选自如下的序列组成：SEQ ID NO:9(YMLDLQPET)(源自人HPV16-E7)、SEQ ID NO:10(LLMGTLGIV)(源自人HPV16-E7)和SEQ ID NO:11(KCLKFYSKI)(源自小鼠HPV16-E6)、SEQ ID NO:12(FHNIRGRWTGR)、SEQ ID NO:13(PYAVCDKCL)、SEQ ID NO:14(EQQYNKPLC)、SEQ ID NO:15(RFHNIRGR)、SEQ ID NO:16(IRGRWTGRCM)、SEQ ID NO:17(LPQLCTELQT)、SEQ ID NO:18(MHQKRTAM)、SEQ ID NO:19(MFQDPQERPR)、SEQ ID NO:20(NKPLCDLLIRC)、SEQ ID NO:21(LRREVYDFAFR)、SEQ ID NO:22(KLPQLCTEL)、SEQ ID NO:23(TIHDIILECV)、SEQ ID NO:24(TIHDIILEC)、SEQ ID NO:25(IVYRDGNPYA)、SEQ ID NO:26(IVYRDGNPYAV)、SEQ ID NO:27(FQDPQERPRKL)、SEQ ID NO:28(YRDGNPYAV)、SEQ ID NO:29(QLCTELQTTI)、SEQ ID NO:30(KISEYRHYC)、SEQ ID NO:31(QQYNKPLCDLL)、SEQ ID NO:32(TLHEYMLDLQ)、SEQ ID NO:33(PAGQAEPD)、SEQ ID NO:34(FCCKCDST)、SEQ ID NO:35(LRLCVQSTH)、SEQ ID NO:36(CVQSTHVD)、SEQ ID NO:37(NIVTFCCK)、SEQ ID NO:38(VCPICSQK)、SEQ ID NO:39(MLDLQPETTD)、SEQ ID NO:40(CYEQLNDSSE)、SEQ ID NO:41(TLHEYMLDL)、SEQ ID NO:42(TLEDLLMGTL)、SEQ ID NO:43(YMLDLQPETT)、SEQ ID NO:44(RTLEDLLMGTL)、SEQ ID NO:45(MLDLQPETTDL)、SEQ ID NO:46(MLDLQPETT)、SEQ ID NO:47(GTLGIVCPI)、SEQ ID NO:48(TLEDLLMGT)和SEQ ID NO:49(RLCVQSTHV)，或与SEQ ID NOs:9-48中选出的序列有至少85％、至少86％、至少87％、至少88％、至少89％、至少90％、至少91％、至少92％、至少93％、至少94％、至少95％、至少96％、至少97％、至少98％或至少99％序列同一性的氨基酸序列。In some embodiments, the antigenic peptide is an HPV antigenic peptide. In some other embodiments, the term "HPV antigenic peptide" refers to a tumor antigenic peptide derived from a tumor-specific change in an HPV (such as HPV16 or HPV18) protein. The two main oncogenic proteins of high-risk HPV are E6 and E7. The naming of "E" indicates that these two proteins are early proteins (expressed early in the HPV life cycle). The HPV genome contains six early (E1, E2, E4, E5, E6 and E7) open reading frames (ORFs). After the host cell is infected, the viral early promoter is activated and a polycistronic primary RNA containing all six early ORFs is transcribed. Then, this polycistronic RNA generates multiple mRNA isomers by active RNA splicing. The early transcription of E2 initiates viral E2 regulation because high levels of E2 inhibit transcription. The HPV genome is integrated into the host genome by destroying the ORF of E2, preventing E2 from inhibiting E6 and E7. Therefore, the integration of the viral genome into the host DNA genome will increase the expression of E6 and E7, promoting cell proliferation and the chance of malignant transformation. The degree of E6 and E7 expression is related to the type of cervical lesions that may eventually develop. In some embodiments, the HPV antigen peptide is an E6 or E7 antigen peptide of HPV16 or HPV18. In some embodiments, the HPV antigen peptide is an HPV16-E6 antigen peptide, an HPV16-E7 antigen peptide, an HPV18-E6 antigen peptide or an HPV18-E7 antigen peptide, or a combination thereof. In some embodiments, the HPV antigen peptide comprises or consists of a sequence selected from the group consisting of SEQ ID NO: 9 (YMLDLQPET) (derived from human HPV16-E7), SEQ ID NO: 10 (LLMGTLGIV) (derived from human HPV16-E7) and SEQ ID NO: 11 (KCLKFYSKI) (derived from mouse HPV16-E6), SEQ ID NO: 12 (FHNIRGRWTGR), SEQ ID NO: 13 (PYAVCDKCL), SEQ ID NO: 14 (EQQYNKPLC), SEQ ID NO: 15 (RFHNIRGR), SEQ ID NO: 16 (IRGRWTGRCM), SEQ ID NO: 17 (LPQLCTELQT), SEQ ID NO: 18 (MHQKRTAM), SEQ ID NO: 19 (MFQDPQERPR), SEQ ID NO: 20 (NKPLCDLLIRC), SEQ ID NO: 21 (LRREVYDFAFR), SEQ ID NO: 22 (KLPQLCTEL), SEQ ID NO: 23 ( NO:23(TIHDIILECV), SEQ ID NO:24(TIHDIILEC), SEQ ID NO:25(IVYRDGNPYA), SEQ ID NO:26(IVYRDGNPYAV), SEQ ID NO:27(FQDPQERPRKL), SEQ ID NO:28(YRDGNPYAV), SEQ ID NO:29(QLCTELQTTI), SEQ ID NO:30(KISEYRHYC ), SEQ ID NO:31(QQYNKPLCDLL), SEQ ID NO:32(TLHEYMLDLQ), SEQ ID NO:33(PAGQAEPD), SEQ ID NO:34(FCCKCDST), SEQ ID NO:35(LRLCVQSTH), SEQ ID NO:36(CVQSTHVD), SEQ ID NO:37(NIVTFCCK), SEQ ID NO:38(VCPICSQK), SEQ ID NO:39(MLDLQPETTD)、SEQ ID NO:40 (CYEQLNDSSE), SEQ ID NO:41 (TLHEYMLDL), SEQ ID NO:42 (TLEDLLMGTL), SEQ ID NO:43 (YMLDLQPETT), SEQ ID NO:44 (RTLEDLLMGTL), SEQ ID NO:45 (MLDLQPETTDL), SEQ ID NO:46 (MLDLQPETT), SEQ ID NO:47 (GTLGIVCPI), SEQ ID NO:48 (TLEDLLMGT) and SEQ ID NO:49 (RLCVQSTHV), or an amino acid sequence having at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to a sequence selected from SEQ ID NOs:9-48.

一方面，本公开提供了连接有一种或多种HPV抗原肽或其变体的红细胞(RBC)。在一些实施方案中，RBC包含一种以上(例如，两、三、四、五或更多种)多肽，每种多肽至少包含一种HPV抗原肽或其变体。在一些实施方案中，本文所述细胞包含一种以上HPV抗原肽类型，其中每种肽包含一种HPV抗原肽，并且HPV抗原肽并不相同(例如，HPV抗原肽可以包含来自不同HPV的不同致癌蛋白的不同类型抗原肽，或者同一类型HPV抗原肽的变体)。例如，在一些实施方案中，RBC可以包含第一HPV抗原肽或其变体，以及第二HPV抗原肽。On the one hand, the disclosure provides red blood cells (RBC) connected with one or more HPV antigen peptides or variants thereof. In some embodiments, RBC comprises more than one (e.g., two, three, four, five or more) polypeptides, and each polypeptide comprises at least one HPV antigen peptide or variants thereof. In some embodiments, cells described herein comprise more than one HPV antigen peptide type, wherein each peptide comprises one HPV antigen peptide, and the HPV antigen peptides are not the same (e.g., HPV antigen peptides can comprise different types of antigen peptides from different oncoproteins of different HPVs, or variants of the same type of HPV antigen peptides). For example, in some embodiments, RBC can comprise a first HPV antigen peptide or variants thereof, and a second HPV antigen peptide.

如本文所使用的术语“主要组织相容性复合物”或“MHC”是一簇基因，它们在控制负责生理免疫反应的细胞相互作用中发挥作用。MHC分子有两种主要类型，即MHC I类(是指“MHC-I”、“MHC I”或“MHC1”)和MHC II类(是指“MHC-II”、“MHC II”或“MHC2”)。在人类中，MHC复合物也被称为人白细胞抗原(HLA)复合物。在一些实施方案中，MHC复合物是HLA I类。在一些实施方案中，HLA I类可以从HLA A0201、HLA A2402、HLA B07、HLA B18、HLA B35或HLA B44中选择，例如，HLA A0201。在一些实施方案中，HPV抗原肽与MHC I类蛋白，或MHC I类蛋白的等位基因变体结合。在一些实施方案中，与MHC I类蛋白相比，等位基因变体具有多达大约10个(例如，大约1、2、3、4、5、6、7、8、9或10个中的任意个)氨基酸取代。在一些实施方案中，等位基因变体与MHC I类蛋白是同一血清型。在一些实施方案中，等位基因变体与MHC I类蛋白是不同血清型。As used herein, the term "major histocompatibility complex" or "MHC" is a cluster of genes that play a role in controlling cellular interactions responsible for physiological immune responses. There are two main types of MHC molecules, namely MHC class I (referring to "MHC-I", "MHC I" or "MHC1") and MHC class II (referring to "MHC-II", "MHC II" or "MHC2"). In humans, the MHC complex is also called the human leukocyte antigen (HLA) complex. In some embodiments, the MHC complex is HLA class I. In some embodiments, HLA class I can be selected from HLA A0201, HLA A2402, HLA B07, HLA B18, HLA B35 or HLA B44, for example, HLA A0201. In some embodiments, the HPV antigen peptide is bound to an MHC class I protein, or an allelic variant of an MHC class I protein. In some embodiments, the allelic variant has up to about 10 (e.g., about any of 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10) amino acid substitutions compared to the MHC class I protein. In some embodiments, the allelic variant and the MHC class I protein are of the same serotype. In some embodiments, the allelic variant and the MHC class I protein are of different serotypes.

在一些实施方案中，HPV抗原肽与MHC I类蛋白(例如HLA-A02，例如HLA-A02:01、HLA-A02:03(GenBank登录号：AAA03604)、HLA-A02:05(GenBank登录号：AAA03603)、HLA-A02:06(GenBank登录号：CCB78868)、HLA-A02:07(GenBank登录号：ACR55712)和HLA-A02:11(GenBank登录号：CAB56609))结合。In some embodiments, the HPV antigen peptide is bound to an MHC class I protein (e.g., HLA-A02, such as HLA-A02:01, HLA-A02:03 (GenBank Accession No.: AAA03604), HLA-A02:05 (GenBank Accession No.: AAA03603), HLA-A02:06 (GenBank Accession No.: CCB78868), HLA-A02:07 (GenBank Accession No.: ACR55712), and HLA-A02:11 (GenBank Accession No.: CAB56609)).

在一些实施方案中，物质包含HPV抗原肽和MHC-I，并且在一些另外的实施方案中，MHC-I与HPV抗原肽的C端结合。In some embodiments, the substance comprises an HPV antigen peptide and an MHC-I, and in some other embodiments, the MHC-I binds to the C-terminus of the HPV antigen peptide.

在一些实施方案中，HLA-A*02:01的氨基酸序列如下SEQ ID NO:50显示：In some embodiments, the amino acid sequence of HLA-A*02:01 is shown as follows in SEQ ID NO:50:

GSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPGSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGP

EYWDGETRKVKAHSQTHRVDLGTLRGCYNQSEAGSHTVQRMYGCDVGSDWRFLREYWDGETRKVKAHSQTHRVDLGTLRGCYNQSEAGSHTVQRMYGCDVGSDWRFLR

GYHQYAYDGKDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQLRAYLEGTCVGYHQYAYDGKDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQLRAYLEGTCV

EWLRRYLENGKETLQRTDAPKTHMTHHAVSDHEATLRCWALSFYPAEITLTWQRDEWLRRYLENGKETLQRTDAPKTHMTHHAVSDHEATLRCWALSFYPAEITLTWQRD

GEDQTTELVETRPAGDGTFQKWAAVVVPSGQEQRYTCHVQHEGLPKPLTLRWEMGEDQTTELVETRPAGDGTFQKWAAVVVPSGQEQRYTCHVQHEGLPKPLTLRWEM

(SEQ ID NO:50)(SEQ ID NO:50)

在一些实施方案中，本发明所用的β2-微球蛋白是人β2-微球蛋白。在一些实施方案中，β2-微球蛋白具有如下SEQ ID NO:51显示的氨基酸序列：In some embodiments, the β2-microglobulin used in the present invention is human β2-microglobulin. In some embodiments, the β2-microglobulin has the amino acid sequence shown in SEQ ID NO: 51 as follows:

IQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKDWSFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRDM(SEQ ID NO:51)IQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKDWSFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRDM(SEQ ID NO:51)

修饰细胞的方法Methods of modifying cells

分选酶可以识别特定的分选酶识别基序，如序列LPXTG，并通过转肽反应将一个蛋白质C-末端的LPXTG与另一蛋白质N-末端的G连接起来。该原理可用于修饰感兴趣的物质，使得该物质可连接至包含末端甘氨酸的接头，该末端甘氨酸已连接至细胞(例如RBC)的至少一种膜蛋白。The sortase can recognize a specific sortase recognition motif, such as the sequence LPXTG, and connect the LPXTG at the C-terminus of one protein to the G at the N-terminus of another protein through a transpeptidation reaction. This principle can be used to modify a substance of interest so that it can be linked to a linker containing a terminal glycine that is already linked to at least one membrane protein of a cell (e.g., RBC).

在一个方面，本公开提供了一种用于修饰细胞的方法，包括：a)提供包含A-M-L'结构的分选酶底物，其中A代表物质，L'代表分选酶识别基序，以及M代表交联后双功能交联剂的残余部分；b)(i)提供具有式Gly_mX_n-M'的肽，其中Gly_m代表m个甘氨酸，m优选为1-5，并且X_n代表n个间隔氨基酸，n优选为0-10；并且M'代表双功能交联剂；(ii)在某一条件下，用Gly_mX_n-M'肽处理细胞，使所述肽与细胞的至少一种膜蛋白连接从而生成Gly_mX_n-M-P，其中M代表交联后双功能交联剂的残余部分，P代表所述细胞的至少一种膜蛋白；和c)在一种或多种适合于分选酶通过分选酶介导的反应将分选酶底物缀合至Gly_m的条件下，将处理的细胞与分选酶底物在分选酶存在下进行接触。a)和b)可以同时，或者a)在b)之前，或者a)在b)之后进行。In one aspect, the present disclosure provides a method for modifying a cell, comprising: a) providing a sortase substrate comprising an AM-L' structure, wherein A represents a substance, L' represents a sortase recognition motif, and M represents a residual portion of a bifunctional cross-linking agent after cross-linking; b) (i) providing a peptide having the formula _GlymXn _- M', wherein _Glym represents m glycines, m is preferably 1-5, and _Xn represents n spacer amino acids, n is preferably 0-10; and M' represents a bifunctional cross-linking agent; (ii) treating the cell with the _GlymXn _- M' peptide under conditions such that the peptide is linked to at least one membrane protein of the cell to generate _GlymXn _- MP, wherein M represents a residual portion of a bifunctional cross-linking agent after cross-linking, and P represents at least one membrane protein of the cell; and c) contacting the treated cell with the sortase substrate in the presence of a sortase under one or more conditions suitable for the sortase to conjugate the sortase substrate to _Glym through a sortase-mediated reaction. a) and b) can be performed simultaneously, or a) can be performed before b), or a) can be performed after b).

在一些实施方案中，当巯基是待使用的双功能交联剂的反应基团之一时，在处理步骤之前，进行用还原剂预处理细胞的步骤以形成或增加暴露的巯基的数量。本公开考虑了各种还原剂，只要它们能够还原表面膜蛋白内或表面膜蛋白之间的二硫键以暴露巯基即可。在一些实施方案中，使用对所处理的细胞的活力没有或几乎没有负面影响的还原剂。在一些实施方案中，可以例如在部分或全部还原条件下使用还原剂，例如三(2-羧乙基)膦盐酸盐(TCEP)或二硫苏糖醇(DTT)或β-巯基乙醇。In some embodiments, when sulfhydryl is one of the reactive groups of the bifunctional cross-linking agent to be used, before the treatment step, the step of pre-treating the cells with a reducing agent is performed to form or increase the number of exposed sulfhydryls. The present disclosure contemplates various reducing agents, as long as they can reduce the disulfide bonds within or between surface membrane proteins to expose sulfhydryls. In some embodiments, a reducing agent having no or almost no negative impact on the viability of the treated cells is used. In some embodiments, a reducing agent such as tris (2-carboxyethyl) phosphine hydrochloride (TCEP) or dithiothreitol (DTT) or β-mercaptoethanol can be used, for example, under partial or complete reducing conditions.

应当理解，本领域普通技术人员能够根据分选酶底物的性质、分选酶类型等选择适合分选酶将分选酶底物连接至包含末端甘氨酸的接头的条件(例如，最佳温度、pH、反应时间、浓度)。It should be understood that one of ordinary skill in the art can select conditions (e.g., optimal temperature, pH, reaction time, concentration) suitable for the sortase to connect the sortase substrate to the linker comprising a terminal glycine according to the properties of the sortase substrate, the type of sortase, etc.

还应当理解，本领域普通技术人员能够选择合适的条件(例如，最佳温度、pH、反应时间、浓度)，用于将包含末端甘氨酸的接头连接至细胞(例如，血细胞，例如RBC)的至少一种膜蛋白。It should also be understood that one of ordinary skill in the art can select appropriate conditions (e.g., optimal temperature, pH, reaction time, concentration) for attaching a linker comprising a terminal glycine to at least one membrane protein of a cell (e.g., a blood cell such as an RBC).

用途use

在一些方面，本公开提供了一种在有需要的受试者中治疗或预防与HPV感染相关的疾病的方法，该方法包含向受试者施用如本文所述的红细胞或组合物。In some aspects, the present disclosure provides a method of treating or preventing a disease associated with HPV infection in a subject in need thereof, the method comprising administering to the subject a red blood cell or composition as described herein.

如本文所用的术语“与HPV感染相关的疾病”是指与HPV感染相关或由HPV感染引起的疾病，例如HPV16或HPV18感染引起的疾病。在实施方案中，与HPV感染相关的疾病是癌症。在一些实施方案中，癌症是，例如，鳞状细胞癌、宫颈癌、肛门癌、阴道癌、外阴癌、阴茎癌、头颈部癌或口咽癌。As used herein, the term "disease associated with HPV infection" refers to a disease associated with or caused by HPV infection, such as a disease caused by infection with HPV16 or HPV18. In embodiments, the disease associated with HPV infection is cancer. In some embodiments, the cancer is, for example, squamous cell carcinoma, cervical cancer, anal cancer, vaginal cancer, vulvar cancer, penile cancer, head and neck cancer, or oropharyngeal cancer.

本文所用的“治疗”是指治疗干预，其至少部分改善、消除或减少病原体相关疾病、病症或适应症开始发生后的症状或病理体征。治疗不需要绝对对受试者有益。有益效果可以使用普通技术人员已知的任何方法或标准来确定。As used herein, "treatment" refers to a therapeutic intervention that at least partially ameliorates, eliminates or reduces symptoms or pathological signs after the onset of a pathogen-related disease, disorder or indication. Treatment need not be absolutely beneficial to the subject. Beneficial effects can be determined using any method or criteria known to those of ordinary skill.

如本文所用，“预防”或“防止”是指在被病原体或其分子组分感染或暴露于病原体或其分子组分之前和/或在疾病、病症或适应症的症状或病理体征发作之前开始的一系列行动，以预防感染和/或减轻症状或病理体征。应当理解，这种预防不需要绝对对受试者有益。“预防性”治疗是对不表现出疾病、病症或适应症的体征或仅表现出早期体征的受试者施用的治疗，其目的是降低出现疾病、病症或适应症的症状或病理体征的风险。As used herein, "prevention" or "prevention" refers to a series of actions that begin before infection or exposure to a pathogen or its molecular components and/or before the onset of symptoms or pathological signs of a disease, disorder or indication to prevent infection and/or alleviate symptoms or pathological signs. It should be understood that such prevention does not need to be absolutely beneficial to the subject. "Preventive" treatment is a treatment administered to a subject who does not show signs of a disease, disorder or indication or only shows early signs, with the purpose of reducing the risk of symptoms or pathological signs of a disease, disorder or indication.

在一些实施方案中，本文所述的方法还包括向受试者施用缀合的红细胞，例如通过注射或输注静脉内直接施用到循环系统中。In some embodiments, the methods described herein further comprise administering the conjugated red blood cells to the subject, such as by intravenous administration directly into the circulatory system via injection or infusion.

在一些实施方案中，受试者在治疗过程中接受单剂量的细胞，或接受多剂量的细胞，例如，2至5、10、20或更多个剂量。在一些实施方案中，剂量或总细胞数可以表示为细胞/公斤。例如，剂量可以是约10³、10⁴、10⁵、10⁶、10⁷、或10⁸个细胞/公斤。In some embodiments, the subject receives a single dose of cells during treatment, or receives multiple doses of cells, for example, 2 to 5, 10, 20 or more doses. In some embodiments, the dose or total number of cells can be expressed as cells/kg. For example, the dose can be about 10 ³ , 10 ⁴ , 10 ⁵ , 10 ⁶ , 10 ⁷ , or 10 ⁸ cells/kg.

在一些实施方案中，治疗过程持续约1周至12个月或更长，例如，1、2、3或4周或2、3、4、5或6个月。在一些实施方案中，可以约每2-4周治疗受试者。In some embodiments, the course of treatment lasts from about 1 week to 12 months or longer, for example, 1, 2, 3 or 4 weeks or 2, 3, 4, 5 or 6 months. In some embodiments, the subject can be treated about every 2-4 weeks.

本领域普通技术人员将理解，细胞数、剂量和/或给药间隔可以基于多种因素来选择，例如受试者的体重和/或血容量、正在治疗的适应症、受试者的反应等。所需的确切细胞数可以因受试者而异，取决于多种因素，例如受试者的物种、年龄、体重、性别和一般状况、疾病或病症的严重程度、具体的细胞、与细胞缀合的物质的成分和活性、施用模式、同时疗法，等等。One of ordinary skill in the art will appreciate that the number of cells, dosage, and/or dosing interval can be selected based on a variety of factors, such as the subject's weight and/or blood volume, the indication being treated, the subject's response, etc. The exact number of cells required may vary from subject to subject, depending on a variety of factors, such as the species, age, weight, sex, and general condition of the subject, the severity of the disease or condition, the specific cells, the composition and activity of the substance conjugated to the cells, the mode of administration, concurrent therapy, and the like.

组合物Composition

另一方面，本公开提供了包含本文所述的修饰的细胞和任选地生理学上可接受的载体的组合物，例如呈药物组合物、递送组合物、诊断组合物或物质盒的形式。In another aspect, the present disclosure provides a composition comprising a modified cell described herein and optionally a physiologically acceptable carrier, for example in the form of a pharmaceutical composition, a delivery composition, a diagnostic composition, or a kit of parts.

在一些实施方案中，组合物可以包含多个细胞，例如血细胞，例如，RBC。在一些实施方案中，组合物中至少选定百分比的细胞被修饰，即，通过本文所述的方法使其与物质缀合。例如，在一些实施方案中，至少5％、10％、15％、20％、25％、30％、40％、50％、60％、70％、80％、90％、95％、96％、97％、98％、99％或更多的细胞具有与其缀合的物质。在一些实施方案中，包括与不同物质缀合的两个或更多个红细胞或红细胞群。In some embodiments, the composition can include a plurality of cells, such as blood cells, for example, RBCs. In some embodiments, at least a selected percentage of cells in the composition are modified, i.e., conjugated to a substance by the methods described herein. For example, in some embodiments, at least 5%, 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or more of the cells have a substance conjugated thereto. In some embodiments, two or more red blood cells or red blood cell populations conjugated to different substances are included.

在一些实施方案中，组合物包含本公开的修饰的细胞，例如血红细胞，其中细胞用任何感兴趣的物质修饰。在一些实施方案中，组合物包含有效量的细胞，例如，多至约10¹⁴个细胞，例如，约10、10²、10³、10⁴、10⁵、5×10⁵、10⁶、5×10⁶、10⁷、5×10⁷、10⁸、5×10⁸、10⁹、5×10⁹、10¹⁰、5×10¹⁰、10¹¹、5×10¹¹、10¹²、5×10¹²、10¹³、5×10¹³或10¹⁴个细胞。在一些实施方案中，细胞数可以在前述数量中的任意两个之间。In some embodiments, the composition comprises modified cells of the present disclosure, such as red blood cells, wherein the cells are modified with any substance of interest. In some embodiments, the composition comprises an effective amount of cells, for example, up to about 10 ¹⁴ cells, for example, about 10, 10 ² , 10 ³ , 10 ⁴ , 10 ⁵ , 5×10 ⁵ , 10 ⁶ , 5×10 ⁶ , 10 ⁷ , 5×10 ⁷ , 10 ⁸ , 5×10 ⁸ , 10 ⁹ , 5×10 ⁹ , 10 ¹⁰ , 5×10 ¹⁰ , 10 ¹¹ , 5×10 ¹¹ , 10 ¹² , 5×10 ¹² , 10 ¹³ , 5×10 ¹³ , or 10 ¹⁴ cells. In some embodiments, the number of cells can be between any two of the aforementioned numbers.

如本文所用，术语“有效量”是指足以实现感兴趣的生物反应或效果(例如，减少疾病或适应症的一种或多种症状或表现，或调节免疫反应)的量。在一些实施方案中，向受试者施用的组合物包含多至约10¹⁴个细胞，例如，约10³、10⁴、10⁵、10⁶、10⁷、10⁸、10⁹、10¹⁰、10¹¹、10¹²、10¹³或10¹⁴个细胞，或任何中间的数或范围。As used herein, the term "effective amount" refers to an amount sufficient to achieve a biological response or effect of interest (e.g., reducing one or more symptoms or manifestations of a disease or indication, or modulating an immune response). In some embodiments, the composition administered to a subject comprises up to about 10 ¹⁴ cells, e.g., about 10 ³ , 10 ⁴ , 10 ⁵ , 10 ⁶ , 10 ⁷ , 10 ⁸ , 10 ⁹ , 10 ¹⁰ , 10 ¹¹ , 10 ¹² , 10 ¹³ , or 10 ¹⁴ cells, or any intermediate number or range.

如本文所用，术语“生理学上可接受的载体”是指可以安全地用于全身施用的固体或液体填充剂、稀释剂或包封物质。根据特定的施用途径，可以使用本领域熟知的多种载体、稀释剂和赋形剂。这些可以选自糖、淀粉、纤维素及其衍生物、麦芽、明胶、滑石、硫酸钙、植物油、合成油、多元醇、海藻酸、磷酸盐缓冲溶液、乳化剂、等渗盐水和盐如无机酸盐，包括盐酸盐、溴化物和硫酸盐，有机酸如乙酸盐、丙酸盐和丙二酸盐、水以及无热原水。As used herein, the term "physiologically acceptable carrier" refers to a solid or liquid filler, diluent or encapsulating material that can be safely used for systemic administration. Depending on the specific route of administration, a variety of carriers, diluents and excipients well known in the art can be used. These can be selected from sugars, starches, cellulose and derivatives thereof, malt, gelatin, talc, calcium sulfate, vegetable oils, synthetic oils, polyols, alginic acid, phosphate buffered solutions, emulsifiers, isotonic saline and salts such as inorganic acid salts, including hydrochlorides, bromides and sulfates, organic acids such as acetates, propionates and malonates, water and pyrogen-free water.

本领域技术人员将理解，在不脱离本发明的范围的情况下，也可以实践本文描述的实施方案的其他变型。因此其他修改是可能的。Those skilled in the art will appreciate that other variations of the embodiments described herein may also be practiced without departing from the scope of the present invention.Other modifications are therefore possible.

尽管已经以具有一定程度的特殊性的示例性形式描述和说明了本公开，但是应当注意，仅通过示例的方式进行了描述和说明。可以对构造和组合的细节以及部件和步骤的布置进行许多更改。因此，这样的更改旨在包括在本发明中，本发明的范围由权利要求书限定。Although the present disclosure has been described and illustrated in an exemplary form with a certain degree of particularity, it should be noted that the description and illustration are provided by way of example only. Many changes may be made to the details of construction and combination and the arrangement of parts and steps. Therefore, such changes are intended to be included in the present invention, and the scope of the present invention is defined by the claims.

实施例Example

方法(见图1) Method (see Figure 1)

1.HPV16-MCH1蛋白的表达和纯化 1. Expression and purification of HPV16-MCH1 protein

将HPV16-MHC cDNA克隆到pcDNA3.1载体中。cDNA和电转缓冲液混合后放入电转槽中。依据制造商(Ettabiotech)针对CHO细胞优化的操作规程将载体电转入CHO细胞。7天后，在4℃下以14000g离心40分钟并使用0.22微米滤膜过滤后，收集所有上清液。HPV16-MHC cDNA was cloned into the pcDNA3.1 vector. The cDNA was mixed with electroporation buffer and placed in an electroporation tank. The vector was electroporated into CHO cells according to the manufacturer's (Ettabiotech) optimized protocol for CHO cells. After 7 days, all supernatants were collected after centrifugation at 14,000 g for 40 minutes at 4°C and filtered through a 0.22 μm filter.

在实施例中生成了四种HPV16-MHC1融合蛋白。In the Examples, four HPV16-MHC1 fusion proteins were generated.

融合蛋白1：HPV16-hMHC1-Fc包含人β2M信号肽-HPV16表位(SEQ ID NO:9)-GS接头-人β2M-GS接头-HLA0201-GS接头-人IgG1Fc-Cys的结构。融合蛋白1的氨基酸序列和DNA序列显示在SEQ ID NO:52和53。Fusion protein 1: HPV16-hMHC1-Fc comprises the structure of human β2M signal peptide-HPV16 epitope (SEQ ID NO: 9)-GS linker-human β2M-GS linker-HLA0201-GS linker-human IgG1Fc-Cys. The amino acid sequence and DNA sequence of fusion protein 1 are shown in SEQ ID NOs: 52 and 53.

融合蛋白2：HPV16-hMHC1-Fc包含人β2M信号肽-HPV16表位(SEQ ID NO:10)-GS接头-人β2M-GS接头-HLA0201-GS接头-人IgG1Fc-Cys的结构。融合蛋白2的氨基酸序列和DNA序列显示在SEQ ID NO:54和55。Fusion protein 2: HPV16-hMHC1-Fc comprises the structure of human β2M signal peptide-HPV16 epitope (SEQ ID NO: 10)-GS linker-human β2M-GS linker-HLA0201-GS linker-human IgG1Fc-Cys. The amino acid sequence and DNA sequence of fusion protein 2 are shown in SEQ ID NOs: 54 and 55.

融合蛋白3：HPV16-mMHC1-His包含小鼠β2M信号肽-HPV16表位(SEQ ID NO:11)-GS接头-小鼠β2M-GS接头-小鼠H2KB-GS接头-His₆-AAC的结构。融合蛋白3的氨基酸序列和DNA序列显示在SEQ ID NO:56和57。Fusion protein 3: HPV16-mMHC1-His comprises the structure of mouse β2M signal peptide-HPV16 epitope (SEQ ID NO: 11)-GS linker-mouse β2M-GS linker-mouse H2KB-GS linker-His ₆ -AAC. The amino acid sequence and DNA sequence of fusion protein 3 are shown in SEQ ID NOs: 56 and 57.

融合蛋白4：HPV16-mMHC1-Fc包含小鼠β2M信号肽-HPV16表位(SEQ ID NO:11)-GS接头-小鼠β2M-GS接头-小鼠H2KB-GS接头-人IgG1 Fc-Cys的结构。融合蛋白4的氨基酸序列和DNA序列显示在SEQ ID NO:58和59。Fusion protein 4: HPV16-mMHC1-Fc comprises the structure of mouse β2M signal peptide-HPV16 epitope (SEQ ID NO: 11)-GS linker-mouse β2M-GS linker-mouse H2KB-GS linker-human IgG1 Fc-Cys. The amino acid sequence and DNA sequence of fusion protein 4 are shown in SEQ ID NO: 58 and 59.

通过离心和微滤与细胞分离后，上清液被上样到带有Ni²⁺离子的IMAC BestaroseFF柱(Bestchrom，中国上海)上，所述柱用结合缓冲液(20mM Tris-HCl，500mM NaCl，pH7.6)平衡。用结合缓冲液洗涤柱，然后用洗脱缓冲液1(20mM Tris-HCl，500mM NaCl，30mM咪唑，pH7.6)洗脱，直至280nm处的UV吸光度稳定。用洗脱缓冲液2(20mM Tris-HCl、500mM NaCl、100mM咪唑、pH7.6)收集蛋白质。After separation from cells by centrifugation and microfiltration, the supernatant was loaded onto an IMAC BestaroseFF column (Bestchrom, Shanghai, China) with Ni ²⁺ ions, which was balanced with binding buffer (20 mM Tris-HCl, 500 mM NaCl, pH 7.6). The column was washed with binding buffer and then eluted with elution buffer 1 (20 mM Tris-HCl, 500 mM NaCl, 30 mM imidazole, pH 7.6) until the UV absorbance at 280 nm was stable. Protein was collected with elution buffer 2 (20 mM Tris-HCl, 500 mM NaCl, 100 mM imidazole, pH 7.6).

然后用ddH₂O(1:1)稀释蛋白质组分，并上样到用结合缓冲液(10mM Tris-HCl，250mM NaCl，pH7.6)平衡的Diamond Mix-A柱(Bestchrom，中国上海)上。用结合缓冲液洗涤并用洗脱缓冲液1(13.3mM Tris-HCl,337.5mM NaCl,pH7.6)洗脱后，用洗脱缓冲液2(20mMTris-HCl,2000mM NaCl,pH7.6)洗脱目标蛋白，然后用Amicon Ultra-15离心过滤装置(Millipore，达姆施塔特，德国)浓缩。The protein fraction was then diluted with ddH ₂ O (1:1) and loaded onto a Diamond Mix-A column (Bestchrom, Shanghai, China) equilibrated with binding buffer (10 mM Tris-HCl, 250 mM NaCl, pH 7.6). After washing with binding buffer and eluting with elution buffer 1 (13.3 mM Tris-HCl, 337.5 mM NaCl, pH 7.6), the target protein was eluted with elution buffer 2 (20 mM Tris-HCl, 2000 mM NaCl, pH 7.6) and then concentrated using an Amicon Ultra-15 centrifugal filter device (Millipore, Darmstadt, Germany).

将浓缩蛋白质上样至用PBS平衡的Chromdex 200pg(Bestchrom，中国上海)，并收集目标蛋白质组分。将蛋白质浓缩并储存在-80℃。The concentrated protein was loaded onto Chromdex 200 pg (Bestchrom, Shanghai, China) equilibrated with PBS, and the target protein fractions were collected. The protein was concentrated and stored at -80°C.

2.通过半胱氨酸偶联将不可逆接头缀合至HPV16-MHC1 2. Conjugation of irreversible linkers to HPV16-MHC1 via cysteine coupling

合成不可逆接头，6-马来酰亚胺己酸-Leu-Pro-Glu-Thr-2-羟基乙酸-Gly(6-马来酰亚胺己酸-LPET-(2-羟基乙酸)-G，6-Mal-LPET*G)，纯度超过99％。反应在PBS缓冲液中以1mL的总体积在室温下进行1小时，同时以10rpm的速度旋转。6-Mal-LPET*G和HPV16-MHC1蛋白的浓度分别是2mM和500μM。该方法使用了相比HPV16-MHC1蛋白两倍摩尔量的不可逆接头。反应后，通过透析和超滤去除多余的不可逆接头，收集HPV16-MHC1-6-mal-LPET*G产物。The irreversible linker, 6-maleimidocaproic acid-Leu-Pro-Glu-Thr-2-hydroxyacetic acid-Gly (6-maleimidocaproic acid-LPET-(2-hydroxyacetic acid)-G, 6-Mal-LPET*G), was synthesized with a purity of more than 99%. The reaction was carried out in a total volume of 1 mL in PBS buffer at room temperature for 1 hour while rotating at a speed of 10 rpm. The concentrations of 6-Mal-LPET*G and HPV16-MHC1 protein were 2 mM and 500 μM, respectively. This method used twice the molar amount of irreversible linker compared to HPV16-MHC1 protein. After the reaction, the excess irreversible linker was removed by dialysis and ultrafiltration, and the HPV16-MHC1-6-mal-LPET*G product was collected.

3.Mg SrtA介导的HPV16-MHC1-6-mal-LEPT*G标记 3. Mg SrtA-mediated HPV16-MHC1-6-mal-LEPT*G labeling

3.1RBC预处理3.1 RBC pretreatment

通过密度梯度离心从外周血中分离红细胞。分离的红细胞用PBS洗涤3次。然后将RBC在室温下用1mM TCEP预处理1小时。然后用PBS洗涤预处理的RBC 3次。合成GGGSK-6-MAL接头(由北京中科亚光生物科技有限公司合成)(见图2)，纯度超过99％。在37℃下，将GGGSK-6-MAL溶解在磷酸盐缓冲液中，终浓度为100μM。然后在37℃用50μMGGGSK-6-MAL预处理1×10⁹RBC 30分钟。然后将预处理后的RBC用PBS洗涤3次。预处理后的RBC需要立即使用，并且不能长时间储存。Red blood cells were separated from peripheral blood by density gradient centrifugation. The separated red blood cells were washed 3 times with PBS. The RBCs were then pretreated with 1 mM TCEP for 1 hour at room temperature. The pretreated RBCs were then washed 3 times with PBS. The GGGSK-6-MAL linker (synthesized by Beijing Zhongke Yaguang Biotechnology Co., Ltd.) (see Figure 2) was synthesized with a purity of over 99%. GGGSK-6-MAL was dissolved in phosphate buffer at 37°C to a final concentration of 100 μM. 1×10 ⁹ RBCs were then pretreated with 50 μM GGGSK-6-MAL at 37°C for 30 minutes. The pretreated RBCs were then washed 3 times with PBS. The pretreated RBCs need to be used immediately and cannot be stored for a long time.

3.2通过mg分选酶介导的反应将GGGSK-6-MAL-RBC与HPV16-MHC1-6-mal-LPET*G缀合3.2 Conjugation of GGGSK-6-MAL-RBC to HPV16-MHC1-6-mal-LPET*G via a Mg-sortase-mediated reaction

反应在PBS缓冲液中以200μL的总体积在室温下进行2小时，同时以10rpm的速度旋转。反应中GGGSK-6-MAL-RBC的浓度为1×10⁹/mL。mg SrtA(SEQ ID NO:5)的浓度是10μM，并且HPV16-MHC1-6-mal-LEPT*G底物的浓度是为25μM。反应后，使用Beckman CoulterCytoFLEX LX分析RBC的标记效率。The reaction was carried out in PBS buffer in a total volume of 200 μL at room temperature for 2 hours while rotating at 10 rpm. The concentration of GGGSK-6-MAL-RBC in the reaction was 1×10 ⁹ /mL. The concentration of mg SrtA (SEQ ID NO: 5) was 10 μM, and the concentration of HPV16-MHC1-6-mal-LEPT*G substrate was 25 μM. After the reaction, the labeling efficiency of RBC was analyzed using Beckman Coulter CytoFLEX LX.

4.干扰素γ(IFNγ)ELISpot法 4. Interferon γ (IFN γ ) ELISpot method

根据制造商的说明，使用人IFN-γELISpot试剂盒(Mabtech)。对于离体ELISpot，从HPV16⁺患者中分离的外周血单个核细胞(PBMC)以每孔2×10⁵个细胞的量进行复孔铺板。修饰的RBC以每孔10⁷个细胞的量进行铺板。HLA-A2限制性HPV16 E6/E7衍生肽以20μg/mL的浓度加入到ELISpot孔中。板在37℃下孵育48小时。通过生物素标记的抗IFNγmAb检测分泌的IFNγ，并使用链霉亲和素-HRP和色素试剂BCIP/NBT进行反应显色。计算特定分泌IFNγ的T细胞数量。The human IFN-γ ELISpot kit (Mabtech) was used according to the manufacturer's instructions. For ex vivo ELISpot, peripheral blood mononuclear cells (PBMCs) isolated from HPV16 ⁺ patients were plated in duplicate at 2×10 ⁵ cells per well. Modified RBCs were plated at 10 ⁷ cells per well. HLA-A2-restricted HPV16 E6/E7-derived peptides were added to the ELISpot wells at a concentration of 20 μg/mL. The plates were incubated at 37°C for 48 hours. Secreted IFNγ was detected by biotin-labeled anti-IFNγ mAb, and the reaction was developed using streptavidin-HRP and the pigment reagent BCIP/NBT. The number of T cells specifically secreting IFNγ was calculated.

5.体外HPV16-MHC1-RBC产生免疫反应的细胞毒性潜力 5. Cytotoxic potential of HPV16-MHC1-RBC immune response in vitro

本研究使用HLA-A*0201阳性、HPV-16阳性的人宫颈癌细胞系和HPV16-MC38细胞系(转导以表达HPV-16的E6、E7致癌基因)。This study used HLA-A*0201-positive, HPV-16-positive human cervical cancer cell line and HPV16-MC38 cell line (transduced to express the E6 and E7 oncogenes of HPV-16).

从ATCC获得的CaSki细胞(CRL-1550TM)整合了HPV-16基因组。来自HPV16⁺患者的外周血单个核细胞(PBMC)与HPV16(YMLDLQPET)-MHC1-RBC(PBMC:工程化RBC＝1:50)在37℃孵育72小时。刺激后，使用CD8⁺T细胞分离试剂盒(Stem cell公司)富集患者PBMC中的CD8⁺T细胞。分离出的CD8⁺T细胞与靶细胞CaSki共培养72小时(效应细胞:靶细胞＝100:1)。通过CD8⁺T细胞表面的4-1BB、CD107a和荧光素缀合的MHC四聚体的表达以及CaSki细胞的存活率来检测由HPV16(YMLDLQPET)-MHC1-RBC产生的免疫反应的细胞毒性潜力。The CaSki cells (CRL-1550TM) obtained from ATCC integrated the HPV-16 genome. Peripheral blood mononuclear cells (PBMCs) from HPV16 ⁺ patients were incubated with HPV16 (YMLDLQPET) -MHC1-RBC (PBMC: engineered RBC = 1: 50) at 37 ° C for 72 hours. After stimulation, CD8 ⁺ T cell isolation kit (Stem cell company) was used to enrich CD8 ⁺ T cells in the patient's PBMC. The isolated CD8 ⁺ T cells were co-cultured with target cells CaSki for 72 hours (effector cells: target cells = 100: 1). The cytotoxic potential of the immune response generated by HPV16 (YMLDLQPET) -MHC1-RBC was detected by the expression of 4-1BB, CD107a and fluorescein-conjugated MHC tetramers on the surface of ^CD8 + T cells and the survival rate of CaSki cells.

来自携带HPV16-MC38肿瘤小鼠的脾细胞与HPV16(KCLKFYSKI)-mMHC1-RBC(脾细胞:工程化RBC＝1:50)在37℃孵育72小时。刺激后，使用CD8⁺T细胞分离试剂盒(Stem cell公司)从脾细胞中富集CD8⁺T细胞。分离出的CD8⁺T细胞与靶细胞HPV16-MC38共培养72小时(效应细胞:靶细胞＝100:1)。通过CD8⁺T细胞表面的CD107a和荧光素缀合的MHC四聚体的表达以及HPV16-MC38细胞的存活率来检测由HPV16(KCLKFYSKI)-mMHC1-RBC产生的免疫反应的细胞毒性潜力。Splenocytes from mice carrying HPV16-MC38 tumors were incubated with HPV16 (KCLKFYSKI)-mMHC1-RBC (splenocytes: engineered RBC = 1:50) at 37 ° C for 72 hours. After stimulation, CD8 ⁺ T cells were enriched from splenocytes using a CD8 ⁺ T cell isolation kit (Stem cell company). The isolated CD8 ⁺ T cells were co-cultured with target cells HPV16-MC38 for 72 hours (effector cells: target cells = 100: 1). The cytotoxic potential of the immune response generated by HPV16 (KCLKFYSKI)-mMHC1-RBC was detected by the expression of CD107a and fluorescein ^- conjugated MHC tetramers on the surface of CD8 + T cells and the survival rate of HPV16-MC38 cells.

6.HPV16-MHC1-RBC在实体瘤模型中的治疗 6. HPV16-MHC1-RBC therapy in solid tumor models

在免疫治疗中，在D0天首先在小鼠左侧后腹部皮下(s.c.)注射10⁵个HPV16-MC38细胞。小鼠被分成2组：(1)对照RBC治疗组；(2)HPV16(KCLKFYSKI)-mMHC1-RBC治疗组。在第1、4、7、10、13、16天施用HPV16(KCLKFYSKI)-mMHC1-RBC。每三天测量体重和肿瘤体积。使用经验式V＝1/2×[(最短直径)²×(最长直径)]计算肿瘤体积。治疗4周后，处死小鼠并称重肿瘤且处理后进行IHC分析。In immunotherapy, 10 ⁵ HPV16-MC38 cells were first injected subcutaneously (sc) in the left posterior abdomen of mice on D0 day. The mice were divided into 2 groups: (1) control RBC treatment group; (2) HPV16 (KCLKFYSKI) -mMHC1-RBC treatment group. HPV16 (KCLKFYSKI) -mMHC1-RBC was administered on days 1, 4, 7, 10, 13, and 16. Body weight and tumor volume were measured every three days. Tumor volume was calculated using the empirical formula V = 1/2 × [(shortest diameter) ² × (longest diameter)]. After 4 weeks of treatment, the mice were sacrificed and the tumors were weighed and processed for IHC analysis.

结果result

标记效率Labeling efficiency

我们表征了mg SrtA介导的HPV16(YMLDLQPET)-hMHC1在RBC膜上的标记效率。通过将标记后的RBC与缀合有PE的抗Fc标签抗体共同孵育并使用流式细胞仪进行分析来检测缀合效率。图3的结果显示，大于99％的天然人RBC在体外通过mg SrtA标记了HPV16(YMLDLQPET)-hMHC1。相反，在没有mg SrtA酶的模拟对照组中，在人RBC表面未检测到显著的Fc标签信号。与未经GGGSK-mal预处理RBC上的mg SrtA介导的HPV16(YMLDLQPET)-hMHC1的效率相比，经预处理后的标记效率也显著更高。We characterized the efficiency of mg SrtA-mediated HPV16(YMLDLQPET)-hMHC1 labeling on RBC membranes. The conjugation efficiency was detected by incubating the labeled RBCs with anti-Fc tag antibodies conjugated with PE and analyzing them using flow cytometry. The results in Figure 3 show that more than 99% of natural human RBCs were labeled with HPV16(YMLDLQPET)-hMHC1 by mg SrtA in vitro. In contrast, in the mock control group without mg SrtA enzyme, no significant Fc tag signal was detected on the surface of human RBCs. Compared with the efficiency of mg SrtA-mediated HPV16(YMLDLQPET)-hMHC1 on RBCs without GGGSK-mal pretreatment, the labeling efficiency after pretreatment was also significantly higher.

免疫原性Immunogenicity

通过IFN-γELISpot法在外周血中评估HPV16(YMLDLQPET)-hMHC1-RBC的免疫原性。HPV16(YMLDLQPET)肽、对照RBC和HPV16(YMLDLQPET)-hMHC1-RBC在3名HPV16⁺宫颈癌患者和2名健康捐献者中分析。如图4所示，在患者中检测到对HPV16(YMLDLQPET)-hMHC1-RBC的外周免疫应答反应，但在健康捐献者的外周血中几乎没有检测到对HPV16(YMLDLQPET)-hMHC1-RBC的免疫应答。总之，导致T细胞应答产生的HPV16(YMLDLQPET)-hMHC1-RBC刺激是强烈且特异性的。The immunogenicity of HPV16(YMLDLQPET)-hMHC1-RBC was evaluated in peripheral blood by IFN-γ ELISpot. HPV16(YMLDLQPET) peptide, control RBC and HPV16(YMLDLQPET)-hMHC1-RBC were analyzed in 3 HPV16 ⁺ cervical cancer patients and 2 healthy donors. As shown in Figure 4, peripheral immune responses to HPV16(YMLDLQPET)-hMHC1-RBC were detected in patients, but almost no immune responses to HPV16(YMLDLQPET)-hMHC1-RBC were detected in the peripheral blood of healthy donors. In summary, the HPV16(YMLDLQPET)-hMHC1-RBC stimulation that led to the generation of T cell responses was strong and specific.

细胞毒性潜力Cytotoxic potential

在HPV16(YMLDLQPET)-hMHC1-RBC存在的条件下，测定4-1BB、CD107a和荧光素缀合的MHC四聚体的表面表达来确定外周血中新抗原特异性CD8⁺T细胞的细胞毒性表型。图5显示与HPV16(YMLDLQPET)-hMHC1-RBC共培养的患者CD8+T细胞的4-1BB、CD107a和MHC四聚体的表面表达。In the presence of HPV16(YMLDLQPET)-hMHC1-RBC, the surface expression of 4-1BB, CD107a and fluorescein-conjugated MHC tetramers was measured to determine the cytotoxic phenotype of neoantigen-specific CD8 ⁺ T cells in peripheral blood. Figure 5 shows the surface expression of 4-1BB, CD107a and MHC tetramers of patient CD8+ T cells co-cultured with HPV16(YMLDLQPET)-hMHC1-RBC.

细胞毒性测定是基于用HPV16(YMLDLQPET)-hMHC1-RBC刺激的与预处理患者PBMC共培养的Caski靶细胞的存活情况。图6的数据表明HPV16(YMLDLQPET)-hMHC1-RBC产生杀死癌细胞的细胞毒性T细胞。The cytotoxicity assay is based on the survival of Caski target cells co-cultured with pre-treated patient PBMCs stimulated with HPV16(YMLDLQPET)-hMHC1-RBCs. The data in Figure 6 show that HPV16(YMLDLQPET)-hMHC1-RBCs generate cytotoxic T cells that kill cancer cells.

细胞毒性测定是基于与来自携带HPV16-MC38肿瘤小鼠的预处理脾细胞共培养的HPV16(KCLKFYSKI)-MC38靶细胞的存活情况。图7的数据表明HPV16(KCLKFYSKI)-mMHC1-RBC产生杀死癌细胞的细胞毒性T细胞。The cytotoxicity assay is based on the survival of HPV16(KCLKFYSKI)-MC38 target cells co-cultured with pre-treated splenocytes from HPV16-MC38 tumor-bearing mice. The data in Figure 7 demonstrate that HPV16(KCLKFYSKI)-mMHC1-RBCs generate cytotoxic T cells that kill cancer cells.

肿瘤生长Tumor growth

在治疗性研究中，如图8所示，接受HPV16(KCLKFYSKI)-mMHC1-RBC反复输注的动物肿瘤生长显著延迟(p＜0.01)。In the therapeutic study, as shown in FIG8 , animals that received repeated infusions of HPV16(KCLKFYSKI)-mMHC1-RBCs had a significant delay in tumor growth (p<0.01).

虽然已经说明和描述了本发明的具体实施方案，但并不旨在这些实施方案说明和描述本发明的所有可能形式。相反地，说明书中使用的词语是描述性的而非限制性的词语，并且可以理解为在不脱离本发明的精神和范围的情况下可以进行各种更改。Although specific embodiments of the present invention have been illustrated and described, it is not intended that these embodiments illustrate and describe all possible forms of the present invention. On the contrary, the words used in the description are descriptive rather than restrictive words, and it is understood that various changes can be made without departing from the spirit and scope of the present invention.

Claims

1. A cell having a substance attached thereto, wherein the substance is attached to at least one membrane protein of the cell by a sortase recognition motif, and the substance attached to the at least one membrane protein comprises the structure: A-M-L-Gly _mX_n -M-P, wherein A represents a substance, L represents the remainder of the sortase recognition motif after sortase-mediated reaction, gly _m represents M glycine, wherein M is preferably 1-5, X _n represents n spacer amino acids, wherein n is preferably 0-10, M represents the remainder of the bifunctional cross-linker after cross-linking, and P represents at least one membrane protein of the cell.

2. The cell according to claim 1, wherein the bifunctional crosslinking reagent is an amine-thiol, preferably maleimide carboxylic acid (C _2-8), such as 6-maleimide hexanoic acid and 4-maleimide butanoic acid.

3. Cell according to claim 1 or 2, wherein X _n comprises at least one amino acid having a side chain amino group, such as lysine, and preferably the C-terminal amino acid of X _n is an amino acid having a side chain amino group.

4. The cell of claim 3, wherein M crosslinks the side chain amino group with at least one exposed thiol group of the at least one membrane protein.

5. The cell of any one of claims 1-4, wherein the sortase recognition motif comprises, or consists essentially of, or consists of an amino acid sequence selected from LPXTG, LPXAG, LPXSG, LPXLG, LPXVG, LGXTG, LAXTG, LSXTG, NPXTG, MPXTG, IPXTG, SPXTG, VPXTG, YPXRG, LPXTS and LPXTA, wherein X is any amino acid.

6. The cell of any one of claims 1-4, wherein the sortase recognition motif comprises an unnatural amino acid at position 5 in the N-terminal to C-terminal direction of the sortase recognition motif, wherein the unnatural amino acid is an optionally substituted hydroxycarboxylic acid having the formula CH ₂OH-(CH₂)_n -COOH, where N is an integer from 0to 3, preferably N = 0.

7. The cell of claim 6, wherein the sortase recognition motif comprises, or consists essentially of, or consists of an amino acid sequence selected from LPXT Y, LPXA, Y, LPXS, Y, LPXL, Y, LPXV, Y, LGXT, Y, LAXT, Y, LSXT, Y, NPXT, Y, MPXT, Y, IPXT, Y, SPXT, Y, VPXT, and YPXR, Y, wherein x represents an optionally substituted hydroxycarboxylic acid; and X and Y independently represent any amino acid.

8. The cell of claim 7, wherein the sortase recognition motif comprises, or consists essentially of, or consists of an amino acid sequence selected from LPXT*G、LPXA*G、LPXS*G、LPXL*G、LPXV*G、LGXT*G、LAXT*G、LSXT*G、NPXT*G、MPXT*G、IPXT*G、SPXT*G、VPXT*G、YPXR*G、LPXT*S and LPXT a, wherein M is preferably LPET x G, is 2-hydroxyacetic acid.

9. The cell of any one of claims 1-8, wherein L is selected from LPXT, LPXA, LPXS, LPXL, LPXV, LGXT, LAXT, LSXT, NPXT, MPXT, IPXT, SPXT, VPXT and YPXR and X is any amino acid.

10. The cell according to any one of claims 1-9, wherein the substance a comprises an exposed thiol group, preferably an exposed cysteine, more preferably a terminal cysteine, most preferably a C-terminal cysteine.

11. The cell of any one of claims 1-10, wherein the substance comprises HPV antigenic peptides.

12. The red blood cell of claim 11, wherein the HPV antigenic peptide binds to major histocompatibility complex class I (MHC-I) proteins, such as HLA-A 02:01, HLA-A 02:02, HLA-A 02:06, HLA-A 02:07, and HLA-A 02:11.

13. The red blood cell according to claim 11 or 12, wherein the HPV antigenic peptide is an HPV16 antigenic peptide, preferably comprising or consisting of an amino acid sequence selected from the group consisting of SEQ ID No. 9 (YMLDLQPET), SEQ ID No. 10 (LLMGTLGIV) and SEQ ID No. 11 (KCLKFYSKI), or a functionally equivalent variant thereof.

14. The cell of any one of claims 1-13, wherein the sortase is sortase a (SrtA), e.g., staphylococcus aureus transpeptidase a variant (mgSrtA).

15. The cell according to any one of claims 1-14, wherein the cell is selected from enucleated cells, preferably erythrocytes.

16. A method of modifying a cell, the method comprising:

a) Providing a sortase substrate comprising an a-M-L 'structure, wherein a represents a substance, L' represents a sortase recognition motif, and M represents the remainder of the bifunctional cross-linking agent after cross-linking;

b) (i) providing a peptide having the formula Gly _mX_n -M', wherein Gly _m represents M glycine, M preferably being 1-5; x _n represents n spacer amino acids, n preferably being 0 to 10; m' represents a bifunctional crosslinking agent;

(ii) Treating a cell with Gly _mX_n -M 'peptide under conditions that allow linking of the Gly _mX_n -M' peptide to at least one membrane protein of the cell to produce Gly _mX_n -M-P, wherein M represents the remainder of the bifunctional cross-linking agent after cross-linking and P represents the at least one membrane protein of the cell; and

C) Contacting the treated cells with a sortase substrate in the presence of a sortase under one or more conditions suitable for the sortase to conjugate the sortase substrate to Gly _m by a sortase-mediated reaction;

a) And b) may be carried out simultaneously, either a) before b), or a) after b).

17. The method of claim 16, wherein prior to the treating step, the method further comprises the step of pre-treating the cells with a reducing agent to form exposed sulfhydryl groups.

18. The method according to claim 16 or 17, wherein the bifunctional crosslinking reagent is an amine-thiol, preferably a maleimide carboxylic acid (C _2-8), such as 6-maleimide caproic acid and 4-maleimide butyric acid.

19. The method according to any one of claims 16-18, wherein X _n comprises at least one amino acid having a side chain amino group, such as lysine, and preferably the C-terminal amino acid of X _n is an amino acid having a side chain amino group.

20. The method of claim 19, wherein the bifunctional crosslinking reagent crosslinks the side chain amino groups and at least one exposed thiol group of the at least one membrane protein.

21. The method of any one of claims 16-20, wherein the sortase-mediated reaction forms a substance that is linked to at least one membrane protein of the cell, the substance comprising the structure: A-M-L-Gly _mX_n -M-P, wherein A represents a substance, L represents the remainder of the recognition motif of the sortase after sortase-mediated reaction, gly _m and X _n are as defined in claim 16, M represents the remainder of the bifunctional crosslinking reagent after crosslinking, and P represents at least one membrane protein of the cell.

22. The method of any one of claims 16-21, wherein the sortase recognition motif comprises, or consists essentially of, or consists of an amino acid sequence selected from LPXTG, LPXAG, LPXSG, LPXLG, LPXVG, LGXTG, LAXTG, LSXTG, NPXTG, MPXTG, IPXTG, SPXTG, VPXTG, YPXRG, LPXTS and LPXTA, wherein X is any amino acid.

23. The method of any one of claims 16-21, wherein the sortase recognition motif comprises an unnatural amino acid at position 5 in the N-terminal to C-terminal direction of the sortase recognition motif, wherein the unnatural amino acid is an optionally substituted hydroxycarboxylic acid having the formula CH ₂OH-(CH₂)_n -COOH, where N is an integer from 0 to 3, preferably N = 0.

24. The method of claim 23, wherein the sortase recognition motif comprises, or consists essentially of, or consists of an amino acid sequence selected from LPXT Y, LPXA, Y, LPXS, Y, LPXL, Y, LPXV, Y, LGXT, Y, LAXT, Y, LSXT, Y, NPXT, Y, MPXT, Y, IPXT, Y, SPXT, Y, VPXT, and YPXR, Y, wherein x represents an optionally substituted hydroxycarboxylic acid; and X and Y independently represent any amino acid.

25. The method of claim 24, wherein the sortase recognition motif comprises, or consists essentially of, or consists of an amino acid sequence selected from LPXT*G、LPXA*G、LPXS*G、LPXL*G、LPXV*G、LGXT*G、LAXT*G、LSXT*G、NPXT*G、MPXT*G、IPXT*G、SPXT*G、VPXT*G、YPXR*G、LPXT*S and LPXT a, wherein M is preferably LPET x G, is 2-hydroxyacetic acid.

26. The method of any one of claims 16-25, wherein the substance comprises HPV antigenic peptides.

27. The method of claim 26, wherein the HPV antigenic peptide binds to major histocompatibility complex class I (MHC-I) proteins, such as HLA-A 02:01, HLA-A 02:02, HLA-A 02:06, HLA-A 02:07, and HLA-A 02:11.

28. The method of claim 26 or 27, wherein the HPV antigenic peptide is an HPV16 antigenic peptide, preferably comprising or consisting of an amino acid sequence selected from the group consisting of SEQ ID No. 9 (YMLDLQPET), SEQ ID No. 10 (LLMGTLGIV) and SEQ ID No. 11 (KCLKFYSKI), or a functionally equivalent variant thereof.

29. The method of any one of claims 16-28, wherein the sortase is sortase a (SrtA), e.g., staphylococcus aureus transpeptidase a variant (mgSrtA).

30. The method according to any one of claims 16-28, wherein the cells are selected from enucleated cells, preferably erythrocytes.

31. A cell obtained by the method of any one of claims 16-30.

32. A composition comprising the cell of any one of claims 1-15 and 31 and optionally a physiologically acceptable carrier.

33. A method for diagnosing, treating or preventing a disease associated with HPV infection in a subject in need thereof, the method comprising administering to the subject the cell of any one of claims 1-15 and 31 or the composition of claim 32.

34. The method of claim 33, wherein the disease associated with HPV infection is selected from cervical cancer, anogenital cancer, head and neck cancer, and oropharyngeal cancer.

35. A method of delivering a substance to a subject in need thereof, the method comprising administering to the subject the cell of any one of claims 1-15 and 31 or the composition of claim 32.

36. A method of increasing the circulation time or plasma half-life of a substance in a subject, the method comprising attaching the substance to a cell according to the method of any one of claims 16-30.

37. Use of a cell according to any one of claims 1-15 and 31 or a composition according to claim 32 in the manufacture of a medicament for the diagnosis, treatment or prevention of a disease associated with HPV infection.

38. The use according to claim 37, wherein the disease associated with HPV infection is selected from cervical cancer, anogenital cancer, head and neck cancer and oropharyngeal cancer.

39. The use of claim 37 or 38, wherein the medicament is a vaccine.