[go: up one dir, main page]

CN116590375A - Method for screening drugs for tumor diseases, program product, high-content imaging analysis system, method for analyzing phase separation - Google Patents

Method for screening drugs for tumor diseases, program product, high-content imaging analysis system, method for analyzing phase separation Download PDF

Info

Publication number
CN116590375A
CN116590375A CN202310118308.5A CN202310118308A CN116590375A CN 116590375 A CN116590375 A CN 116590375A CN 202310118308 A CN202310118308 A CN 202310118308A CN 116590375 A CN116590375 A CN 116590375A
Authority
CN
China
Prior art keywords
cells
target
well
aggregates
fusion protein
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310118308.5A
Other languages
Chinese (zh)
Inventor
李婷婷
李丕龙
余春雨
汪源
裴高峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Peking University
Original Assignee
Tsinghua University
Peking University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University, Peking University filed Critical Tsinghua University
Priority to CN202310118308.5A priority Critical patent/CN116590375A/en
Publication of CN116590375A publication Critical patent/CN116590375A/en
Priority to PCT/CN2024/075422 priority patent/WO2024160270A1/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/5005Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells
    • G01N33/5008Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics
    • G01N33/5011Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving human or animal cells for testing or evaluating the effect of chemical or biological compounds, e.g. drugs, cosmetics for testing antineoplastic activity
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/01Arrangements or apparatus for facilitating the optical investigation
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N21/00Investigating or analysing materials by the use of optical means, i.e. using sub-millimetre waves, infrared, visible or ultraviolet light
    • G01N21/84Systems specially adapted for particular applications
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2503/00Use of cells in diagnostics
    • C12N2503/02Drug screening
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2500/00Screening for compounds of potential therapeutic value
    • G01N2500/10Screening for compounds of potential therapeutic value involving cells

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Immunology (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Pathology (AREA)
  • Biomedical Technology (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Molecular Biology (AREA)
  • Urology & Nephrology (AREA)
  • Hematology (AREA)
  • Microbiology (AREA)
  • Food Science & Technology (AREA)
  • Medicinal Chemistry (AREA)
  • Cell Biology (AREA)
  • Biotechnology (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Toxicology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

The application relates to a method for screening medicines for tumor diseases, a program product, a high content imaging analysis system and a method for analyzing phase separation conditions. The method includes seeding cells comprising a target fusion protein associated with a tumor condition of interest, comprising PS and DBD and comprising a marker molecule, into an aperture plate comprising an array of apertures, forming an aggregate that is recognizable by imaging. Partitioning a plurality of small molecule compounds into individual wells; with a microscopic imaging system, each well is continuously photographed in several different fields of view for a predetermined period of time to generate time-series images of the several fields of view. The attenuation of the amount of intracellular aggregates in each well was analyzed accordingly. Screening out the pores of which the attenuation condition of the amount of the intracellular aggregates meets the predetermined condition, and using the correspondingly distributed small molecule compounds as candidate drugs for the target tumor disorder. Therefore, the dynamic action of a large number of different small molecules relative to the aggregate can be continuously and comprehensively monitored in a high flux, and candidate medicines can be efficiently and accurately screened.

Description

筛选肿瘤病症的药物的方法、程序产品、高内涵成像分析系 统、分析相分离状况的方法Method for screening drugs for tumor diseases, program products, high-content imaging analysis system, method for analyzing phase separation conditions

技术领域Technical Field

本申请涉及基于药物筛选领域,具体而言,涉及一种基于细胞的显微成像来筛选肿瘤病症的药物的方法、程序产品、高内涵成像分析系统、分析相分离状况的方法。The present application relates to the field of drug screening, and more specifically, to a method for screening drugs for tumor diseases based on cell microscopic imaging, a program product, a high-content imaging analysis system, and a method for analyzing phase separation conditions.

背景技术Background Art

染色体重排是一种由DNA断裂和在错误位置重新连接引起的突变事件,有时会产生融合蛋白。许多融合蛋白,如慢性粒细胞白血病中发现的BCR-ABL,都会导致肿瘤发生。此外,一些与癌变相关联的融合蛋白由来自一种蛋白质的DNA结合结构域(DBD)和来自另一种蛋白质的相分离倾向结构域(PS)组成(可简称为PS-DBD融合蛋白)。相分离倾向结构和DNA结合结构域之间的融合可能导致“超级转录因子”,通过不受上游信号调控的相分离来招募转录工厂的组分。Chromosomal rearrangement is a mutational event caused by DNA breaking and rejoining at the wrong location, which sometimes produces fusion proteins. Many fusion proteins, such as BCR-ABL found in chronic myeloid leukemia, can lead to tumorigenesis. In addition, some fusion proteins associated with carcinogenesis are composed of a DNA binding domain (DBD) from one protein and a phase separation prone domain (PS) from another protein (which can be simply referred to as PS-DBD fusion proteins). The fusion between the phase separation prone structure and the DNA binding domain may lead to a "super transcription factor" that recruits components of the transcription factory through phase separation that is not regulated by upstream signals.

先前的研究中,偶尔提到PS-DBD融合蛋白的相分离能力,如尤因肉瘤中的EWS-FLI1融合蛋白和急性髓细胞白血病(AML)中的NUP98-HOXA9融合蛋白。FET蛋白家族包括三个成员,FUS、EWS和TAF15。它们都包含一个保守的N端IDR,可以驱动液-液相分离(LLPS)。FET蛋白的IDR与ETS(E26转化序列)家族转录因子的DBD融合,形成FET-ETS融合蛋白。这样的融合蛋白可以形成生物分子聚集物来招募RNA聚合酶II,从而导致基因表达异常,增加致癌性。In previous studies, the phase separation ability of PS-DBD fusion proteins has occasionally been mentioned, such as the EWS-FLI1 fusion protein in Ewing sarcoma and the NUP98-HOXA9 fusion protein in acute myeloid leukemia (AML). The FET protein family includes three members, FUS, EWS, and TAF15. They all contain a conserved N-terminal IDR that can drive liquid-liquid phase separation (LLPS). The IDR of FET proteins is fused to the DBD of the ETS (E26 transformation sequence) family transcription factors to form FET-ETS fusion proteins. Such fusion proteins can form biomolecular aggregates to recruit RNA polymerase II, resulting in abnormal gene expression and increased carcinogenicity.

先前的研究推测,一些小分子可能会增加生物分子聚集物的流动性以缓解疾病(Patel,A.et al.A liquid-to-solid phase transition of the ALS protein FUSaccelerated by disease 715mutation.Cell 162,1066-1077(2015))。但目前无法就小分子对聚集物的作用进行准确监测,更缺乏从海量小分子中高效筛选出针对目标肿瘤病症的候选药物的方法。提出本申请以旨在解决现有技术中的以上问题。Previous studies have speculated that some small molecules may increase the fluidity of biomolecular aggregates to alleviate diseases (Patel, A. et al. A liquid-to-solid phase transition of the ALS protein FUS accelerated by disease 715 mutation. Cell 162, 1066-1077 (2015)). However, it is currently impossible to accurately monitor the effects of small molecules on aggregates, and there is a lack of methods for efficiently screening candidate drugs for target tumor diseases from a large number of small molecules. This application is proposed to solve the above problems in the prior art.

发明内容Summary of the invention

需要一种筛选肿瘤病症的药物的方法、程序产品、高内涵成像分析系统、分析相分离状况的方法,其能够使得融合蛋白形成的聚集物可供成像识别,对大量不同小分子相对于聚集物的动态作用进行持续且全面的高通量监测,以针对目标肿瘤病症高效且准确地筛选出候选药物。What is needed is a method for screening drugs for tumor diseases, a program product, a high-content imaging analysis system, and a method for analyzing phase separation conditions, which can make aggregates formed by fusion proteins available for imaging identification, and conduct continuous and comprehensive high-throughput monitoring of the dynamic effects of a large number of different small molecules on the aggregates, so as to efficiently and accurately screen candidate drugs for target tumor diseases.

根据本申请的第一方案,提供一种筛选肿瘤病症的药物的方法。该方法包括如下步骤。将包含靶融合蛋白的细胞接种到包括孔阵列的孔板中,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白与目标肿瘤病症相关联,包含相分离倾向结构域(PS)和DNA结合结构域(DBD)且进一步包含标记分子,以便能够形成可供成像识别的聚集物。将多种小分子化合物分配到各个孔中。利用显微成像系统,对各个孔以数个不同的视野在预定时间段内持续拍摄,以生成各个孔的数个视野的时间序列图像。由处理器,基于各个孔的至少一个视野的时间序列图像,来分析各个孔的细胞内聚集物的量衰减状况。由所述处理器,筛选出细胞内聚集物的量的衰减状况满足预定条件的孔,使用向所筛选出的孔分配的小分子化合物作为目标肿瘤病症的候选药物。According to the first scheme of the present application, a method for screening drugs for tumor conditions is provided. The method comprises the following steps. Cells containing a target fusion protein are inoculated into a well plate comprising a well array, wherein the cells contain a nucleic acid molecule encoding a target fusion protein, the target fusion protein is associated with a target tumor condition, contains a phase separation tendency domain (PS) and a DNA binding domain (DBD) and further contains a marker molecule so as to form aggregates that can be imaged and identified. A variety of small molecule compounds are distributed to each well. Using a microscopic imaging system, each well is continuously photographed in a predetermined time period with several different fields of view to generate time series images of several fields of view of each well. The processor analyzes the attenuation of the amount of intracellular aggregates in each well based on the time series images of at least one field of view of each well. The processor screens out the wells whose attenuation of the amount of intracellular aggregates meets the predetermined conditions, and uses the small molecule compounds distributed to the screened wells as candidate drugs for the target tumor condition.

根据本申请的第二方案,提供一种计算机程序产品,其包含能够存储于和/或下载到非暂时性存储介质上的计算机可执行指令。所述计算机可执行指令由处理器执行时,执行一种筛选肿瘤病症的药物的方法,包括如下步骤。获取包括孔阵列的孔板的各个孔的数个视野的时间序列图像。所述时间序列图像通过在执行如下步骤后对各个孔以数个不同的视野在预定时间段内持续拍摄而生成:将包含靶融合蛋白的细胞接种到包括孔阵列的孔板中,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白与目标肿瘤病症相关联,包含相分离倾向结构域(PS)和DNA结合结构域(DBD)且进一步包含标记分子,以便能够形成可供成像识别的聚集物;将多种小分子化合物分配到各个孔中。接着,基于各个孔的至少一个视野的时间序列图像,来分析各个孔的细胞内聚集物的量衰减状况。筛选出细胞内聚集物的量的衰减状况满足预定条件的孔,并确定向所筛选出的孔分配的小分子化合物作为目标肿瘤病症的候选药物。According to the second scheme of the present application, a computer program product is provided, which includes computer executable instructions that can be stored and/or downloaded to a non-transitory storage medium. When the computer executable instructions are executed by a processor, a method for screening drugs for tumor conditions is performed, comprising the following steps. Time series images of several fields of view of each well of a well plate including a well array are obtained. The time series images are generated by continuously photographing each well in a predetermined time period with several different fields of view after performing the following steps: cells containing a target fusion protein are inoculated into a well plate including a well array, wherein the cells contain a nucleic acid molecule encoding a target fusion protein, the target fusion protein is associated with a target tumor condition, contains a phase separation tendency domain (PS) and a DNA binding domain (DBD) and further contains a marker molecule so as to form aggregates that can be imaged and identified; a plurality of small molecule compounds are distributed to each well. Then, based on the time series images of at least one field of view of each well, the amount of intracellular aggregates in each well is analyzed. Attenuation conditions. Wells whose attenuation conditions of the amount of intracellular aggregates meet predetermined conditions are screened, and small molecule compounds assigned to the screened wells are determined as candidate drugs for the target tumor disorder.

根据本申请的第三方案,提供一种高内涵成像分析系统,包括自动高速显微成像组件和至少一个处理器。所述自动高速显微成像组件配置为在如下步骤执行后对包括孔阵列的孔板的各个孔以数个不同的视野在预定时间段内持续拍摄,而生成各个孔的数个视野的时间序列图像:将包含靶融合蛋白的细胞接种到包括孔阵列的孔板中,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白与目标肿瘤病症相关联,包含相分离倾向结构域(PS)和DNA结合结构域(DBD)且进一步包含标记分子,以便能够形成可供成像识别的聚集物;将多种小分子化合物分配到各个孔中。所述至少一个处理器配置为执行根据本申请各个实施例的筛选肿瘤病症的药物的方法。According to the third scheme of the present application, a high-content imaging analysis system is provided, comprising an automatic high-speed microscopic imaging component and at least one processor. The automatic high-speed microscopic imaging component is configured to continuously shoot each well of a well plate including a well array with several different fields of view within a predetermined time period after the following steps are performed, and generate time series images of several fields of view of each well: inoculating cells containing a target fusion protein into a well plate including a well array, wherein the cells contain a nucleic acid molecule encoding a target fusion protein, the target fusion protein is associated with a target tumor condition, contains a phase separation tendency domain (PS) and a DNA binding domain (DBD) and further contains a marker molecule so as to form aggregates that can be imaged and identified; a plurality of small molecule compounds are distributed into each well. The at least one processor is configured to execute a method for screening drugs for tumor conditions according to various embodiments of the present application.

根据本申请的第四方案,提供一种分析包含相分离倾向结构域(PS)和DNA结合结构域(DBD)的靶融合蛋白的相分离状况的方法,包括如下步骤。采集包含所述靶融合蛋白的细胞的图像,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白进一步包含标记分子以便能够形成可供成像识别的聚集物。识别所述图像中的细胞内聚集物的量,并根据所识别的聚集物的量来确定所述PS-DBD融合蛋白在体内和/或体外的相分离状况,聚集物的量越大,则相分离状况越严重。According to the fourth scheme of the present application, a method for analyzing the phase separation state of a target fusion protein comprising a phase separation tendency domain (PS) and a DNA binding domain (DBD) is provided, comprising the following steps. An image of a cell comprising the target fusion protein is collected, wherein the cell comprises a nucleic acid molecule encoding the target fusion protein, and the target fusion protein further comprises a marker molecule so as to form aggregates that can be imaged and identified. The amount of intracellular aggregates in the image is identified, and the phase separation state of the PS-DBD fusion protein in vivo and/or in vitro is determined based on the amount of aggregates identified, and the larger the amount of aggregates, the more severe the phase separation state.

本申请全面调查了致癌融合事件,发现了多达1500多个潜在的PS-DBD融合蛋白,可作为诊断与基因表达异常相关的疾病(例如肿瘤病症)的靶点,具体地确定了数百个新的癌症的靶点。此外,本申请还建立了一种基于显微成像系统的计算机视觉的筛选肿瘤病症的药物的方法(本文中也简称为DropScan)、程序产品、高内涵成像分析系统、分析相分离状况的方法。其能够使得融合蛋白形成的聚集物可供成像识别,对大量不同小分子相对于聚集物的动态衰减调节作用进行持续且全面的高通量监测,以针对目标肿瘤病症高效且准确地筛选出候选药物,该候选药物作为肿瘤病症的临床治疗药物的先导候选者。本申请成功地使用DropScan识别了一些小分子,这些小分子可以部分或完全溶解FET-ETS癌融合蛋白形成的聚集物。特别是LY2835219,它通过上调溶酶体活性来减少聚集物的数量,随后挽救相关的异常基因表达。The present application comprehensively investigated oncogenic fusion events and found more than 1,500 potential PS-DBD fusion proteins that can be used as targets for diagnosing diseases associated with abnormal gene expression (e.g., tumor conditions), and specifically identified hundreds of new cancer targets. In addition, the present application also established a method for screening drugs for tumor conditions based on computer vision of a microscopic imaging system (also referred to herein as DropScan), a program product, a high-content imaging analysis system, and a method for analyzing phase separation conditions. It enables the aggregates formed by the fusion protein to be imaged and identified, and a large number of different small molecules are continuously and comprehensively monitored for dynamic attenuation regulation of the aggregates, so as to efficiently and accurately screen candidate drugs for the target tumor condition, and the candidate drugs are used as lead candidates for clinical treatment drugs for tumor conditions. The present application successfully used DropScan to identify some small molecules that can partially or completely dissolve aggregates formed by FET-ETS cancer fusion proteins. In particular, LY2835219 reduces the number of aggregates by upregulating lysosomal activity, and then rescues the related abnormal gene expression.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

在不一定按比例绘制的附图中,相同的附图标记可以在不同的视图中描述相似的部件。具有字母后缀或不同字母后缀的相同附图标记可以表示相似部件的不同实例。附图大体上通过举例而不是限制的方式示出各种实施例,并且与说明书以及权利要求书一起用于对所公开的实施例进行说明。这样的实施例是例证性的,而并非旨在作为本装置或方法的穷尽或排他实施例。In the drawings, which are not necessarily drawn to scale, the same reference numerals may describe similar components in different views. The same reference numerals with letter suffixes or different letter suffixes may represent different instances of similar components. The accompanying drawings generally illustrate various embodiments by way of example and not limitation, and together with the description and claims, serve to illustrate the disclosed embodiments. Such embodiments are illustrative and are not intended to be exhaustive or exclusive embodiments of the present apparatus or method.

图1示出根据本申请第一实施例的筛选肿瘤病症的药物的方法的流程图。FIG. 1 shows a flow chart of a method for screening drugs for tumor diseases according to a first embodiment of the present application.

图2示出根据本申请第二实施例的从多个数据库获得多达1500多个潜在的PS-DBD融合蛋白的流程示意图。FIG. 2 shows a schematic diagram of a process for obtaining more than 1,500 potential PS-DBD fusion proteins from multiple databases according to the second embodiment of the present application.

图3示出根据本申请第三实施例的PS-DBD融合物的相互关联性的示意图,其中示出了具有10个以上的融合对象的中心蛋白的融合网络。FIG3 is a schematic diagram showing the interconnectedness of PS-DBD fusions according to the third embodiment of the present application, wherein a fusion network of a central protein with more than 10 fusion partners is shown.

图4(a)示出根据本申请第四实施例的筛选肿瘤病症的药物的方法中,将包含靶融合蛋白的细胞-U2OS/mCherry-FUS-ERGmut细胞,接种到孔板中进行持续拍摄的示意图。FIG4( a ) shows a schematic diagram of inoculating cells containing the target fusion protein, U2OS/mCherry-FUS-ERGmut cells, into a well plate for continuous imaging in a method for screening drugs for tumor diseases according to the fourth embodiment of the present application.

图4(b)示出根据本申请第五实施例的筛选肿瘤病症的药物的方法中,对各个图像执行U2OS/mCherry-FUS-ERGmut细胞和聚集物的分割的示意图。FIG4( b ) shows a schematic diagram of performing segmentation of U2OS/mCherry-FUS-ERGmut cells and aggregates on each image in the method for screening drugs for tumor diseases according to the fifth embodiment of the present application.

图4(c)示出根据本申请第六实施例的筛选肿瘤病症的药物的方法中,基于各孔在所述预定时间段内的富含聚集物细胞分数的分布曲线来拟合的指数模型的曲线图示。FIG4( c ) shows a graphical representation of an exponential model fitted based on the distribution curve of the fraction of aggregate-rich cells in each well within the predetermined time period in the method for screening drugs for tumor diseases according to the sixth embodiment of the present application.

图4(d)示出根据本申请第七实施例的筛选肿瘤病症的药物的方法中,为1777种小分子计算的药物效果和作为拟合效果参数的R平方的散点图。FIG4( d ) shows a scatter plot of drug effects calculated for 1777 small molecules and R-squared as a fitting effect parameter in the method for screening drugs for tumor diseases according to the seventh embodiment of the present application.

图4(e)示出根据本申请第七实施例的筛选肿瘤病症的药物的方法中,144个筛选出的阳性小分子的药物效果的排名图示。FIG. 4( e ) shows a ranking diagram of the drug effects of 144 screened positive small molecules in the method for screening drugs for tumor diseases according to the seventh embodiment of the present application.

图5(a)-图5(m)示出根据本申请实施例的FUS-ERG融合蛋白在体内外驱动相分离的示意图,其中:FIG. 5( a) to FIG. 5( m) are schematic diagrams showing the FUS-ERG fusion protein driving phase separation in vivo and in vitro according to an embodiment of the present application, wherein:

图5(a)示出FUS、ERG和FUS-ERG融合蛋白的示意图(上)和无序预测(下),矩形表示ERG中的DBD;FIG5( a ) shows a schematic diagram (top) and disorder prediction (bottom) of FUS, ERG, and FUS-ERG fusion protein, with the rectangle representing the DBD in ERG;

图5(b)示出异源表达mCherry-FUS-ERG的Hela细胞的活细胞图像,DNA用Hoechst33342(蓝色)染色,比例尺,10μm;FIG5( b ) shows a live cell image of Hela cells heterologously expressing mCherry-FUS-ERG, DNA was stained with Hoechst33342 (blue), scale bar, 10 μm;

图5(c)示出OptoIDR方法示意图,OptoIDR元件由目的IDR、mCherry和Cry2olig(这是拟南芥CRY2 PHR结构域的突变形式)组成;FIG5( c ) shows a schematic diagram of the OptoIDR method, wherein the OptoIDR element is composed of the target IDR, mCherry, and Cry2olig (which is a mutant form of the Arabidopsis CRY2 PHR domain);

图5(d)示出表达mCherry-Cry2olig(上)和FUS IDR-mCherry-Cry2olig(下)融合蛋白的U2OS活细胞的代表性图像,用488nm激光刺激细胞;FIG5( d ) shows representative images of U2OS living cells expressing mCherry-Cry2olig (top) and FUS IDR-mCherry-Cry2olig (bottom) fusion proteins, with cells stimulated with 488 nm laser;

图5(e)示出如图5(d)所示刺激后液滴形成动力学的量化,每个构建体n≥30个细胞;Figure 5(e) shows quantification of droplet formation kinetics after stimulation as in Figure 5(d), with n ≥ 30 cells per construct;

图5(f)示出EGFP-FUS-ERG重组蛋白在指定浓度下的共聚焦荧光图像,比例尺,10μm;FIG5( f ) shows confocal fluorescence images of EGFP-FUS-ERG recombinant protein at specified concentrations, scale bar, 10 μm;

图5(g)示出EGFP-FUS-ERG(40μM)形成的聚集物的融合,比例尺,2μm;Fig. 5(g) shows the fusion of aggregates formed by EGFP-FUS-ERG (40 μM), scale bar, 2 μm;

图5(h)示出EGFP-FUS-ERG(40μM)形成的聚集物的FRAP的代表性荧光图像,比例尺,2μm;Figure 5 (h) shows a representative fluorescence image of FRAP of aggregates formed by EGFP-FUS-ERG (40 μM), scale bar, 2 μm;

图5(i)示出EGFP-FUS-ERG(40μM)形成的聚集物的FRAP数据的定量分析(n=9,平均值±标准差);FIG5( i ) shows the quantitative analysis of FRAP data of aggregates formed by EGFP-FUS-ERG (40 μM) (n=9, mean±standard deviation);

图5(j)示出40μM EGFP-FUS-ERG与50nM 306-bp dsDNA混合的成像结果,306-bpdsDNA含有25×GGAA并用Cy5标记,比例尺,10μm;Fig. 5(j) shows the imaging results of 40 μM EGFP-FUS-ERG mixed with 50 nM 306-bp dsDNA, 306-bp dsDNA containing 25×GGAA and labeled with Cy5, scale bar, 10 μm;

图5(k)示出由EGFP-FUS-ERG(40μM)和306-bp dsDNA(50nM,25×GGAA,用Cy5标记)形成的聚合物FRAP的代表性荧光图像,比例尺,2μm;Figure 5(k) shows a representative fluorescence image of FRAP of a polymer formed by EGFP-FUS-ERG (40 μM) and 306-bp dsDNA (50 nM, 25×GGAA, labeled with Cy5), scale bar, 2 μm;

图5(l)示出由EGFP-FUS-ERG(40μM)和306-bp dsDNA(50nM,25×GGAA,用Cy5标记)形成的缩合物的数据的定量分析(n=6,平均值±标准差);FIG5( l ) shows the quantitative analysis of the data of the condensate formed by EGFP-FUS-ERG (40 μM) and 306-bp dsDNA (50 nM, 25×GGAA, labeled with Cy5) (n=6, mean±SD);

图5(m)示出异源表达mCherry-FUS-ERG(图5(b))的DNA结合域突变形式mCherry-FUS-ERGmut的Hela细胞的活细胞图像。DNA被Hoechst 33342(蓝色)染色,比例尺,10μm。Figure 5(m) shows live cell images of Hela cells heterologously expressing the DNA binding domain mutant form of mCherry-FUS-ERG (Figure 5(b)), mCherry-FUS-ERGmut. DNA was stained with Hoechst 33342 (blue), scale bar, 10 μm.

图6(a)-图6(c)示出根据本申请第八实施例的体外纯化DNA和FUS-ERG的分析图示,其中:FIG. 6( a) to FIG. 6( c) show the analysis diagrams of in vitro purified DNA and FUS-ERG according to the eighth embodiment of the present application, wherein:

图6(a)示出根据本申请第八实施例的从大肠杆菌中纯化的MBP-EGFP-FUS-ERG的SDS-PAGE分析;FIG6( a) shows SDS-PAGE analysis of MBP-EGFP-FUS-ERG purified from Escherichia coli according to the eighth embodiment of the present application;

图6(b)示出根据本申请第八实施例的纯化的306-bp 25×GGAA dsDNA(用cy5标记)的琼脂糖电泳分析示意图。FIG6( b ) shows a schematic diagram of agarose electrophoresis analysis of purified 306-bp 25×GGAA dsDNA (labeled with cy5) according to the eighth embodiment of the present application.

图6(c)示出根据本申请第八实施例的306-bp dsDNA的序列,其中25×GGAA碱基突出显示。FIG6( c ) shows the sequence of 306-bp dsDNA according to the eighth embodiment of the present application, in which 25×GGAA bases are highlighted.

图7(a)-图7(y)示出根据本申请第九实施例的BRD9 IDR/COL17A1 IDR/MLLT1IDR/EGFP-SFPQ IDR/EGFP-MED15 IDR融合蛋白在体内外驱动相分离的示意图,其中:FIG. 7( a) to FIG. 7( y ) show schematic diagrams of BRD9 IDR/COL17A1 IDR/MLLT1IDR/EGFP-SFPQ IDR/EGFP-MED15 IDR fusion protein driving phase separation in vivo and in vitro according to the ninth embodiment of the present application, wherein:

图7(a)示出表达指定BRD9 IDR-mCherry-Cry2olig融合蛋白的U2OS细胞的代表性图像,用488nm激光刺激细胞以激活OptoIDR,比例尺,20μm;Figure 7 (a) shows representative images of U2OS cells expressing the indicated BRD9 IDR-mCherry-Cry2olig fusion proteins, and the cells were stimulated with a 488 nm laser to activate OptoIDR, scale bar, 20 μm;

图7(b)示出如图7(a)所示刺激后液滴形成动力学的量化,每个构建体n≥30个细胞;Figure 7(b) shows quantification of droplet formation kinetics after stimulation as in Figure 7(a), with n ≥ 30 cells per construct;

图7(c)示出Alexa 488标记的BRD9 IDR蛋白在指定浓度下的共聚焦荧光图像,比例尺,10μm;FIG7( c ) shows confocal fluorescence images of Alexa 488-labeled BRD9 IDR protein at the indicated concentrations, scale bar, 10 μm;

图7(d)示出由Alexa 488标记的BRD9 IDR蛋白在以下浓度(BRD9 IDR(150μM)下形成的聚集物的FRAP,比例尺,2μm;FIG7( d ) shows FRAP of aggregates formed by Alexa 488-labeled BRD9 IDR protein at the following concentrations (BRD9 IDR (150 μM), scale bar, 2 μm;

图7(e)示出对来自图7(d)的FRAP数据进行定量分析,(n=6-12,平均值±标准差);FIG7( e ) shows the quantitative analysis of the FRAP data from FIG7( d ) (n=6-12, mean±standard deviation);

图7(f)示出表达指定COL17A1 IDR-mCherry-Cry2olig融合蛋白的U2OS细胞的代表性图像,用488nm激光刺激细胞以激活OptoIDR,比例尺,20μm;Figure 7(f) shows representative images of U2OS cells expressing the indicated COL17A1 IDR-mCherry-Cry2olig fusion proteins, and the cells were stimulated with 488 nm laser to activate OptoIDR, scale bar, 20 μm;

图7(g)示出如图7(f)所示刺激后液滴形成动力学的量化,每个构建体n≥30个细胞;Figure 7(g) shows quantification of droplet formation kinetics after stimulation as in Figure 7(f), with n ≥ 30 cells per construct;

图7(h)示出Alexa 488标记的COL17A1 IDR蛋白在指定浓度下的共聚焦荧光图像,比例尺,10μm;FIG7( h ) shows confocal fluorescence images of Alexa 488-labeled COL17A1 IDR protein at the indicated concentrations, scale bar, 10 μm;

图7(i)示出示出由Alexa 488标记的COL17A1 IDR蛋白在以下浓度(COL17A1 IDR(100μM)下形成的聚集物的FRAP,比例尺,2μm;FIG. 7( i ) shows FRAP showing aggregates formed by Alexa 488-labeled COL17A1 IDR protein at the following concentrations (COL17A1 IDR (100 μM), scale bar, 2 μm;

图7(j)示出对来自图7(i)的FRAP数据进行定量分析,(n=6-12,平均值±标准差);FIG7( j ) shows the quantitative analysis of the FRAP data from FIG7( i ) (n=6-12, mean±standard deviation);

图7(k)示出表达指定MLLT1 IDR-mCherry-Cry2olig融合蛋白的U2OS细胞的代表性图像,用488nm激光刺激细胞以激活OptoIDR,比例尺,20μm;Figure 7(k) shows representative images of U2OS cells expressing the indicated MLLT1 IDR-mCherry-Cry2olig fusion proteins, and the cells were stimulated with 488 nm laser to activate OptoIDR, scale bar, 20 μm;

图7(l)示出如图7(k)所示刺激后液滴形成动力学的量化,每个构建体n≥30个细胞;Figure 7(l) shows quantification of droplet formation kinetics after stimulation as shown in Figure 7(k), with n≥30 cells per construct;

图7(m)示出Alexa 488标记的MLLT1 IDR蛋白在指定浓度下的共聚焦荧光图像,比例尺,10μm;FIG7( m ) shows confocal fluorescence images of Alexa 488-labeled MLLT1 IDR protein at the indicated concentrations, scale bar, 10 μm;

图7(n)示出由Alexa 488标记的MLLT1 IDR蛋白在以下浓度(MLLT1 IDR(25μM)下形成的聚集物的FRAP,比例尺,2μm;FIG7( n ) shows FRAP of aggregates formed by Alexa 488-labeled MLLT1 IDR protein at the following concentrations (MLLT1 IDR (25 μM), scale bar, 2 μm;

图7(o)示出对来自图7(n)的FRAP数据进行定量分析,(n=6-12,平均值±标准差);FIG7( o ) shows the quantitative analysis of the FRAP data from FIG7( n ) (n=6-12, mean±standard deviation);

图7(p)示出表达指定EGFP-SFPQ IDR-mCherry-Cry2olig融合蛋白的U2OS细胞的代表性图像,用488nm激光刺激细胞以激活OptoIDR,比例尺,20μm;Fig. 7(p) shows representative images of U2OS cells expressing the indicated EGFP-SFPQ IDR-mCherry-Cry2olig fusion proteins, cells were stimulated with 488 nm laser to activate OptoIDR, scale bar, 20 μm;

图7(q)示出如图7(p)所示刺激后液滴形成动力学的量化,每个构建体n≥30个细胞;Figure 7(q) shows quantification of droplet formation kinetics after stimulation as in Figure 7(p), n ≥ 30 cells per construct;

图7(r)示出Alexa 488标记的EGFP-SFPQ IDR蛋白在指定浓度下的共聚焦荧光图像,比例尺,10μm;FIG7 (r) shows confocal fluorescence images of Alexa 488-labeled EGFP-SFPQ IDR protein at the indicated concentrations, scale bar, 10 μm;

图7(s)示出由Alexa 488标记的EGFP-SFPQ IDR蛋白在以下浓度(FPQ IDR(10μM)下形成的聚集物的FRAP,比例尺,2μm;Figure 7(s) shows FRAP of aggregates formed by Alexa 488-labeled EGFP-SFPQ IDR protein at the following concentrations (FPQ IDR (10 μM), scale bar, 2 μm;

图7(t)示出对来自图7(s)的FRAP数据进行定量分析,(n=6-12,平均值±标准差);FIG7( t ) shows the quantitative analysis of the FRAP data from FIG7( s ) (n=6-12, mean±standard deviation);

图7(u)示出表达指定EGFP-MED15 IDR-mCherry-Cry2olig融合蛋白的U2OS细胞的代表性图像,用488nm激光刺激细胞以激活OptoIDR,比例尺,20μm;Fig. 7(u) shows representative images of U2OS cells expressing the indicated EGFP-MED15 IDR-mCherry-Cry2olig fusion proteins, and the cells were stimulated with 488 nm laser to activate OptoIDR, scale bar, 20 μm;

图7(v)示出如图7(u)所示刺激后液滴形成动力学的量化,每个构建体n≥30个细胞;Figure 7(v) shows quantification of droplet formation kinetics after stimulation as shown in Figure 7(u), with n≥30 cells per construct;

图7(w)示出Alexa 488标记的EGFP-MED15 IDR蛋白在指定浓度下的共聚焦荧光图像,比例尺,10μm;FIG7 ( w ) shows confocal fluorescence images of Alexa 488-labeled EGFP-MED15 IDR protein at the indicated concentrations, scale bar, 10 μm;

图7(x)示出由Alexa 488标记的EGFP-MED15 IDR蛋白在以下浓度(MED15 IDR(10μM)下形成的聚集物的FRAP,比例尺,2μm;FIG7( x ) shows FRAP of aggregates formed by Alexa 488-labeled EGFP-MED15 IDR protein at the following concentrations (MED15 IDR (10 μM), scale bar, 2 μm;

图7(y)示出对来自图7(x)的FRAP数据进行定量分析,(n=6-12,平均值±标准差)。FIG. 7( y ) shows the quantitative analysis of the FRAP data from FIG. 7( x ) (n=6-12, mean±s.d.).

图8(a)-图8(b)示出根据本申请第九实施例的选定的PS-DBD融合蛋白的验证图示,其中:FIG. 8( a)-FIG. 8( b) show a validation diagram of a selected PS-DBD fusion protein according to the ninth embodiment of the present application, wherein:

图8(a)示出根据本申请第九实施例的从大肠杆菌中纯化的指定PS-DBD融合蛋白的SDS-PAGE分析图示;FIG8( a) shows a schematic diagram of SDS-PAGE analysis of a designated PS-DBD fusion protein purified from Escherichia coli according to the ninth embodiment of the present application;

图8(b)示出根据本申请第九实施例的所示PS、DBD和PS-DBD融合蛋白和无序预测的示意图(每个虚线框左侧),矩形表示DBD。FIG8( b ) shows a schematic diagram of the PS, DBD and PS-DBD fusion protein and disorder prediction according to the ninth embodiment of the present application (on the left side of each dotted box), and the rectangle represents DBD.

图9(a)示出根据本申请第十实施例的TCGA数据库中PS-DBD融合肿瘤样本的DBD靶标的GSEA分析图示,对于每个具有PS-DBD融合的样本,选择没有这种融合的相同肿瘤类型的样本作为背景集。Figure 9(a) shows a GSEA analysis diagram of the DBD target of PS-DBD fusion tumor samples in the TCGA database according to the tenth embodiment of the present application. For each sample with PS-DBD fusion, samples of the same tumor type without such fusion are selected as the background set.

图9(b)示出根据本申请第十一实施例的CCLE数据库中EWS-FLI1融合细胞系的FLI1靶标的GSEA分析图示,其中来自没有EWS-FLI1融合的骨细胞系用作背景,且显示了显著增加下游转录的六种具有EWS-FLI1融合的细胞系。Figure 9(b) shows a GSEA analysis diagram of the FLI1 target of the EWS-FLI1 fusion cell lines in the CCLE database according to the eleventh embodiment of the present application, in which a bone cell line without EWS-FLI1 fusion is used as the background, and six cell lines with EWS-FLI1 fusion are shown to have significantly increased downstream transcription.

图10(a)-图10(c)示出根据本申请第十二实施例的指定IDR的相分离分析示意图,其中:FIG. 10( a) to FIG. 10( c) are schematic diagrams showing phase separation analysis of a specified IDR according to the twelfth embodiment of the present application, wherein:

图10(a)示出根据本申请第十二实施例的表达指定IDR-mCherry-Cry2olig融合蛋白的U2OS细胞的代表性图像,其中,用488nm激光刺激细胞以激活OptoIDR,比例尺为20μm;FIG. 10( a ) shows a representative image of U2OS cells expressing a specified IDR-mCherry-Cry2olig fusion protein according to the twelfth embodiment of the present application, wherein the cells are stimulated with a 488 nm laser to activate OptoIDR, and the scale bar is 20 μm;

图10(b)示出根据本申请第十二实施例的在没有488nm激光刺激的情况下,表达指定IDR-mCherry-Cry2olig融合蛋白的U2OS细胞的代表性图像,比例尺为20μm;FIG10( b ) shows a representative image of U2OS cells expressing a specified IDR-mCherry-Cry2olig fusion protein without 488 nm laser stimulation according to the twelfth embodiment of the present application, with a scale bar of 20 μm;

图10(c)示出根据本申请第十二实施例的显示OptoIDR结果汇总的饼状图。FIG10( c ) shows a pie chart showing a summary of OptoIDR results according to the twelfth embodiment of the present application.

图11(a)-图11(j)示出根据本申请第十三实施例的筛选肿瘤病症的药物的方法中,筛选出的肿瘤病症的10种代表性的候选药物的结构式、处理前的图像和处理6小时后的图像,比例尺为50μm。Figures 11(a)-11(j) show the structural formulas, images before treatment, and images 6 hours after treatment of 10 representative candidate drugs for tumor diseases screened out in the method for screening drugs for tumor diseases according to the thirteenth embodiment of the present application, and the scale bar is 50μm.

图12示出根据本申请第十四实施例的筛选肿瘤病症的药物的方法中,分配到ON123300后细胞存活状况劣于预定存活条件的孔的图像。12 shows images of wells in which the cell survival conditions after being dispensed with ON123300 are inferior to the predetermined survival conditions in the method for screening drugs for tumor disorders according to the fourteenth embodiment of the present application.

图13(a)-图13(h)示出根据本申请第十五实施例的LY2835219通过激活溶酶体溶解聚集物的示意图,其中:FIG. 13( a) to FIG. 13( h) show schematic diagrams of LY2835219 dissolving aggregates by activating lysosomes according to the fifteenth embodiment of the present application, wherein:

图13(a)示出根据本申请第十五实施例的未用LY2835219处理的细胞图像及用LY2835219处理6小时之后的细胞图像的对比图示;FIG. 13( a ) is a comparative diagram showing a cell image not treated with LY2835219 and a cell image treated with LY2835219 for 6 hours according to the fifteenth embodiment of the present application;

图13(b)示出根据本申请第十五实施例的LY2835219对指定浓度的U2OS/mCherry-FUS-ERGmut细胞的液滴的动态定量作用的示意图,具体地,示出分配到LY2835219后富有液滴的细胞分数随时间在不同剂量下的下降曲线;FIG13( b) is a schematic diagram showing the dynamic quantitative effect of LY2835219 on droplets of U2OS/mCherry-FUS-ERGmut cells of a specified concentration according to the fifteenth embodiment of the present application, specifically, showing the decreasing curve of the cell fraction rich in droplets at different doses over time after being distributed to LY2835219;

图13(c)示出根据本申请第十五实施例的用7.5μM LY2835219处理4小时后U2OS/mCherry-FUS-ERGmut细胞的Lysotracker green(溶酶体绿色荧光探针)染色图像,其中,DMSO处理组作为阴性对照,比例尺为50μm;FIG13( c ) shows a Lysotracker green (lysosomal green fluorescent probe) staining image of U2OS/mCherry-FUS-ERGmut cells after being treated with 7.5 μM LY2835219 for 4 hours according to the fifteenth embodiment of the present application, wherein the DMSO treatment group is used as a negative control, and the scale bar is 50 μm;

图13(d)示出根据本申请第十五实施例的通过流式细胞术对用7.5μM LY2835219处理0、3和6小时的U2OS/mCherry-FUS-ERGmut细胞中Lysotracker green(溶酶体绿色荧光探针)的相对强度的定量示意图,***P<0.001,对于两样本t检验而言,误差线代表标准差;;FIG13( d ) shows a quantitative schematic diagram of the relative intensity of Lysotracker green (lysosomal green fluorescent probe) in U2OS/mCherry-FUS-ERGmut cells treated with 7.5 μM LY2835219 for 0, 3 and 6 hours by flow cytometry according to the fifteenth embodiment of the present application, ***P<0.001, for two-sample t-test, the error bars represent standard deviations;

图13(e)示出根据本申请第十五实施例的用溶酶体抑制剂Baf-A1(200nM)和/或LY2835219(7.5μM)处理的U2OS/mCherry-FUS-ERGmut细胞的活细胞图像,其中,比例尺为50μm;FIG13( e ) shows a live cell image of U2OS/mCherry-FUS-ERGmut cells treated with the lysosomal inhibitor Baf-A1 (200 nM) and/or LY2835219 (7.5 μM) according to the fifteenth embodiment of the present application, wherein the scale bar is 50 μm;

图13(f)示出根据本申请第十五实施例的转染溶酶体报告质粒YFP-LAMP1,然后用DMSO或7.5μM LY2835219处理指定时间的U2OS/mCherry-FUS-ERGmut细胞的活细胞图像,其中,比例尺为20μm;FIG. 13( f ) shows live cell images of U2OS/mCherry-FUS-ERGmut cells transfected with lysosomal reporter plasmid YFP-LAMP1 and then treated with DMSO or 7.5 μM LY2835219 for a specified time according to the fifteenth embodiment of the present application, wherein the scale bar is 20 μm;

图13(g)示出根据本申请第十五实施例的U2OS/mCherry-EWS-FLI1、U2OS/mCherry-EWS-FLI1mut、U2OS/mCherry-NUP98-HOXD13mut和U2OS/mCherry-NUP98-HOXA9mut用10μM LY2835219处理指定时间的活细胞图像,下图和右图显示了10μM LY2835219对每个细胞系的量化效果,比例尺为50μm;Figure 13(g) shows live cell images of U2OS/mCherry-EWS-FLI1, U2OS/mCherry-EWS-FLI1mut, U2OS/mCherry-NUP98-HOXD13mut and U2OS/mCherry-NUP98-HOXA9mut treated with 10 μM LY2835219 for the indicated time according to the fifteenth example of the present application, the lower and right figures show the quantitative effect of 10 μM LY2835219 on each cell line, and the scale bar is 50 μm;

图13(h)示出根据本申请第十五实施例的显示7.5μM LY2835219对U2OS/mCherry-FUS-ERGmut细胞分别转染了GFP-FUS IDR、GFP-NUP98 IDR或EGFP-TDP43的U2OS细胞处理6小时前后的活细胞图像,比例尺为20μm。13( h ) shows live cell images of U2OS cells transfected with GFP-FUS IDR, GFP-NUP98 IDR or EGFP-TDP43 respectively before and after treatment with 7.5 μM LY2835219 for 6 hours according to the fifteenth embodiment of the present application, and the scale bar is 20 μm.

图14(a)-图14(c)示出根据本申请第十六实施例的LY2835219减少聚集物的量并激活溶酶体的示意图,其中:FIG. 14( a) to FIG. 14( c) show schematic diagrams of LY2835219 reducing the amount of aggregates and activating lysosomes according to the sixteenth embodiment of the present application, wherein:

图14(a)示出根据本申请第十六实施例的用指定浓度的LY2835219处理的U2OS/mCherry-FUS-ERGmut细胞的活细胞图像,比例尺为50μm;FIG. 14( a ) shows a live cell image of U2OS/mCherry-FUS-ERGmut cells treated with LY2835219 at a specified concentration according to the sixteenth example of the present application, with a scale bar of 50 μm;

图14(b)示出根据本申请第十六实施例的用7.5μM LY2835219处理4小时后U2OS细胞的Lysotracker green染色图像,比例尺为50μm;FIG. 14( b ) shows a Lysotracker green staining image of U2OS cells after being treated with 7.5 μM LY2835219 for 4 hours according to the sixteenth embodiment of the present application, with a scale bar of 50 μm;

图14(c)示出根据本申请第十六实施例的用7.5μM LY2835219处理指定时间的U2OS/mCherry-FUS-ERGmut的蛋白质免疫印迹分析图示,其中用抗mCherry抗体检测融合蛋白,且抗GAPDH抗体用作内参。14( c ) shows a diagram of Western immunoblot analysis of U2OS/mCherry-FUS-ERGmut treated with 7.5 μM LY2835219 for the indicated time according to the sixteenth example of the present application, wherein the fusion protein was detected with an anti-mCherry antibody and an anti-GAPDH antibody was used as an internal control.

图15(a)-图15(f)示出根据本申请第十七实施例的LY2835219减少液滴数量并挽救异常基因表达的示意图,其中:FIG. 15( a) to FIG. 15( f) show schematic diagrams of LY2835219 reducing the number of droplets and rescuing abnormal gene expression according to the seventeenth embodiment of the present application, wherein:

图15(a)示出根据本申请第十七实施例的用7.5μM LY2835219处理指定时间的细胞中双荧光素酶报告基因测定的结果,用双荧光素酶报告质粒和EWS-FLI1融合蛋白表达载体转染HEK293T细胞,其中EV表示没有引入融合基因的空载体,***P<0.001,对于两样本t检验而言,误差线代表标准差;FIG15( a ) shows the results of a dual luciferase reporter gene assay in cells treated with 7.5 μM LY2835219 for a specified time according to the seventeenth embodiment of the present application, HEK293T cells were transfected with a dual luciferase reporter plasmid and an EWS-FLI1 fusion protein expression vector, wherein EV represents an empty vector without an introduced fusion gene, ***P<0.001, for a two-sample t-test, the error bars represent standard deviations;

图15(b)示出根据本申请第十七实施例的在公开的扰动数据中识别EWS-FLI1靶标的示意图,其中,EWS-FLI1过表达细胞中EWS-FLI1靶标的表达水平高于FLI1过表达细胞和对照细胞,以黑体突出显示,NC表示无过表达的对照;FIG15( b ) shows a schematic diagram of identifying EWS-FLI1 targets in the disclosed perturbation data according to the seventeenth embodiment of the present application, wherein the expression level of the EWS-FLI1 target in the EWS-FLI1 overexpressing cells is higher than that in the FLI1 overexpressing cells and the control cells, which is highlighted in bold, and NC represents a control without overexpression;

图15(c)示出根据本申请第十七实施例的在用7.5μM LY2835219处理3小时和6小时之前/之后,U2OS/mCherry-EWS-FLI1细胞中差异表达基因的RNA-seq数据火山图;FIG. 15( c ) shows a volcano plot of RNA-seq data of differentially expressed genes in U2OS/mCherry-EWS-FLI1 cells before/after treatment with 7.5 μM LY2835219 for 3 hours and 6 hours according to the seventeenth embodiment of the present application;

图15(d)示出根据本申请第十七实施例的在用7.5μM LY2835219处理之前/之后,U2OS/mCherry-EWS-FLI1细胞中EWS-FLI1靶标的相对表达水平,其中,没有EWS-FLI1融合的野生型U2OS细胞的转录组用作对照样品,以计算表达的倍数变化;FIG15( d ) shows the relative expression level of the EWS-FLI1 target in U2OS/mCherry-EWS-FLI1 cells before/after treatment with 7.5 μM LY2835219 according to the seventeenth embodiment of the present application, wherein the transcriptome of wild-type U2OS cells without EWS-FLI1 fusion was used as a control sample to calculate the fold change of expression;

图15(e)示出根据本申请第十七实施例的用7.5μM LY2835219处理指定时间的U2OS/mCherry-EWS-FLI细胞中EWS-FLI基因表达水平的RT-qPCR结果,对于每个靶基因,mRNA水平用GAPDH标准化,其中,NS表示无显著性,*P<0.05,**P<0.01,***P<0.001,双样本t检验,误差线代表标准差;Figure 15(e) shows the RT-qPCR results of the EWS-FLI gene expression level in U2OS/mCherry-EWS-FLI cells treated with 7.5 μM LY2835219 for a specified time according to the seventeenth embodiment of the present application, for each target gene, the mRNA level was normalized with GAPDH, wherein NS indicates no significance, *P<0.05, **P<0.01, ***P<0.001, two-sample t-test, and the error bars represent standard deviations;

图15(f)示出根据本申请第十七实施例的PS-DBD融合蛋白如何在相关癌症中驱动异常相分离的原理示意图。FIG. 15( f ) is a schematic diagram showing the principle of how the PS-DBD fusion protein according to the seventeenth embodiment of the present application drives abnormal phase separation in relevant cancers.

图16(a)-图16(b)示出根据本申请第十八实施例的LY2835219部分挽救A-673细胞中的异常基因表达的示意图,其中:FIG. 16( a)-FIG. 16( b) show schematic diagrams of LY2835219 partially rescuing abnormal gene expression in A-673 cells according to the eighteenth embodiment of the present application, wherein:

图16(a)示出根据本申请第十八实施例的用7.5μM LY2835219处理3小时和6小时之前/之后A-673细胞中差异表达基因的RNA-seq数据火山图;FIG. 16( a ) shows a volcano plot of RNA-seq data of differentially expressed genes in A-673 cells before/after treatment with 7.5 μM LY2835219 for 3 hours and 6 hours according to the eighteenth embodiment of the present application;

图16(b)示出根据本申请第十八实施例的在用7.5μM LY2835219处理之前/之后,A-673细胞中EWS-FLI1靶标的相对表达水平,其中,来自没有EWS-FLI1融合的CCLE骨细胞系的转录组用作对照样品以计算表达的倍数变化;FIG16( b) shows the relative expression level of the EWS-FLI1 target in A-673 cells before/after treatment with 7.5 μM LY2835219 according to the eighteenth embodiment of the present application, wherein the transcriptome from the CCLE bone cell line without EWS-FLI1 fusion is used as a control sample to calculate the fold change of expression;

图17示出了本申请实施例中使用的骨架质粒pCMV-mCherry-Age I-FUS-ERG-AgeI的结构示意图。FIG. 17 shows a schematic diagram of the structure of the backbone plasmid pCMV-mCherry-Age I-FUS-ERG-Age I used in the examples of the present application.

图18示出了本申请实施例中使用的骨架质粒pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig的结构示意图。FIG. 18 shows a schematic diagram of the structure of the backbone plasmid pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig used in the examples of the present application.

图19示出了本申请实施例中使用的骨架质粒pETDuet的结构示意图。FIG. 19 shows a schematic diagram of the structure of the backbone plasmid pETDuet used in the examples of the present application.

图20示出了本申请实施例中使用的骨架质粒pRSFDual的结构示意图。Figure 20 shows a schematic diagram of the structure of the backbone plasmid pRSFDual used in the examples of the present application.

具体实施方式DETAILED DESCRIPTION

为使本领域具有普通知识的人员可了解本发明的特点及效果,以下谨就说明书及申请专利范围中提及的术语及用语进行一般性的说明及定义。除非另有指明,否则文中使用的所有技术及科学上的字词,皆具有本领域技术人员对于本申请所了解的通常意义,当有冲突情形时,应以本说明书的定义为准。In order to enable persons with ordinary knowledge in the art to understand the features and effects of the present invention, the following is a general description and definition of the terms and expressions mentioned in the specification and the scope of the patent application. Unless otherwise specified, all technical and scientific words used in the text have the common meanings understood by those skilled in the art for this application. In the event of a conflict, the definition in this specification shall prevail.

筛选肿瘤病症的药物的方法Method for screening drugs for tumor diseases

图1示出根据本申请第一实施例的筛选肿瘤病症的药物的方法的流程图,其中所示的方法本文中也简称为DropScan。如图1所示,该方法包括如下步骤。Fig. 1 shows a flow chart of a method for screening drugs for tumor diseases according to the first embodiment of the present application, wherein the method shown is also referred to as DropScan herein. As shown in Fig. 1 , the method comprises the following steps.

在步骤101,将包含靶融合蛋白的细胞接种到包括孔阵列的孔板中,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白与目标肿瘤病症相关联,包含相分离倾向结构域(PS)和DNA结合结构域(DBD)且进一步包含标记分子,以便能够形成可供成像识别的聚集物。In step 101, cells containing a target fusion protein are seeded into a well plate comprising a well array, wherein the cells contain a nucleic acid molecule encoding a target fusion protein, the target fusion protein is associated with a target tumor disorder, contains a phase separation prone domain (PS) and a DNA binding domain (DBD) and further contains a marker molecule so as to be able to form aggregates that can be identified by imaging.

如本文中所使用的,术语“靶融合蛋白”是指肿瘤中发生基因融合事件,由两种带特定结构域的蛋白所在基因产生的融合基因所翻译而成的蛋白,所述融合基因主要由带有相分离倾向结构域(PS)的蛋白所在基因和带有DNA结合结构域(DBD)的蛋白所在基因融合组成。对于由两个基因融合翻译得到的融合蛋白,称这一蛋白由融合基因5’端的基因翻译的部分为头部,由融合基因3’端的基因翻译的部分为尾部。如果融合蛋白的头部带有相分离倾向结构域(PS),尾部带有DNA结合结构域(DBD),或者相反,尾部带有相分离倾向结构域(PS),头部带有DNA结合结构域(DBD),则称其为靶融合蛋白。比如肾癌中的SFPQ-TFE3融合蛋白,由带有相分离倾向结构域的SFPQ蛋白所在基因和带有DNA结合结构域的TFE3蛋白所在基因的融合基因翻译产生;又比如尤因肉瘤中的EWS-FLI1融合蛋白,由带有相分离倾向结构域的EWSR1蛋白所在基因和带有DNA结合结构域的FLI1蛋白所在基因的融合基因翻译产生。As used herein, the term "target fusion protein" refers to a protein translated from a fusion gene produced by two proteins with specific domains in a gene fusion event in a tumor, wherein the fusion gene is mainly composed of a fusion of a protein with a phase separation tendency domain (PS) and a protein with a DNA binding domain (DBD). For a fusion protein obtained by fusion and translation of two genes, the portion of the protein translated from the gene at the 5' end of the fusion gene is called the head, and the portion translated from the gene at the 3' end of the fusion gene is called the tail. If the head of the fusion protein carries a phase separation tendency domain (PS) and the tail carries a DNA binding domain (DBD), or conversely, the tail carries a phase separation tendency domain (PS) and the head carries a DNA binding domain (DBD), it is called a target fusion protein. For example, the SFPQ-TFE3 fusion protein in renal cancer is produced by the translation of the fusion gene of the SFPQ protein with a phase separation tendency domain and the TFE3 protein with a DNA binding domain; another example is the EWS-FLI1 fusion protein in Ewing sarcoma, which is produced by the translation of the fusion gene of the EWSR1 protein with a phase separation tendency domain and the FLI1 protein with a DNA binding domain.

如本文中所使用的,术语“相分离倾向结构域(PS)”是指固有无序区域(IDR)、低复杂度区域(LCR)和朊蛋白样结构域(PLD)中的至少一种区域,其中固有无序区域可由ESpritz工具检测,低复杂度区域可由SEG工具检测,朊蛋白样结构域可由PLAAC工具检测。以人类蛋白EWS为例子(Uniprot ID Q01844),其第1氨基酸-第56氨基酸即为一个固有无需区域,其第182氨基酸-第266氨基酸即为一个低复杂度区域,其第1氨基酸-第306氨基酸即为一个朊蛋白样结构域。As used herein, the term "phase separation prone domain (PS)" refers to at least one of an intrinsic disordered region (IDR), a low complexity region (LCR), and a prion-like domain (PLD), wherein the intrinsic disordered region can be detected by the ESpritz tool, the low complexity region can be detected by the SEG tool, and the prion-like domain can be detected by the PLAAC tool. Taking the human protein EWS as an example (Uniprot ID Q01844), its 1st amino acid to the 56th amino acid is an intrinsic unneeded region, its 182nd amino acid to the 266th amino acid is a low complexity region, and its 1st amino acid to the 306th amino acid is a prion-like domain.

如本文中所使用的,术语“DNA结合结构域(DBD)”是指蛋白中可以和DNA直接或者间接结合的结构域,比如转录因子AHR中的bHLH结构域,转录因子ATF1中的bZIP结构域。As used herein, the term "DNA binding domain (DBD)" refers to a domain in a protein that can bind directly or indirectly to DNA, such as the bHLH domain in the transcription factor AHR and the bZIP domain in the transcription factor ATF1.

例如,融合蛋白的结构可参见附表1。For example, the structure of the fusion protein can be found in Appendix 1.

PS-DBD融合蛋白的获取Acquisition of PS-DBD fusion protein

在一些实施例中,如图2所示,从多个数据库获得多达1500多个潜在的PS-DBD融合蛋白。这些PS-DBD融合蛋白根据与目标肿瘤病症的发生的关联关系(例如伴随发生、伴随后者发生的显著性变化、依赖于后者状况变化的对应显著变化等),或者导致目标肿瘤病症的发生(与目标肿瘤病症存在因果关系),可以作为意图为目标肿瘤病症筛选药物的方法中使用的靶融合蛋白。In some embodiments, as shown in Figure 2, more than 1500 potential PS-DBD fusion proteins are obtained from multiple databases. These PS-DBD fusion proteins can be used as target fusion proteins for use in methods intended to screen drugs for target tumor disorders, based on their association with the occurrence of target tumor disorders (e.g., concomitant occurrence, significant changes associated with the latter occurrence, corresponding significant changes dependent on changes in the latter status, etc.), or causing the occurrence of target tumor disorders (having a causal relationship with the target tumor disorder).

相分离相关区域包括固有无序区域(IDR)、低复杂度区域(LCR)和朊蛋白样结构域(PLD)。IDR是蛋白质中缺乏独特的原始结构模式的柔性片段。LCR是指含有偏向于少数类型的氨基酸组成的蛋白质片段。PLD的特点是含有强烈偏向少数特殊类型氨基酸的区域,如天冬酰胺、谷氨酰胺、酪氨酸和甘氨酸残基。所有这三种序列已被发现在相分离蛋白中富集。Phase separation-associated regions include intrinsically disordered regions (IDRs), low complexity regions (LCRs), and prion-like domains (PLDs). IDRs are flexible segments of proteins that lack unique, native structural patterns. LCRs are segments of proteins that contain a composition that is biased toward a few types of amino acids. PLDs are characterized by regions that contain a strong bias toward a few specific types of amino acids, such as asparagine, glutamine, tyrosine, and glycine residues. All three sequences have been found to be enriched in phase-separating proteins.

为了确定融合蛋白的相分离部分,如图2所示,在FusionGDB数据库、ChimerKB数据库和Mitelman数据库中的融合蛋白的序列(融合基因的基因对)中,搜索IDR结构、LCR结构和PLD结构。为了检测IDR,从网站http://old.protein.bio.unipd.it/espritz/,下载了Espritz-Disprot工具,并使用5%的错误发现率(FDR)作为分界线。为了识别LCR,从http://manpages.ubuntu.com/manpages/bionic/man1/ncbi-seg.1.html,下载了SEG工具,并使用ncbi-seg软件的默认参数作为阈值。对于PLD,从http://plaac.wi.mit.edu/,下载了PLAAC软件,并提取了PLAAC预测的含有PLD核心区域的片段。对于FusionGDB中那些有断点信息的融合蛋白,当其头部或尾部有超过50%的序列被预测为IDR,或有超过40%的序列被预测为LCR,或包括PLD核心区域时,则认为蛋白质含有相分离相关区域。对于那些来自ChimerKB和Mitelman的没有详细断点位置的融合,在评估其相分离相关区域时,全长蛋白被用作融合蛋白的头或尾部分。In order to determine the phase separation part of the fusion protein, as shown in Figure 2, the IDR structure, LCR structure and PLD structure were searched in the sequences of the fusion proteins (gene pairs of fusion genes) in the FusionGDB database, the ChimerKB database and the Mitelman database. In order to detect IDR, the Espritz-Disprot tool was downloaded from the website http://old.protein.bio.unipd.it/espritz/, and a 5% false discovery rate (FDR) was used as the dividing line. In order to identify LCR, the SEG tool was downloaded from http://manpages.ubuntu.com/manpages/bionic/man1/ncbi-seg.1.html, and the default parameters of the ncbi-seg software were used as the threshold. For PLD, the PLAAC software was downloaded from http://plaac.wi.mit.edu/, and the fragment containing the PLD core region predicted by PLAAC was extracted. For those fusion proteins with breakpoint information in FusionGDB, the protein was considered to contain a phase separation-related region when more than 50% of the sequence in the head or tail was predicted to be IDR, or more than 40% of the sequence was predicted to be LCR, or included the PLD core region. For those fusions from ChimerKB and Mitelman without detailed breakpoint positions, the full-length protein was used as the head or tail part of the fusion protein when evaluating its phase separation-related region.

对于DNA结合部分,从核酸-蛋白质相互作用数据库(NPIDB)中获得DNA结合结构域及其相应的PFAM ID。从PFAM数据库中下载了相应条目的HMM profile,并使用HMMER软件的hmmsearch模块在FusionGDB、ChimerKB和Mitelman数据库得出的融合蛋白序列中搜索上述HMM profile。由hmmsearch检测到的E值<10^-5的片段被认为是候选DBD片段。同样,对于那些来自FusionGDB的融合,头部或尾部的序列被用来检测DBD片段。对于那些来自ChimerKB和Mitelman的融合,在检测DBD片段时,全长蛋白被用作融合蛋白的头部或尾部部分。For the DNA binding part, the DNA binding domain and its corresponding PFAM ID were obtained from the Nucleic Acid-Protein Interaction Database (NPIDB). The HMM profiles of the corresponding entries were downloaded from the PFAM database, and the hmmsearch module of the HMMER software was used to search the above HMM profiles in the fusion protein sequences obtained from the FusionGDB, ChimerKB and Mitelman databases. Fragments with E values <10^-5 detected by hmmsearch were considered as candidate DBD fragments. Similarly, for those fusions from FusionGDB, the head or tail sequences were used to detect the DBD fragments. For those fusions from ChimerKB and Mitelman, when detecting the DBD fragments, the full-length protein was used as the head or tail part of the fusion protein.

最终,上述步骤确定的包括PS相关区域和DBD片段,分别在融合蛋白的头部序列和尾部序列中,或者反过来,分别在融合蛋白的尾部序列和头部序列中,该融合蛋白即被定义为PS-DBD融合蛋白(下文中有时也称为PS-DBD融合物)。Finally, the PS-related region and DBD fragment determined in the above steps are respectively in the head sequence and tail sequence of the fusion protein, or conversely, in the tail sequence and head sequence of the fusion protein, and the fusion protein is defined as a PS-DBD fusion protein (hereinafter sometimes also referred to as a PS-DBD fusion).

具体地,从FusionGDB获得了110个潜在的PS-DBD融合蛋白,从ChimerKB和Mitelman数据库获得了1424个潜在的PS-DBD融合蛋白。PS-DBD融合蛋白分为3类,IDR-DBD、LCR-DBD和PLD-DBD。由于IDR、LCR和PLD区域并不相互排斥,这三类融合显示出实质性的重叠。例如,47个融合物属于所有上述三类,利用图2所示的方法,总共获得了1472个独特的PS-DBD融合蛋白,均可以作为靶融合蛋白来为目标肿瘤病症筛选药物。Specifically, 110 potential PS-DBD fusion proteins were obtained from FusionGDB, and 1424 potential PS-DBD fusion proteins were obtained from ChimerKB and Mitelman databases. PS-DBD fusion proteins are divided into three categories, IDR-DBD, LCR-DBD and PLD-DBD. Since the IDR, LCR and PLD regions are not mutually exclusive, these three types of fusions show substantial overlap. For example, 47 fusions belong to all three of the above categories. Using the method shown in Figure 2, a total of 1472 unique PS-DBD fusion proteins were obtained, all of which can be used as target fusion proteins to screen drugs for target tumor diseases.

图3示出根据本申请实施例的PS-DBD融合物的相互关联性的示意图,其中示出了具有10个以上的伙伴的中心蛋白的融合网络。两个节点之间的每条边表示具有相分离潜力的蛋白质和具有DNA结合能力的蛋白质之间的融合事件。一个中心蛋白的子网络与相关癌症类型相关。例如,NUP98的N末端与27种具有DBD的不同蛋白质融合,其中许多融合与AML相关。有趣的是,许多突出显示的中心枢纽是具有相分离潜力的蛋白质,如NUP98、EWS和FUS。此外,尽管不像PS蛋白那样集中,但一些DBD蛋白也经常与多个PS蛋白融合。例如,MiT转录因子家族蛋白TFE3可以与SFPQ、MED15或PRCC融合,并且所有这些融合都与肾细胞癌有关。FIG3 shows a schematic diagram of the interconnectedness of PS-DBD fusions according to an embodiment of the present application, wherein a fusion network of a central protein with more than 10 partners is shown. Each edge between two nodes represents a fusion event between a protein with phase separation potential and a protein with DNA binding ability. A subnetwork of a central protein is associated with a related cancer type. For example, the N-terminus of NUP98 is fused to 27 different proteins with DBD, many of which are associated with AML. Interestingly, many of the highlighted central hubs are proteins with phase separation potential, such as NUP98, EWS, and FUS. In addition, although not as concentrated as PS proteins, some DBD proteins are also often fused to multiple PS proteins. For example, MiT transcription factor family protein TFE3 can be fused to SFPQ, MED15, or PRCC, and all of these fusions are associated with renal cell carcinoma.

一些融合蛋白的异常相分离可通过增强转录激活促进肿瘤发生,也称为致病性融合蛋白。例如,NUP98-HOXA9中的IDR驱动相分离聚集物的形成,这对异常染色质环和癌症的发展至关重要。另一个类似的例子是EWS-FLI1,其中EWS中的PLD驱动相分离聚集物的生成,这对于招募BAF复合物和激活异常转录至关重要。可以预计,以上1472个候选PS-DBD融合蛋白中的许多蛋白都通过异常相分离促进肿瘤发生,从而可以作为其促进发生的肿瘤的药物筛选方法的靶融合蛋白。Abnormal phase separation of some fusion proteins can promote tumorigenesis by enhancing transcriptional activation, also known as pathogenic fusion proteins. For example, the IDR in NUP98-HOXA9 drives the formation of phase-separated aggregates, which is essential for the development of abnormal chromatin loops and cancer. Another similar example is EWS-FLI1, in which the PLD in EWS drives the generation of phase-separated aggregates, which is essential for recruiting the BAF complex and activating abnormal transcription. It can be expected that many of the above 1472 candidate PS-DBD fusion proteins promote tumorigenesis through abnormal phase separation, and thus can serve as target fusion proteins for drug screening methods for the tumors they promote.

在一些实施例中,标记分子包括各种荧光标记,例如但不限于mCherry荧光标记等,以便使得所形成的聚集物可供成像识别。在一些实施例中,包含靶融合蛋白的细胞例如U2OS细胞、Hela细胞等。In some embodiments, the labeling molecule includes various fluorescent labels, such as but not limited to mCherry fluorescent labels, etc., so that the formed aggregates can be imaged and identified. In some embodiments, cells containing the target fusion protein are, for example, U2OS cells, Hela cells, etc.

关于步骤101,例如,如图4(a)所示,可以将稳定表达mCherry-FUS-ERGmut的U2OS细胞接种到384孔阵列的孔板中,该细胞包含编码FUS-ERG作为靶融合蛋白的核酸分子。又例如,也可以将稳定表达mCherry-FUS-ERG或mCherry-EWS-FLI1mut(而非mCherry-FUS-ERGmut)的U2OS细胞以外的其他细胞(例如选自U2OS细胞和293T细胞)接种到384孔阵列的孔板中。FUS-ERG融合蛋白是EWS-FLI1的同源融合蛋白(参见图5(a)),该融合蛋白在急性髓细胞白血病(AML)和尤因肉瘤中发现,与临床不良有关。Regarding step 101, for example, as shown in FIG. 4( a), U2OS cells stably expressing mCherry-FUS-ERGmut can be inoculated into a well plate of a 384-well array, and the cells contain a nucleic acid molecule encoding FUS-ERG as a target fusion protein. For another example, other cells (e.g., selected from U2OS cells and 293T cells) other than U2OS cells stably expressing mCherry-FUS-ERG or mCherry-EWS-FLI1mut (rather than mCherry-FUS-ERGmut) can also be inoculated into a well plate of a 384-well array. The FUS-ERG fusion protein is a homologous fusion protein of EWS-FLI1 (see FIG. 5( a)), which is found in acute myeloid leukemia (AML) and Ewing's sarcoma and is associated with poor clinical outcomes.

mCherry-FUS-ERGmut氨基酸序列如下(SEQ ID NO:1):The amino acid sequence of mCherry-FUS-ERGmut is as follows (SEQ ID NO: 1):

MKVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYKAKKPVQLPGAYNVNIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYKTMKGGSGGSGGSGGSLEMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQQSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGSGQIQLWQFLLELLSDSSNSSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSDALDYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPPESSLYKYPSDLPYMGSYHAHPQKMNFVAPHPPALPVTSSSFFAAPNPYWNSPTGGIYPNTRLPTSHMPSHLGTYY*MKVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYKAKKPVQLPGAYNVNIKLDIT SHNEDYTIVEQYERAEGRHTGGMDELYKTMKGGSGGSGGSGGSLEMASNDYTQQATQSYGAYPTQPGQGYSQQSSQPYGQQSYSGYSQSTDTSGYGQSSYSSYGQSQNTGYGTQSTPQGYGSTGGYGSSQSSQSSYGQ QSSYPGYGQQPAPSSTSGSYGSSSQSSSYGQPQSGSYSQQPSYGGQQQSYGQQQSYNPPQGYGQQNQYNSSSGGGGGGGGGGNYGQDQSSMSSGGGSGGGYGNQDQSGGGGSGGYGQQDRGGRGRGGSGGGGGGGGGGYNRSSGGYEPRGRGGGRGGRGGMGGSDRGGFNKFGGSGQIQLWQFLLELLSDSS NSSCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSDALDYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPPESSLYKYPSDLPYMGSYHAHPQKMNFVAPHPPALPVTSSSFFAAPNPYWNSPTGGIYPNTRLPTSHMPSHLGTYY*

mCherry-EWS-FLI1mut氨基酸序列如下(SEQ ID NO:2):The amino acid sequence of mCherry-EWS-FLI1mut is as follows (SEQ ID NO: 2):

MKVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYKAKKPVQLPGAYNVNIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYKTMKGGSGGSGGSGGSLEMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQPAYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQNPSYDSVRRGAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPDPYQILGPTSSRLANPGSGQIQLWQFLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSLALLYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHPSSMPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY*MKVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYK AKKPVQLPGAYNVNIKLDITSHNEDYTIVEQYERAEGRHTGGMDELYKTMKGGSGGSGGSGGSLEMASTDYSTYSQAAAQQGYSAYTAQPTQGYAQTTQAYSAYTAQPTQGYAQTTQAYGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTTTATVTTTQASYAAQSAYGTQP AYPAYGQQPAATAPTRPQDGNKPTETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQSSYSQQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGSYSQAPSQYSQQSSSYGQQNPSYDSVRRGAWGNNMNSGLNKSPPLGGAQTISKNTEQRPQPDPY QILGPTSSRLANPGSGQIQLWQFLLELLSDSANASCITWEGTNGEFKMTDPDEVARRWGERKSKPNMNYDKLSLALLYYYDKNIMTKVHGKRYAYKFDFHGIAQALQPHPTESSMYKYPSDISYMPSYHAHQQKVNFVPPHPSSMPVTSSSFFGAASQYWTSPTGGIYPNPNVPRHPNTHVPSHLGSYY*

PS-DBD融合蛋白与肿瘤病症状况的关联机制:相分离、聚集物的生成及异常基因Mechanisms of association between PS-DBD fusion proteins and tumor pathology: phase separation, aggregate formation, and abnormal genes 表达Express

以FUS-ERG融合蛋白为例进行说明,其包括FUS作为PS且包括ERG作为DBD,能够驱动相分离的发生。具体说来,FUS的部分经历了相分离,而ERG的部分包含一个识别GGAA微卫星序列的DBD。Taking the FUS-ERG fusion protein as an example, which includes FUS as PS and ERG as DBD, it can drive the occurrence of phase separation. Specifically, the FUS part undergoes phase separation, while the ERG part contains a DBD that recognizes the GGAA microsatellite sequence.

通过在细胞(例如Hela细胞、293T细胞或U2OS细胞)中(稳定或瞬时)表达mCherry-FUS-ERG,发现它与染色质共定位(图5(b)),并且在细胞核中显示出少量聚集物。具体地,FUS-ERG融合蛋白在体内和体外都经历了相分离而形成聚集物,该聚集物结合束缚于染色质,表现出可识别的偏离球形的形态,类似于液体的润湿行为。By expressing mCherry-FUS-ERG (stable or transient) in cells (such as Hela cells, 293T cells or U2OS cells), it was found to co-localize with chromatin (Figure 5(b)) and show a small amount of aggregates in the nucleus. Specifically, the FUS-ERG fusion protein undergoes phase separation both in vivo and in vitro to form aggregates, which are bound to chromatin and show a recognizable morphology that deviates from the spherical shape, similar to the wetting behavior of a liquid.

在一些实施例中,利用OptoIDR系统对FUS-ERG融合蛋白的相分离情况进行了评估和验证。OptoIDR系统由感兴趣的IDR、mCherry荧光标记和拟南芥Cry2蛋白(Cry2olig)光解酶结构域的聚集增强形式的突变体组成。在蓝光刺激下,Cry2olig形成寡聚物,这也导致IDR寡聚并增加IDR的价态。这可能会在低于LLPS临界浓度的浓度下驱动相分离,如图5(c)所示。如图5(d)和图5(e)所示,蓝光刺激导致表达OptoFUS的细胞发生明显的相分离,这表明FUS IDR确实具有发生相分离的潜力。In some embodiments, the phase separation of FUS-ERG fusion proteins was evaluated and verified using the OptoIDR system. The OptoIDR system consists of an IDR of interest, an mCherry fluorescent marker, and a mutant of an aggregation-enhanced form of the photolyase domain of the Arabidopsis Cry2 protein (Cry2olig). Under blue light stimulation, Cry2olig forms oligomers, which also causes IDR oligomerization and increases the valence of IDRs. This may drive phase separation at concentrations below the LLPS critical concentration, as shown in Figure 5(c). As shown in Figures 5(d) and 5(e), blue light stimulation causes obvious phase separation in cells expressing OptoFUS, indicating that FUS IDR does have the potential to undergo phase separation.

接下来,纯化了标记重组增强绿色荧光蛋白(EGFP)的FUS-ERG(图6(a))。在无拥挤试剂的生理缓冲条件下,EGFP-FUS-ERG在亚微摩尔浓度下进行相分离(图5(f))。聚集物融合实验和荧光漂白恢复(FRAP)实验后的荧光回收表明,聚集物具有液体性质(参见图5(g)、图5(h)和图5(i))。通过用EGFP-FUS-ERG孵育含有25个GGAA微卫星序列拷贝的DNA片段(图6(b)和图6(c)),发现DNA底物被招募到聚集物中,这些聚集物变得不规则(图5(j))。FRAP分析表明,这些聚集物仍然保留了一定的流动性(图5(k)和图5(l))。这些聚集物从图像上能够识别出来,例如可以呈现为凝胶状、可流动非规则的液体形态等等。Next, FUS-ERG tagged with recombinant enhanced green fluorescent protein (EGFP) was purified (Figure 6(a)). Under physiological buffer conditions without crowding agents, EGFP-FUS-ERG phase separated at submicromolar concentrations (Figure 5(f)). Aggregate fusion experiments and fluorescence recovery after photobleaching (FRAP) experiments showed that the aggregates had liquid properties (see Figures 5(g), 5(h), and 5(i)). By incubating a DNA fragment containing 25 copies of the GGAA microsatellite sequence with EGFP-FUS-ERG (Figures 6(b) and 6(c)), it was found that the DNA substrate was recruited into the aggregates, and these aggregates became irregular (Figure 5(j)). FRAP analysis showed that these aggregates still retained a certain degree of fluidity (Figures 5(k) and 5(l)). These aggregates can be identified from images, for example, they can appear as gel-like, flowable irregular liquid forms, etc.

以上1472个候选PS-DBD融合蛋白中,除了FUS-ERG融合蛋白以外,很多其他融合蛋白也在体内和体外驱动、发生或经历异常相分离。Among the above 1472 candidate PS-DBD fusion proteins, in addition to the FUS-ERG fusion protein, many other fusion proteins also drive, occur or undergo abnormal phase separation in vivo and in vitro.

例如,BRD9是一种含溴结构域的蛋白质,在染色质重塑和转录调控中发挥重要作用。BRD9在肝细胞癌中是TERT(端粒酶逆转录酶的转录)的融合伙伴。基于OptoIDR系统,BRD9 IDR促进了相分离的发生(见图7(a)和图7(b))。此外,纯化的重组Alexa 488标记BRD9IDR蛋白(图8(a))形成具有流动性的球形液滴(参见图7(c)、图7(d)和图7(e))。因此,BRD9IDR确实在体内和体外推动了相分离。先前的研究表明,在表达另一种TERT融合蛋白(BRD9-TERT)的脂肪肉瘤中,TERT mRNA的表达水平比没有这种融合蛋白的脂肪肉瘤高约100倍。TERT的激活对于肿瘤细胞的无限增殖是必要的。也就是说,BRD9的相分离活性通过融合作用促进TERT活化。For example, BRD9 is a bromodomain-containing protein that plays an important role in chromatin remodeling and transcriptional regulation. BRD9 is a fusion partner of TERT (telomerase reverse transcriptase) in hepatocellular carcinoma. Based on the OptoIDR system, BRD9 IDR promoted the occurrence of phase separation (see Figure 7(a) and Figure 7(b)). In addition, the purified recombinant Alexa 488-labeled BRD9IDR protein (Figure 8(a)) formed spherical droplets with fluidity (see Figure 7(c), Figure 7(d) and Figure 7(e)). Therefore, BRD9IDR does promote phase separation in vivo and in vitro. Previous studies have shown that in liposarcoma expressing another TERT fusion protein (BRD9-TERT), the expression level of TERT mRNA is about 100 times higher than that in liposarcoma without this fusion protein. Activation of TERT is necessary for the unlimited proliferation of tumor cells. That is, the phase separation activity of BRD9 promotes TERT activation through fusion.

COL17A1(胶原类型XVIIα1链)是一种胶原跨膜蛋白,也是表皮锚定复合物的关键成分。COL17Al与胃腺癌中的CST复合物亚单位SNT1融合。CST复合物以序列无关的方式与单链DNA高亲和力结合。COL17A1 IDR还促进了OptoIDR系统中的相分离。此外,纯化的重组COL17A1 IDR蛋白(见图8(a))形成具有凝胶性质的聚集物。因此,COL17A1在体内和体外都能促进相分离。STN1-COL17A1的相分离潜能可能促进诸如胃腺癌的肿瘤发生。COL17A1 (collagen type XVII α1 chain) is a collagen transmembrane protein and a key component of the epidermal anchoring complex. COL17A1 is fused to the CST complex subunit SNT1 in gastric adenocarcinoma. The CST complex binds to single-stranded DNA with high affinity in a sequence-independent manner. COL17A1 IDR also promotes phase separation in the OptoIDR system. In addition, the purified recombinant COL17A1 IDR protein (see Figure 8(a)) forms aggregates with gel properties. Therefore, COL17A1 can promote phase separation both in vivo and in vitro. The phase separation potential of STN1-COL17A1 may promote tumorigenesis such as gastric adenocarcinoma.

MLLT1(也称为ENL)是一种组蛋白乙酰化读取器,对AML中的细胞增殖和致癌转录调控特别重要。在白血病中经常发现MLLT1与MLL(也称为KMT2A)发生融合。MLLT1 IDR促进了OptoIDR系统中的相分离。此外,纯化的MLLT1 IDR蛋白(见图8(a))形成了液体聚集物。也就是说,MLLT1在体内和体外都可能驱动相分离,这表明MLL-MLLT1融合的异常相分离可能在诸如AML的白血病发生中发挥作用。MLLT1 (also known as ENL) is a histone acetylation reader that is particularly important for cell proliferation and oncogenic transcriptional regulation in AML. MLLT1 is often found fused to MLL (also known as KMT2A) in leukemia. MLLT1 IDR promotes phase separation in the OptoIDR system. In addition, purified MLLT1 IDR protein (see Figure 8(a)) formed liquid aggregates. That is, MLLT1 can drive phase separation both in vivo and in vitro, suggesting that abnormal phase separation of MLL-MLLT1 fusions may play a role in leukemogenesis such as AML.

SFPQ是前mRNA剪接因子。涉及SFPQ与TFE3融合的染色体畸变可能是乳头状肾细胞癌的原因。SFPQ IDR促进OptoIDR系统中的相分离。纯化的EGFP-SFPQ IDR蛋白(见图8(a))形成聚集物,并表现出类似液体的性质(未图示)。SFPQ is a pre-mRNA splicing factor. Chromosomal aberrations involving the fusion of SFPQ with TFE3 may be the cause of papillary renal cell carcinoma. SFPQ IDR promotes phase separation in the OptoIDR system. Purified EGFP-SFPQ IDR protein (see Figure 8(a)) forms aggregates and exhibits liquid-like properties (not shown).

MED15是TFE3的另一个融合伙伴(见图8(a)),该融合事件与肾细胞癌的发生相关联。纯化的EGFP-MED15 IDR蛋白能够在体内和体外可形成具有液体性质的聚集物(未图示)。MED15 is another fusion partner of TFE3 (see Figure 8(a)), and this fusion event is associated with the development of renal cell carcinoma. Purified EGFP-MED15 IDR protein can form aggregates with liquid properties in vivo and in vitro (not shown).

如上例示的,大量的PS-DBD融合蛋白在细胞中经历相分离,且所经历的相分离及其导致形成的聚集物与某些肿瘤病症的发生相关联,各种PS-DBD融合蛋白就可以用作相关联的这些肿瘤病症(也就是目标肿瘤病症)的药物筛选方法中的靶融合蛋白。在一些实施例中,所述目标肿瘤病症包括但不限于恶性肿瘤。As exemplified above, a large number of PS-DBD fusion proteins undergo phase separation in cells, and the phase separation and the aggregates formed as a result thereof are associated with the occurrence of certain tumor disorders, and various PS-DBD fusion proteins can be used as target fusion proteins in drug screening methods for these associated tumor disorders (i.e., target tumor disorders). In some embodiments, the target tumor disorders include but are not limited to malignant tumors.

从细胞图像中识别出的聚集物由相分离导致,使得可以通过观测细胞内聚集物量的变化,来判定PS-DBD融合蛋白(作为靶融合蛋白)的相分离状况,和/或判定细胞的目标肿瘤病症的状况的变化。例如,细胞内聚集物的量越大,则所述PS-DBD融合蛋白在体内和/或体外的相分离状况越严重。异常相分离是癌症相关PS-DBD融合的运行机制。本申请提出了一种通过小分子改变相分离行为来对抗相关癌症的新途径,相应地,PS-DBD融合物可能代表新的癌症药物靶点。又例如,细胞内聚集物量的减少,意味着细胞的目标肿瘤病症的状况的缓解,也就是目标肿瘤病症的良好治疗效果。Aggregates identified from cell images are caused by phase separation, so that the phase separation status of the PS-DBD fusion protein (as a target fusion protein) and/or the change in the status of the target tumor disease of the cell can be determined by observing the change in the amount of aggregates in the cell. For example, the greater the amount of aggregates in the cell, the more severe the phase separation status of the PS-DBD fusion protein in vivo and/or in vitro. Abnormal phase separation is the operating mechanism of cancer-related PS-DBD fusion. The present application proposes a new approach to combat related cancers by changing the phase separation behavior through small molecules, and accordingly, PS-DBD fusions may represent new cancer drug targets. For another example, a reduction in the amount of aggregates in the cell means an alleviation of the status of the target tumor disease of the cell, that is, a good therapeutic effect of the target tumor disease.

所述靶融合蛋白所经历的相分离驱动异常基因表达,所述聚集物的量的减少与异常基因表达的逆转相关联。The phase separation experienced by the target fusion protein drives aberrant gene expression, and a reduction in the amount of the aggregates is associated with a reversal of aberrant gene expression.

例如EWS-FLI1和NUP98-HOXA9可改变下游转录并驱动肿瘤发生。为了确定PS-DBD融合导致的下游转录变化是否普遍,对癌症基因组图谱(TCGA)中含有PS-DBD融合物的患者的转录组数据进行了分析。116例含有PS-DBD融合物的患者数据从TCGA库中提取。PS-DBD融合物的直接靶点尚不明确,因此使用DBD的靶基因作为替代。DBD蛋白的靶基因从enrichrlibrarys中收集,以便为462个DBD蛋白构建一个靶基因列表。结合TCGA转录组数据和目标基因列表,获得了27例患者的21个PS-DBD融合物的下游转录水平。使用GSEA算法评估PS-DBD融合的下游表达水平,并将来自相同癌症类型的样本作为背景对照。在27例肿瘤样本中,13例融合事件与下游转录显著改变相关,其中11例上调,2例下调(图9(a))。这一结果表明,PS-DBD融合蛋白确实可以改变肿瘤样本中的下游转录。For example, EWS-FLI1 and NUP98-HOXA9 can alter downstream transcription and drive tumorigenesis. To determine whether downstream transcriptional changes caused by PS-DBD fusions are common, transcriptome data from patients with PS-DBD fusions in The Cancer Genome Atlas (TCGA) were analyzed. Data from 116 patients with PS-DBD fusions were extracted from the TCGA library. The direct targets of PS-DBD fusions are not yet clear, so target genes of DBD were used as a surrogate. Target genes of DBD proteins were collected from enrichrlibraries to construct a target gene list for 462 DBD proteins. Combining TCGA transcriptome data and target gene lists, downstream transcription levels of 21 PS-DBD fusions in 27 patients were obtained. The GSEA algorithm was used to evaluate the downstream expression levels of PS-DBD fusions, and samples from the same cancer type were used as background controls. Among the 27 tumor samples, 13 fusion events were associated with significant changes in downstream transcription, of which 11 were upregulated and 2 were downregulated (Figure 9(a)). This result suggests that PS-DBD fusion protein can indeed alter downstream transcription in tumor samples.

接下来,评估了PS-DBD融合蛋白是否可以改变已建立的癌细胞系的下游表达。通过对CCLE数据库56中的细胞系转录组数据进行分析,获得了10个EWS-FLI1融合细胞系的数据。在所有10个细胞系中,FLI1靶基因的表达水平均显著上调(图9(b))。此外,使用双荧光素酶分析来检测具有FET-ETS融合物的细胞系中的荧光素酶相对活性。荧光素酶报告基因的启动子区域携带25个GGAA拷贝,GGAA是ERG和FLI1的靶序列。FUS-ERG和EWS-FLI1都增强了荧光素酶的相对活性。Next, it was assessed whether the PS-DBD fusion protein could alter downstream expression in established cancer cell lines. Data for 10 EWS-FLI1 fusion cell lines were obtained by analyzing cell line transcriptome data in the CCLE database 56. In all 10 cell lines, the expression levels of FLI1 target genes were significantly upregulated (Figure 9(b)). In addition, dual luciferase analysis was used to detect the relative luciferase activity in cell lines with FET-ETS fusions. The promoter region of the luciferase reporter gene carries 25 copies of GGAA, which is the target sequence of ERG and FLI1. Both FUS-ERG and EWS-FLI1 enhanced the relative activity of luciferase.

如上,通过分析包含PS-DBD融合物信息的患者数据和癌细胞系中的下游表达,以及使用荧光素酶报告分析,验证了PS-DBD融合蛋白确实可以改变靶基因的表达,从而驱动异常基因表达。As above, by analyzing patient data containing PS-DBD fusion information and downstream expression in cancer cell lines, as well as using luciferase reporter analysis, it was verified that PS-DBD fusion proteins can indeed alter the expression of target genes, thereby driving abnormal gene expression.

了解了PS-DBD融合蛋白与目标肿瘤病症的关联机制,包括相分离及其对异常基因表达的驱动,以及聚集物的识别量与相分离状况的正向关联,可以为各种目标肿瘤病症针对性地选择不同的PS-DBD融合蛋白,进而选出对聚集物的量显著衰减的小分子化合物,也就是对相分离状况显著逆转的小分子化合物,作为目标肿瘤病症的候选药物。请注意,本申请提出了一种高效直观的目标肿瘤病症的候选药物的表型筛选方法,不管小分子化合物以何种机制何种方式溶解并衰减了聚集物,不管其以直接物理相互作用还是间接作用来调节聚集物,只要减少了聚集物的量,则认为对目标肿瘤病症具有治疗效用。By understanding the association mechanism between PS-DBD fusion protein and target tumor disease, including phase separation and its driving of abnormal gene expression, and the positive correlation between the amount of aggregate recognition and the phase separation status, different PS-DBD fusion proteins can be selected for various target tumor diseases, and then small molecule compounds that significantly attenuate the amount of aggregates, that is, small molecule compounds that significantly reverse the phase separation status, can be selected as candidate drugs for target tumor diseases. Please note that this application proposes an efficient and intuitive phenotypic screening method for candidate drugs for target tumor diseases. Regardless of the mechanism and method by which the small molecule compound dissolves and attenuates the aggregates, regardless of whether it regulates the aggregates by direct physical interaction or indirect action, as long as the amount of aggregates is reduced, it is considered to have therapeutic efficacy for the target tumor disease.

在一些实施例中,在目标肿瘤病症为与FET融合相关的肿瘤、NUP98或MLL融合相关的血液病、TFE3融合相关的肾细胞癌、和HMGA2融合相关的脂肪瘤的情况下,选择衍生自FET-ETS融合蛋白家族的成员作为所述靶融合蛋白。In some embodiments, when the target tumor disorder is a tumor associated with FET fusion, a blood disease associated with NUP98 or MLL fusion, a TFE3 fusion-associated renal cell carcinoma, and a HMGA2 fusion-associated lipoma, a member derived from the FET-ETS fusion protein family is selected as the target fusion protein.

具体说来,所述靶融合蛋白和目标恶性肿瘤的对应关系包括如下至少一种。所述靶融合蛋白包含NUP98-HOXA9,则对应的目标恶性肿瘤为急性髓细胞白血病。所述靶融合蛋白包含SFPQ-TFE3、MED15-TFE3和/或PRCC-TFE3,则对应的目标恶性肿瘤为肾细胞癌。所述靶融合蛋白包含BRD9-IDR或BRD9-TERT,则对应的目标恶性肿瘤为肝细胞癌。所述靶融合蛋白包含TRIO-TERT,则对应的目标恶性肿瘤为脂肪肉瘤。所述靶融合蛋白包含STN1-COL17A1,则对应的目标恶性肿瘤为胃腺癌。所述靶融合蛋白包含MLL-MLLT1,则对应的目标恶性肿瘤为白血病。所述靶融合蛋白包含SFPQ-TFE3,则对应的目标恶性肿瘤为乳头状肾细胞癌。Specifically, the correspondence between the target fusion protein and the target malignant tumor includes at least one of the following. If the target fusion protein contains NUP98-HOXA9, the corresponding target malignant tumor is acute myeloid leukemia. If the target fusion protein contains SFPQ-TFE3, MED15-TFE3 and/or PRCC-TFE3, the corresponding target malignant tumor is renal cell carcinoma. If the target fusion protein contains BRD9-IDR or BRD9-TERT, the corresponding target malignant tumor is hepatocellular carcinoma. If the target fusion protein contains TRIO-TERT, the corresponding target malignant tumor is liposarcoma. If the target fusion protein contains STN1-COL17A1, the corresponding target malignant tumor is gastric adenocarcinoma. If the target fusion protein contains MLL-MLLT1, the corresponding target malignant tumor is leukemia. If the target fusion protein contains SFPQ-TFE3, the corresponding target malignant tumor is papillary renal cell carcinoma.

在靶融合蛋白的DBD中引入突变Introducing mutations in the DBD of the target fusion protein

在一些实施例中,所述靶融合蛋白的DBD包含突变,可以使得所形成的聚集物的成像识别性相较DBD不包含突变所形成的聚集物的成像识别性更显著。在Hela细胞中瞬时表达PS-DBD融合蛋白,还生成了每个PS-DBD的DBD突变版本(PS-DBDmut)。异位表达的Hela细胞的活细胞图像如图8(b)所示,其中,DNA被Hoechst 33342(蓝色)染色,比例尺为10μm,各个虚线框中的上方的活细胞图像示出表达mCherry-PS-DBD的Hela细胞的图像,而对应虚线框中的下方的活细胞图像则示出表达mCherry-PS-DBDmut(DNA结合结构域引入突变的突变版本)的Hela细胞的图像,可以看出,大部分虚线框中,后者形成的聚集物呈现接近球形(或者球形)的液滴状(见图5(m)),或者汇聚成更大的团块,使得成像识别性相较前者更好。前者形成的聚集物束缚且结合到染色质(或染色体),DNA结合结构域引入突变使得解开束缚,从而聚集物得以释放并汇聚成液滴状。In some embodiments, the DBD of the target fusion protein contains mutations, which can make the imaging recognition of the formed aggregates more significant than the imaging recognition of the aggregates formed by the DBD not containing mutations. The PS-DBD fusion protein was transiently expressed in Hela cells, and a DBD mutant version (PS-DBDmut) of each PS-DBD was also generated. The live cell image of the ectopically expressed Hela cell is shown in Figure 8 (b), wherein the DNA is stained with Hoechst 33342 (blue), the scale bar is 10 μm, and the upper live cell image in each dotted box shows the image of the Hela cell expressing mCherry-PS-DBD, while the lower live cell image in the corresponding dotted box shows the image of the Hela cell expressing mCherry-PS-DBDmut (a mutant version with a mutation introduced into the DNA binding domain). It can be seen that in most of the dotted boxes, the aggregates formed by the latter are nearly spherical (or spherical) droplets (see Figure 5 (m)), or converge into larger clumps, so that the imaging recognition is better than the former. The aggregates formed by the former are bound and attached to chromatin (or chromosomes). The introduction of mutations into the DNA binding domain releases the binding, thereby releasing the aggregates and converging into droplets.

对于TFE3融合蛋白,SFPQ/MED15-TFE3及其DBD突变形式在细胞中产生形态不同的聚集物,后者形态更容易识别(见图8(b))。这些数据表明,在肾细胞癌中发现的PS-TFE3融合蛋白可能在致癌过程中具有相分离依赖性机制。EWS-FLI1融合蛋白形成了一些类似于FUS-ERG的聚集物,而EWS-FLI在细胞中形成了许多液滴(见图8(b))。另外两个EWS融合蛋白EWS-ERG和EWS-ATF1在DBD无突变的情况下,已经均显示出明显的相分离液滴(图8(b))。NUP98融合蛋白在细胞中形成聚集物(图8(b))。也就是说,对于某些PS-DBD融合蛋白而言,只要聚集物的形态识别性足够高,也可以不引入DBD中的突变,突变可以根据需求来选择引入。For TFE3 fusion proteins, SFPQ/MED15-TFE3 and its DBD mutant form produced aggregates with different morphologies in cells, and the latter morphology was easier to identify (see Figure 8(b)). These data suggest that the PS-TFE3 fusion protein found in renal cell carcinoma may have a phase separation-dependent mechanism in the carcinogenic process. The EWS-FLI1 fusion protein formed some aggregates similar to FUS-ERG, while EWS-FLI formed many droplets in cells (see Figure 8(b)). The other two EWS fusion proteins, EWS-ERG and EWS-ATF1, have both shown obvious phase separation droplets in the absence of DBD mutations (Figure 8(b)). The NUP98 fusion protein forms aggregates in cells (Figure 8(b)). In other words, for some PS-DBD fusion proteins, as long as the morphological recognition of the aggregates is high enough, mutations in the DBD may not be introduced, and mutations can be selectively introduced according to needs.

注意到一些PS-DBD融合蛋白,如PRCC-TFE3,没有经历相分离(图8(b))。It was noted that some PS-DBD fusion proteins, such as PRCC-TFE3, did not undergo phase separation ( FIG. 8( b ) ).

利用OptoIDR分析对更多的PS蛋白候选物(图10(a)和图10(b)),发现OptoHES4在蓝光刺激下形成了聚集物。但仍有一些OptoIDR修饰的蛋白质对蓝光没有反应,没有显示出明显的聚集物。推测这些蛋白质具有相对较弱的相分离能力或没有相分离行为,并且刺激强度不足以激活它。对于所有的OptoIDR结果,发现20%的蛋白质在刺激前显示聚集物,40%的OptoIDR修饰蛋白在蓝光刺激下发生相分离并呈现聚集物,40%的OptoIDR修饰蛋白对蓝光刺激没有反应(参见图10(c))。总的来说,这些结果表明,在癌症患者中发现的超过一半的PS-DBD融合蛋白倾向于发生相分离,异常的相分离可能放大或抑制下游信号通路,从而影响细胞生理,与癌症的发生相关联,进而可以用作步骤101中的靶融合蛋白。Using OptoIDR analysis on more PS protein candidates (Figure 10(a) and Figure 10(b)), it was found that OptoHES4 formed aggregates under blue light stimulation. However, there are still some OptoIDR-modified proteins that do not respond to blue light and do not show obvious aggregates. It is speculated that these proteins have relatively weak phase separation ability or no phase separation behavior, and the stimulation intensity is not enough to activate it. For all OptoIDR results, it was found that 20% of the proteins showed aggregates before stimulation, 40% of the OptoIDR-modified proteins phase separated and showed aggregates under blue light stimulation, and 40% of the OptoIDR-modified proteins did not respond to blue light stimulation (see Figure 10(c)). Overall, these results indicate that more than half of the PS-DBD fusion proteins found in cancer patients tend to phase separate, and abnormal phase separation may amplify or inhibit downstream signaling pathways, thereby affecting cell physiology and being associated with the occurrence of cancer, and can be used as target fusion proteins in step 101.

在步骤102,将多种小分子化合物分配到各个孔中。如图4(a)所示,可以将来自抗癌药物库(APExBIO Anti-Cancer Compound Library Plus)的大于1700种的用作抗癌药物的小分子化合物并行分配到五块孔板也就是384*5个孔中,也就是说,利用五块孔板,通过执行图1所示流程,就可以针对目标肿瘤病症并行完成多达1700多种药物的高通量筛选。用于分配到各个孔中以进行后续筛选的小分子化合物可以从小分子化合物库中随机选择,例如可以从治疗肿瘤病症或恶性肿瘤的病症的药物小分子库中选择。In step 102, a plurality of small molecule compounds are distributed to each well. As shown in FIG4(a), more than 1700 small molecule compounds used as anticancer drugs from the APExBIO Anti-Cancer Compound Library Plus can be distributed in parallel to five well plates, that is, 384*5 wells. That is, by using five well plates and executing the process shown in FIG1, high-throughput screening of more than 1700 drugs can be performed in parallel for the target tumor disease. The small molecule compounds for distribution to each well for subsequent screening can be randomly selected from the small molecule compound library, for example, can be selected from a drug small molecule library for treating tumor diseases or malignant tumor diseases.

在一些实施例中,对于目标肿瘤病症,使用治疗除了肿瘤病症或者恶性肿瘤以外的其他病症的药物作为要分配到各个孔中的所述小分子化合物以进行筛选。所述其他病症与所述靶融合蛋白相关联,或者与所述靶融合蛋白的异常相分离相关联。其他病症,如ALS、额颞叶痴呆(FTD)、阿尔茨海默病(AD)、帕金森病(PD)、亨廷顿病(HD)、脊髓小脑共济失调(SCA)和强直性肌营养不良1型(DM1)和2型(DM2),都已知与病理沉积物相关,也可以从这些病症的药物小分子库中的小分子中经由图1所示方法筛选出目标肿瘤病症的候选药物。受益于本申请的高通量处理效率,这种筛选可以是随机筛选。药物的临床实验是一个漫长的过程,通过从已经获批甚至长期使用的其他病症的药物库中,筛选目标肿瘤病症的候选药物,一旦成功筛选出,能够显著降低该选出药物对人体的潜在风险(因为这些药物可能已经长期被使用,副作用等也得到了充分的验证),并显著降低了制造成本。此外,随着人口老龄化的加剧,很多其他病症可能与目标肿瘤病症伴随出现,如此有利于医生做出更准确的联合用药方案。这些其他病症如果也与目标肿瘤病症对应的同个或同源的靶融合蛋白相关联,或者与所述靶融合蛋白的异常相分离相关联,筛选成功的概率较高,也可以执行重点筛选。也就是说,如果其他病症与目标肿瘤病症对应同一个靶点-同个或同源的靶融合蛋白,则前者的药物也可以通过筛选验证而高效复用于目标肿瘤病症。In some embodiments, for the target tumor condition, drugs for treating other conditions other than the tumor condition or malignant tumor are used as the small molecule compounds to be distributed into each well for screening. The other conditions are associated with the target fusion protein, or are associated with the abnormal phase separation of the target fusion protein. Other conditions, such as ALS, frontotemporal dementia (FTD), Alzheimer's disease (AD), Parkinson's disease (PD), Huntington's disease (HD), spinocerebellar ataxia (SCA), and myotonic dystrophy type 1 (DM1) and type 2 (DM2), are all known to be associated with pathological deposits, and candidate drugs for the target tumor condition can also be screened from the small molecules in the drug small molecule library of these conditions via the method shown in Figure 1. Benefiting from the high-throughput processing efficiency of the present application, this screening can be a random screening. The clinical trial of a drug is a long process. By screening candidate drugs for the target tumor disease from the drug library of other diseases that have been approved or even used for a long time, once successfully screened, the potential risk of the selected drug to the human body can be significantly reduced (because these drugs may have been used for a long time, and the side effects have been fully verified), and the manufacturing cost is significantly reduced. In addition, with the aggravation of the aging of the population, many other diseases may occur with the target tumor disease, which is conducive to doctors to make more accurate combination drug regimens. If these other diseases are also associated with the same or homologous target fusion protein corresponding to the target tumor disease, or are associated with abnormal phase separation of the target fusion protein, the probability of successful screening is high, and key screening can also be performed. In other words, if other diseases correspond to the same target point as the target tumor disease - the same or homologous target fusion protein, the former drug can also be efficiently reused in the target tumor disease through screening and verification.

在一些实施例中,对于目标恶性肿瘤病症,使用治疗除了所述目标恶性肿瘤病症以外的其他恶性肿瘤病症的药物作为要分配到各个孔中的所述小分子化合物以进行筛选。例如,所述目标恶性肿瘤包括肉瘤,所述其他恶性肿瘤病症包括乳腺癌、白血病、肾癌、肝癌、胃癌和甲状腺癌中的任何一种。受益于本申请的高通量处理效率,这种筛选可以是随机筛选,类似于从其他病症的药物库中筛选,如此,可以显著降低该选出药物对人体的潜在风险,显著降低药物的制造成本,并有利于医生做出更准确的联合用药方案。例如,如果其他恶性肿瘤病症与所述靶融合蛋白相关联,或者与所述靶融合蛋白的异常相分离相关联,则从其药物库中有较高的概率能筛选出目标恶性肿瘤病症的候选药物,也可以执行重点筛选。也就是说,如果其他恶性肿瘤病症与目标恶性肿瘤病症对应同一个靶点-同个或同源的靶融合蛋白,则前者的药物也可以通过筛选验证而高效复用于目标恶性肿瘤病症。In some embodiments, for the target malignant tumor condition, drugs for treating other malignant tumor conditions other than the target malignant tumor condition are used as the small molecule compounds to be distributed to each well for screening. For example, the target malignant tumor includes sarcoma, and the other malignant tumor conditions include any one of breast cancer, leukemia, kidney cancer, liver cancer, gastric cancer and thyroid cancer. Benefiting from the high-throughput processing efficiency of the present application, this screening can be a random screening, similar to screening from a drug library of other diseases, so that the potential risk of the selected drug to the human body can be significantly reduced, the manufacturing cost of the drug can be significantly reduced, and it is beneficial for doctors to make more accurate combined drug regimens. For example, if other malignant tumor conditions are associated with the target fusion protein, or are associated with the abnormal phase separation of the target fusion protein, there is a higher probability that candidate drugs for the target malignant tumor condition can be screened out from its drug library, and focused screening can also be performed. That is to say, if other malignant tumor conditions correspond to the same target point-the same or homologous target fusion protein as the target malignant tumor condition, the former drug can also be efficiently reused in the target malignant tumor condition through screening verification.

在步骤103,利用显微成像系统,对各个孔以数个不同的视野在预定时间段内持续拍摄,以生成各个孔的数个视野的时间序列图像。该显微成像系统可以例如高内涵成像分析系统配备的自动高速显微成像组件,但不限于此。可以对各孔的细胞进行持续的拍摄成像的显微成像系统即可用作该显微成像系统。在一些实施例中,显微成像系统可以使用显微镜作为成像构件(如同高内涵成像分析系统那样),但不限于此,诸如微流控芯片等其他成像技术的成像构件也可以使用。In step 103, a microscopic imaging system is used to continuously photograph each well with several different fields of view within a predetermined time period to generate time series images of several fields of view of each well. The microscopic imaging system can be, for example, an automatic high-speed microscopic imaging component equipped with a high-content imaging analysis system, but is not limited thereto. A microscopic imaging system that can continuously photograph and image the cells in each well can be used as the microscopic imaging system. In some embodiments, the microscopic imaging system can use a microscope as an imaging component (as in a high-content imaging analysis system), but is not limited thereto, and imaging components of other imaging technologies such as microfluidic chips can also be used.

如图4(a)所示,可以利用高内涵成像分析系统,在将小分子化合物分配到各个孔中之后,对各个孔的多个视野进行成像,且在6小时内以长期成像模式对每个视野进行持续成像。As shown in FIG4(a), a high-content imaging analysis system can be used to image multiple fields of view of each well after small molecule compounds are dispensed into each well, and each field of view can be continuously imaged in a long-term imaging mode for 6 hours.

所述高内涵成像分析系统通常包括自动高速显微成像组件和至少一个处理器,所述至少一个处理器可以配置为执行根据本申请各个实施例的筛选肿瘤病症的药物的方法。高内涵图像分析是指在保持活细胞结构和功能完整性的前提下,在亚显微形态下同时检测不同条件对细胞形态、生长、分化、迁移、凋亡、代谢途径及信号转导等方面的影响,从单一实验中获取大量相关信息,确定其生物活性和潜在毒性。其检测范围包括但不限于:细胞计数、蛋白表达、细胞凋亡、蛋白转位、细胞活力、细胞迁移、受体内吞、细胞毒性、细胞周期和信号转导等。所述至少一个处理器可以进一步配置为执行全自动图像分析和数据管理等处理。The high-content imaging analysis system generally includes an automatic high-speed microscopic imaging component and at least one processor, and the at least one processor can be configured to execute the method for screening drugs for tumor diseases according to the various embodiments of the present application. High-content image analysis refers to the simultaneous detection of the effects of different conditions on cell morphology, growth, differentiation, migration, apoptosis, metabolic pathways and signal transduction under submicroscopic morphology while maintaining the structural and functional integrity of living cells, obtaining a large amount of relevant information from a single experiment to determine its biological activity and potential toxicity. Its detection range includes but is not limited to: cell counting, protein expression, cell apoptosis, protein translocation, cell viability, cell migration, receptor endocytosis, cytotoxicity, cell cycle and signal transduction. The at least one processor can be further configured to perform fully automatic image analysis and data management and other processing.

如此,在整个成像过程中,为各种小分子化合物作用的每个孔都生成了时间序列图像,从而可以监测各个孔中聚集物行为的动力学变化。基于各个孔中聚集物行为的动力学变化的分析,消除了各个采样时间点上的聚集物行为在孔与孔之间的个体差异的影响,从而能够更高效且准确地筛选出候选药物。In this way, during the entire imaging process, time series images are generated for each well where various small molecule compounds act, so that the kinetic changes of aggregate behavior in each well can be monitored. Based on the analysis of the kinetic changes of aggregate behavior in each well, the influence of individual differences in aggregate behavior between wells at each sampling time point is eliminated, so that candidate drugs can be screened more efficiently and accurately.

在步骤104,由处理器,基于各个孔的至少一个视野的时间序列图像,来分析各个孔的细胞内聚集物的量衰减状况。在步骤105,由所述处理器,筛选出细胞内聚集物的量的衰减状况满足预定条件的孔,使用向所筛选出的孔分配的小分子化合物作为目标肿瘤病症的候选药物。根据以上所述的相分离-聚集物的关联机制,对聚集物的量的衰减效果,意味着对相分离状况的逆转,也意味着对目标肿瘤病症的治疗效果。对聚集物的量的衰减状况满足预定条件,则认为对目标肿瘤病症具有治疗效用,进而对应的小分子化合物可以被选出作为其候选药物。在一些实施例中,聚集物的量的衰减状况可以包括相对于同个孔的当前聚集物的量相对于其在参考时间的聚集物的量的衰减比率、同个孔内当前的富含聚集物细胞分数相对于其在参考时间的富含聚集物细胞分数的衰减比率等等。In step 104, the processor analyzes the attenuation of the amount of intracellular aggregates in each well based on the time series images of at least one field of view of each well. In step 105, the processor screens out the wells whose attenuation of the amount of intracellular aggregates meets the predetermined conditions, and uses the small molecule compounds allocated to the screened wells as candidate drugs for the target tumor disease. According to the phase separation-aggregate association mechanism described above, the attenuation effect on the amount of aggregates means the reversal of the phase separation condition, and also means the therapeutic effect on the target tumor disease. If the attenuation of the amount of aggregates meets the predetermined conditions, it is considered to have therapeutic efficacy for the target tumor disease, and then the corresponding small molecule compound can be selected as its candidate drug. In some embodiments, the attenuation of the amount of aggregates may include the attenuation ratio of the amount of the current aggregates in the same well relative to the amount of aggregates at the reference time, the attenuation ratio of the current aggregate-rich cell fraction in the same well relative to the aggregate-rich cell fraction at the reference time, and the like.

通过图1所示的筛选方法,能够使得融合蛋白形成的聚集物可供成像识别,对大量不同小分子相对于聚集物的动态衰减调节作用进行持续且全面的高通量监测。值得注意的是,该筛选方法通过分析图像的时间序列来揭示聚集物变化的动力学,有效且直接地量化了小分子对诸如液滴形态的聚集物的影响,可用于大规模、高度并行的化合物筛选。这种基于时间序列图像-聚集物动力学变化的分析方式,消除了不同孔的不同时间点富含液滴的细胞比例可能差异很大(参见图11(a)-图11(j))的影响,以便该差异很大,依然可以针对目标肿瘤病症高效且准确地筛选出候选药物。该候选药物可以作为肿瘤病症的临床治疗药物的先导候选者。Through the screening method shown in FIG1 , the aggregates formed by the fusion protein can be imaged and identified, and the dynamic attenuation regulation of a large number of different small molecules relative to the aggregates can be continuously and comprehensively monitored at high throughput. It is worth noting that the screening method reveals the dynamics of aggregate changes by analyzing the time series of images, effectively and directly quantifies the effects of small molecules on aggregates such as droplet morphology, and can be used for large-scale, highly parallel compound screening. This analysis method based on time series image-aggregate dynamic changes eliminates the influence of the large difference in the proportion of droplet-rich cells at different time points in different wells (see FIG11 (a)-FIG11 (j)), so that even if the difference is large, candidate drugs can still be efficiently and accurately screened for the target tumor disease. The candidate drug can be used as a lead candidate for clinical treatment drugs for tumor diseases.

在一些实施例中,对各个孔以数个不同的视野在预定时间段内持续拍摄具体包括:将每个孔划分为数个独立的视野;对于每个孔,依序拍摄每个视野的图像,以得到每个视野在预定时间段内的时间序列图像。对于例如6个小时的长拍摄时间,期间活细胞会运动,如果单独拍一个视野,拍摄图像中的细胞分布容易出现偶然性(例如该视野中细胞数量剧烈变化、细胞数量稀少等),从而导致相对于聚集物行为的真实动力学变化产生偏差。以多个视野持续拍摄图像,并对其中的细胞内聚集物的衰减状况进行联合比照分析,例如可以叠加或者求平均,就可以消除分布偶然性的影响,使得对于各个孔中聚集物行为的真实动力学变化的分析更准确,鲁棒性更好。In some embodiments, continuously photographing each well with several different fields of view within a predetermined time period specifically includes: dividing each well into several independent fields of view; for each well, sequentially photographing images of each field of view to obtain time series images of each field of view within a predetermined time period. For a long photographing time of, for example, 6 hours, during which living cells will move, if a single field of view is photographed, the cell distribution in the photographed image is prone to randomness (for example, the number of cells in the field of view changes dramatically, the number of cells is scarce, etc.), thereby causing deviations from the true dynamic changes of the behavior of the aggregates. Continuously photographing images with multiple fields of view, and performing a joint comparative analysis of the attenuation conditions of the intracellular aggregates therein, such as superposition or averaging, can eliminate the influence of distribution randomness, so that the analysis of the true dynamic changes of the behavior of the aggregates in each well is more accurate and more robust.

在一些实施例中,基于各个孔的至少一个视野的时间序列图像,来分析各个孔的细胞内聚集物的量衰减状况具体包括,对于各个孔执行如下步骤。In some embodiments, analyzing the attenuation of the amount of intracellular aggregates in each well based on the time series images of at least one field of view of each well specifically includes performing the following steps for each well.

对于每个视野的所述时间序列图像的各个采样时间点的图像,执行细胞和聚集物的分割,并确定各个细胞的聚集物量。具体说来,可以通过将分割出的聚集物与分割出的细胞相关联,以得到各个细胞的聚集物量。例如,细胞和聚集物的分割可以采用各种图像分割算法,包括但不限于各种分割学习网络(例如卷积神经网络)、基于边缘的分割算法、基于阈值的分割算法等。For the images of each sampling time point of the time series images of each field of view, the segmentation of cells and aggregates is performed, and the amount of aggregates of each cell is determined. Specifically, the amount of aggregates of each cell can be obtained by associating the segmented aggregates with the segmented cells. For example, the segmentation of cells and aggregates can adopt various image segmentation algorithms, including but not limited to various segmentation learning networks (such as convolutional neural networks), edge-based segmentation algorithms, threshold-based segmentation algorithms, etc.

对于每个视野的各个时间点的图像,确定聚集物量超过富集聚集物阈值的细胞所占比例,并基于视野求平均后作为该孔在各个采样时间点的富含聚集物细胞分数,从而得到该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线。对于每个视野确定聚集物量超过富集聚集物阈值的细胞所占比例再基于视野求平均,来表征细胞内聚集物的量的衰减状况,相较直接采用聚集物的量的衰减量,或者先基于视野求聚集物平均量再计算富含聚集物细胞分数,能够消除不同视野中细胞数量不平衡的影响,以免细胞量较多的视野的聚集物状况主导并掩盖了其他视野的聚集物状况,同时又能够消除不同视野中细胞运动导致的分布偶然性的影响,使得对细胞内聚集物的量的衰减状况的表征更全面周到且精准。For the images at each time point of each visual field, the proportion of cells whose aggregate amount exceeds the enriched aggregate threshold is determined, and the average is calculated based on the visual field as the aggregate-rich cell fraction of the well at each sampling time point, thereby obtaining a distribution curve of the aggregate-rich cell fraction of the well within the predetermined time period. For each visual field, the proportion of cells whose aggregate amount exceeds the enriched aggregate threshold is determined and then averaged based on the visual field to characterize the attenuation of the amount of intracellular aggregates. Compared with directly using the attenuation of the amount of aggregates, or first calculating the average amount of aggregates based on the visual field and then calculating the aggregate-rich cell fraction, the influence of the imbalance of cell numbers in different visual fields can be eliminated, so as to prevent the aggregate state of the visual field with a large number of cells from dominating and covering up the aggregate state of other visual fields, and at the same time, the influence of the distribution contingency caused by cell movement in different visual fields can be eliminated, so that the characterization of the attenuation of the amount of intracellular aggregates is more comprehensive, thoughtful and accurate.

基于该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线,拟合指数模型。基于拟合的指数模型,来计算拟合效果参数和富含聚集物细胞分数的衰减比率。本申请人发现,拟合出的指数模型相对于分布曲线上各个时间点的富含聚集物细胞分数能够实现优化的回归并消除局部扰动,不仅据此计算的富含聚集物细胞分数的衰减比率更准确,鲁棒性更高,且还能够通过拟合效果参数来检验分布曲线的衰减比率计算的可靠性。其中,所述预定条件包括富含聚集物细胞分数的衰减比率达到第一预定条件且拟合效果参数优于第二阈值,也就是说,富含聚集物细胞分数的衰减比率的计算结果足够可靠且达到第一预定条件,才认为向所筛选出的孔分配的小分子化合物对于聚集物的量的溶解衰减作用(也意味着对相分离状况的逆转作用)达到治疗效果,如此,可以避免基于不可靠的衰减比率计算结果来指导目标肿瘤病症的候选药物的筛选。Based on the distribution curve of the aggregate-rich cell fraction of the well within the predetermined time period, an exponential model is fitted. Based on the fitted exponential model, the fitting effect parameter and the attenuation ratio of the aggregate-rich cell fraction are calculated. The applicant has found that the fitted exponential model can achieve optimized regression and eliminate local disturbances relative to the aggregate-rich cell fraction at each time point on the distribution curve. Not only is the attenuation ratio of the aggregate-rich cell fraction calculated based on this more accurate and more robust, but the reliability of the attenuation ratio calculation of the distribution curve can also be tested by the fitting effect parameter. Among them, the predetermined condition includes that the attenuation ratio of the aggregate-rich cell fraction reaches the first predetermined condition and the fitting effect parameter is better than the second threshold value, that is, the calculation result of the attenuation ratio of the aggregate-rich cell fraction is sufficiently reliable and reaches the first predetermined condition, and it is considered that the dissolution attenuation effect of the small molecule compound distributed to the screened well on the amount of aggregates (also means the reversal effect on the phase separation state) reaches a therapeutic effect, so that it can avoid guiding the screening of candidate drugs for target tumor diseases based on unreliable attenuation ratio calculation results.

在一些实施例中,如果视野中细胞运动导致的分布偶然性的影响较小,也可以基于单个视野来执行类似的处理。具体说来,对于每个视野的所述时间序列图像在各个采样时间点的图像,执行细胞和聚集物的分割,并确定各个细胞的聚集物量。对于单个视野的各个采样时间点的图像,确定聚集物量超过富集聚集物阈值的细胞所占比例,作为该孔在各个采样时间点的富含聚集物细胞分数,从而得到该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线。基于该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线,拟合指数模型。基于拟合的指数模型,来计算拟合效果参数和富含聚集物细胞分数的衰减比率。其中,所述预定条件包括富含聚集物细胞分数的衰减比率达到第一预定条件且拟合效果参数优于第二阈值。In some embodiments, if the influence of distribution contingency caused by cell movement in the field of view is small, similar processing can also be performed based on a single field of view. Specifically, for the images of the time series images of each field of view at each sampling time point, the segmentation of cells and aggregates is performed, and the amount of aggregates of each cell is determined. For the images of each sampling time point of a single field of view, the proportion of cells whose aggregate amount exceeds the enriched aggregate threshold is determined as the aggregate-rich cell fraction of the hole at each sampling time point, thereby obtaining the distribution curve of the aggregate-rich cell fraction of the hole in the predetermined time period. Based on the distribution curve of the aggregate-rich cell fraction of the hole in the predetermined time period, an exponential model is fitted. Based on the fitted exponential model, the fitting effect parameter and the attenuation ratio of the aggregate-rich cell fraction are calculated. Wherein, the predetermined condition includes that the attenuation ratio of the aggregate-rich cell fraction reaches the first predetermined condition and the fitting effect parameter is better than the second threshold.

在一些实施例中,可以采用如下方式来对各个图像执行细胞和聚集物的分步针对性分割。可以利用第一算法来分割细胞。识别图像中的点状结构,并以聚集物的代表尺寸作为目标尺寸执行增强;再利用第二算法,分割出强度值比背景的代表强度值高出第三阈值的聚集物。一块孔板的6小时拍摄能够产生多达10G甚至50G的图像数据,通过先利用第一算法分割细胞,就能够形成细胞蒙版(二元矩阵),显著提升了后续的图像处理速度并降低了计算负荷。聚集物对应的点状结构是亮的(且分布可能不均匀或因噪声影响而分散),图像背景也是亮的,两者对比度不够高,通过以聚集物的代表尺寸(例如5-10个像素)作为目标尺寸执行增强,就能够将对比度不够高的聚集物点高,通过与背景的代表强度值的简单阈值划分而高效识别出来。In some embodiments, the following method can be used to perform step-by-step targeted segmentation of cells and aggregates on each image. The first algorithm can be used to segment cells. Identify the point structure in the image, and perform enhancement with the representative size of the aggregate as the target size; then use the second algorithm to segment the aggregates whose intensity value is higher than the representative intensity value of the background by a third threshold. A 6-hour shooting of a well plate can generate up to 10G or even 50G of image data. By first using the first algorithm to segment the cells, a cell mask (binary matrix) can be formed, which significantly improves the subsequent image processing speed and reduces the computational load. The point structure corresponding to the aggregate is bright (and the distribution may be uneven or dispersed due to noise), and the image background is also bright. The contrast between the two is not high enough. By performing enhancement with the representative size of the aggregate (e.g., 5-10 pixels) as the target size, the aggregate points with insufficient contrast can be efficiently identified by simple threshold division with the representative intensity value of the background.

例如,所述第一算法自动地确定用于分割细胞前景和细胞背景的强度阈值且据此进行细胞分割,相较诸如利用学习网络或者基于边缘的识别,基于前景和背景的强度阈值的分割计算负荷更小且计算速度更快,并且,对于孔的视野图像中细胞的分割效果也良好。第二算法自动地确定聚集物背景的鲁棒性的代表强度值,所述第三阈值为背景的代表强度值的标准偏差的3-5倍,如此,在分割出细胞后,利用这样的第三阈值设置,也能够将增强后的聚集物高效全面地识别出来。For example, the first algorithm automatically determines the intensity threshold for segmenting the cell foreground and the cell background and performs cell segmentation accordingly. Compared with the use of learning networks or edge-based recognition, the segmentation based on the intensity threshold of the foreground and the background has a smaller computational load and a faster computational speed, and the segmentation effect of the cells in the field of view image of the hole is also good. The second algorithm automatically determines the representative intensity value of the robustness of the aggregate background, and the third threshold is 3-5 times the standard deviation of the representative intensity value of the background. In this way, after the cells are segmented, the enhanced aggregates can be efficiently and comprehensively identified using such a third threshold setting.

参见图4(b),对于高内涵显微镜拍摄的每个图像,可以对进行图像标准化,例如将其每个像素的强度值除以0.05,以生成新图像。对于每个图像,可以首先采用Otsu算法用于分割目标直径大小为70至250像素的细胞。Otsu算法能够迅速自动地确定分割细胞前景和背景的阈值,尤其适用于单幅图像中几十个到上百个量级的细胞的分割,这恰好是各个孔的单个视野内通常的细胞数量。Referring to FIG. 4( b ), for each image captured by the high-content microscope, the image can be normalized, for example, the intensity value of each pixel is divided by 0.05 to generate a new image. For each image, the Otsu algorithm can be first used to segment cells with a target diameter of 70 to 250 pixels. The Otsu algorithm can quickly and automatically determine the threshold for segmenting the foreground and background of cells, and is particularly suitable for the segmentation of dozens to hundreds of cells in a single image, which happens to be the number of cells usually in a single field of view of each well.

参见图4(b),对图像中的点状结构进行了增强,目标直径设置为5像素,也就是对液滴进行了增强。例如,使用白色tophat滤波器对图像进行增强,其增强半径设置为接近液滴代表半径。As shown in Figure 4(b), the dot structure in the image is enhanced, and the target diameter is set to 5 pixels, that is, the droplet is enhanced. For example, a white tophat filter is used to enhance the image, and its enhancement radius is set to be close to the representative radius of the droplet.

然后,参见图4(b),可以使用鲁棒背景算法检测直径为5至10像素的液滴,其中,将强度高于背景的代表性强度值例如4个标准偏差以上的结构识别并分割为液滴。单幅图像中存在的液滴通常是成千上万数量级的,而且不同液滴的亮度不如细胞那么均匀,利用鲁棒背景算法,考虑到结构与周围亮度的相对比较关系来判断该结构是否液滴。具体说来,该鲁棒背景算法可以首先去除图像中最亮和最暗的像素强度,以消除异常值和真实信号,以便所有剩余的像素仅代表背景强度级别。然后,计算背景强度的均值和标准差。然后通过将平均值与n*标准差相加来计算阈值,n=3、4或5,高于阈值的被认为是前景的液滴。该鲁棒背景算法可以在整个图像中执行,也可以按需在图像的任意局部针对性执行。Then, referring to FIG. 4( b ), a robust background algorithm can be used to detect droplets with a diameter of 5 to 10 pixels, wherein structures with an intensity higher than the representative intensity value of the background, such as 4 standard deviations, are identified and segmented as droplets. The droplets present in a single image are usually in the order of tens of thousands, and the brightness of different droplets is not as uniform as that of cells. The robust background algorithm is used to determine whether the structure is a droplet by considering the relative comparison relationship between the structure and the surrounding brightness. Specifically, the robust background algorithm can first remove the brightest and darkest pixel intensities in the image to eliminate outliers and true signals so that all remaining pixels represent only the background intensity level. Then, the mean and standard deviation of the background intensity are calculated. Then, the threshold is calculated by adding the mean to n*standard deviation, n=3, 4 or 5, and those above the threshold are considered to be foreground droplets. The robust background algorithm can be executed in the entire image or in any local area of the image as needed.

接着,将所识别的液滴与之前步骤中识别的细胞相关联,可以得到每个细胞的液滴数。Next, the identified droplets are associated with the cells identified in the previous step to obtain the number of droplets per cell.

对于每个孔板,可以获得初始时间点每个细胞的液滴数。将初始时间点的所有细胞与其液滴数进行排序,将顶部15%位置的液滴数用作定义液滴富集细胞的阈值。然后,对于板中的所有孔,随着时间的推移,监测液滴数超过液滴富集阈值的细胞比例。药物治疗后液滴富集细胞比例的时间序列可以用如下的指数模型来良好地拟合逼近,该指数模型的曲线图如图4(c)所示。For each well plate, the number of droplets per cell at the initial time point can be obtained. All cells at the initial time point are sorted by their droplet number, and the number of droplets at the top 15% is used as the threshold for defining droplet-enriched cells. Then, for all wells in the plate, the proportion of cells with droplet numbers exceeding the droplet enrichment threshold is monitored over time. The time series of the proportion of droplet-enriched cells after drug treatment can be well fitted by the following exponential model, the curve of which is shown in Figure 4(c).

该指数模型由如下的公式(1)来定义,其将液滴富集细胞的分数(也是每个孔中的富含聚集物细胞分数)描述为药物治疗后时间的函数:The exponential model is defined by the following formula (1), which describes the fraction of droplet-enriched cells (also the fraction of aggregate-enriched cells in each well) as a function of time after drug treatment:

其中c0和c1是指数模型中需要回归的常数,t是小分子化合物的作用时间,Y(t)是每个孔中的富含聚集物细胞分数,其可以是基于单个视野的富含聚集物细胞分数,也可以是基于多个视野求平均的富含聚集物细胞分数。Where c0 and c1 are constants that need to be regressed in the exponential model, t is the action time of the small molecule compound, and Y(t) is the fraction of aggregate-rich cells in each well, which can be the fraction of aggregate-rich cells based on a single field of view or the fraction of aggregate-rich cells averaged over multiple fields of view.

在一些实施例中,拟合效果参数为R平方,所述第二阈值为0.2-1.0。具体说来,参见图4(d)所示的1777多种小分子化合物的R平方(拟合效果参数的示例)和log2药物效果(富含聚集物细胞分数的衰减比率的衍生参数示例)的散点图。具体说来,所述第一预定条件为衰减比率达到0.2且按照衰减比率大小顺序排列的前预定数量个,所述衰减比率定义为利用拟合的指数模型计算的衰减后的富含聚集物细胞分数与初始的富含聚集物细胞分数的比率,达到该第一预定条件且拟合效果参数优于第二阈值的孔所分配的小分子化合物可以作为目标肿瘤病症的候选药物。In some embodiments, the fitting effect parameter is R square, and the second threshold is 0.2-1.0. Specifically, see the scatter plot of R square (example of fitting effect parameter) and log 2 drug effect (example of derivative parameter of attenuation ratio of aggregate-rich cell fraction) of more than 1777 small molecule compounds shown in Figure 4 (d). Specifically, the first predetermined condition is the first predetermined number of attenuation ratios reaching 0.2 and arranged in order of attenuation ratio, and the attenuation ratio is defined as the ratio of the attenuated aggregate-rich cell fraction calculated using the fitted exponential model to the initial aggregate-rich cell fraction. The small molecule compound assigned to the well that meets the first predetermined condition and the fitting effect parameter is better than the second threshold can be used as a candidate drug for the target tumor condition.

例如,基于图4(d)的散点图,可以提取出144个R平方大于0.2(拟合效果较好)且药物效应小于0.5(药物作用下的富含液滴的细胞分数的衰减度超过50%)的区域所对应的小分子化合物,作为进一步研究的候选化合物。For example, based on the scatter plot in Figure 4(d), 144 small molecule compounds corresponding to the regions with R-square greater than 0.2 (good fitting effect) and drug effect less than 0.5 (the attenuation of the droplet-rich cell fraction under the action of the drug exceeds 50%) can be extracted as candidate compounds for further study.

在手动检查细胞状态和治疗后的液滴数量后,可以在144种候选药物中选择按照衰减比率大小顺序排列的前10种进行重新测试(参见图4(e))。在一些实施例中,还包括:人为或者自动地识别各个孔中的细胞的存活状况,确定存活状况劣于预定存活条件的孔;将向该孔分配到的小分子化合物从目标肿瘤病症的候选药物中排除。After manually checking the cell status and the number of droplets after treatment, the top 10 drugs ranked in order of attenuation ratio among the 144 candidate drugs can be selected for retesting (see FIG. 4( e)). In some embodiments, the method further includes: manually or automatically identifying the survival status of cells in each well, determining the wells whose survival status is inferior to the predetermined survival condition; and excluding the small molecule compound assigned to the well from the candidate drugs for the target tumor disease.

例如,自动地识别各个孔中的细胞的存活状况具体包括:利用图像处理算法,确定细胞的形态和细胞数量的减少率中的至少一种;在所识别的细胞的形态相较球形的偏离度高于偏离度阈值和/或细胞数量的减少率高于减少率阈值的情况下,将细胞识别为存活状况劣于预定存活条件。如此,可以通过人为筛选或自动筛选,排除有毒性的小分子化合物。For example, automatically identifying the survival status of cells in each well specifically includes: using an image processing algorithm to determine at least one of the cell morphology and the rate of reduction of the number of cells; when the deviation of the identified cell morphology from the spherical shape is higher than a deviation threshold and/or the rate of reduction of the number of cells is higher than a reduction threshold, identifying the cell as having a survival status worse than a predetermined survival condition. In this way, toxic small molecule compounds can be excluded through manual or automatic screening.

图4(e)所示的前10种小分子化合物中,八种化合物的聚集物数量明显减少(参见图11(a)-图11(j))。其中,包括LY2835219(也称为甲磺酸阿贝莫西利),它是一种选择性细胞周期素依赖性激酶(CDK)抑制剂,靶向细胞周期素D1/CDK4和细胞周期素D1/CDK6细胞周期途径。此外,LY2835219是一种有效的泛PIM激酶抑制剂CX-6258,以及两种表皮生长因子受体抑制剂EKB-569和AZD-9291,显示了有效的聚集物溶解(参见图11(a)、图11(i)和图11(b))。此外,一些命中化合物,如UNC 0631,可能通过触发细胞死亡来减少液滴数量(参见图11(h))。总之,使用DropScan,能够以一次接近2000的小分子化合物的筛选通量,识别出一组具有不同有效性的聚集物调节化合物。Of the top 10 small molecule compounds shown in Figure 4(e), eight compounds showed a significant reduction in the number of aggregates (see Figures 11(a)-11(j)). Among them, LY2835219 (also known as abetamoxiclib mesylate) is included, which is a selective cyclin-dependent kinase (CDK) inhibitor that targets the cyclin D1/CDK4 and cyclin D1/CDK6 cell cycle pathways. In addition, LY2835219, a potent pan-PIM kinase inhibitor CX-6258, and two epidermal growth factor receptor inhibitors EKB-569 and AZD-9291 showed effective aggregate dissolution (see Figures 11(a), 11(i), and 11(b)). In addition, some hit compounds, such as UNC 0631, may reduce the number of droplets by triggering cell death (see Figure 11(h)). In summary, using DropScan, we were able to identify a panel of aggregation-modulating compounds with varying effectiveness at a screening throughput of nearly 2,000 small molecules at a time.

目标肿瘤病症可以为与FET-ETS融合蛋白相关的各种肿瘤病症,包括尤文氏肉瘤(与EWS-FLI1融合蛋白相关联)、间皮细胞瘤(与FUS-ATF1融合蛋白相关联)、恶性黑色素瘤(与EWS-ATF1融合蛋白相关联)、促纤维增生性小圆细胞肿瘤(与EWS-ERG融合蛋白相关联)和白血病(与FUS-ERG融合蛋白相关联)的至少一种。相应地,在步骤101中,也可以采用各种FET-ETS融合蛋白作为靶融合蛋白,相应地,通过筛选方法得到的目标肿瘤病症的候选药物包括如下的至少一种:The target tumor disorder can be various tumor disorders associated with FET-ETS fusion proteins, including at least one of Ewing's sarcoma (associated with EWS-FLI1 fusion protein), mesothelioma (associated with FUS-ATF1 fusion protein), malignant melanoma (associated with EWS-ATF1 fusion protein), desmoplastic small round cell tumor (associated with EWS-ERG fusion protein) and leukemia (associated with FUS-ERG fusion protein). Accordingly, in step 101, various FET-ETS fusion proteins can also be used as target fusion proteins. Accordingly, the candidate drugs for the target tumor disorder obtained by the screening method include at least one of the following:

LY2835219(甲磺酸阿贝莫西利),其结构式为 LY2835219 (Abetamoxiclib mesylate), the structural formula of which is

CHIR-124,其结构式为 CHIR-124, whose structural formula is

EMD-1214063,其结构式为 EMD-1214063, whose structural formula is

CX-6258,其结构式为 CX-6258, whose structural formula is

Pelitinib(EKB-569),其结构式为 Pelitinib (EKB-569), whose structural formula is

AZD-9291,其结构式为 AZD-9291, whose structural formula is

GDC-0941,其结构式为 GDC-0941, whose structural formula is

SB743921,其结构式为 SB743921, whose structural formula is

这也是图11(a)-图11(j)中呈现出的使得聚集物数量明显减少的八种小分子化合物,如上所述,图11(a)-图11(j)是通过使用FET-ETS融合蛋白(例如使用U2OS/mCherry-FUS-ERGmut细胞)进行筛选所得到的候选药物。These are also the eight small molecule compounds shown in Figures 11(a) to 11(j) that significantly reduce the number of aggregates. As mentioned above, Figures 11(a) to 11(j) are candidate drugs obtained by screening using FET-ETS fusion proteins (for example, using U2OS/mCherry-FUS-ERGmut cells).

可以得出结论:LY2835219在命中化合物中排名第一(参见图4(e)),LY2835219有效溶解了表达尤因肉瘤融合物的报告细胞系中的聚集物,并部分挽救了靶基因的异常表达,可以作为FET融合相关的尤因肉瘤的候选药物。It can be concluded that LY2835219 ranks first among the hit compounds (see Figure 4(e)). LY2835219 effectively dissolves aggregates in reporter cell lines expressing Ewing sarcoma fusions and partially rescues the abnormal expression of target genes. It can be used as a candidate drug for FET fusion-related Ewing sarcoma.

LY2835219的治疗机制Therapeutic mechanism of LY2835219

考虑到LY2835219是一种CDK4/6抑制剂,在小分子库中检索了其他CDK4/6抑制剂,并发现了四种以上的抑制剂。其中,purvalanol B、palbociclib isesionate和LEE011未能溶解聚集物,而ON123300导致大量细胞死亡(参见图12)。Considering that LY2835219 is a CDK4/6 inhibitor, other CDK4/6 inhibitors were searched in the small molecule library and more than four inhibitors were found. Among them, purvalanol B, palbociclib isesionate and LEE011 failed to dissolve aggregates, while ON123300 caused massive cell death (see Figure 12).

这些数据表明,LY2835219不依赖于CDK4//6抑制剂活性来诱导聚集物溶解。接下来,在1至10μM的浓度范围内测试了LY2835219。聚集物量随时间呈剂量依赖性下降(参见图13(b)以及图14(a)-图14(c))。These data indicate that LY2835219 does not rely on CDK4//6 inhibitor activity to induce aggregate dissolution. Next, LY2835219 was tested in a concentration range of 1 to 10 μM. The amount of aggregates decreased in a dose-dependent manner over time (see Figure 13 (b) and Figure 14 (a)-Figure 14 (c)).

可以注意到,经LY2835219处理的细胞含有许多空泡状结构(参见图13(a))。之前的一项研究报告称,用LY2835219处理过的细胞中,溶酶体膨胀形成液泡结构。为了证实这一点,在LY2835219处理前后用溶酶体追踪法将报告细胞系染色为绿色。与之前的研究一致,液泡结构与溶酶体共定位(参见图13(c),溶酶体的强度显著增强(见图13(c)和图13(d))。这些结果与LY2835219处理导致溶酶体酸化的概念一致。It can be noted that cells treated with LY2835219 contain many vacuolar structures (see Figure 13(a)). A previous study reported that lysosomes swell to form vacuolar structures in cells treated with LY2835219. To confirm this, the reporter cell line was stained green using the lysosome tracking method before and after LY2835219 treatment. Consistent with the previous study, vacuolar structures co-localized with lysosomes (see Figure 13(c)), and the intensity of lysosomes was significantly enhanced (see Figure 13(c) and Figure 13(d)). These results are consistent with the concept that LY2835219 treatment leads to lysosomal acidification.

此外,U2OS细胞对LY2835219的反应也显示出溶酶体强度显著增强(图14(b))。因此,将溶酶体的特异性抑制剂-巴弗洛霉素A1(Baf-A1)与LY2835219共同添加,以评估Baf-A1是否降低LY28305219的活性。在使用LY2835219或Baf-A1单独处理或两者同时处理期间对细胞进行高内涵成像,并注意到在同时使用Baf-A1和LY2875219培养的细胞中没有形成空泡(参见图13(e))。此外,Baf-A1显著降低了LY2835219的溶解速度(参见图13(e))。In addition, U2OS cells also showed a significant increase in lysosomal intensity in response to LY2835219 (Figure 14(b)). Therefore, a specific inhibitor of lysosomes, bafilomycin A1 (Baf-A1), was co-added with LY2835219 to assess whether Baf-A1 reduces the activity of LY28305219. High-content imaging of cells was performed during treatment with LY2835219 or Baf-A1 alone or both, and it was noted that no vacuoles were formed in cells cultured with Baf-A1 and LY2875219 (see Figure 13(e)). In addition, Baf-A1 significantly reduced the dissolution rate of LY2835219 (see Figure 13(e)).

进一步地,将溶酶体报告基因YFP-LAMP1(溶酶体相关膜蛋白1)导入融合细胞系,并在小分子处理下进行长时间成像。在本试验中,发现在小分子处理之前,mCherry-FUS-ERGmut的聚集物与溶酶体高度共定位。LY2835219处理增加了这种共定位,溶解了聚集物,并显著增大了溶酶体(参见图13(f))。综上所述,这些数据表明,LY2835219可能通过激活溶酶体酸化,部分溶解细胞中的融合蛋白聚集物。Further, the lysosomal reporter gene YFP-LAMP1 (lysosomal associated membrane protein 1) was introduced into the fusion cell line and imaged for a long time under small molecule treatment. In this experiment, it was found that before small molecule treatment, the aggregates of mCherry-FUS-ERGmut were highly colocalized with lysosomes. LY2835219 treatment increased this colocalization, dissolved the aggregates, and significantly enlarged the lysosomes (see Figure 13 (f)). In summary, these data indicate that LY2835219 may partially dissolve the fusion protein aggregates in the cell by activating lysosomal acidification.

接下来,测试了LY2835219对许多相分离倾向蛋白:EWS-FLI1、EWS-FLI1mut、NUP98-HOXA9mut、NUP98 HOXD13mut、FUS的N端(aa 1~212)、NUP98的N端(aa1~515)和全长TDP43产生的聚集物的影响。LY2835219处理导致所有四种PS-DBD融合聚集物(图13(g))溶解,但三种非融合蛋白(图13(h))没有溶解。有趣的是,LY2835219在溶解聚集物方面表现出对PS-DBD融合蛋白的特异性。Next, we tested the effect of LY2835219 on aggregates generated by a number of phase separation prone proteins: EWS-FLI1, EWS-FLI1mut, NUP98-HOXA9mut, NUP98 HOXD13mut, the N terminus of FUS (aa 1-212), the N terminus of NUP98 (aa1-515), and full-length TDP43. LY2835219 treatment resulted in the dissolution of all four PS-DBD fusion aggregates (Figure 13(g)), but not the three non-fusion proteins (Figure 13(h)). Interestingly, LY2835219 showed specificity for PS-DBD fusion proteins in dissolving aggregates.

除了能够有效溶解FET-ETS液滴以外,发明人发现,LY2835219介导的液滴溶解能够逆转融合物靶基因的异常表达。In addition to being able to effectively dissolve FET-ETS droplets, the inventors found that LY2835219-mediated droplet dissolution can reverse the aberrant expression of the fusion target gene.

EWS-FLI1融合显著增强了相对荧光素酶的活性,在荧光素酶分析中,EWS-FLI1比FUS-ERG更有效。因此,为了进一步进行功能测试,使用EWS-FLI1和LY2835219处理监测荧光素酶活性。从而发现,相对荧光素酶活性随时间而降低,在处理12小时后降低50%(图15(a))。The EWS-FLI1 fusion significantly enhanced the relative luciferase activity, and in the luciferase assay, EWS-FLI1 was more effective than FUS-ERG. Therefore, for further functional testing, luciferase activity was monitored using EWS-FLI1 and LY2835219 treatment. It was found that the relative luciferase activity decreased over time and decreased by 50% after 12 hours of treatment (Figure 15(a)).

在用LY2835219治疗前后,对U2OS/mCherry-ESW-FLI1细胞进行了RNA测序,来验证LY2835219治疗是否会降低EWS-FLI1靶基因的表达。EWS-FLI1的假定靶基因是从先前发布的转录组数据集中提取的。简而言之,在EWS-FLI过表达细胞中表达水平同时显著高于FLI1过表达细胞和对照细胞的356个基因被视为EWS-FLI1靶基因(图15(b))。通过比较LY2835219处理前后356个EWS-FLI1靶基因的表达水平,发现处理3小时后有26个靶基因显著下调,只有2个靶基因上调(图15(c))。进一步地,计算了LY2835219处理前后表达EWS-FLI1的细胞与野生型细胞的相对表达水平。与上述发现一致,LY2835219处理后EWS-FLI1靶基因的相对表达水平显著降低(参见图15(d))。此外,发明人在尤因肉瘤细胞系A-673上重复了上述实验,并得出了一致的结论。RNA sequencing was performed on U2OS/mCherry-ESW-FLI1 cells before and after treatment with LY2835219 to verify whether LY2835219 treatment would reduce the expression of EWS-FLI1 target genes. Putative target genes of EWS-FLI1 were extracted from a previously published transcriptome dataset. In short, 356 genes whose expression levels in EWS-FLI overexpressing cells were significantly higher than those in FLI1 overexpressing cells and control cells were considered to be EWS-FLI1 target genes (Figure 15(b)). By comparing the expression levels of 356 EWS-FLI1 target genes before and after LY2835219 treatment, it was found that 26 target genes were significantly downregulated after 3 hours of treatment, and only 2 target genes were upregulated (Figure 15(c)). Further, the relative expression levels of cells expressing EWS-FLI1 and wild-type cells before and after LY2835219 treatment were calculated. Consistent with the above findings, the relative expression level of the EWS-FLI1 target gene was significantly reduced after LY2835219 treatment (see FIG. 15( d )). In addition, the inventors repeated the above experiment on the Ewing sarcoma cell line A-673 and reached a consistent conclusion.

进一步地,还进行了实时荧光定量聚合酶链式反应(RT-qPCR),以验证LY2835219处理后EWS-FLI1靶基因的下调模式。LY2835219处理后,大多数受试基因的表达水平降低,且此过程具有时间依赖性(参见图15(e))。总的来说,荧光素酶、RT-qPCR和RNA测序数据联合证实,LY2835219确实可以挽救融合癌蛋白诱导的异常基因表达,是治疗这些FET融合相关尤因肉瘤或其他癌症的候选药物。Furthermore, real-time fluorescence quantitative polymerase chain reaction (RT-qPCR) was performed to verify the downregulation pattern of EWS-FLI1 target genes after LY2835219 treatment. After LY2835219 treatment, the expression levels of most tested genes decreased, and this process was time-dependent (see Figure 15(e)). Overall, the luciferase, RT-qPCR and RNA sequencing data jointly confirmed that LY2835219 can indeed rescue the abnormal gene expression induced by fusion oncoproteins and is a candidate drug for the treatment of these FET fusion-related Ewing sarcomas or other cancers.

在许多癌细胞中,基因组和转录组编码具有PS和DBD的融合蛋白。这些融合蛋白可以在染色质上进行相分离,作为新的转录因子,改变下游靶基因的表达。异常转录组通常与癌症有关。相分离调节药物可以干扰聚集物并逆转异常的转录模式。这些药物有望作为融合物相关癌症的治疗药物,作用机制可以参见图15(f):正常情况下,基因正常表达;当发生染色体重排以产生框内PS-BDB融合时,目标DNA位点的相分离会导致异常的基因表达,这与癌症相关,而溶解相分离聚集物的小分子可以挽救异常表达。In many cancer cells, the genome and transcriptome encode fusion proteins with PS and DBD. These fusion proteins can phase separate on chromatin and act as new transcription factors to alter the expression of downstream target genes. Abnormal transcriptomes are often associated with cancer. Phase separation regulating drugs can interfere with aggregates and reverse abnormal transcription patterns. These drugs are expected to be used as therapeutic drugs for fusion-related cancers. The mechanism of action can be seen in Figure 15(f): Under normal circumstances, genes are expressed normally; when chromosome rearrangements occur to produce in-frame PS-BDB fusions, phase separation of the target DNA site leads to abnormal gene expression, which is associated with cancer, and small molecules that dissolve phase-separated aggregates can rescue abnormal expression.

计算机程序产品Computer program product

根据本申请的实施例,提供一种计算机程序产品,其包含能够存储于和/或下载到非暂时性存储介质上的计算机可执行指令。According to an embodiment of the present application, a computer program product is provided, which includes computer executable instructions that can be stored and/or downloaded to a non-transitory storage medium.

所述计算机可执行指令由处理器执行时,执行根据本申请各个实施例的筛选肿瘤病症的药物的方法。所述处理器可以是包括一个或多个通用处理设备(诸如微处理器、中央处理单元(CPU)、图形处理单元(GPU)等)的处理设备。更具体地说,处理器可以是复杂指令集计算(CISC)微处理器、精简指令集计算(RISC)微处理器、超长指令字(VLIW)微处理器、运行其他指令集的处理器或运行指令集的组合的处理器。处理器也可以是一个或多个专用处理设备,例如专用集成电路(ASIC)、现场可编程门阵列(FPGA)、数字信号处理器(DSP)、片上系统(SoC)等等。如本领域技术人员将理解的,在一些实施例中,处理器可以是专用处理器,而不是通用处理器。When the computer executable instruction is executed by the processor, the method for screening the medicine of tumor disease according to each embodiment of the present application is executed. The processor can be a processing device including one or more general processing devices (such as a microprocessor, a central processing unit (CPU), a graphics processing unit (GPU), etc.). More specifically, the processor can be a complex instruction set computing (CISC) microprocessor, a reduced instruction set computing (RISC) microprocessor, a very long instruction word (VLIW) microprocessor, a processor running other instruction sets, or a processor running a combination of instruction sets. The processor can also be one or more special processing devices, such as an application specific integrated circuit (ASIC), a field programmable gate array (FPGA), a digital signal processor (DSP), a system on chip (SoC), etc. As will be understood by those skilled in the art, in some embodiments, the processor can be a special processor, rather than a general processor.

以下仅对该方法的示例进行简单说明,须知根据本申请各个实施例的筛选肿瘤病症的药物的方法的各种示例均可以结合于此,在此不赘述。The following is a brief description of an example of the method. It should be noted that various examples of the method for screening drugs for tumor diseases according to various embodiments of the present application can be combined here and will not be repeated here.

例如,处理器执行的筛选肿瘤病症的药物的方法可以包括如下步骤。For example, a method for screening drugs for tumor conditions executed by a processor may include the following steps.

获取包括孔阵列的孔板的各个孔的数个视野的时间序列图像,所述时间序列图像通过在执行如下步骤后对各个孔以数个不同的视野在预定时间段内持续拍摄而生成(这些步骤可以不由处理器执行,而是表示获取图像时的孔的操作状态):将包含靶融合蛋白的细胞接种到包括孔阵列的孔板中,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白与目标肿瘤病症相关联,包含相分离倾向结构域(PS)和DNA结合结构域(DBD)且进一步包含标记分子,以便能够形成可供成像识别的聚集物;将多种小分子化合物分配到各个孔中。Acquire time series images of several fields of view of each well of a well plate including a well array, wherein the time series images are generated by continuously photographing each well with several different fields of view within a predetermined time period after executing the following steps (these steps may not be executed by a processor, but represent the operating state of the well when the image is acquired): inoculate cells containing a target fusion protein into a well plate including a well array, wherein the cells contain a nucleic acid molecule encoding the target fusion protein, the target fusion protein is associated with a target tumor disorder, contains a phase separation prone domain (PS) and a DNA binding domain (DBD) and further contains a marker molecule so as to be able to form aggregates that can be identified by imaging; distribute a plurality of small molecule compounds into each well.

基于各个孔的至少一个视野的时间序列图像,来分析各个孔的细胞内聚集物的量衰减状况。Based on the time series images of at least one field of view of each well, the amount decay status of the intracellular aggregates in each well is analyzed.

筛选出细胞内聚集物的量的衰减状况满足预定条件的孔,并确定向所筛选出的孔分配的小分子化合物作为目标肿瘤病症的候选药物。Wells whose attenuation conditions of the amount of intracellular aggregates meet predetermined conditions are screened, and small molecule compounds assigned to the screened wells are determined as candidate drugs for the target tumor disorder.

在一些实施例中,基于各个孔的至少一个视野的时间序列图像,来分析各个孔的细胞内聚集物的量衰减状况具体包括,对于各个孔:对于每个视野的所述时间序列图像的各个采样时间点的图像,执行细胞和聚集物的分割,并确定各个细胞的聚集物量;对于每个视野的各个时间点的图像,确定聚集物量超过富集聚集物阈值的细胞所占比例,并基于视野求平均后作为该孔在各个采样时间点的富含聚集物细胞分数,从而得到该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线;基于该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线,拟合指数模型;基于拟合的指数模型,来计算拟合效果参数和富含聚集物细胞分数的衰减比率,其中,所述预定条件包括富含聚集物细胞分数的衰减比率达到第一预定条件且拟合效果参数优于第二阈值。In some embodiments, analyzing the attenuation of the amount of intracellular aggregates in each well based on the time series images of at least one field of view of each well specifically includes: for each well: for the images of each sampling time point of the time series images of each field of view, performing cell and aggregate segmentation, and determining the amount of aggregates of each cell; for the images of each time point of each field of view, determining the proportion of cells whose aggregate amount exceeds the enriched aggregate threshold, and averaging based on the field of view as the aggregate-rich cell fraction of the well at each sampling time point, thereby obtaining a distribution curve of the aggregate-rich cell fraction of the well within the predetermined time period; fitting an exponential model based on the distribution curve of the aggregate-rich cell fraction of the well within the predetermined time period; and calculating the fitting effect parameter and the attenuation ratio of the aggregate-rich cell fraction based on the fitted exponential model, wherein the predetermined condition includes that the attenuation ratio of the aggregate-rich cell fraction reaches a first predetermined condition and the fitting effect parameter is better than a second threshold.

在一些实施例中,拟合效果参数为R平方,且所述第二阈值为0.2-1.0。In some embodiments, the fitting effect parameter is R squared, and the second threshold is 0.2-1.0.

在一些实施例中,所述第一预定条件为衰减比率达到0.2且按照衰减比率大小顺序排列的前预定数量个,所述衰减比率定义为利用拟合的指数模型计算的衰减后的富含聚集物细胞分数与初始的富含聚集物细胞分数的比率。In some embodiments, the first predetermined condition is the first predetermined number of cells whose attenuation ratio reaches 0.2 and are arranged in order of attenuation ratio, and the attenuation ratio is defined as the ratio of the aggregate-rich cell fraction after attenuation calculated using a fitted exponential model to the initial aggregate-rich cell fraction.

在一些实施例中,所述筛选肿瘤病症的药物的方法还包括:自动地识别各个孔中的细胞的存活状况,确定存活状况劣于预定存活条件的孔;将向该孔分配到的小分子化合物从目标肿瘤病症的候选药物中排除。In some embodiments, the method for screening drugs for tumor conditions further comprises: automatically identifying the survival status of cells in each well, determining wells whose survival status is worse than a predetermined survival condition; and excluding the small molecule compound assigned to the well from candidate drugs for the target tumor condition.

在一些实施例中,自动地识别各个孔中的细胞的存活状况具体包括:利用图像处理算法,确定细胞的形态和细胞数量的减少率中的至少一种;在所识别的细胞的形态相较球形的偏离度高于偏离度阈值和/或细胞数量的减少率高于减少率阈值的情况下,将细胞识别为存活状况劣于预定存活条件。In some embodiments, automatically identifying the survival status of cells in each well specifically includes: using an image processing algorithm to determine at least one of the cell morphology and the rate of reduction in cell number; when the deviation of the identified cell morphology from a spherical shape is higher than a deviation threshold and/or the rate of reduction in cell number is higher than a reduction rate threshold, identifying the cell as having a survival status worse than a predetermined survival condition.

在一些实施例中,所述指数模型由如下的公式(1)来定义:In some embodiments, the exponential model is defined by the following formula (1):

其中c0和c1是指数模型中需要回归的常数,t是小分子化合物的作用时间,Y(t)是每个孔中的富含聚集物细胞分数。where c0 and c1 are constants to be regressed in the exponential model, t is the exposure time of the small molecule compound, and Y(t) is the fraction of aggregate-rich cells in each well.

在一些实施例中,对各个图像执行细胞和聚集物的分割具体包括:利用第一算法来分割细胞;识别图像中的点状结构,并以聚集物的代表尺寸作为目标尺寸执行增强;利用第二算法,分割出强度值比背景的代表强度值高出第三阈值的聚集物。In some embodiments, performing segmentation of cells and aggregates on each image specifically includes: using a first algorithm to segment cells; identifying point structures in the image and performing enhancement with the representative size of the aggregates as the target size; using a second algorithm to segment aggregates whose intensity values are higher than the representative intensity value of the background by a third threshold.

在一些实施例中,所述第一算法自动地确定用于分割细胞前景和细胞背景的强度阈值且据此进行细胞分割,所述第二算法自动地确定聚集物背景的鲁棒性的代表强度值,所述第三阈值为背景的代表强度值的标准偏差的3-5倍。In some embodiments, the first algorithm automatically determines an intensity threshold for segmenting cell foreground and cell background and performs cell segmentation accordingly, the second algorithm automatically determines a representative intensity value for the robustness of the aggregate background, and the third threshold is 3-5 times the standard deviation of the representative intensity value of the background.

在一些实施例中,所述第一算法包括Otsu算法,所述第二算法包括鲁棒背景算法。In some embodiments, the first algorithm comprises an Otsu algorithm and the second algorithm comprises a robust background algorithm.

高内涵成像分析系统High-content imaging analysis system

根据本申请的实施例,提供一种高内涵成像分析系统,包括:自动高速显微成像组件,其配置为在如下步骤执行后对包括孔阵列的孔板的各个孔以数个不同的视野在预定时间段内持续拍摄,而生成各个孔的数个视野的时间序列图像:将包含靶融合蛋白的细胞接种到包括孔阵列的孔板中,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白与目标肿瘤病症相关联,包含相分离倾向结构域(PS)和DNA结合结构域(DBD)且进一步包含标记分子,以便能够形成可供成像识别的聚集物;将多种小分子化合物分配到各个孔中;以及至少一个处理器,其配置为执行根据本申请各个实施例的筛选肿瘤病症的药物的方法。According to an embodiment of the present application, a high-content imaging analysis system is provided, comprising: an automatic high-speed microscopic imaging component, which is configured to continuously photograph each well of a well plate including a well array with several different fields of view within a predetermined time period to generate time series images of several fields of view of each well after the following steps are performed: inoculating cells containing a target fusion protein into a well plate including a well array, wherein the cells contain a nucleic acid molecule encoding the target fusion protein, the target fusion protein is associated with a target tumor condition, contains a phase separation prone domain (PS) and a DNA binding domain (DBD) and further contains a marker molecule so as to be able to form aggregates that can be identified by imaging; dispensing a plurality of small molecule compounds into each well; and at least one processor, which is configured to execute a method for screening drugs for tumor conditions according to various embodiments of the present application.

根据本申请各个实施例的筛选肿瘤病症的药物的方法的各种示例均可以结合于此,在此不赘述。Various examples of methods for screening drugs for tumor diseases according to various embodiments of the present application can be combined here and will not be described in detail here.

靶融合蛋白的相分离状况的分析方法Method for analyzing the phase separation of target fusion protein

根据本申请的实施例,提供一种靶融合蛋白的相分离状况的分析方法。该分析方法可以包括如下步骤。According to an embodiment of the present application, a method for analyzing the phase separation status of a target fusion protein is provided. The analysis method may include the following steps.

采集包含所述靶融合蛋白的细胞的图像,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白进一步包含标记分子以便能够形成可供成像识别的聚集物。An image of a cell containing the target fusion protein is acquired, wherein the cell contains a nucleic acid molecule encoding the target fusion protein, and the target fusion protein further contains a labeling molecule so as to form aggregates that can be identified by imaging.

识别所述图像中的细胞内聚集物的量,并根据所识别的聚集物的量来确定所述PS-DBD融合蛋白在体内和/或体外的相分离状况,聚集物的量越大,则相分离状况越严重。The amount of intracellular aggregates in the image is identified, and the phase separation status of the PS-DBD fusion protein in vivo and/or in vitro is determined based on the amount of the identified aggregates. The larger the amount of aggregates, the more serious the phase separation status.

在一些实施例中,所述聚集物呈现为液滴或者凝胶状。In some embodiments, the aggregates are in the form of droplets or gels.

在一些实施例中,所述靶融合蛋白的DBD包含突变,使得所形成的聚集物的成像识别性相较DBD不包含突变所形成的聚集物的成像识别性更显著。In some embodiments, the DBD of the target fusion protein comprises mutations such that the imaging recognition of the aggregates formed is more significant than the imaging recognition of aggregates formed when the DBD does not comprise the mutations.

关于靶融合蛋白的相分离状况及与聚集物的关联机制,在上文“PS-DBD融合蛋白 与肿瘤病症状况的关联机制:相分离、聚集物的生成及异常基因表达”部分中已经进行了详细说明,在此不赘述。The phase separation status of the target fusion protein and its association mechanism with aggregates have been described in detail in the above section "The association mechanism between PS-DBD fusion protein and tumor disease status: phase separation, aggregate formation and abnormal gene expression " and will not be repeated here.

在一些实施例中,所述突变为点突变,且引入待分析的靶融合蛋白的DBD中具有与DNA结合活性的位点。关于突变在DBD中的引入,在上文“在靶融合蛋白的DBD中引入突变”部分中已经进行了详细说明,在此不赘述。In some embodiments, the mutation is a point mutation, and is introduced into a site having DNA binding activity in the DBD of the target fusion protein to be analyzed. The introduction of the mutation into the DBD has been described in detail in the above section " Introducing mutations into the DBD of the target fusion protein ", which will not be repeated here.

在一些实施例中,待分析的靶融合蛋白的DBD中具有与DNA结合活性的位点根据Swiss-Prot数据库中有关DNA结合活性的信息来确定。具体说来,如果Swiss-Prot数据库中报道目标蛋白的该结构域所在区域中有一个位点的突变影响其与DNA结合的能力,或者其它蛋白的同种结构域中有一个位点的突变影响其与DNA结合的能力,且该位点可以被序列比对到目标蛋白,这样的位点即视为DBD中具有与DNA结合活性的位点。In some embodiments, the site with DNA binding activity in the DBD of the target fusion protein to be analyzed is determined based on the information on DNA binding activity in the Swiss-Prot database. Specifically, if the Swiss-Prot database reports that a mutation at a site in the region where the domain of the target protein is located affects its ability to bind to DNA, or a mutation at a site in the same domain of other proteins affects its ability to bind to DNA, and the site can be sequence aligned to the target protein, such a site is considered to be a site with DNA binding activity in the DBD.

实施例Example

在下文中,将通过实施例详细描述本申请。然而,在此提供的实施例仅用于说明目的,并不用于限制本申请。Hereinafter, the present application will be described in detail by way of examples. However, the examples provided herein are only for illustrative purposes and are not intended to limit the present application.

下述实施例所使用的实验方法如无特殊说明,均为常规方法。Unless otherwise specified, the experimental methods used in the following examples are all conventional methods.

下述实施例所用的材料、试剂等,如无特殊说明,均可从商业途径得到。Unless otherwise specified, the materials and reagents used in the following examples can be obtained from commercial sources.

实施例中涉及的科学仪器如下:The scientific instruments involved in the embodiments are as follows:

移液器(EPPENDORF);Pipette (EPPENDORF);

DNA电泳装置(六一生物公司);DNA electrophoresis device (Six One Biological Company);

DNA凝胶电泳成像系统(六一生物公司);DNA gel electrophoresis imaging system (Liuyi Biotechnology Co., Ltd.);

高温高压灭菌锅(ZEALWAY);High temperature and high pressure sterilizer (ZEALWAY);

电子分析天平(上海佑科公司);Electronic analytical balance (Shanghai Youke Company);

紫外分光光度计(北京君意东方公司);UV spectrophotometer (Beijing Junyi Dongfang Company);

Nanodrop(ThermoFisher Scientific);Nanodrop (ThermoFisher Scientific);

PCR核酸扩增仪(BIO-RAD);PCR nucleic acid amplifier (BIO-RAD);

实时荧光定量PCR仪(BIO-RAD)Real-time fluorescence quantitative PCR instrument (BIO-RAD)

电热恒温水浴箱(北京永光明公司);Electric constant temperature water bath (Beijing Yongguangming Company);

金属浴(大龙兴创公司);Metal bath (Dalong Xingchuang Company);

超净工作台(上海智城公司);Clean bench (Shanghai Zhicheng Company);

台式冷冻离心机(BECKMAN);Tabletop refrigerated centrifuge (BECKMAN);

低温冷冻离心机(EPPENDORF);Low temperature refrigerated centrifuge (EPPENDORF);

超低温冰箱(ThermoFisher Scientific);Ultra-low temperature freezer (ThermoFisher Scientific);

蛋白质电泳装置(BIO-RAD);protein electrophoresis apparatus (BIO-RAD);

蛋白半干转膜仪(BIO-RAD);Protein semi-dry transfer instrument (BIO-RAD);

生物安全柜(ThermoFisher Scientific);Biological safety cabinet (ThermoFisher Scientific);

流式细胞分选仪(BD Biosciences);Flow cytometer (BD Biosciences);

细胞培养箱(ThermoFisher Scientific);Cell culture incubator (ThermoFisher Scientific);

细菌培养箱(知楚仪器有限公司);Bacterial incubator (Zhichu Instrument Co., Ltd.);

超声细胞破碎仪(ASONE);Ultrasonic cell disruptor (ASONE);

高压细胞破碎仪(ATS);High pressure cell disruptor (ATS);

蛋白质纯化系统AKTA(GE Healthcare);Protein purification system AKTA (GE Healthcare);

镍离子柱亲和层析柱Ni-NTA resin(Genscript);Nickel ion affinity chromatography column Ni-NTA resin (Genscript);

HiTrap Q HP(GE Healthcare);HiTrap Q HP (GE Healthcare);

Superdex 200 Increase 10/300层析分子筛柱SD200(GE Healthcare);Superdex 200 Increase 10/300 chromatography molecular sieve column SD200 (GE Healthcare);

荧光倒置显微镜(NIKON);Fluorescence inverted microscope (NIKON);

高分辨共聚焦显微镜(NIKON A1R HD25);High-resolution confocal microscope (NIKON A1R HD25);

超分辨共聚焦显微镜A1/SIM/STORM(NIKON);Super-resolution confocal microscope A1/SIM/STORM (NIKON);

Echo550纳升级声波移液系统(Labcyte);Echo550 nanoliter acoustic pipetting system (Labcyte);

激光共聚焦高内涵成像显微镜Opera Phenix(PerkinElmer);Laser confocal high-content imaging microscope Opera Phenix (PerkinElmer);

多功能酶标仪EnVision(PerkinElmer)。Multifunctional microplate reader EnVision (PerkinElmer).

本实施例所涉及的试剂如下:The reagents involved in this embodiment are as follows:

FastPfu聚合酶(TransGen Biotech),dNTPs溶液(TransGen Biotech),限制性核酸内切酶Age I-HF,Spe I-HF,Nco I-HF,Xho I-HF(NEB),HiFi DNA AssemblyMaster Mix(NEB),琼脂糖(Biowest),DNA分子量Trans DNA Marker(TransGen Biotech),DNA电泳TAE缓冲液(普利莱),DNA电泳上样缓冲液(TransGen Biotech),质粒提取试剂盒(CWBIO),DNA琼脂糖凝胶回收试剂盒(CWBIO),大量DNA产物纯化试剂盒(TIANGENBIOTECH),4S Green核酸染料(Sangon Biotech)。FastPfu polymerase (TransGen Biotech), dNTPs solution (TransGen Biotech), restriction endonucleases Age I-HF, Spe I-HF, Nco I-HF, Xho I-HF (NEB), HiFi DNA Assembly Master Mix (NEB), agarose (Biowest), DNA molecular weight Trans DNA Marker (TransGen Biotech), DNA electrophoresis TAE buffer (Prile), DNA electrophoresis loading buffer (TransGen Biotech), plasmid extraction kit (CWBIO), DNA agarose gel recovery kit (CWBIO), large-scale DNA product purification kit (TIANGENBIOTECH), 4S Green nucleic acid dye (Sangon Biotech).

RNA提取试剂Trizol(ThermoFisher Scientific),反转录试剂One-StepgDNA Removal and cDNA Synthesis SuperMix(TransGen Biotech),qPCR试剂TransgenGreen qPCR SuperMix(TransGen Biotech)。RNA extraction reagent Trizol (ThermoFisher Scientific), reverse transcription reagent One-StepgDNA Removal and cDNA Synthesis SuperMix (TransGen Biotech), qPCR reagent Transgen Green qPCR SuperMix (TransGen Biotech).

NaCl(北京化工),MgCl2(国药集团),KCl(国药集团),磷酸钠(国药集团),盐酸(国药集团),Na2HPO4(国药集团),NaH2PO4(国药集团),磷酸钙(Sigma),Tris(Sigma),甘氨酸Glycine(Sigma),甘油Glycerol(Sigma),咪唑Imidazole(Sigma),HEPES(Sigma),乙醇(Sigma),IPTG(Inalco),EDTA(Sigma),醋酸铵(Sigma),醋酸钠(Amresco),冰醋酸(北京化工厂),异丙醇(北京化工厂),二硫苏糖醇DTT(Inaclo),麦芽糖(Amresco),盐酸胍(MPBiomedicals),3-吗啉丙磺酸MOPS(Solarbio),十二烷基磺酸钠SDS(Amresco),Ni-NTAResin(ThermoFisher Scientific),聚丙烯酰胺预制胶(Genscript),三色预染蛋白质分子量标准Marker(YEASEN),考马斯亮蓝染色液(Genscript),考马斯亮蓝脱色液(Genscript),脱脂奶粉(ThermoFisher Scientific),Tween-20(Amresco),Nonidet P40(Amresco),β-巯基乙醇(Amresco),PMSF蛋白酶抑制剂(Sigma),蛋白酶抑制剂混合物(Beyotime),磷酸缓冲液PBS(HyClone),考马斯亮蓝R250(MACKLIN)。NaCl (Beijing Chemical Industry), MgCl 2 (Sinopharm Group), KCl (Sinopharm Group), sodium phosphate (Sinopharm Group), hydrochloric acid (Sinopharm Group), Na 2 HPO 4 (Sinopharm Group), NaH 2 PO 4 (Sinopharm Group), calcium phosphate (Sigma), Tris (Sigma), glycine (Sigma), glycerol (Sigma), imidazole (Sigma), HEPES (Sigma), ethanol (Sigma), IPTG (Inalco), EDTA (Sigma), ammonium acetate (Sigma), sodium acetate (Amresco), glacial acetic acid (Beijing Chemical Plant), isopropanol (Beijing Chemical Plant), dithiothreitol DTT (Inaclo), maltose (Amresco), guanidine hydrochloride (MPBiomedicals), 3-morpholinepropanesulfonic acid MOPS (Solarbio), sodium dodecyl sulfate SDS (Amresco), Ni-NTA Resin (ThermoFisher Scientific), polyacrylamide precast gel (Genscript), three-color prestained protein molecular weight standard Marker (YEASEN), Coomassie brilliant blue staining solution (Genscript), Coomassie brilliant blue destaining solution (Genscript), skim milk powder (ThermoFisher Scientific), Tween-20 (Amresco), Nonidet P40 (Amresco), β-mercaptoethanol (Amresco), PMSF protease inhibitor (Sigma), protease inhibitor cocktail (Beyotime), phosphate buffered saline (PBS) (HyClone), Coomassie brilliant blue R250 (MACKLIN).

实施例1:稳定的细胞系结构的建立以及相关实验流程Example 1: Establishment of stable cell line structure and related experimental procedures

细胞培养:本文所涉及细胞HEK293T、Hela、U2OS和A-673均使用高糖DMEM培养基(HyClone,SH30243.01),加10%胎牛血清FBS(Gibco,10099-141),1%青霉素链霉素双抗(Gibco,15140122),在37℃恒温保湿培养箱中,5%CO2条件下培养。在进行细胞筛选和培养稳转细胞系时,培养基额外添加终浓度为1μg/ml的嘌呤霉素或对应抗生素。细胞生长至密度为85%时,使用胰蛋白酶Trypsin进行消化传代。所有表达OptoIDR结构的细胞的操作均在红灯照明下进行,以避免环境光照射。Cell culture: The cells HEK293T, Hela, U2OS and A-673 involved in this article all used high-glucose DMEM medium (HyClone, SH30243.01), added with 10% fetal bovine serum FBS (Gibco, 10099-141), 1% penicillin-streptomycin double antibody (Gibco, 15140122), and cultured in a constant temperature and moisturizing incubator at 37°C and 5% CO 2. When performing cell screening and culturing stable cell lines, the culture medium is additionally added with puromycin or corresponding antibiotics at a final concentration of 1 μg/ml. When the cells grow to a density of 85%, trypsin Trypsin is used for digestion and passage. All operations on cells expressing OptoIDR structures were performed under red light to avoid ambient light exposure.

本实施例所涉及的质粒构建原始骨架质粒如下The original backbone plasmid involved in the plasmid construction of this example is as follows

(1)pCMV-mCherry-Age I-FUS-ERG-Age I(1)pCMV-mCherry-Age I-FUS-ERG-Age I

在图17中示出了pCMV-mCherry-Age I-FUS-ERG-Age I的结构。如图17所示,骨架质粒pCMV-mCherry-Age I-FUS-ERG-Age I为慢病毒(Lenti-Virus)表达载体,原核细胞为氨苄青霉素抗性,真核细胞为嘌呤霉素抗性。目的基因可插入内切酶Age I线性化的载体中,载体上带有的mCherry荧光标签可以对目的蛋白进行示踪。该质粒可以用于包装慢病毒,构建稳转细胞系,也可瞬时转染检测目的蛋白表达状态。The structure of pCMV-mCherry-Age I-FUS-ERG-Age I is shown in Figure 17. As shown in Figure 17, the backbone plasmid pCMV-mCherry-Age I-FUS-ERG-Age I is a lentivirus expression vector, prokaryotic cells are ampicillin resistant, and eukaryotic cells are puromycin resistant. The target gene can be inserted into the vector linearized by endonuclease Age I, and the mCherry fluorescent label carried on the vector can trace the target protein. The plasmid can be used to package lentivirus, construct a stable cell line, and can also be transiently transfected to detect the expression status of the target protein.

pCMV-mCherry-Age I-FUS-ERG-Age I的全长序列如SEQ ID NO:3所示。The full-length sequence of pCMV-mCherry-Age I-FUS-ERG-Age I is shown in SEQ ID NO: 3.

(2)pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig(2)pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig

在图18中示出了pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig的结构。如图18所示,骨架质粒pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig改造自Addgene质粒#10122(Shin etal.,2017),为真核细胞表达载体。目的基因可以插入内切酶Spe I线性化的载体中,载体上带有的mCherry-Cry2olig元件可以用于OptoIDR实验,通过蓝光刺激,观测融合蛋白的状态评估目的蛋白是否有相分离潜能。The structure of pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig is shown in Figure 18. As shown in Figure 18, the backbone plasmid pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig is modified from Addgene plasmid #10122 (Shin et al., 2017) and is a eukaryotic cell expression vector. The target gene can be inserted into the vector linearized by endonuclease Spe I, and the mCherry-Cry2olig element carried on the vector can be used in the OptoIDR experiment. Through blue light stimulation, the state of the fusion protein is observed to evaluate whether the target protein has phase separation potential.

pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig的全长序列如如SEQ ID NO:4所示。The full-length sequence of pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig is shown in SEQ ID NO:4.

(3)pETDuet载体(3) pETDuet vector

在图19中示出了pETDuet载体的结构。如图19所示,骨架质粒pETDuet(His tag-MBP-TEV)为大肠杆菌原核表达质粒,氨苄青霉素抗性,从Novagen公司购得。目的基因可以插入限制性内切酶Nco I和Xho I线性化的载体中,连接在重组烟草蚀纹病毒蛋白酶(Tobacco Etch Virus,TEV)序列之后。该质粒也可以进一步改造,在TEV蛋白酶位点后插入增强型绿色荧光蛋白(Enhanced Green Fluorescent Protein,EGFP),之后的目的基因连接在EGFP之后。质粒在大肠杆菌中经过异丙基-β-D-硫代半乳糖苷(IPTG)诱导表达后可得到融合蛋白,用于体外相分离实验。The structure of pETDuet vector is shown in Figure 19. As shown in Figure 19, the backbone plasmid pETDuet (His tag-MBP-TEV) is an Escherichia coli prokaryotic expression plasmid, ampicillin resistance, purchased from Novagen. The target gene can be inserted into the vector linearized with restriction endonucleases Nco I and Xho I, and connected after the recombinant tobacco etch virus protease (Tobacco Etch Virus, TEV) sequence. The plasmid can also be further transformed by inserting enhanced green fluorescent protein (Enhanced Green Fluorescent Protein, EGFP) after the TEV protease site, and the target gene is then connected after EGFP. The plasmid can obtain a fusion protein after induction expression by isopropyl-β-D-thiogalactoside (IPTG) in Escherichia coli for in vitro phase separation experiments.

(4)pRSFDual载体(4) pRSFDual vector

在图20中示出了pRSFDual载体的结构。如图20所示,骨架质粒pRSFDual(His tag-EGFP)在原始载体pRSFDual改造而来,在组氨酸标签His tag后连接了EGFP。质粒为大肠杆菌原核表达载体,卡那霉素抗性,从Novagen公司购得。目的基因可以插入限制性内切酶XhoI线性化的重组载体中,在大肠杆菌中经过IPTG诱导,表达与His tag-EGFP融合的目的蛋白,用于体外相分离研究。The structure of the pRSFDual vector is shown in Figure 20. As shown in Figure 20, the backbone plasmid pRSFDual (His tag-EGFP) is modified from the original vector pRSFDual, and EGFP is connected after the histidine tag His tag. The plasmid is an E. coli prokaryotic expression vector, kanamycin resistant, purchased from Novagen. The target gene can be inserted into the recombinant vector linearized with the restriction endonuclease XhoI, and the target protein fused with His tag-EGFP can be expressed in E. coli after IPTG induction for in vitro phase separation research.

编码融合蛋白的慢病毒转移载体的构建过程如下:The construction process of the lentiviral transfer vector encoding the fusion protein is as follows:

本研究所涉及到的慢病毒原始载体pCMV-mCherry-Age I-FUS-ERG-Age I以及pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig均来自华大学李丕龙教授实验室。涉及的基因及其突变如EWS-FLI1/EWS-FLI1mut或者EWS IDR均由金唯智公司(已被安升达收购)合成。构建过程先用限制性内切酶Age I-HF(NEB,R3552S)线性化pCMV-mCherry-Age I-FUS-ERG-Age I载体,内切酶Spe I-HF(NEB,R3133S)线性化pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig载体,凝胶电泳后纯化得到线性化的载体;然后通过PCR获取目的基因片段如EWS-FLI1/EWS-FLI1mut或者EWS IDR,凝胶电泳后纯化得到目的片段;接着根据产品说明书使用重组试剂HiFi DNA Assembly Master Mix(NEB,E2621L)将线性化的载体和目的片段重组,融合基因如EWS-FLI1/EWS-FLI1mut与线性化的载体pCMV-mCherry-AgeI-FUS-ERG-Age I重组,融合基因如EWS IDR与线性化的载体pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig重组;最后将重组产物转化感受态Stbl3(康体生命,KTSM110L),复苏培养后均匀的平铺在带有对应抗性的平板上,37℃过夜培养后挑取单克隆测序鉴定。The original lentiviral vectors pCMV-mCherry-Age I-FUS-ERG-Age I and pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig involved in this study were from the laboratory of Professor Li Pilong of Huazhong University. The genes involved and their mutations such as EWS-FLI1/EWS-FLI1mut or EWS IDR were synthesized by Genewise (which has been acquired by Ascent). The construction process first linearizes the pCMV-mCherry-Age I-FUS-ERG-Age I vector with restriction endonuclease Age I-HF (NEB, R3552S), and linearizes the pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig vector with endonuclease Spe I-HF (NEB, R3133S), and purifies the linearized vector after gel electrophoresis; then, PCR is used to obtain the target gene fragment such as EWS-FLI1/EWS-FLI1mut or EWS IDR, and the target fragment is purified after gel electrophoresis; then, the recombinant reagent is used according to the product instructions. HiFi DNA Assembly Master Mix (NEB, E2621L) was used to recombinate the linearized vector and the target fragment. Fusion genes such as EWS-FLI1/EWS-FLI1mut were recombined with the linearized vector pCMV-mCherry-AgeI-FUS-ERG-Age I. Fusion genes such as EWS IDR were recombined with the linearized vector pHR-Spe I-FUS IDR-Spe I-mCherry-Cry2olig. Finally, the recombinant products were transformed into competent Stbl3 (Kangti Life Science, KTSM110L), and after recovery culture, they were evenly spread on plates with corresponding resistance. After overnight culture at 37°C, single clones were picked for sequencing and identification.

细胞转染具体过程如下:The specific process of cell transfection is as follows:

根据实验需求,细胞接种到不同孔径、材质的培养板或者培养皿中培养,待细胞密度50-80%时,进行转染实验。具体实验方法如下(以Lipo8000转染试剂为例):According to the experimental requirements, cells are inoculated into culture plates or culture dishes with different pore sizes and materials, and transfection experiments are performed when the cell density reaches 50-80%. The specific experimental method is as follows (taking Lipo8000 transfection reagent as an example):

1)在转染前,根据细胞类型、大小和生长速度等将细胞以适宜的密度接种在培养皿(下文以6孔板为例)中,待细胞密度达到75%左右时进行转染实验;1) Before transfection, cells are seeded in a culture dish (a 6-well plate is used as an example below) at an appropriate density according to cell type, size, and growth rate, and transfection experiments are performed when the cell density reaches about 75%;

2)转染前将培养基更换为预热的2ml新鲜培养基;2) Before transfection, replace the culture medium with 2 ml of pre-warmed fresh culture medium;

3)取用洁净无菌的离心管,加入125μL Opti-MEM(Gibco,31985062)培养基,加入2.5μg去除内毒素的质粒DNA,用移液器轻轻的吹打均匀混合;再加入4μL Lipo8000转染试剂,同样用移液器吹打混匀;3) Take a clean and sterile centrifuge tube, add 125 μL Opti-MEM (Gibco, 31985062) medium, add 2.5 μg endotoxin-free plasmid DNA, and mix them evenly with a pipette; then add 4 μL Lipo8000 transfection reagent and mix them evenly with a pipette;

4)(可选)室温放置10-20分钟;4) (Optional) Leave at room temperature for 10-20 minutes;

5)将转染试剂-DNA混合物均匀的滴加到培养皿中,轻轻的混匀即可;5) Add the transfection reagent-DNA mixture evenly into the culture dish and mix gently;

6)继续培养24-48小时,即可用适当的方式检测转染效果,如蛋白质免疫印迹Western blot,流式细胞分选FACS,荧光检测等,或加入适宜的抗生素进行筛选稳转细胞株。6) Continue culturing for 24-48 hours, and then detect the transfection effect by appropriate methods, such as Western blot, flow cytometry FACS, fluorescence detection, etc., or add appropriate antibiotics to screen stable transfected cell lines.

本实施例构建的稳转细胞系均为慢病毒侵染的稳转细胞株,所用质粒为两个骨架质粒pMD2.G(Addgene,#12259)和psPAX2(Addgene,#12260)。The stable cell lines constructed in this example are all stable cell lines infected with lentivirus, and the plasmids used are two backbone plasmids pMD2.G (Addgene, #12259) and psPAX2 (Addgene, #12260).

稳转细胞系的构建方法如下(以转染试剂PEI在6cm培养皿中的操作为例):The method for constructing a stable cell line is as follows (taking the operation of transfection reagent PEI in a 6 cm culture dish as an example):

1)将HEK293T细胞接种至6cm培养皿,待细胞密度为50%左右时开始转染;1) Inoculate HEK293T cells into a 6 cm culture dish and start transfection when the cell density reaches about 50%;

2)转染前将培养基更换为预热的6ml不含抗生素的培养基;2) Before transfection, replace the culture medium with 6 ml of pre-warmed culture medium without antibiotics;

3)配置转染混合液:800μL Opti-MEM中加入4μg含有目的基因的慢病毒质粒,2.4μg psPAX2和1.6μg pMD2.G,混合均匀后,加入18μL转染试剂PEI,温和的混匀,室温孵育20分钟;3) Prepare the transfection mixture: add 4 μg of lentiviral plasmid containing the target gene, 2.4 μg psPAX2 and 1.6 μg pMD2.G to 800 μL Opti-MEM, mix well, add 18 μL transfection reagent PEI, mix gently, and incubate at room temperature for 20 minutes;

4)将转染混合液均匀的滴加到培养皿中,轻轻的混匀后放入细胞培养箱培养;4) Add the transfection mixture evenly to the culture dish, mix gently and place in a cell culture incubator for culture;

5)转染8-12小时后,更换新鲜培养基,培养基中额外添加终浓度为1mM的丙酮酸钠和1mM的非必需氨基酸;5) 8-12 hours after transfection, replace the culture medium with fresh medium and add sodium pyruvate and 1 mM non-essential amino acids at a final concentration of 1 mM;

6)之后每12-18小时,收集培养基上清病毒液,用0.45μm的针头滤器过滤后加入到待侵染的细胞(例如U2OS)(密度50%左右),包装病毒的HEK293T细胞继续用添加了丙酮酸钠和非必需氨基酸的新鲜培养基培养;病毒上清液可连续收集3-5次;6) Every 12-18 hours thereafter, collect the virus liquid from the culture medium supernatant, filter it with a 0.45 μm syringe filter, and add it to the cells to be infected (e.g., U2OS) (density of about 50%). The HEK293T cells that packaged the virus continue to be cultured with fresh culture medium supplemented with sodium pyruvate and non-essential amino acids; the virus supernatant can be collected continuously for 3-5 times;

7)多次收集病毒上清侵染细胞(例如U2OS)后,更换新鲜培养基继续培养24小时;7) After collecting the viral supernatant and infecting the cells (e.g., U2OS) multiple times, replace with fresh culture medium and continue culturing for 24 hours;

8)根据相应抗生素标记或者荧光标记可选用流式分选或者持续的抗生素筛选方式富集稳转细胞株。8) Based on the corresponding antibiotic labeling or fluorescent labeling, flow cytometry or continuous antibiotic screening can be used to enrich stable cell lines.

本实施例所用到的U2OS/mCherry-FUS-ERGmut稳转细胞系为单克隆细胞系,是在得到稳转细胞株pool后经过流式分选192个单克隆细胞,扩大培养后根据成像效果选择的单克隆细胞系。其它所涉及的细胞系均为非单克隆稳转细胞系。具体构建过程在本流程第三步更换目的质粒即可,其余流程一致。The U2OS/mCherry-FUS-ERGmut stable cell line used in this example is a monoclonal cell line. After obtaining a stable cell line pool, 192 monoclonal cells were flow-sorted and the monoclonal cell line was selected based on the imaging effect after expanded culture. The other cell lines involved are all non-monoclonal stable cell lines. The specific construction process is to replace the target plasmid in the third step of this process, and the rest of the process is consistent.

本实施例所涉及的所有插入的基因序列信息由下表1提供。The sequence information of all inserted genes involved in this embodiment is provided in Table 1 below.

表1Table 1

本实施例所涉及的原核表达蛋白纯化及哺乳动物细胞转染相关质粒信息如下表2所示。The plasmid information related to prokaryotic protein expression purification and mammalian cell transfection involved in this embodiment is shown in Table 2 below.

表2本实施例相关质粒信息Table 2 Plasmid information related to this example

本实施例涉及的活细胞染色主要包括Hoechst对细胞核的染色和LysotrackerGreen对溶酶体的染色。The live cell staining involved in this embodiment mainly includes the staining of cell nuclei with Hoechst and the staining of lysosomes with LysotrackerGreen.

在进行活细胞细胞核染色时,可以用Hoechst 33342核酸染料(Invitrogen,H3570)进行染色。Hoechst可以透过细胞膜,与dsDNA结合后发出蓝色荧光。具体实验方法如下:When staining the nuclei of living cells, Hoechst 33342 nucleic acid dye (Invitrogen, H3570) can be used for staining. Hoechst can penetrate the cell membrane and emit blue fluorescence after binding to dsDNA. The specific experimental method is as follows:

1)把细胞接种至适宜的培养皿中培养;1) Inoculate the cells into a suitable culture dish for culture;

2)用PBS 1:2,000稀释Hoechst 33342染色液配置染色工作液;2) Dilute Hoechst 33342 staining solution 1:2,000 with PBS to prepare staining working solution;

3)吸去培养基,加入染色工作液;3) Aspirate the culture medium and add the staining working solution;

4)避光孵育5-10分钟;4) Incubate in dark for 5-10 minutes;

5)吸去染色液,温柔的用PBS漂洗细胞三次;5) Aspirate the staining solution and gently rinse the cells three times with PBS;

6)加入培养基,显微镜下观察细胞染色情况,成像留存;6) Add culture medium, observe cell staining under a microscope, and keep images;

在进行活细胞溶酶体染色时,可以用Lysotracker Green(Invitrogen,L7526)进行染色。Lysotracker Green是一种绿色染料,能够透过细胞膜,可以染色活细胞中的酸性区室,最大激发/发射波长分别为大约504/511nm。具体实验方法如下:When staining lysosomes in living cells, Lysotracker Green (Invitrogen, L7526) can be used for staining. Lysotracker Green is a green dye that can penetrate the cell membrane and can stain the acidic compartments in living cells. The maximum excitation/emission wavelengths are approximately 504/511 nm, respectively. The specific experimental method is as follows:

1)取少量Lysotracker Green按照1:13,333-1:20,000的比例加入到细胞培养基中待用,终浓度为50-75nM,37℃预孵育;1) Take a small amount of Lysotracker Green and add it to the cell culture medium at a ratio of 1:13,333-1:20,000 for use. The final concentration is 50-75nM and pre-incubate at 37°C.

2)去除细胞培养基,加入预孵育的Lysotracker Green染色工作液,与细胞37℃共孵育30分钟;2) Remove the cell culture medium, add the pre-incubated Lysotracker Green staining solution, and incubate with the cells at 37°C for 30 minutes;

3)去除Lysotracker Green染色工作液,用PBS漂洗细胞三次;3) Remove the Lysotracker Green staining solution and rinse the cells three times with PBS;

4)加入新鲜培养基,进行荧光观察,成像留存。4) Add fresh culture medium, perform fluorescence observation, and keep the image.

本实施例涉及的双荧光素报告实验所用试剂为Dual-Reporter AssaySystem 10-Pack(Promega,E6921)。具体实验步骤如下:The reagents used in the dual fluorescein reporter experiment involved in this example are Dual- Reporter Assay System 10-Pack (Promega, E6921). The specific experimental steps are as follows:

1)以24孔板一个孔为例,转染HEK293T细胞,转染试剂为Lipo8000,每孔25μLOpti-MEM+1.2μL Lipo8000+606ng质粒,质粒组分如下:1) Take one well of a 24-well plate as an example, transfect HEK293T cells, the transfection reagent is Lipo8000, 25μL Opti-MEM + 1.2μL Lipo8000 + 606ng plasmid per well, the plasmid components are as follows:

300μg mCherry-EWS-FLI1300 μg mCherry-EWS-FLI1

300μg 25x GGAA-promoter firefly luciferase(从北京大学生命学院齐志实验室获得)300 μg 25x GGAA-promoter firefly luciferase (obtained from Qi Zhi's laboratory, School of Life Sciences, Peking University)

6μg Renilla luciferase---internal reference(从北京大学生命学院齐志实验室获得)6μg Renilla luciferase---internal reference (obtained from Qi Zhi's laboratory, School of Life Sciences, Peking University)

对照组转染不含融合蛋白的质粒;The control group was transfected with a plasmid without fusion protein;

2)转染24小时后,用PBS小心漂洗被转染的细胞;2) 24 hours after transfection, carefully rinse the transfected cells with PBS;

3)配制1x Passive lysis buffer(PLB),用无菌水稀释5x PLB储液;5x PLB储液分装后存于-80冰箱;3) Prepare 1x Passive lysis buffer (PLB) and dilute 5x PLB stock solution with sterile water; store the 5x PLB stock solution in aliquots at -80°C;

4)用100μL 1x PLB加入被漂洗的转染细胞中,充分裂解15分钟左右,收集裂解液,离心去除上清待用;4) Add 100 μL 1x PLB to the washed transfected cells and fully lyse for about 15 minutes. Collect the lysate and centrifuge to remove the supernatant for later use.

5)配制Luciferase Assay Reagent II(LAR II):向冻干的Luciferase AssaySubstrate中加入Luciferase Assay Buffer II,充分溶解后分装存于-80℃冰箱;5) Preparation of Luciferase Assay Reagent II (LAR II): Add Luciferase Assay Buffer II to the freeze-dried Luciferase Assay Substrate, dissolve thoroughly, and store in aliquots at -80°C.

6)分装50x Stop&Substrate(SGS),存于-80℃冰箱;6) Aliquot 50x Stop& Substrate (SGS), stored in -80℃ freezer;

7)分装Stop&Buffer(SGB),存于-80℃冰箱;7) Stop& Buffer (SGB), stored in -80℃ refrigerator;

8)一份体积的SGS加入50倍体积的SGB中,配制Stop&Reagent(SGR);8) Add one volume of SGS to 50 volumes of SGB to prepare Stop & Reagent(SGR);

9)在专用检测96板上加入100μL底物LAR II,之后加入20μL细胞裂解液混匀;9) Add 100 μL of substrate LAR II to the dedicated detection 96 plate, and then add 20 μL of cell lysis buffer and mix well;

10)将检测板放入多功能酶标仪中检测,详细程序为2秒的延迟后,进行两次10秒的读值,此项读数为Firefly荧光素酶读值;10) Place the test plate in a multifunctional microplate reader for testing. The detailed procedure is to perform two 10-second readings after a 2-second delay. This reading is the Firefly luciferase reading;

11)取出检测板,加入100μL底物SGR后充分混匀,再次放入多功能酶标仪中检测,此项读数为Renilla荧光素酶读值。11) Take out the test plate, add 100 μL of substrate SGR and mix thoroughly, then put it into the multifunctional microplate reader for detection again. This reading is the Renilla luciferase reading.

每个样品做三个重复,每次实验重复三次。Each sample was replicated three times and each experiment was repeated three times.

计算时,用Renilla荧光素酶读值去矫正Firefly荧光素酶读值,再跟不转染融合蛋白的对照组进行比较就可以算出比值。这个比值代表了融合蛋白作为转录因子对下游靶基因的转录增强能力。When calculating, the reading of Firefly luciferase is corrected by using the reading of Renilla luciferase, and then compared with the control group without transfection of fusion protein to calculate the ratio. This ratio represents the transcription enhancement ability of the fusion protein as a transcription factor on the downstream target gene.

本实施例涉及的蛋白质表达纯化过程如下:The protein expression and purification process involved in this embodiment is as follows:

(1)蛋白质的原核表达(1) Prokaryotic expression of proteins

1)含有目的基因的载体转化至BL21感受态中,涂板后过夜培养;1) The vector containing the target gene is transformed into BL21 competent cells and cultured overnight after plating;

2)在LB平板上挑取单克隆,加入5ml含有对应抗生素的LB液体培养基中,在37℃摇床中220rpm培养4小时;2) Pick a single clone on the LB plate, add it to 5 ml of LB liquid medium containing the corresponding antibiotic, and culture it in a shaker at 37°C and 220 rpm for 4 hours;

3)将培养的5ml菌液转移至200ml含有抗生素的新鲜LB液体培养基中,37℃200rpm继续培养3小时;3) Transfer 5 ml of the cultured bacterial solution to 200 ml of fresh LB liquid medium containing antibiotics and continue culturing at 37°C and 200 rpm for 3 hours;

4)将培养的200ml菌液转移至800ml含有抗生素的新鲜LB液体培养基中,37℃200rpm继续培养;4) Transfer 200 ml of the cultured bacterial liquid to 800 ml of fresh LB liquid medium containing antibiotics and continue culturing at 37°C and 200 rpm;

5)等到细菌进入到对数生长期(OD600=0.8)时加入诱导剂IPTG,终浓度为0.5mM5) When the bacteria enter the logarithmic growth phase (OD600 = 0.8), add the inducer IPTG to a final concentration of 0.5 mM

6)16℃下200rpm过夜培养16小时。6) Incubate overnight at 16°C and 200 rpm for 16 hours.

(2)收集细菌和裂解(2) Collecting bacteria and lysing

1)在高速离心机中4℃4,392g离心30分钟收集细菌;1) Collect the bacteria by centrifugation at 4,392 g for 30 minutes at 4°C in a high-speed centrifuge;

2)去除上清,每升菌液对应加入30ml细胞裂解缓冲液重悬;2) Remove the supernatant and add 30 ml of cell lysis buffer per liter of bacterial liquid to resuspend;

3)冰浴中超声破碎菌体30分钟,条件为:超声功率50W,超声频率20kHz,周期为2秒声波破碎,4秒冷却。3) Ultrasonic disruption of the bacteria in an ice bath for 30 minutes, under the following conditions: ultrasonic power 50 W, ultrasonic frequency 20 kHz, cycle of 2 seconds for ultrasonic disruption and 4 seconds for cooling.

(3)蛋白Ni-NTA初纯化(3) Protein Ni-NTA initial purification

1)4℃47,850g离心30分钟收集细菌液上清;1) Centrifuge at 47,850 g for 30 minutes at 4°C to collect the bacterial supernatant;

2)用30ml细胞裂解液平衡Ni-NTA resin;2) Equilibrate Ni-NTA resin with 30 ml cell lysate;

3)向平衡后的Ni-NTA resin加入细菌裂解液上清,4℃温柔混匀孵育30分钟使蛋白与Ni充分结合;3) Add bacterial lysate supernatant to the equilibrated Ni-NTA resin, mix gently and incubate at 4°C for 30 minutes to allow the protein to fully bind to Ni;

4)在4℃让裂解液在由重力自然穿过Ni-NTA resin,收集流穿液;4) Allow the lysate to pass through the Ni-NTA resin by gravity at 4°C and collect the flow-through;

5)在4℃用30ml细胞裂解液冲洗Ni-NTA resin洗脱杂质;5) Wash the Ni-NTA resin with 30 ml of cell lysis buffer at 4°C to elute impurities;

6)在4℃用30ml镍离子柱亲和层析冲洗缓冲液冲洗Ni-NTA resin洗脱杂质和非特异性结合的蛋白;6) Wash the Ni-NTA resin with 30 ml of nickel ion column affinity chromatography wash buffer at 4°C to elute impurities and non-specifically bound proteins;

7)在4℃用30ml镍离子柱亲和层析洗脱缓冲液洗脱Ni-NTA resin;7) Elute the Ni-NTA resin with 30 ml of nickel ion column affinity chromatography elution buffer at 4°C;

8)收集洗脱液并用SDS-PAGE检测蛋白表达情况。8) Collect the eluate and use SDS-PAGE to detect protein expression.

(4)蛋白二次纯化(4) Secondary protein purification

1)用120ml蛋白稀释液稀释洗脱液;1) Dilute the eluate with 120 ml of protein diluent;

2)用HiTrap Q HP阴离子交换柱对蛋白进行纯化,使用蛋白稀释液(从100%到0%)和高盐溶液(从0%到100%)进行梯度洗脱;2) Purify the protein using a HiTrap Q HP anion exchange column, using a gradient elution of protein diluent (from 100% to 0%) and a high salt solution (from 0% to 100%);

3)收集洗脱液并用SDS-PAGE对目的蛋白进行鉴定;3) Collect the eluate and identify the target protein using SDS-PAGE;

4)超滤浓缩高纯度(>95%)目的蛋白洗脱液;4) Ultrafiltration and concentration of high purity (>95%) target protein eluate;

5)利用SD200,使用凝胶过滤缓冲液进行凝胶过滤色谱进一步纯化蛋白;5) using SD200 and gel filtration buffer to further purify the protein;

6)收集洗脱液并用SDS-PAGE对目的蛋白进行鉴定;6) Collect the eluate and identify the target protein using SDS-PAGE;

7)超滤浓缩高纯度(>95%)目的蛋白并测量蛋白质浓度;7) Ultrafiltration concentrates the high-purity (>95%) target protein and measures the protein concentration;

8)浓缩后的蛋白用液氮迅速冷冻后存储于-80℃备用。8) The concentrated protein was quickly frozen with liquid nitrogen and stored at -80°C for future use.

本实施例涉及的蛋白质荧光标记如下:The protein fluorescent labeling involved in this embodiment is as follows:

Alexa FluorTM488 NHS酯是一种亮绿色荧光染料,具有水溶性和较强的pH稳定性,可被488nm激发产生信号稳定的荧光。Alexa FluorTM488 NHS酯可以与蛋白质上的氨基形成稳定的酰胺键,主要是伯胺。Alexa Fluor TM 488 NHS Ester is a bright green fluorescent dye that is water-soluble and pH-stable. It can be excited at 488nm to produce stable fluorescence signals. Alexa Fluor TM 488 NHS Ester can form stable amide bonds with amino groups on proteins, mainly primary amines.

小分子染料溶解于DMSO中,浓度为1mg/ml。使用时将小分子染料与目的蛋白质按照摩尔比1:1的比例进行混合,室温孵育1小时。用MicroSpin G-50分离柱5,000rpm离心10分钟,弃掉柱子中预装的平衡液后将标记好的蛋白质加入分离柱中,5,000rpm离心10分钟,标记好的蛋白质溶液在离心管中。The small molecule dye is dissolved in DMSO at a concentration of 1 mg/ml. When used, the small molecule dye and the target protein are mixed at a molar ratio of 1:1 and incubated at room temperature for 1 hour. Centrifuge the MicroSpin G-50 separation column at 5,000 rpm for 10 minutes, discard the pre-loaded equilibrium solution in the column, add the labeled protein to the separation column, and centrifuge at 5,000 rpm for 10 minutes. The labeled protein solution is in a centrifuge tube.

本实施例涉及的DNA荧光标记过程如下:The DNA fluorescent labeling process involved in this embodiment is as follows:

Cy5是一种远红外染料,激发波长为633nm或者647nm。该染料的显著优势是该光谱区域的生物样本的低自发荧光。Cy5 is a far-red dye with an excitation wavelength of 633nm or 647nm. The significant advantage of this dye is its low autofluorescence in biological samples in this spectral region.

本文在合成引物时在正向引物5’端标记修饰上Cy5标记,之后正常PCR扩增,DNA产物纯化得到标记好的DNA片段,稀释至所需浓度即可使用。In this paper, when synthesizing primers, the 5' end of the forward primer is labeled with Cy5, followed by normal PCR amplification, purification of the DNA product to obtain the labeled DNA fragment, and dilution to the required concentration for use.

体外与FUS-ERG融合蛋白孵育的306bp DNA序列(SEQ ID NO:47)如下:CTCGACTAGGTTTTCCTCTTATGCTGAGAATTCCAGGTCCTGGAGAAGAAGAAAAAGAGAAAGAAAGAGAGAGAGAGAAGGAGTGAGAGAGGGAGGGAGGGAGGGAGGGAGGGAGGAAGGAAGGAAGGAAGGAAGGAAGGAAAGGAAGGAAGGAAGGAAGG AAGGAAGGAAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAGGAAAAGAAACAGCAAAAAAAGAAAGAGGGAGGATGGGAGGGAGGGAAAAAGTAAAAATGATTCTGTATCAGCTGGTATATACCAACAACTAGC,划线部分为25x GGAA序列。The 306 bp DNA sequence (SEQ ID NO:47) incubated with the FUS-ERG fusion protein in vitro is as follows: CTCGACTAGGTTTTCCTTATGCTGAGAATTCCAGGTCCTGGAAGAAGAAAAAGAGAAAGAAAGAGAGAGAAGGAGTGAGAGGGAGGGAGGGAGGGAGGGAGGGA GGAAGGAAGGAAGGAAGGAAGGAAGGAAAGGAAGGAAGGAAGGAAGG AAGGAAGGAAAGGAAGGAAGGAAGGA AGGAAGGAAGGAAGGAAGGAAGGAA AAGAAACAGCAAAAAAAGAAAGAGGGAGGATGGGAGGGAGGGAAAAAGTAAAAATGATTCTGTATCAGCTGGTATATACCAACAACTAGC, the underlined part is the 25x GGAA sequence.

本实施例涉及的体外蛋白质相分离实验过程如下:The in vitro protein phase separation experimental process involved in this embodiment is as follows:

蛋白质体外相分离实验均在384孔低吸附的多孔板(Cellvis,P384-1.5H-N)中进行。对于带有MBP标签的蛋白,1μL 1mg/ml的TEV蛋白酶直接加入10μL待检测蛋白溶液中,室温孵育半小时即可。Protein in vitro phase separation experiments were all performed in 384-well low-adsorption multiwell plates (Cellvis, P384-1.5H-N). For proteins with MBP tags, 1 μL of 1 mg/ml TEV protease was directly added to 10 μL of the protein solution to be tested and incubated at room temperature for half an hour.

对于检测FUS-ERG与25x GGAA双链DNA相互作用的实验,在FUS-ERG相分离液滴中加入50nM的DNA片段,孵育5分钟后开始观察。For the experiment to detect the interaction between FUS-ERG and 25x GGAA double-stranded DNA, 50 nM of DNA fragments were added to the FUS-ERG phase-separated droplets, and observation began after incubation for 5 minutes.

蛋白质体外相分离实验成像均使用Nikon A1R HD25高分辨共聚焦显微镜完成。The imaging of protein in vitro phase separation experiments was performed using a Nikon A1R HD25 high-resolution confocal microscope.

本实施例涉及的荧光漂白恢复实验过程如下:The fluorescence photobleaching recovery experiment process involved in this embodiment is as follows:

荧光漂白恢复实验FRAP,使用Nikon A1R HD25高分辨共聚焦显微镜,用和目的蛋白或DNA所带发光基团相同的激发光进行漂白实验,即用488nm漂白带有EGFP标签的蛋白,用647nm漂白Cy5标记的DNA。每组实验至少重复三次,激光强度为50%,漂白时间1秒。根据恢复快慢间隔不同时间采集图像,直至荧光强度曲线不再变化。未漂白区域相变液滴的荧光强度变化用来矫正成像过程自然发生的漂白效应,无相变液滴的背景部分的荧光强度变化用来扣除背景荧光强度。数据处理使用NIS-Elements AR Analysis,ImageJ和GraphPadPrism 8.0。Fluorescence photobleaching recovery experiment FRAP, using Nikon A1R HD25 high-resolution confocal microscope, bleaching experiment with the same excitation light as the luminescent group carried by the target protein or DNA, that is, 488nm was used to bleach the protein with EGFP label, and 647nm was used to bleach the Cy5 labeled DNA. Each group of experiments was repeated at least three times, with a laser intensity of 50% and a bleaching time of 1 second. Images were collected at different intervals according to the recovery speed until the fluorescence intensity curve no longer changed. The fluorescence intensity change of the phase change droplet in the unbleached area was used to correct the bleaching effect that occurred naturally during the imaging process, and the fluorescence intensity change of the background part without phase change droplet was used to subtract the background fluorescence intensity. Data processing was performed using NIS-Elements AR Analysis, ImageJ and GraphPadPrism 8.0.

本实施例涉及的细胞荧光成像过程如下:The cell fluorescence imaging process involved in this embodiment is as follows:

细胞转染24小时后即可进行活细胞成像。使用的成像培养皿为35mm玻璃底的四分皿(In Vitro Scientific,D35C4-20-1-N)。使用NIKON A1R HD25高分辨共聚焦显微镜或NIKON超分辨共聚焦显微镜A1/SIM/STORM,在37℃恒温保湿并保持5%CO2的活细胞工作平台成像。图像处理使用NIS-Elements AR Analysis。Live cell imaging can be performed 24 hours after cell transfection. The imaging culture dish used is a 35mm glass-bottomed quadrant (In Vitro Scientific, D35C4-20-1-N). Use a NIKON A1R HD25 high-resolution confocal microscope or a NIKON super-resolution confocal microscope A1/SIM/STORM, and image the live cell working platform at 37°C with constant temperature and humidity and 5% CO 2. Image processing uses NIS-Elements AR Analysis.

OptoIDR实验优化自文献(Shin et al.,2017)。本实验使用NIKON超分辨共聚焦显微镜A1/SIM/STORM,对于待检测IDR蛋白,先用561nm采集处理前的图像,然后根据检测蛋白的相分离能力强弱,用荧光强度为0.5-3%的488nm激光点扫描刺激一次,然后用561nm每隔3秒采集一张图像,持续3分钟左右。本实验需要注意全程避光,每组蛋白重复至少两次。The OptoIDR experiment was optimized from the literature (Shin et al., 2017). This experiment uses the NIKON super-resolution confocal microscope A1/SIM/STORM. For the IDR protein to be detected, the image before processing is first collected at 561nm, and then stimulated once with a 488nm laser point scan with a fluorescence intensity of 0.5-3% according to the phase separation ability of the detected protein, and then an image is collected every 3 seconds at 561nm for about 3 minutes. This experiment needs to be kept away from light throughout the process, and each group of proteins is repeated at least twice.

定量处理时,使用CellProfiler软件的Identify Primary Objects module识别图像中直径为5-20pixels的相分离液滴。对于识别出的液滴区域,本文使用CellProfiler来测量这个区域的荧光强度以及整个图像的总荧光强度,然后用这两种强度的比值作为图像中蛋白质相分离的度量,以此在一个时间序列里衡量蛋白质的相分离能力。During quantitative processing, the Identify Primary Objects module of CellProfiler software was used to identify phase-separated droplets with a diameter of 5-20 pixels in the image. For the identified droplet area, this paper used CellProfiler to measure the fluorescence intensity of this area and the total fluorescence intensity of the entire image, and then used the ratio of these two intensities as a measure of protein phase separation in the image, thereby measuring the phase separation ability of the protein in a time series.

本实施例涉及的DropScan实验过程如下:The DropScan experimental process involved in this embodiment is as follows:

本申请搭建了DropScan的实验方法,用以发现影响蛋白质相分离的小分子化合物。This application builds a DropScan experimental method to discover small molecule compounds that affect protein phase separation.

具体操作如下:The specific operations are as follows:

将构建好的单克隆细胞系U2OS/mCherry-FUS-ERGmut以每孔7,000-8,000的密度接种到CellCarrier-384Ultra微孔板(PerkinElmer)中,然后过夜培养,未接种细胞的孔内补加等体积的无菌水或者PBS以减缓边缘孔挥发导致体积不准影响实验结果。第二天用Echo550纳升级声波移液系统(Labcyte)将提前准备好的小分子化合物以终浓度10μM的浓度加入细胞中,之后立即利用激光共聚焦高内涵成像显微镜进行高内涵成像,40x水镜下每孔采集两张视野,逐孔拍照,一轮结束后立即开始新的循环,持续拍照6小时左右,对于每张视野而言,就是一个长达6小时的长时程拍照。The constructed monoclonal cell line U2OS/mCherry-FUS-ERGmut was inoculated into the CellCarrier-384Ultra microplate (PerkinElmer) at a density of 7,000-8,000 per well, and then cultured overnight. The wells without cells were supplemented with an equal volume of sterile water or PBS to slow down the evaporation of the edge wells, which may cause inaccurate volume and affect the experimental results. The next day, the prepared small molecule compounds were added to the cells at a final concentration of 10μM using the Echo550 nanoliter acoustic pipetting system (Labcyte), and then high-content imaging was performed immediately using a laser confocal high-content imaging microscope. Two fields of view were collected for each well under a 40x water microscope, and photos were taken one by one. A new cycle was started immediately after one round, and the photos were taken for about 6 hours. For each field of view, it was a long-term photo for up to 6 hours.

小分子信息Small molecule information

本实施例中细胞内筛选使用的小分子库为APExBIO Anti-Cancer-Compound-Library Plus,为清华大学药学实验平台订购和管理。储液小分子浓度为10毫摩尔,溶剂为二甲基亚砜DMSO。实验中小分子如无特殊说明终浓度均为10微摩尔。The small molecule library used for intracellular screening in this example is APExBIO Anti-Cancer-Compound-Library Plus, which is ordered and managed by the Pharmaceutical Experiment Platform of Tsinghua University. The concentration of the small molecule in the stock solution is 10 millimolar, and the solvent is dimethyl sulfoxide (DMSO). Unless otherwise specified, the final concentration of the small molecule in the experiment is 10 micromolar.

在后面小分子验证实验中,十个小分子LY2835219,CX-6258,EMD-1214063,Pelitinib(EKB-569),GDC-0941,AZD-9291,UNC 0631,CHIR-124,SB743921and Ursolicacid均为清华药学平台APExBIO Anti-Cancer-Compound-Library Plus分子库单取使用。In the subsequent small molecule verification experiments, ten small molecules, LY2835219, CX-6258, EMD-1214063, Pelitinib (EKB-569), GDC-0941, AZD-9291, UNC 0631, CHIR-124, SB743921 and Ursolicacid, were all taken from the APExBIO Anti-Cancer-Compound-Library Plus molecular library of Tsinghua University's School of Pharmacy.

在证明LY2835219发挥功能不依赖于CDK4/6抑制剂活性实验中,所涉及的小分子化合物均来于小分子库APExBIO Anti-Cancer-Compound-Library Plus,在展示结果时直接取用了筛选时的小分子处理细胞图像。In the experiment proving that the function of LY2835219 is independent of the activity of CDK4/6 inhibitors, the small molecule compounds involved were all from the small molecule library APExBIO Anti-Cancer-Compound-Library Plus, and the cell images treated with small molecules during screening were directly used to display the results.

在研究LY2835219发挥功能的潜在机制时,使用的溶酶体抑制剂Baf-A1终浓度为200纳摩尔。When investigating the potential mechanism of LY2835219 function, the lysosomal inhibitor Baf-A1 was used at a final concentration of 200 nanomolar.

本实施例涉及的蛋白质免疫印迹实验(Western blot)过程如下:The protein immunoblotting experiment (Western blot) process involved in this embodiment is as follows:

细胞样品用试剂盒MinuteTM Total Protein Extraction Kit for AnimalCultured Cells/Tissues(Invent)提取细胞总蛋白。总蛋白可直接进行下游蛋白免疫印迹Western blot检测。The total protein of cells was extracted using the Minute TM Total Protein Extraction Kit for Animal Cultured Cells/Tissues (Invent). The total protein can be directly detected by downstream Western blot.

Western blot的具体操作步骤如下:The specific steps of Western blot are as follows:

1)电泳:使用预制梯度胶,在蛋白上样后,根据检测蛋白大小,110V电压运行1.5小时左右即可进行免疫印迹检测;1) Electrophoresis: Use pre-made gradient gel. After protein loading, run at 110V for about 1.5 hours to perform immunoblotting detection according to the size of the detected protein.

2)转膜:本实验均为半干转,硝酸纤维素膜(NC膜)和转膜滤纸均需使用新鲜配置的预冷的转膜液浸泡充分,然后由下及上的转膜三明治为滤纸、NC膜、蛋白胶及滤纸。需尽可能除尽空气避免影响转膜效果,转膜条件为恒流250mA,65分钟;2) Transfer: This experiment is semi-dry transfer. Both the nitrocellulose membrane (NC membrane) and the transfer filter paper need to be fully soaked in freshly prepared pre-cooled transfer solution. Then the transfer sandwich from bottom to top is the filter paper, NC membrane, protein gel and filter paper. Air should be removed as much as possible to avoid affecting the transfer effect. The transfer conditions are constant current 250mA, 65 minutes;

3)封闭:转膜结束后,将膜剪裁至适合的大小,放入含有5%脱脂奶粉的PBST溶液中,摇床上室温封闭1.5小时左右;3) Blocking: After the transfer, cut the membrane into a suitable size, put it into PBST solution containing 5% skimmed milk powder, and block it at room temperature on a shaker for about 1.5 hours;

4)一抗:封闭结束后,用PBST溶液在摇床上漂洗NC膜三次,每次5分钟,然后将NC膜放入一抗溶液中,4℃过夜孵育;4) Primary antibody: After blocking, rinse the NC membrane three times with PBST solution on a shaker for 5 minutes each time, then place the NC membrane in the primary antibody solution and incubate overnight at 4°C;

5)二抗:一抗结束后,用PBST溶液在摇床上漂洗NC膜三次,每次5分钟,然后将NC膜放入二抗溶液中,摇床上室温孵育1小时;5) Secondary antibody: After the primary antibody is completed, the NC membrane is rinsed three times with PBST solution on a shaker for 5 minutes each time, and then the NC membrane is placed in the secondary antibody solution and incubated on a shaker at room temperature for 1 hour;

6)显色:二抗结束后,用PBST溶液在摇床上漂洗NC膜三次,每次5分钟,将膜置于成像仪中,均匀的滴加显色液,即可进行成像。6) Color development: After the secondary antibody is completed, rinse the NC membrane with PBST solution on a shaker three times, 5 minutes each time, place the membrane in an imager, evenly add the color development solution, and then start imaging.

本实施例所涉及的RT-qPCR实验过程如下:The RT-qPCR experimental process involved in this embodiment is as follows:

本实施例中RNA提取使用的是Trizol法提取,具体操作如下:In this embodiment, RNA extraction was performed using the Trizol method, and the specific steps were as follows:

1)用预冷的无DNA/RNA酶的PBS冲洗细胞,之后直接在培养皿中加入300μL Trizol裂解细胞并富集RNA,将细胞Trizol裂解液吸取至无DNA/RNA酶的离心管中;1) Rinse the cells with pre-cooled PBS free of DNA/RNase, then add 300 μL Trizol directly to the culture dish to lyse the cells and enrich RNA, and pipette the cell Trizol lysate into a centrifuge tube free of DNA/RNase;

2)向离心管中加入2μL线性丙烯酰胺,终浓度为10-20μg/ml,充分混匀;2) Add 2 μL of linear acrylamide to the centrifuge tube to a final concentration of 10-20 μg/ml and mix thoroughly;

3)室温孵育5分钟;3) Incubate at room temperature for 5 minutes;

4)加入60μL三氯甲烷,充分混匀15秒,室温孵育5分钟;4) Add 60 μL of chloroform, mix thoroughly for 15 seconds, and incubate at room temperature for 5 minutes;

5)4℃,14,000rpm离心15分钟;5) Centrifuge at 14,000 rpm for 15 min at 4°C;

6)小心的转移上清(约160-200μL,大约是Trizol体积的60%)到无DNA/RNA酶的离心管中;6) Carefully transfer the supernatant (about 160-200 μL, about 60% of the volume of Trizol) to a DNase/RNase-free centrifuge tube;

7)加入1倍体积的预冷的异丙醇,充分混匀后将离心管放置在-20℃,至少2小时,过夜为佳;7) Add 1 volume of pre-cooled isopropanol, mix thoroughly and place the centrifuge tube at -20°C for at least 2 hours, preferably overnight;

8)4℃,14,000rpm离心30分钟,小心的去除上清;8) Centrifuge at 14,000 rpm for 30 min at 4°C and carefully remove the supernatant;

9)用300μL 75%的乙醇漂洗沉淀,室温孵育5-10分钟,去除可能残余的胍盐;9) Rinse the precipitate with 300 μL 75% ethanol and incubate at room temperature for 5-10 minutes to remove any residual guanidine salt;

10)4℃,14,000rpm离心10分钟;10) Centrifuge at 14,000 rpm for 10 min at 4°C;

11)小心的去除上清,可以用吸头吸取残余的乙醇,室温风干5-10分钟;11) Carefully remove the supernatant. You can use a pipette to absorb the remaining ethanol and air-dry at room temperature for 5-10 minutes.

12)加入50μL商业化的去RNA酶水,室温放置5分钟,用吸头反复吹吸充分溶解;12) Add 50 μL of commercial RNase-free water, leave at room temperature for 5 minutes, and pipette repeatedly to fully dissolve;

13)用Nanodrop测量RNA浓度,之后保存在-80℃;13) Measure RNA concentration using Nanodrop and store at -80°C;

之后提取的RNA反转录用全式金One-Step gDNA Removal and cDNASynthesis SuperMix(AT311-02),先按下表操作。The extracted RNA was then reverse transcribed using the full-length gold One-Step gDNA Removal and cDNA Synthesis SuperMix (AT311-02), first follow the table below.

然后按如下条件继续操作,如下表:Then continue to operate according to the following conditions, as shown in the following table:

反转录的得到的cDNA稀释至80μL,qPCR所用试剂为Transgen PerfectStartTMGreen qPCR SuperMix(AQ601-01)。10μL体系如下表所示:The cDNA obtained by reverse transcription was diluted to 80 μL, and the reagent used for qPCR was Transgen PerfectStart Green qPCR SuperMix (AQ601-01). The 10 μL system is shown in the following table:

每个样品三个技术性重复,两个实验性重复,样品放入CFX Real-time PCRsystem(BIO-RAD)进行检测。本实施例涉及到的qPCR引物如下表所示:Each sample was tested in three technical replicates and two experimental replicates, and the samples were placed in CFX Real-time PCR system (BIO-RAD) for detection. The qPCR primers involved in this example are shown in the following table:

结果:result:

FUS-ERG融合蛋白在体内外驱动相分离FUS-ERG fusion protein drives phase separation in vivo and in vitro

EWS-FLI1的相分离行为已被广泛研究。本发明人首先将注意力转向EWS-FLI的同源融合蛋白FUS-ERG(图5(a)),该融合蛋白在AML和尤因肉瘤中发现,与临床预后不良相关。据报道,FUS-ERG可增强靶基因表达。本发明人想知道,FUS-ERG是否与EWS-FLl1一样也能驱动相分离的发生。The phase separation behavior of EWS-FLI1 has been extensively studied. The inventors first turned their attention to the homologous fusion protein of EWS-FLI, FUS-ERG (Figure 5(a)), which is found in AML and Ewing's sarcoma and is associated with poor clinical prognosis. FUS-ERG has been reported to enhance target gene expression. The inventors wanted to know whether FUS-ERG, like EWS-FLl1, could also drive the occurrence of phase separation.

众所周知,在FUS-ERG中,FUS的部分经历了相分离,而ERG的部分包含一个识别GGAA微卫星序列的DBD。本发明人在Hela细胞中瞬时表达mCherry-FUS-ERG。发现它与染色质共定位(图5(b)),并且在细胞核中显示出少量聚集物。这一观察结果表明,FUS-ERG要么具有很小的相分离电位,要么FUS-ERG聚集物润湿了染色质,从而失去了它们的球形形态。接下来,本申请试图区分这两种可能性。为此,本发明人利用Brangwynne和同事(Shin,Y.etal.Spatiotemporal control of intracellular phase transitions using light-activated optodroplets.Cell 168,159-171(2017))建立的OptoIDR系统来评估相分离潜力。简而言之,OptoIDR系统由感兴趣的IDR、mCherry荧光标记和拟南芥Cry2蛋白(Cry2olig)光解酶结构域的聚集增强形式组成。在蓝光刺激下,Cry2olig形成寡聚物,这也导致IDR寡聚并增加IDR的价态。这可能会在低于LLPS临界浓度的浓度下驱动相分离(图5(c))。与之前的工作一致(Wei,M.T.et al.Nucleated transcriptional condensatesamplify gene expression.Nat.Cell Biol.22,1187-1196(2020);Shin,Y.etal.Spatiotemporal control of intracellular phase transitions using light-activated optodroplets.Cell 168,159-171(2017)),蓝光刺激导致表达OptoFUS的细胞发生明显的相分离(图5(d),图5(e)),这表明FUS IDR确实具有发生相分离的潜力。It is well known that in FUS-ERG, part of FUS undergoes phase separation, while part of ERG contains a DBD that recognizes the GGAA microsatellite sequence. The inventors transiently expressed mCherry-FUS-ERG in Hela cells. It was found to colocalize with chromatin (Figure 5(b)) and show a small amount of aggregates in the nucleus. This observation suggests that FUS-ERG either has a very small phase separation potential or that FUS-ERG aggregates wet the chromatin, thereby losing their spherical morphology. Next, the present application attempts to distinguish between these two possibilities. To this end, the inventors used the OptoIDR system established by Brangwynne and colleagues (Shin, Y. et al. Spatiotemporal control of intracellular phase transitions using light-activated optodroplets. Cell 168, 159-171 (2017)) to evaluate the phase separation potential. In brief, the OptoIDR system consists of an IDR of interest, an mCherry fluorescent marker, and an aggregation-enhanced form of the photolyase domain of the Arabidopsis Cry2 protein (Cry2olig). Upon stimulation with blue light, Cry2olig forms oligomers, which also leads to IDR oligomerization and increases the valence of IDRs. This may drive phase separation at concentrations below the critical concentration for LLPS (Figure 5(c)). Consistent with previous work (Wei, M.T. et al. Nucleated transcriptional condensates amplify gene expression. Nat. Cell Biol. 22, 1187-1196 (2020); Shin, Y. et al. Spatiotemporal control of intracellular phase transitions using light-activated optodroplets. Cell 168, 159-171 (2017)), blue light stimulation caused obvious phase separation in cells expressing OptoFUS (Figure 5(d), Figure 5(e)), indicating that FUS IDR does have the potential to undergo phase separation.

接下来,本发明人纯化了标记重组增强绿色荧光蛋白(EGFP)的FUS-ERG(图6(c))。在无拥挤试剂的生理缓冲条件下,EGFP-FUS-ERG在亚微摩尔浓度下进行相分离(图5(f))。聚集物融合实验和荧光漂白恢复(FRAP)实验后的荧光回收表明,聚集物具有液体性质(图5(g),图5(h),图5(i))。本申请用EGFP-FUS-ERG孵育含有25个GGAA微卫星序列拷贝的DNA片段(图6(b),图6(c)),发现DNA底物被招募到聚集物中,这些聚集物变得不规则(5(j))。FRAP分析表明,这些聚集物仍然保留了一定的流动性(5(k),5(l))。Next, the inventors purified FUS-ERG labeled with recombinant enhanced green fluorescent protein (EGFP) (Figure 6 (c)). Under physiological buffer conditions without crowding agents, EGFP-FUS-ERG phase separated at submicromolar concentrations (Figure 5 (f)). Aggregate fusion experiments and fluorescence recovery after photobleaching (FRAP) experiments showed that the aggregates had liquid properties (Figure 5 (g), Figure 5 (h), Figure 5 (i)). The present application incubated a DNA fragment containing 25 copies of the GGAA microsatellite sequence with EGFP-FUS-ERG (Figure 6 (b), Figure 6 (c)) and found that the DNA substrate was recruited into the aggregates, which became irregular (5 (j)). FRAP analysis showed that these aggregates still retained a certain fluidity (5 (k), 5 (l)).

考虑到以上观察结果,本发明人想知道FUS-ERG是否真的在细胞中经历了相分离,并且产生的聚集物被限制在染色质上并发生变形。接下来,本发明人将点突变引入ERG,破坏DNA的结合。正如预期的那样,突变融合蛋白(FUS-ERGmut)在细胞中形成了许多球形液滴(5(m))。综上所述,这些结果表明FUS-ERG在体内和体外都经历了相分离。由于染色质结合,体内的FUS-ERG液滴偏离球形形态,类似于液体的润湿行为(Morin,J.A.et al.Surfacecondensation of a pioneer transcription factor on DNA.BioRxiv,311712(2020))。Taking into account the above observations, the inventors wanted to know whether FUS-ERG really undergoes phase separation in cells, and the resulting aggregates are confined to chromatin and deformed. Next, the inventors introduced point mutations into ERG to disrupt DNA binding. As expected, the mutant fusion protein (FUS-ERGmut) formed many spherical droplets in cells (5 (m)). In summary, these results indicate that FUS-ERG undergoes phase separation both in vivo and in vitro. Due to chromatin binding, FUS-ERG droplets in vivo deviate from a spherical morphology, similar to the wetting behavior of liquids (Morin, J.A.et al.Surface condensation of a pioneer transcription factor on DNA.BioRxiv, 311712 (2020)).

PS-DBD融合蛋白中普遍的异常相分离Pervasive anomalous phase separation in PS-DBD fusion proteins

本申请数据显示了超过1000个潜在的PS-DBD融合蛋白(图2)。为了研究这些融合导致的异常相分离是否比目前所了解的更普遍,本发明人随机选择了更多的融合蛋白进行进一步研究。The data of this application show more than 1000 potential PS-DBD fusion proteins (Figure 2). In order to study whether the abnormal phase separation caused by these fusions is more common than currently understood, the inventors randomly selected more fusion proteins for further study.

BRD9是一种含溴结构域的蛋白质,在染色质重塑和转录调控中发挥重要作用。BRD9在肝细胞癌中是TERT(端粒酶逆转录酶的转录)的融合伙伴。基于OptoIDR系统,BRD9IDR促进了相分离的发生(图7(a),图7(b))。此外,纯化的重组Alexa 488标记BRD9IDR蛋白(图8(a))形成具有流动性的球形液滴(图7(c),图7(d),图7(e))。因此,BRD9 IDR确实在体内和体外推动了相分离。BRD9 is a bromodomain-containing protein that plays an important role in chromatin remodeling and transcriptional regulation. BRD9 is a fusion partner of TERT (transcription of telomerase reverse transcriptase) in hepatocellular carcinoma. Based on the OptoIDR system, BRD9IDR promoted the occurrence of phase separation (Figure 7(a), Figure 7(b)). In addition, the purified recombinant Alexa 488-labeled BRD9IDR protein (Figure 8(a)) formed spherical droplets with fluidity (Figure 7(c), Figure 7(d), Figure 7(e)). Therefore, BRD9 IDR does promote phase separation in vivo and in vitro.

COL17A1(胶原类型XVIIα1链)是一种胶原跨膜蛋白,也是表皮锚定复合物的关键成分。COL17Al与胃腺癌中的CST复合物亚单位SNT1融合。CST复合物以序列无关的方式与单链DNA高亲和力结合。COL17A1 IDR还促进了OptoIDR系统中的相分离(图7(f),图7(g))。此外,纯化的重组COL17A1 IDR蛋白(图8(a))形成具有凝胶性质的聚集物(图7(f),图7(i),图7(j))。因此,COL17A1在体内和体外都能促进相分离。STN1-COL17A1的相分离潜能可能有助于其肿瘤发生。COL17A1 (collagen type XVII α1 chain) is a collagen transmembrane protein and a key component of the epidermal anchoring complex. COL17A1 is fused to the CST complex subunit SNT1 in gastric adenocarcinoma. The CST complex binds to single-stranded DNA with high affinity in a sequence-independent manner. COL17A1 IDR also promotes phase separation in the OptoIDR system (Figure 7(f), Figure 7(g)). In addition, the purified recombinant COL17A1 IDR protein (Figure 8(a)) forms aggregates with gel properties (Figure 7(f), Figure 7(i), Figure 7(j)). Therefore, COL17A1 can promote phase separation both in vivo and in vitro. The phase separation potential of STN1-COL17A1 may contribute to its tumorigenesis.

MLLT1(也称为ENL)是一种组蛋白乙酰化读取器,对AML中的细胞增殖和致癌转录调控特别需要。在白血病中经常发现MLLT1与MLL(也称为KMT2A)发生融合。MLLT1 IDR促进了OptoIDR系统中的相分离。此外,纯化的MLLT1 IDR蛋白(图8(a))形成了液体聚集物(图7(m),图7(o))。MLLT1 (also known as ENL) is a histone acetylation reader that is specifically required for cell proliferation and oncogenic transcriptional regulation in AML. MLLT1 is frequently found fused to MLL (also known as KMT2A) in leukemia. MLLT1 IDR promotes phase separation in the OptoIDR system. In addition, purified MLLT1 IDR protein (Figure 8(a)) formed liquid aggregates (Figure 7(m), Figure 7(o)).

SFPQ是前mRNA剪接因子。涉及SFPQ与TFE3融合的染色体畸变可能是乳头状肾细胞癌的原因。SFPQ IDR促进OptoIDR系统中的相分离(图7(p),图7(q))。纯化的EGFP-SFPQ IDR蛋白(图8(a))形成聚集物,并表现出类似液体的性质(图7(r),图7(s),图7(t))。MED15是肾细胞癌中TFE3的另一个融合伙伴。本申请的数据显示,MED15(图8(a))在体内和体外可形成具有液体性质的聚集物。SFPQ is a pre-mRNA splicing factor. Chromosomal aberrations involving the fusion of SFPQ and TFE3 may be the cause of papillary renal cell carcinoma. SFPQ IDR promotes phase separation in the OptoIDR system (Figure 7(p), Figure 7(q)). Purified EGFP-SFPQ IDR protein (Figure 8(a)) forms aggregates and exhibits liquid-like properties (Figure 7(r), Figure 7(s), Figure 7(t)). MED15 is another fusion partner of TFE3 in renal cell carcinoma. The data of this application show that MED15 (Figure 8(a)) can form aggregates with liquid properties in vivo and in vitro.

此外,本申请在Hela细胞中瞬时表达这些融合蛋白(图8(b))。由于DNA结合可能会扭曲相分离聚集物的球形形态,本申请还生成了每个PS-DBD的DBD突变版本(PSDBDmut)。本申请数据表明,这些融合蛋白可以在细胞中形成聚集物。对于TFE3融合蛋白,SFPQ/MED15-TFE3及其DBD突变形式在细胞中产生聚集物(图8(b))。这些数据表明,在肾细胞癌中发现的PS-TFE3融合蛋白可能在致癌过程中具有相分离依赖性机制。EWS-FLI1融合蛋白形成了一些类似于FUS-ERG的聚集物,而EWS-FLI在细胞中形成了许多液滴(图8(b))。另外两个EWS融合蛋白EWS-ERG和EWS-ATF1均显示明显的相分离液滴,且DBD无突变(图8(b))。NUP98融合蛋白在细胞中形成聚集物(图8(b))。In addition, the present application transiently expressed these fusion proteins in Hela cells (Figure 8(b)). Since DNA binding may distort the spherical morphology of phase-separated aggregates, the present application also generated a DBD mutant version (PSDBDmut) of each PS-DBD. The data of the present application show that these fusion proteins can form aggregates in cells. For TFE3 fusion proteins, SFPQ/MED15-TFE3 and its DBD mutant forms produce aggregates in cells (Figure 8(b)). These data suggest that the PS-TFE3 fusion protein found in renal cell carcinoma may have a phase separation-dependent mechanism in carcinogenesis. The EWS-FLI1 fusion protein formed some aggregates similar to FUS-ERG, while EWS-FLI formed many droplets in cells (Figure 8(b)). The other two EWS fusion proteins, EWS-ERG and EWS-ATF1, both showed obvious phase-separated droplets, and the DBD had no mutations (Figure 8(b)). The NUP98 fusion protein formed aggregates in cells (Figure 8(b)).

本发明人注意到一些PS-DBD融合蛋白,如PRCC-TFE3,没有经历相分离(图8(b))。然后,本发明人利用OptoIDR分析评估更多的PS蛋白候选物(图10(a),图10(b))。本发明人发现OptoHES4在蓝光刺激下形成了聚集物。但仍有一些OptoIDR修饰的蛋白质对蓝光没有反应,没有显示出明显的聚集物。本发明人推测这是因为这些蛋白质具有相对较弱的相分离能力或没有相分离行为,并且刺激强度不足以激活它。对于所有的OptoIDR结果,本发明人发现20%的蛋白质在刺激前显示聚集物,40%的OptoIDR修饰蛋白在蓝光刺激下发生相分离,40%的OptoIDR修饰蛋白对蓝光刺激没有反应(图10(c))。总的来说,这些结果表明,在癌症患者中发现的超过一半的PS-DBD蛋白倾向于发生相分离。异常的相分离可能放大或抑制下游信号通路,从而影响细胞生理。The inventors noticed that some PS-DBD fusion proteins, such as PRCC-TFE3, did not undergo phase separation (Figure 8(b)). The inventors then used OptoIDR analysis to evaluate more PS protein candidates (Figure 10(a), Figure 10(b)). The inventors found that OptoHES4 formed aggregates under blue light stimulation. But there are still some OptoIDR-modified proteins that do not respond to blue light and do not show obvious aggregates. The inventors speculate that this is because these proteins have relatively weak phase separation ability or no phase separation behavior, and the stimulation intensity is not enough to activate it. For all OptoIDR results, the inventors found that 20% of the proteins showed aggregates before stimulation, 40% of the OptoIDR-modified proteins phase separated under blue light stimulation, and 40% of the OptoIDR-modified proteins did not respond to blue light stimulation (Figure 10(c)). Overall, these results show that more than half of the PS-DBD proteins found in cancer patients tend to phase separate. Abnormal phase separation may amplify or inhibit downstream signaling pathways, thereby affecting cell physiology.

实施例2:TCGA数据的下游表达分析Example 2: Downstream expression analysis of TCGA data

为了验证PS-DBD融合促进下游靶基因转录,本发明人访问了融合基因包含在FusionGDB数据库中的TCGA患者。从UCSC Xena(Goldman,M.J.et al.Visualizing andinterpreting cancer genomics data via the xena platform.Nat.Biotechnol.38,675-678(2020))中获得所有TCGA肿瘤样本的转录组数据、基因表达TPM矩阵和相应的临床信息。然后,本发明人从enrichr文库中获得462个DBD蛋白的目标基因信息,这些信息来自三个数据源:(1)来自ENCODE和ChEA数据库的ChIP seq数据;(2)来自GEO数据库的经历扰动(过度表达或敲除)的DBD蛋白的转录组数据;以及(3)来自TRANSFAC和JASPAR数据库的DBDs的PWM结合矩阵。To verify that PS-DBD fusion promotes transcription of downstream target genes, the inventors visited TCGA patients whose fusion genes were included in the FusionGDB database. Transcriptome data, gene expression TPM matrix and corresponding clinical information of all TCGA tumor samples were obtained from UCSC Xena (Goldman, M.J. et al. Visualizing and interpreting cancer genomics data via the xena platform. Nat. Biotechnol. 38, 675-678 (2020)). Then, the inventors obtained target gene information of 462 DBD proteins from the enrichr library, which came from three data sources: (1) ChIP seq data from ENCODE and ChEA databases; (2) transcriptome data of DBD proteins that experienced perturbations (overexpression or knockout) from the GEO database; and (3) PWM binding matrices of DBDs from the TRANSFAC and JASPAR databases.

对于这462个DBD蛋白,其中55个与PS-DBD融合基因有关,27个TCGA样本含有这些融合基因。对于每个包含一个PS-DBD融合的样本,本发明人将其作为前景集,并选取没有融合的相同肿瘤类型的样本作为背景集。对于每个基因,本申请通过如下公式计算Z分数来规范其表达水平:Of these 462 DBD proteins, 55 were associated with PS-DBD fusion genes, and 27 TCGA samples contained these fusion genes. For each sample containing a PS-DBD fusion, the inventors used it as a foreground set and selected samples of the same tumor type without fusion as a background set. For each gene, the present application calculated the Z score to normalize its expression level using the following formula:

其中Xj表示含有PS-DBD融合基因的样本中基因j的表达水平。μ(Yj)表示背景样本中基因j表达水平的平均值,以及σ(Yj)表示标准偏差。然后使用GSEA算法评估基于该Z分数的PS-DBD融合基因的下游表达水平。以DBD蛋白的靶点作为基因集。Where Xj represents the expression level of gene j in samples containing PS-DBD fusion gene. μ( Yj ) represents the mean expression level of gene j in background samples, and σ( Yj ) represents the standard deviation. The GSEA algorithm was then used to evaluate the downstream expression level of PS-DBD fusion gene based on the Z score. The target of DBD protein was used as the gene set.

结果:result:

PS-DBD融合改变下游转录PS-DBD fusions alter downstream transcription

已知几种PS-DBD融合,如EWS-FLI1和NUP98-HOXA9,可改变下游转录并驱动肿瘤发生(Chong,S.et al.Imaging dynamic and selective low-complexity domaininteractions that control gene transcription.Science 361,eaar2555(2018);Zuo,L.Y.et al.Loci-specific phase separation of FET fusion oncoproteins promotesgene transcription.Nat.Commun.12,1491(2021);Wei,M.T.et al.Nucleatedtranscriptional condensates amplify gene expression.Nat.Cell Biol.22,1187-1196(2020);Terlecki-Zaniewicz,S.et al.Biomolecular condensation of nup98fusion proteins drives leukemogenic gene expression.Nat.Struct.Mol.Biol.28,190-201(2021))。为了确定PS-DBD融合导致的下游转录变化是否普遍,本申请分析了癌症基因组图谱(TCGA)中含有PS-DBD融合物的患者的转录组数据。116例含有PS-DBD融合物的患者数据从TCGA库中提取。PS-DBD融合物的直接靶点尚不明确。因此,DBD蛋白的靶基因被用来作为替代。DBD蛋白的靶基因从enrichr librarys(Xie,Z.et al.Gene set knowledgediscovery with enrichr.Curr.Protoc.1,e90(2021))中收集。Several PS-DBD fusions, such as EWS-FLI1 and NUP98-HOXA9, are known to alter downstream transcription and drive tumorigenesis (Chong, S. et al. Imaging dynamic and selective low-complexity domain interactions that control gene transcription. Science 361, eaar2555 (2018); Zuo, L. Y. et al. Loci-specific phase separation of FET fusion oncoproteins promotes gene transcription. Nat. Commun. 12, 1491 (2021); Wei, M. T. et al. Nucleated transcriptional condensates amplify gene expression. Nat. Cell Biol. 22, 1187-1196 (2020); Terlecki-Zaniewicz, S. et al. Biomolecular condensation of nup98 fusion proteins drives leukemogenic gene expression. Nat. Struct. Mol. Biol. 28, 190-201 (2021)). In order to determine whether the downstream transcriptional changes caused by PS-DBD fusion are common, this application analyzed the transcriptome data of patients with PS-DBD fusions in the Cancer Genome Atlas (TCGA). Data of 116 patients with PS-DBD fusions were extracted from the TCGA library. The direct target of PS-DBD fusions is not yet clear. Therefore, the target genes of DBD proteins are used as an alternative. The target genes of DBD proteins are collected from enrichr libraries (Xie, Z. et al. Gene set knowledge discovery with enrichr. Curr. Protoc. 1, e90 (2021)).

实施例3:化合物筛选Example 3: Compound Screening

对于体外筛选,使用了Anti-Cancer-Compound-Library Plus library Plus(来源于清华大学药学技术中心)。分析的最终浓度为10μM。对于后续的重新测试分析,从库中提取了10种化合物(LY2835219、CX-6258、EMD-1214063、Pelitinib(EKB-569)、GDC-0941、AZD-9291、UNC 0631、CHIR-124、SB743921和Ursolic acid)。LY2835219从APExBIO购回,作为DMSO中的20mM储备。其他化合物在DMSO中制备为10mM储备。所有化合物分析的最终浓度为10μM。For in vitro screening, the Anti-Cancer-Compound-Library Plus library Plus (from the Pharmaceutical Technology Center of Tsinghua University) was used. The final concentration analyzed was 10 μM. For subsequent retest analysis, 10 compounds were extracted from the library (LY2835219, CX-6258, EMD-1214063, Pelitinib (EKB-569), GDC-0941, AZD-9291, UNC 0631, CHIR-124, SB743921 and Ursolic acid). LY2835219 was purchased back from APExBIO as a 20 mM stock in DMSO. The other compounds were prepared as 10 mM stocks in DMSO. The final concentration of all compounds analyzed was 10 μM.

高内涵成像。使用DropScan分析法在PerkinElmer CellCarrier-384平板中筛选化合物对U2OS/mCherry-FUS-ERGmut细胞的影响。每个孔接种7000-8000个细胞,并培养过夜。然后用Labcyte ECHO 550Liquid Handle将化合物添加到细胞中,最终浓度为10μM。添加小分子后立即拍摄图像。使用配备40×NA 1.1水浸物镜的Opera Phenix高含量筛选系统捕获每个孔的两个视野。成像过程重复至少30次,或分析持续至少6小时。High-content imaging. Compounds were screened for their effects on U2OS/mCherry-FUS-ERGmut cells in PerkinElmer CellCarrier-384 plates using the DropScan assay. 7,000-8,000 cells were seeded per well and incubated overnight. Compounds were then added to the cells using a Labcyte ECHO 550Liquid Handle at a final concentration of 10 μM. Images were captured immediately after the addition of the small molecules. Two fields of view per well were captured using the Opera Phenix High-Content Screening System equipped with a 40× NA 1.1 water immersion objective. The imaging process was repeated at least 30 times or the analysis lasted at least 6 hours.

对于Lysotracker绿色染色图像,用DMSO或化合物处理U2OS和U2OS/mCherry-FUS-ERGmut细胞约4小时,然后用Lysotracer绿色进行染色(Invitrogen,Cat.L7526)。使用配备60×水浸物镜的Opera Phenix高含量筛选系统,捕获了每个孔的多个视野。For Lysotracker green staining images, U2OS and U2OS/mCherry-FUS-ERGmut cells were treated with DMSO or compounds for approximately 4 hours and then stained with Lysotracer green (Invitrogen, Cat. L7526). Multiple fields of view per well were captured using the Opera Phenix High Content Screening System equipped with a 60× water immersion objective.

图像量化和分析。对于高内涵显微镜拍摄的每个图像,本申请将其强度除以0.05,以生成新图像。对于每个图像,CellProfiler中的Otsu算法用于分割目标直径大小为70至250像素的细胞。为了描述液滴的特征,本申请通过CellProfiler中的Enhance Features模块增强了图像中的点状结构,目标直径设置为5像素。然后,使用鲁棒的背景算法检测直径为5至10像素的水滴,信号高于背景4个标准偏差。所识别的液滴与之前步骤中使用CellProfiler中的Relate Objects模块识别的细胞相关联。根据上述结果,得到了每个细胞的液滴数。Image quantification and analysis. For each image taken by high-content microscopy, the application divided its intensity by 0.05 to generate a new image. For each image, the Otsu algorithm in CellProfiler was used to segment cells with a target diameter size of 70 to 250 pixels. To describe the characteristics of droplets, the application enhanced the point structures in the image using the Enhance Features module in CellProfiler, with the target diameter set to 5 pixels. Then, a robust background algorithm was used to detect droplets with a diameter of 5 to 10 pixels, with a signal 4 standard deviations above the background. The identified droplets were associated with the cells identified in the previous step using the Relate Objects module in CellProfiler. Based on the above results, the number of droplets per cell was obtained.

然后,对于每个平板,可以获得初始时间点每个细胞的液滴数。将初始时间点的所有细胞与其液滴数进行排序,将顶部15%位置的液滴数用作定义液滴富集细胞的阈值。然后,对于板中的所有孔,随着时间的推移,监测液滴数超过液滴富集阈值的细胞比例。基于这些数据,推导了药物治疗后液滴富集细胞比例的时间序列。然后,本申请提出了一个指数模型,将液滴富集细胞的分数描述为药物治疗后时间的函数:Then, for each plate, the number of droplets per cell at the initial time point is available. All cells at the initial time point are sorted by their droplet number, and the droplet number at the top 15% position is used as the threshold to define droplet-enriched cells. Then, for all wells in the plate, the proportion of cells with droplet numbers exceeding the droplet enrichment threshold is monitored over time. Based on these data, a time series of the proportion of droplet-enriched cells after drug treatment is derived. Then, this application proposes an exponential model to describe the fraction of droplet-enriched cells as a function of time after drug treatment:

其中c0和c1是指数模型中需要回归的常数,t是小分子化合物的作用时间,Y(t)是每个孔中的富含聚集物细胞分数,其可以是基于单个视野的富含聚集物细胞分数,也可以是基于多个视野求平均的富含聚集物细胞分数。Where c0 and c1 are constants that need to be regressed in the exponential model, t is the action time of the small molecule compound, and Y(t) is the fraction of aggregate-rich cells in each well, which can be the fraction of aggregate-rich cells based on a single field of view or the fraction of aggregate-rich cells averaged over multiple fields of view.

最后,根据拟合模型计算了两个测量值。第一个是药物效应,定义为Y(t_max)/(0)。在这里Y(0)根据t=0的拟合模型计算,Y(t_max)根据t=t_max的拟合模型计算。第二个是R平方,它衡量上述指数模型的拟合优异性。Finally, two measures are calculated from the fitted model. The first is the drug effect, defined as Y(t_max)/(0). Here Y(0) is calculated from the fitted model at t=0, and Y(t_max) is calculated from the fitted model at t=t_max. The second is R-squared, which measures the goodness of fit of the exponential model above.

结果:result:

通过计算视觉辅助的高内涵筛选识别聚集物调节剂化合物Identification of aggregation modulator compounds by computational vision-assisted high-content screening

随着PS-DBD融合蛋白经历相分离并改变目标基因表达,本发明人试图识别调节相分离以逆转异常基因表达的小分子。报道的小分子筛选方案只比较了在任意结束时间点用不同分子处理的细胞图像,包括建立的阳性/阴性对照。使用这些方案时,会丢失有关聚集物变化的动力学信息。此外,孔之间的不均匀性会导致不可避免的不确定性和/或噪音。此外,相关的染色和清洗步骤进行起来很繁琐,常常会导致较大的系统误差。在这里,本申请设计了一种更有效的策略,称为DropScan,用于筛选调节小分子的聚集物。As the PS-DBD fusion protein undergoes phase separation and changes target gene expression, the inventors attempted to identify small molecules that regulate phase separation to reverse abnormal gene expression. Reported small molecule screening protocols only compare cell images treated with different molecules at any end time point, including established positive/negative controls. When using these protocols, kinetic information about aggregate changes is lost. In addition, the heterogeneity between holes can lead to inevitable uncertainty and/or noise. In addition, the related staining and washing steps are cumbersome to perform and often lead to large systematic errors. Here, the present application has designed a more effective strategy, called DropScan, for screening aggregates that regulate small molecules.

在DropScan中,本发明人在长期成像模式下使用高内涵成像筛选(图4(a)至图4(c))。具体而言,将稳定表达mCherry-FUS-ERGmut的U2OS细胞(U2OS/mCherry-FUS-ERGmut)接种到384孔板中。在将小分子化合物分配到每个孔中后,对每个孔的多个场进行成像,并在6小时内重复成像相同的场(图4(a)至图4(c),模型和成像)。通过这种方式,可以在整个成像过程中监测同一组场中细胞内聚集物行为的变化。在本研究中,在五个384孔板中筛选了1777种抗癌药物库(APExBIO Anti-Cancer Compound Library Plus),并为每种药物生成了时间序列图像。In DropScan, the inventors used high-content imaging screening in long-term imaging mode (Figures 4(a) to 4(c)). Specifically, U2OS cells stably expressing mCherry-FUS-ERGmut (U2OS/mCherry-FUS-ERGmut) were inoculated into 384-well plates. After small molecule compounds were distributed into each well, multiple fields of each well were imaged, and the same fields were repeatedly imaged within 6 hours (Figures 4(a) to 4(c), model and imaging). In this way, changes in the behavior of intracellular aggregates in the same set of fields can be monitored throughout the imaging process. In this study, a library of 1,777 anti-cancer drugs (APExBIO Anti-Cancer Compound Library Plus) was screened in five 384-well plates, and time series images were generated for each drug.

对于每个图像,使用Otsu和鲁棒背景算法分割细胞和液滴,并获得每个细胞的液滴数量(图4(a)至图4(c),图像处理和特征提取)。对于每种药物,随着时间的推移监测每个细胞的液滴数,并相应计算液滴数超过富集液滴阈值的细胞比例。然后拟合指数模型,将富含液滴的细胞分数描述为药物治疗时间的函数(图4(a)至图4(c),量化和命中识别)。For each image, cells and droplets were segmented using Otsu and robust background algorithms, and the number of droplets per cell was obtained (Figure 4(a) to Figure 4(c), image processing and feature extraction). For each drug, the number of droplets per cell was monitored over time, and the proportion of cells with droplet numbers exceeding the droplet-enriched threshold was calculated accordingly. An exponential model was then fitted to describe the fraction of cells enriched in droplets as a function of drug treatment time (Figure 4(a) to Figure 4(c), quantification and hit identification).

为了评估药物对液滴溶解的影响,本申请根据上述指数模型计算了两个测量值,药物效应和R平方(详细信息请参见材料和方法部分),并显示了1777种化合物的药物效应与R平方的散点图(图4(d))。其中包括LY2835219(也称为甲磺酸阿贝莫西利),它是一种选择性细胞周期素依赖性激酶(CDK)抑制剂,靶向细胞周期素D1/CDK4和细胞周期素D1/CDK6细胞周期途径。此外,LY2835219是一种有效的泛PIM激酶抑制剂CX-6258,以及两种表皮生长因子受体抑制剂EKB-569和AZD-9291,显示了有效的聚集物溶解(图11(a)至图11(j))。此外,一些命中化合物,如UNC 0631,可能通过触发细胞死亡来减少液滴数量(图11(a)至图11(j))。总之,本申请成功地使用DropScan识别了一组具有不同有效性的聚集物调节化合物。To evaluate the effect of drugs on droplet dissolution, the present application calculated two measurements, drug effect and R-squared, according to the above exponential model (see the Materials and Methods section for details), and displayed a scatter plot of drug effect and R-squared for 1777 compounds (Figure 4(d)). Among them is LY2835219 (also known as abetamoxiclib mesylate), which is a selective cyclin-dependent kinase (CDK) inhibitor that targets the cyclin D1/CDK4 and cyclin D1/CDK6 cell cycle pathways. In addition, LY2835219, a potent pan-PIM kinase inhibitor CX-6258, and two epidermal growth factor receptor inhibitors EKB-569 and AZD-9291, showed effective aggregate dissolution (Figures 11(a) to 11(j)). In addition, some hit compounds, such as UNC 0631, may reduce the number of droplets by triggering cell death (Figures 11(a) to 11(j)). In summary, the present application successfully used DropScan to identify a group of aggregation modulating compounds with different effectiveness.

实施例4:LY2835219的验证Example 4: Verification of LY2835219

荧光素酶分析。萤火虫荧光素酶报告基因在最小启动子中含有25个GGAA模体拷贝(由北京大学齐志实验室惠赠)。293T细胞与FUS-ERG或EWS-FLI1、25×GGAA萤火虫荧光素酶质粒和Renilla荧光素酶质粒(Promega,Cat.E6921)共转染。转染~24小时后,按照制造商关于荧光素酶分析系统的说明(Promega,Cat.E1960),进行双重荧光素酶分析,包括萤火虫荧光素酶和Renilla荧光素酶。对于每个实验,萤火虫荧光素酶的荧光强度由Renilla荧光素酶强度标准化,并与空载体的标准化强度进行比较。萤火虫荧光素酶的相对活性反映了被测TF在GGAA微卫星驱动启动子处的反式激活能力。每个实验条件重复三次。Luciferase assay. The firefly luciferase reporter gene contains 25 copies of the GGAA motif in a minimal promoter (kindly provided by Qi Zhi Laboratory, Peking University). 293T cells were co-transfected with FUS-ERG or EWS-FLI1, 25×GGAA firefly luciferase plasmid, and Renilla luciferase plasmid (Promega, Cat. E6921). After ~24 hours of transfection, a dual luciferase assay including firefly luciferase and Renilla luciferase was performed according to the manufacturer's instructions for the luciferase assay system (Promega, Cat. E1960). For each experiment, the fluorescence intensity of firefly luciferase was normalized by the Renilla luciferase intensity and compared with the normalized intensity of the empty vector. The relative activity of firefly luciferase reflects the transactivation ability of the tested TF at the GGAA microsatellite-driven promoter. Each experimental condition was repeated three times.

Western blotting。本研究中使用的抗体为Anti-mCherry(CST,Cat.43590)、Anti-GAPDH(Easybio,Cat.BE0023),F(ab’)2-Goat anti-Rabbit IgG(H+L)Cross-Adsorbed Secondary antibody,HRP(Invitrogen,Cat.A24537)、Goat anti-Mouse IgG(H+L)Secondary Antibody,HRP(Invitrogen,Cat.62-6520)。用7.5μM LY2835219处理所建立的U2OS/mCherry-FUS-ERGmut细胞,处理时间分别为0小时、2小时、4小时和6小时。使用MinuteTM动物培养细胞/组织总蛋白提取试剂盒从细胞样本中纯化蛋白质。蛋白质通过SDS-PAGE分离并转移到PVDF膜。将细胞膜与一级抗体孵育过夜,然后与相应的HRP标记的IgG孵育。最后,使用增强化学发光试剂对结果进行可视化。Western blotting. The antibodies used in this study were Anti-mCherry (CST, Cat.43590), Anti-GAPDH (Easybio, Cat.BE0023), F(ab')2-Goat anti-Rabbit IgG (H+L) Cross-Adsorbed Secondary antibody, HRP (Invitrogen, Cat.A24537), and Goat anti-Mouse IgG (H+L) Secondary Antibody, HRP (Invitrogen, Cat.62-6520). The established U2OS/mCherry-FUS-ERGmut cells were treated with 7.5 μM LY2835219 for 0 h, 2 h, 4 h, and 6 h, respectively. Proteins were purified from cell samples using the Minute TM Animal Culture Cell/Tissue Total Protein Extraction Kit. Proteins were separated by SDS-PAGE and transferred to PVDF membranes. The membranes were incubated with primary antibodies overnight, followed by incubation with corresponding HRP-labeled IgGs. Finally, the results were visualized using enhanced chemiluminescence reagents.

RT-qPCR。用7.5μM LY2835219处理已建立的U2OS/mCherry-EWS-FLI1细胞0小时、3小时和6小时。使用Trizol(Invitrogen,Cat.A33252)从细胞中纯化总RNA,并用Nanodrop定量。使用一步gDNA去除和cDNA合成SuperMix(TransGen,Cat.A T311-02)将1μg总RNA反转录为cDNA。使用1μl 1:4cDNA稀释液在BIO-RAD CFX实时PCR系统上用Green qPCR SuperMix(TransGen,Cat.AQ601-01)进行定量PCR。每个基因进行三次独立重复,每种治疗进行两次独立重复。使用GAPDH标准化数据,并计算每个目标基因的标准化倍数变化。RT-qPCR. Established U2OS/mCherry-EWS-FLI1 cells were treated with 7.5 μM LY2835219 for 0, 3, and 6 hours. Total RNA was purified from cells using Trizol (Invitrogen, Cat. A33252) and quantified using Nanodrop. One-step gDNA removal and cDNA synthesis SuperMix (TransGen, Cat. A T311-02) was used to reverse transcribe 1 μg of total RNA into cDNA. 1 μl of 1:4 cDNA dilution was used for PCR on the BIO-RAD CFX real-time PCR system. Quantitative PCR was performed using Green qPCR SuperMix (TransGen, Cat. AQ601-01). Three independent replicates were performed for each gene and two independent replicates were performed for each treatment. Data were normalized using GAPDH and the normalized fold change for each target gene was calculated.

RNA-seq。使用RNA测序(RNA-seq)进行全基因组基因表达分析。用7.5μMLY2835219处理已建立的U2OS细胞和A-673细胞,处理时间分别为0小时、3小时和6小时。在Trizol收集未处理的U2OS细胞和处理过的细胞样本,并将其交付给Annoroad GeneTechnology Company(中国北京)。用寡核苷酸(dT)珠富集总mRNA,然后用随机引物进行片段化和反转录。cDNA被纯化,其5'和3'端被修复并用接头连接。通过PCR扩增连接的cDNA,并使用Illumina Novaseq 6000 S4系统(Annoroad)进行150-nt双末端测序。RNA-seq. Whole-genome gene expression analysis was performed using RNA sequencing (RNA-seq). Established U2OS cells and A-673 cells were treated with 7.5 μM LY2835219 for 0 h, 3 h, and 6 h, respectively. Untreated U2OS cells and treated cell samples were collected in Trizol and delivered to Annoroad GeneTechnology Company (Beijing, China). Total mRNA was enriched with oligo(dT) beads, followed by fragmentation and reverse transcription with random primers. cDNA was purified, and its 5' and 3' ends were repaired and ligated with adapters. The ligated cDNA was amplified by PCR and subjected to 150-nt double-end sequencing using the Illumina Novaseq 6000 S4 system (Annoroad).

使用无比对量化软件Kallisto对有效数据文件进行量化,从Kallisto转录组索引网站下载从Ensembl参考转录组构建的预建索引。应用默认参数计算不同转录本的TPM值,并将一个基因的TPM数值计算为该基因所有转录物TPM值的总和。The valid data files were quantified using the unmatched quantification software Kallisto, and the pre-built index constructed from the Ensembl reference transcriptome was downloaded from the Kallisto transcriptome index website. The TPM values of different transcripts were calculated using default parameters, and the TPM value of a gene was calculated as the sum of the TPM values of all transcripts of the gene.

为了验证LY2835219是否抑制EWS-FLI1靶基因的表达,本申请计算了用7.5μMLY2835219处理0小时、3小时和6小时的U2OS细胞系中EWS-FLI靶基因表达水平相比于野生型U2OS细胞的倍数变化。对于A-673细胞,从CCLE数据库中提取了10个骨细胞系的靶基因表达水平作为对照。然后,计算了与对照表达水平相比,7.5μM L Y2835219处理0小时、3小时和6小时的A-673细胞系中EWS-FLI1靶基因表达水平的倍数变化。In order to verify whether LY2835219 inhibits the expression of EWS-FLI1 target genes, the present application calculated the fold change of EWS-FLI target gene expression levels in U2OS cell lines treated with 7.5 μM LY2835219 for 0 hours, 3 hours and 6 hours compared with wild-type U2OS cells. For A-673 cells, the target gene expression levels of 10 bone cell lines were extracted from the CCLE database as controls. Then, the fold change of EWS-FLI1 target gene expression levels in A-673 cell lines treated with 7.5 μM LY2835219 for 0 hours, 3 hours and 6 hours compared with the control expression level was calculated.

在排除LY2835219发挥功能不依赖于CDK4/6抑制剂活性实验中,所涉及的小分子化合物均来与小分子库APExBIO Anti-Cancer-Compound-Library Plus,在展示结果时直接取用了筛选时的小分子处理细胞图像。In the experiment to exclude that the function of LY2835219 is independent of the activity of CDK4/6 inhibitors, the small molecule compounds involved were all from the small molecule library APExBIO Anti-Cancer-Compound-Library Plus, and the cell images treated with small molecules during screening were directly used to display the results.

在LY2835219处理前后用溶酶体追踪法将报告细胞系染色为绿色,具体流程见实施例1中活细胞染色(Lysotracker Green对溶酶体的染色)的具体操作流程。The reporter cell line was stained green by lysosome tracking method before and after LY2835219 treatment. The specific process is shown in the specific operation process of live cell staining (lysosome staining by Lysotracker Green) in Example 1.

在研究LY2835219发挥功能的潜在机制时,使用的溶酶体抑制剂Baf-A1终浓度为200纳摩尔。When investigating the potential mechanism of LY2835219 function, the lysosomal inhibitor Baf-A1 was used at a final concentration of 200 nanomolar.

结果:result:

LY2835219通过激活溶酶体溶解聚集物LY2835219 dissolves aggregates by activating lysosomes

LY2835219在命中化合物中排名第一(图4(e))。本申请选择研究其对聚集物溶解影响的分子机制。考虑到LY2835219是一种CDK4/6抑制剂,本申请在文库中搜索了其他CDK4/6抑制剂,并发现了四种以上的抑制剂。其中,purvalanol B、palbociclibisesionate和LEE011未能溶解聚集物,而ON123300导致大量细胞死亡(图12)。这些数据表明,LY2835219通过CDK4//6独立机制诱导聚集物溶解。接下来,在1至10μM的浓度范围内测试了LY2835219。聚集物量量随时间呈剂量依赖性下降(图13(b)和图14(a))。LY2835219 ranks first among the hit compounds (Figure 4 (e)). This application chooses to study the molecular mechanism of its effect on aggregate dissolution. Considering that LY2835219 is a CDK4/6 inhibitor, this application searched for other CDK4/6 inhibitors in the library and found more than four inhibitors. Among them, purvalanol B, palbociclibisesionate and LEE011 failed to dissolve aggregates, while ON123300 caused a large number of cell deaths (Figure 12). These data indicate that LY2835219 induces aggregate dissolution through a CDK4//6 independent mechanism. Next, LY2835219 was tested in a concentration range of 1 to 10 μM. The amount of aggregates decreased in a dose-dependent manner over time (Figure 13 (b) and Figure 14 (a)).

本发明人注意到,经LY2835219处理的细胞含有许多空泡状结构(图11(a)和图12)。之前的一项研究报告称,用LY2835219处理过的细胞中,溶酶体膨胀形成液泡结构(Hino,H.et al.Abemaciclib induces atypical cell death in cancer cellscharacterized by formation of cytoplasmic vacuoles derived fromlysosomes.Cancer Sci.111,2132-2145(2020))。为了证实这一点,本申请在LY2835219处理前后用溶酶体追踪法将报告细胞系染色为绿色。与之前的研究一致,液泡结构与溶酶体共定位(图13(c)),溶酶体的强度显著增强(图13(c),图13(d))。这些结果与LY2835219处理导致溶酶体酸化的概念一致(Hino,H.et al.Abemaciclib induces atypical cell deathin cancer cells characterized by formation of cytoplasmic vacuoles derivedfrom lysosomes.Cancer Sci.111,2132-2145(2020))。The inventors noted that cells treated with LY2835219 contained many vacuolar structures (Figure 11 (a) and Figure 12). A previous study reported that in cells treated with LY2835219, lysosomes expanded to form vacuoles (Hino, H. et al. Abemaciclib induces atypical cell death in cancer cells characterized by formation of cytoplasmic vacuoles derived from lysosomes. Cancer Sci. 111, 2132-2145 (2020)). To confirm this, the present application used lysosome tracking to stain the reporter cell line green before and after LY2835219 treatment. Consistent with previous studies, vacuolar structures co-localized with lysosomes (Figure 13 (c)), and the intensity of lysosomes was significantly enhanced (Figure 13 (c), Figure 13 (d)). These results are consistent with the concept that LY2835219 treatment leads to lysosomal acidification (Hino, H. et al. Abemaciclib induces atypical cell death in cancer cells characterized by formation of cytoplasmic vacuoles derived from lysosomes. Cancer Sci. 111, 2132-2145 (2020)).

此外,U2OS细胞对LY2835219的反应也显示出溶酶体强度显著增强(图14(b))。因此,将溶酶体的特异性抑制剂巴弗洛霉素A1(Baf-A1)与LY2835219共同添加,以评估Baf-A1是否降低LY28305219的活性。本申请在使用LY2835219或Baf-A1单独处理或两者同时处理期间对细胞进行高内涵成像,并注意到在同时使用Baf-A1和LY2875219培养的细胞中没有形成空泡(图13(e))。此外,Baf-A1显著降低了LY2835219的溶解速度(图13(e))。In addition, the response of U2OS cells to LY2835219 also showed a significant increase in lysosomal intensity (Figure 14 (b)). Therefore, the specific inhibitor of lysosomes, bafilomycin A1 (Baf-A1), was added together with LY2835219 to evaluate whether Baf-A1 reduces the activity of LY28305219. The present application performed high-content imaging of cells during treatment with LY2835219 or Baf-A1 alone or both, and noted that no vacuoles were formed in cells cultured with Baf-A1 and LY2875219 (Figure 13 (e)). In addition, Baf-A1 significantly reduced the dissolution rate of LY2835219 (Figure 13 (e)).

本申请进一步将溶酶体报告基因YFP-LAMP1(溶酶体相关膜蛋白1)导入融合细胞系,并在小分子治疗下进行长期成像。在本试验中,本发明人发现在小分子处理之前,mCherry-FUS-ERGmut聚集物与溶酶体高度共定位。LY2835219增加了这种共定位,溶解了聚集物,并显著扩大了溶酶体(图13(f))。综上所述,这些数据表明,LY2835219可能通过激活溶酶体酸化,部分溶解细胞中的融合蛋白聚集物。The present application further introduces the lysosomal reporter gene YFP-LAMP1 (lysosomal associated membrane protein 1) into the fusion cell line and performs long-term imaging under small molecule treatment. In this experiment, the inventors found that before small molecule treatment, mCherry-FUS-ERGmut aggregates were highly colocalized with lysosomes. LY2835219 increased this colocalization, dissolved aggregates, and significantly enlarged lysosomes (Figure 13 (f)). In summary, these data indicate that LY2835219 may partially dissolve fusion protein aggregates in cells by activating lysosomal acidification.

接下来,评估了LY2835219在溶解聚集物方面的特异性。本申请测试了LY2835219对许多相分离倾向蛋白:EWS-FLI1、EWS-FLI1mut、NUP98-HOXA9mut、NUP98HOXD13mut、FUS的N端(aa 1~212)、NUP98的N端(aa 1~515)和全长TDP43产生的聚集物的影响。LY2835219处理导致所有四种PS-DBD融合聚集物(图13(g))溶解,但三种非融合蛋白(图13(h))没有溶解。Next, the specificity of LY2835219 in dissolving aggregates was evaluated. The present application tested the effect of LY2835219 on aggregates produced by many phase separation prone proteins: EWS-FLI1, EWS-FLI1mut, NUP98-HOXA9mut, NUP98HOXD13mut, the N terminus of FUS (aa 1-212), the N terminus of NUP98 (aa 1-515), and full-length TDP43. LY2835219 treatment resulted in the dissolution of all four PS-DBD fusion aggregates (Figure 13 (g)), but the three non-fusion proteins (Figure 13 (h)) were not dissolved.

逆转异常相分离挽救异常基因表达Reversing abnormal phase separation rescues aberrant gene expression

上述结果表明,LY2835219能有效溶解FET-ETS液滴。本申请进一步探讨了LY2835219介导的液滴溶解是否能够逆转融合物靶基因的异常表达。本申请发现,相对荧光素酶活性随时间而降低,在治疗12小时后降低50%(图15(a))。The above results show that LY2835219 can effectively dissolve FET-ETS droplets. The present application further explores whether LY2835219-mediated droplet dissolution can reverse the abnormal expression of the fusion target gene. The present application found that the relative luciferase activity decreased over time and decreased by 50% after 12 hours of treatment (Figure 15 (a)).

为了了解LY2835219治疗是否会降低EWSFLI1靶基因的表达,本申请在用LY2835219治疗前后对U2OS/mCherry-ESW-FLI1细胞进行了RNA测序。EWS-FLI1的假定靶基因是从先前发布的转录组数据集中提取的(Boulay,G.et al.Cancer-specificretargeting of baf complexes by a prion-like domain.Cell 171,163-178(2017))。简而言之,在EWS-FLI过表达细胞中表达水平同时显著高于FLI1过表达细胞和对照细胞的356个基因被视为EWS-FLI1靶基因(图15(b))。通过比较LY2835219处理前后356个EWS-FLI1靶基因的表达水平,本发明发现处理3小时后有26个靶基因显著下调,只有2个靶基因上调(图15(c))。本申请进一步计算了LY2835219处理前后表达EWS-FLI1的细胞与野生型细胞的相对表达水平。与上述发现一致,LY2835219治疗后EWS-FLI1靶基因的相对表达水平显著降低(图15(d))。In order to understand whether LY2835219 treatment will reduce the expression of EWSFLI1 target genes, the present application performed RNA sequencing on U2OS/mCherry-ESW-FLI1 cells before and after treatment with LY2835219. The putative target genes of EWS-FLI1 were extracted from a previously published transcriptome dataset (Boulay, G. et al. Cancer-specific retargeting of baf complexes by a prion-like domain. Cell 171, 163-178 (2017)). In short, 356 genes whose expression levels in EWS-FLI overexpressing cells were significantly higher than those in FLI1 overexpressing cells and control cells were considered as EWS-FLI1 target genes (Figure 15 (b)). By comparing the expression levels of 356 EWS-FLI1 target genes before and after LY2835219 treatment, the present invention found that 26 target genes were significantly downregulated after 3 hours of treatment, and only 2 target genes were upregulated (Figure 15 (c)). The present application further calculated the relative expression levels of EWS-FLI1 expressing cells and wild-type cells before and after LY2835219 treatment. Consistent with the above findings, the relative expression levels of EWS-FLI1 target genes were significantly reduced after LY2835219 treatment (Figure 15(d)).

本申请还进行了实时荧光定量聚合酶链式反应(RT-qPCR),以验证LY2835219治疗后EWS-FLI1靶基因的下调模式。LY2835219处理后,大多数受试基因的表达水平降低,且此过程具有时间依赖性(图15(e))。总的来说,荧光素酶、RT qPCR和RNA测序数据表明,LY2835219可以挽救融合癌蛋白诱导的异常基因表达。本申请的结果表明,LY2835219可能是治疗这些FET融合相关尤因肉瘤或其他癌症的潜在药物。这与与之前的研究一致,LY2835219在临床前尤因肉瘤模型中显示出多方面的抗肿瘤作用(Dowless,M.etal.Abemaciclib is active in preclinical models of ewing sarcoma viamultipronged regulation of cell cycle,DNA methylation,and interferon pathwaysignaling.Clin.Cancer Res.24,6028-6039(2018))。本申请的数据揭示了LY2835219对尤因肉瘤有益作用的可能分子机制。The present application also performed real-time fluorescence quantitative polymerase chain reaction (RT-qPCR) to verify the downregulation pattern of EWS-FLI1 target genes after LY2835219 treatment. After LY2835219 treatment, the expression levels of most tested genes decreased, and this process was time-dependent (Figure 15(e)). Overall, the luciferase, RT qPCR and RNA sequencing data showed that LY2835219 can rescue abnormal gene expression induced by fusion oncoproteins. The results of the present application suggest that LY2835219 may be a potential drug for the treatment of these FET fusion-related Ewing sarcomas or other cancers. This is consistent with previous studies, LY2835219 showed multifaceted anti-tumor effects in preclinical Ewing sarcoma models (Dowless, M. et al. Abemaciclib is active in preclinical models of ewing sarcoma via multipronged regulation of cell cycle, DNA methylation, and interferon pathway signaling. Clin. Cancer Res. 24, 6028-6039 (2018)). The data of this application reveal the possible molecular mechanism of the beneficial effects of LY2835219 on Ewing sarcoma.

在许多癌细胞中,基因组和转录组编码具有相分离倾向结构域和DBD的融合蛋白。这些融合蛋白可以在染色质上进行相分离,作为新的转录因子,改变下游靶基因的表达。异常转录组通常与癌症有关。相分离调节药物可以干扰聚集物并逆转异常的转录模式。这些药物有望作为融合蛋白相关癌症的治疗药物(图15(f))。In many cancer cells, the genome and transcriptome encode fusion proteins with phase separation-prone domains and DBDs. These fusion proteins can phase separate on chromatin and act as novel transcription factors, altering the expression of downstream target genes. Abnormal transcriptomes are often associated with cancer. Phase separation-regulating drugs can interfere with aggregates and reverse abnormal transcription patterns. These drugs are expected to be therapeutic drugs for fusion protein-related cancers (Figure 15(f)).

附表1Schedule 1

附表1中KMT2A-MLLT1即为前文中MLL-MLLT1;附表1中EWSR1即为前文中EWS。KMT2A-MLLT1 in Appendix 1 is the MLL-MLLT1 mentioned above; EWSR1 in Appendix 1 is the EWS mentioned above.

Claims (47)

1.一种筛选肿瘤病症的药物的方法,其特征在于,包括:1. A method for screening drugs for tumor diseases, comprising: 将包含靶融合蛋白的细胞接种到包括孔阵列的孔板中,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白与目标肿瘤病症相关联,包含相分离倾向结构域(PS)和DNA结合结构域(DBD)且进一步包含标记分子,以便能够形成可供成像识别的聚集物;Inoculating cells containing a target fusion protein into a well plate comprising a well array, wherein the cells contain a nucleic acid molecule encoding a target fusion protein, the target fusion protein being associated with a target tumor disorder, containing a phase separation prone domain (PS) and a DNA binding domain (DBD) and further containing a marker molecule so as to be able to form aggregates that can be identified by imaging; 将多种小分子化合物分配到各个孔中;Dispense a variety of small molecule compounds into individual wells; 利用显微成像系统,对各个孔以数个不同的视野在预定时间段内持续拍摄,以生成各个孔的数个视野的时间序列图像;Using a microscopic imaging system, continuously photographing each well in a predetermined period of time with a plurality of different fields of view, so as to generate time series images of the plurality of fields of view of each well; 由处理器,基于各个孔的至少一个视野的时间序列图像,来分析各个孔的细胞内聚集物的量衰减状况;The processor analyzes the attenuation of the amount of intracellular aggregates in each well based on the time series images of at least one field of view of each well; 由所述处理器,筛选出细胞内聚集物的量的衰减状况满足预定条件的孔,使用向所筛选出的孔分配的小分子化合物作为目标肿瘤病症的候选药物。The processor selects wells whose attenuation conditions of the amount of intracellular aggregates meet predetermined conditions, and uses the small molecule compounds allocated to the selected wells as candidate drugs for the target tumor disease. 2.根据权利要求1所述的方法,其特征在于,所述靶融合蛋白与目标肿瘤病症的发生相关联,或者导致目标肿瘤病症的发生。2. The method according to claim 1, characterized in that the target fusion protein is associated with the occurrence of the target tumor condition, or causes the occurrence of the target tumor condition. 3.根据权利要求1所述的方法,其特征在于,所述靶融合蛋白在细胞中经历相分离,且所经历的相分离及其导致形成的聚集物与目标肿瘤病症的发生相关联。3. The method according to claim 1, characterized in that the target fusion protein undergoes phase separation in the cell, and the phase separation and the aggregates formed thereby are associated with the occurrence of the target tumor condition. 4.根据权利要求1所述的方法,其特征在于,所述目标肿瘤病症包括恶性肿瘤。The method of claim 1 , wherein the target tumor condition comprises a malignant tumor. 5.根据权利要求4所述的方法,其特征在于,所述靶融合蛋白所经历的相分离驱动异常基因表达,所述聚集物的量的减少与异常基因表达的逆转相关联。5. The method of claim 4, wherein the phase separation experienced by the target fusion protein drives abnormal gene expression, and a reduction in the amount of the aggregates is associated with a reversal of the abnormal gene expression. 6.根据权利要求1所述的方法,其特征在于,所述靶融合蛋白的DNA结合结构域包含突变,使得所形成的聚集物的成像识别性相较DNA结合结构域不包含突变所形成的聚集物的成像识别性更显著。6. The method according to claim 1 is characterized in that the DNA binding domain of the target fusion protein contains a mutation, so that the imaging recognition of the formed aggregates is more significant than the imaging recognition of the aggregates formed when the DNA binding domain does not contain the mutation. 7.根据权利要求1所述的方法,其特征在于,所述聚集物呈现为液滴或者凝胶状。7. The method according to claim 1, characterized in that the aggregates are in the form of droplets or gel. 8.根据权利要求1所述的方法,其特征在于,对各个孔以数个不同的视野在预定时间段内持续拍摄具体包括:8. The method according to claim 1, characterized in that continuously photographing each hole with a plurality of different fields of view within a predetermined time period specifically comprises: 将每个孔划分为数个独立的视野;Divide each well into several independent fields of view; 对于每个孔,依序拍摄每个视野的图像,以得到每个视野在预定时间段内的时间序列图像。For each well, images of each field of view are taken sequentially to obtain a time series image of each field of view within a predetermined time period. 9.根据权利要求8所述的方法,其特征在于,基于各个孔的至少一个视野的时间序列图像,来分析各个孔的细胞内聚集物的量衰减状况具体包括,对于各个孔:9. The method according to claim 8, characterized in that the step of analyzing the attenuation of the amount of intracellular aggregates in each well based on the time series images of at least one field of view of each well specifically comprises, for each well: 对于每个视野的所述时间序列图像的各个采样时间点的图像,执行细胞和聚集物的分割,并确定各个细胞的聚集物量;For each image of each sampling time point of the time series image of each field of view, performing segmentation of cells and aggregates, and determining the amount of aggregates of each cell; 对于每个视野的各个时间点的图像,确定聚集物量超过富集聚集物阈值的细胞所占比例,并基于视野求平均后作为该孔在各个采样时间点的富含聚集物细胞分数,从而得到该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线;For the images at each time point in each field of view, the proportion of cells whose aggregate amount exceeds the aggregate enrichment threshold is determined, and the fraction of aggregate-rich cells in the well at each sampling time point is averaged based on the field of view, thereby obtaining a distribution curve of the fraction of aggregate-rich cells in the well within the predetermined time period; 基于该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线,拟合指数模型;fitting an exponential model based on the distribution curve of the fraction of aggregate-rich cells for the well over the predetermined time period; 基于拟合的指数模型,来计算拟合效果参数和富含聚集物细胞分数的衰减比率,Based on the fitted exponential model, the fitted effect parameter and the decay ratio of the aggregate-rich cell fraction were calculated. 其中,所述预定条件包括富含聚集物细胞分数的衰减比率达到第一预定条件且拟合效果参数优于第二阈值。The predetermined conditions include that the decay ratio of the aggregate-rich cell fraction reaches a first predetermined condition and the fitting effect parameter is better than a second threshold. 10.根据权利要求8所述的方法,其特征在于,基于各个孔的至少一个视野的时间序列图像,来分析各个孔的细胞内聚集物的量衰减状况具体包括,对于各个孔:10. The method according to claim 8, characterized in that the step of analyzing the attenuation of the amount of intracellular aggregates in each well based on the time series images of at least one field of view of each well specifically comprises, for each well: 对于每个视野的所述时间序列图像在各个采样时间点的图像,执行细胞和聚集物的分割,并确定各个细胞的聚集物量;For the images of the time series images of each field of view at each sampling time point, performing segmentation of cells and aggregates, and determining the amount of aggregates of each cell; 对于单个视野的各个采样时间点的图像,确定聚集物量超过富集聚集物阈值的细胞所占比例,作为该孔在各个采样时间点的富含聚集物细胞分数,从而得到该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线;For the images at each sampling time point of a single field of view, determining the proportion of cells whose aggregate amount exceeds the aggregate enrichment threshold, as the aggregate-rich cell fraction of the well at each sampling time point, thereby obtaining a distribution curve of the aggregate-rich cell fraction of the well within the predetermined time period; 基于该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线,拟合指数模型;fitting an exponential model based on the distribution curve of the fraction of aggregate-rich cells for the well over the predetermined time period; 基于拟合的指数模型,来计算拟合效果参数和富含聚集物细胞分数的衰减比率,Based on the fitted exponential model, the fitted effect parameter and the decay ratio of the aggregate-rich cell fraction were calculated. 其中,所述预定条件包括富含聚集物细胞分数的衰减比率达到第一预定条件且拟合效果参数优于第二阈值。The predetermined conditions include that the decay ratio of the aggregate-rich cell fraction reaches a first predetermined condition and the fitting effect parameter is better than a second threshold. 11.根据权利要求9或10所述的方法,其特征在于,拟合效果参数为R平方。11. The method according to claim 9 or 10, characterized in that the fitting effect parameter is R square. 12.根据权利要求9或10所述的方法,其特征在于,所述第二阈值为0.2-1.0。12. The method according to claim 9 or 10, characterized in that the second threshold is 0.2-1.0. 13.根据权利要求9或10所述的方法,其特征在于,所述第一预定条件为衰减比率达到0.2且按照衰减比率大小顺序排列的前预定数量个,所述衰减比率定义为利用拟合的指数模型计算的衰减后的富含聚集物细胞分数与初始的富含聚集物细胞分数的比率。13. The method according to claim 9 or 10 is characterized in that the first predetermined condition is the first predetermined number of cells whose attenuation ratio reaches 0.2 and are arranged in order of attenuation ratio, and the attenuation ratio is defined as the ratio of the aggregate-rich cell fraction after attenuation calculated using a fitted exponential model to the initial aggregate-rich cell fraction. 14.根据权利要求1-13中任何一项所述的方法,其特征在于,还包括:人为或者自动地识别各个孔中的细胞的存活状况,确定存活状况劣于预定存活条件的孔;将向该孔分配到的小分子化合物从目标肿瘤病症的候选药物中排除。14. The method according to any one of claims 1-13 is characterized in that it also includes: manually or automatically identifying the survival status of the cells in each well, determining the wells whose survival status is worse than the predetermined survival condition; and excluding the small molecule compound allocated to the well from the candidate drugs for the target tumor disease. 15.根据权利要求14所述的方法,其特征在于,自动地识别各个孔中的细胞的存活状况具体包括:15. The method according to claim 14, wherein automatically identifying the survival status of cells in each well specifically comprises: 利用图像处理算法,确定细胞的形态和细胞数量的减少率中的至少一种;Determine at least one of the morphology of the cells and the rate of reduction of the number of cells using an image processing algorithm; 在所识别的细胞的形态相较球形的偏离度高于偏离度阈值和/或细胞数量的减少率高于减少率阈值的情况下,将细胞识别为存活状况劣于预定存活条件。When the deviation of the morphology of the identified cells from a spherical shape is higher than a deviation threshold and/or the reduction rate of the number of cells is higher than a reduction rate threshold, the cells are identified as having a survival condition worse than a predetermined survival condition. 16.根据权利要求9或10所述的方法,其特征在于,所述指数模型由如下的公式(1)来定义:16. The method according to claim 9 or 10, characterized in that the exponential model is defined by the following formula (1): 其中c0和c1是指数模型中需要回归的常数,t是小分子化合物的作用时间,Y(t)是每个孔中的富含聚集物细胞分数。where c0 and c1 are constants to be regressed in the exponential model, t is the exposure time of the small molecule compound, and Y(t) is the fraction of aggregate-rich cells in each well. 17.根据权利要求9或10所述的方法,其特征在于,对各个图像执行细胞和聚集物的分割具体包括:17. The method according to claim 9 or 10, characterized in that performing segmentation of cells and aggregates on each image specifically comprises: 利用第一算法来分割细胞;using a first algorithm to segment cells; 识别图像中的点状结构,并以聚集物的代表尺寸作为目标尺寸执行增强;Identify point structures in the image and perform enhancement using the representative size of the aggregate as the target size; 利用第二算法,分割出强度值比背景的代表强度值高出第三阈值的聚集物。Using the second algorithm, aggregates whose intensity values are higher than the representative intensity value of the background by a third threshold are segmented. 18.根据权利要求17所述的方法,其特征在于,所述第一算法自动地确定用于分割细胞前景和细胞背景的强度阈值且据此进行细胞分割,所述第二算法自动地确定聚合物背景的鲁棒性的代表强度值,所述第三阈值为背景的代表强度值的标准偏差的3-5倍。18. The method according to claim 17 is characterized in that the first algorithm automatically determines the intensity threshold for segmenting cell foreground and cell background and performs cell segmentation based on it, the second algorithm automatically determines the representative intensity value of the robustness of the polymer background, and the third threshold is 3-5 times the standard deviation of the representative intensity value of the background. 19.根据权利要求17所述的方法,其特征在于,所述第一算法包括Otsu算法,所述第二算法包括鲁棒背景算法。19. The method according to claim 17, characterized in that the first algorithm comprises an Otsu algorithm, and the second algorithm comprises a robust background algorithm. 20.根据权利要求17所述的方法,其特征在于,确定各个细胞的聚集物量具体包括:将分割出的聚集物与分割出的细胞相关联,以得到各个细胞的聚集物量。20. The method according to claim 17 is characterized in that determining the amount of aggregates of each cell specifically comprises: associating the segmented aggregates with the segmented cells to obtain the amount of aggregates of each cell. 21.根据权利要求9或10所述的方法,其特征在于,所述拍摄和所述分析经由高内涵细胞成像分析系统来执行。21. The method according to claim 9 or 10, characterized in that the photographing and the analysis are performed via a high-content cell imaging analysis system. 22.根据权利要求1-13中任何一项所述的方法,其特征在于,还包括:在目标肿瘤病症为与FET融合相关的肿瘤、NUP98或MLL融合相关的血液病、TFE3融合相关的肾细胞癌、和HMGA2融合相关的脂肪瘤的情况下,选择衍生自FET-ETS融合蛋白家族的成员作为所述靶融合蛋白。22. The method according to any one of claims 1-13 is characterized in that it also includes: when the target tumor condition is a tumor associated with FET fusion, a blood disease associated with NUP98 or MLL fusion, a renal cell carcinoma associated with TFE3 fusion, and a lipoma associated with HMGA2 fusion, selecting a member derived from the FET-ETS fusion protein family as the target fusion protein. 23.根据权利要求22所述的方法,其特征在于,在目标肿瘤病症为与FET-ETS融合相关的肿瘤病症,包括尤文氏肉瘤(EWS-FLI1)、间皮细胞瘤(FUS-ATF1)、恶性黑色素瘤(EWS-ATF1)、促纤维增生性小圆细胞肿瘤(EWS-ERG)和白血病(FUS-ERG)的至少一种,的情况下,所述目标肿瘤病症的候选药物包括如下的至少一种:23. The method of claim 22, wherein, in the case where the target tumor disorder is a tumor disorder associated with FET-ETS fusion, including at least one of Ewing's sarcoma (EWS-FLI1), mesothelioma (FUS-ATF1), malignant melanoma (EWS-ATF1), desmoplastic small round cell tumor (EWS-ERG) and leukemia (FUS-ERG), the candidate drug for the target tumor disorder comprises at least one of the following: LY2835219(甲磺酸阿贝莫西利),其结构式为CHIR-124,其结构式为EMD-1214063,其结构式为CX-6258,其结构式为Pelitinib(EKB-569),其结构式为LY2835219 (Abetamoxiclib mesylate), the structural formula of which is CHIR-124, whose structural formula is EMD-1214063, whose structural formula is CX-6258, whose structural formula is Pelitinib (EKB-569), whose structural formula is AZD-9291,其结构式为AZD-9291, whose structural formula is GDC-0941,其结构式为GDC-0941, whose structural formula is SB743921,其结构式为 SB743921, whose structural formula is 24.根据权利要求23所述的方法,其特征在于,所述靶融合蛋白包括mCherry-FUS-ERGmut或者mCherry-EWS-FLI1mut,所述细胞选自U2OS细胞和293T细胞,24. The method according to claim 23, characterized in that the target fusion protein comprises mCherry-FUS-ERGmut or mCherry-EWS-FLI1mut, and the cells are selected from U2OS cells and 293T cells, mCherry-FUS-ERGmut的氨基酸序列如SEQ ID NO:1所示,The amino acid sequence of mCherry-FUS-ERGmut is shown in SEQ ID NO: 1. mCherry-EWS-FLI1mut的氨基酸序列如如SEQ ID NO:2所示。The amino acid sequence of mCherry-EWS-FLI1mut is shown in SEQ ID NO:2. 25.根据权利要求23所述的方法,其特征在于,所述LY2835219作为FET融合相关的尤因肉瘤的候选药物。25. The method of claim 23, wherein LY2835219 is used as a drug candidate for FET fusion-associated Ewing sarcoma. 26.根据权利要求4所述的方法,其特征在于,所述靶融合蛋白和目标恶性肿瘤的对应关系包括如下至少一种:26. The method according to claim 4, characterized in that the correspondence between the target fusion protein and the target malignant tumor includes at least one of the following: 所述靶融合蛋白包含NUP98-HOXA9,所述目标恶性肿瘤为急性髓细胞白血病;The target fusion protein comprises NUP98-HOXA9, and the target malignant tumor is acute myeloid leukemia; 所述靶融合蛋白包含SFPQ-TFE3、MED15-TFE3和/或PRCC-TFE3,所述目标恶性肿瘤为肾细胞癌;The target fusion protein comprises SFPQ-TFE3, MED15-TFE3 and/or PRCC-TFE3, and the target malignant tumor is renal cell carcinoma; 所述靶融合蛋白包含BRD9-IDR或BRD9-TERT,所述目标恶性肿瘤为肝细胞癌;The target fusion protein comprises BRD9-IDR or BRD9-TERT, and the target malignant tumor is hepatocellular carcinoma; 所述靶融合蛋白包含TRIO-TERT,所述目标恶性肿瘤为脂肪肉瘤;The target fusion protein comprises TRIO-TERT, and the target malignant tumor is liposarcoma; 所述靶融合蛋白包含STN1-COL17A1,所述目标恶性肿瘤为胃腺癌;The target fusion protein comprises STN1-COL17A1, and the target malignant tumor is gastric adenocarcinoma; 所述靶融合蛋白包含MLL-MLLT1,所述目标恶性肿瘤为白血病;The target fusion protein comprises MLL-MLLT1, and the target malignant tumor is leukemia; 所述靶融合蛋白包含SFPQ-TFE3,所述目标恶性肿瘤为乳头状肾细胞癌。The target fusion protein comprises SFPQ-TFE3, and the target malignant tumor is papillary renal cell carcinoma. 27.根据权利要求1所述的方法,其特征在于,还包括:对于目标肿瘤病症,使用治疗除了肿瘤病症或者恶性肿瘤以外的其他病症的药物作为要分配到各个孔中的所述小分子化合物以进行筛选。27. The method according to claim 1, further comprising: for a target tumor condition, using a drug for treating a condition other than the tumor condition or malignant tumor as the small molecule compound to be distributed into each well for screening. 28.根据权利要求1所述的方法,其特征在于,还包括:对于目标恶性肿瘤病症,使用治疗除了所述目标恶性肿瘤病症以外的其他恶性肿瘤病症的药物作为要分配到各个孔中的所述小分子化合物以进行筛选。28. The method according to claim 1, further comprising: for a target malignant tumor condition, using a drug for treating other malignant tumor conditions other than the target malignant tumor condition as the small molecule compound to be dispensed into each well for screening. 29.根据权利要求28所述的方法,其特征在于,所述目标恶性肿瘤包括肉瘤,所述其他恶性肿瘤病症包括乳腺癌、白血病、肾癌、肝癌、胃癌和甲状腺癌中的任何一种。29. The method of claim 28, wherein the target malignant tumor comprises a sarcoma, and the other malignant tumor conditions comprise any one of breast cancer, leukemia, kidney cancer, liver cancer, gastric cancer, and thyroid cancer. 30.根据权利要求27所述的方法,其特征在于,所述其他病症与所述靶融合蛋白相关联,或者与所述靶融合蛋白的异常相分离相关联。30. The method of claim 27, wherein the other disease is associated with the target fusion protein, or is associated with abnormal phase separation of the target fusion protein. 31.根据权利要求28所述的方法,其特征在于,所述其他恶性肿瘤病症与所述靶融合蛋白相关联,或者与所述靶融合蛋白的异常相分离相关联。31. The method of claim 28, wherein the other malignant tumor condition is associated with the target fusion protein, or is associated with abnormal phase separation of the target fusion protein. 32.一种计算机程序产品,其包含能够存储于和/或下载到非暂时性存储介质上的计算机可执行指令,其特征在于,所述计算机可执行指令由处理器执行时,执行一种筛选肿瘤病症的药物的方法,包括:32. A computer program product comprising computer executable instructions that can be stored and/or downloaded to a non-transitory storage medium, wherein when the computer executable instructions are executed by a processor, a method for screening a drug for a tumor condition is performed, comprising: 获取包括孔阵列的孔板的各个孔的数个视野的时间序列图像,所述时间序列图像通过在执行如下步骤后对各个孔以数个不同的视野在预定时间段内持续拍摄而生成:将包含靶融合蛋白的细胞接种到包括孔阵列的孔板中,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白与目标肿瘤病症相关联,包含相分离倾向结构域(PS)和DNA结合结构域(DBD)且进一步包含标记分子,以便能够形成可供成像识别的聚集物;将多种小分子化合物分配到各个孔中;Acquiring time series images of several fields of view of each well of a well plate including a well array, the time series images being generated by continuously photographing each well in several different fields of view within a predetermined time period after performing the following steps: inoculating cells containing a target fusion protein into the well plate including a well array, wherein the cells contain a nucleic acid molecule encoding the target fusion protein, the target fusion protein is associated with a target tumor disorder, contains a phase separation prone domain (PS) and a DNA binding domain (DBD) and further contains a marker molecule so as to be able to form aggregates that can be identified by imaging; dispensing a plurality of small molecule compounds into each well; 基于各个孔的至少一个视野的时间序列图像,来分析各个孔的细胞内聚集物的量衰减状况;Based on the time series images of at least one field of view of each well, analyzing the attenuation of the amount of intracellular aggregates in each well; 筛选出细胞内聚集物的量的衰减状况满足预定条件的孔,并确定向所筛选出的孔分配的小分子化合物作为目标肿瘤病症的候选药物。Wells whose attenuation conditions of the amount of intracellular aggregates meet predetermined conditions are screened, and small molecule compounds assigned to the screened wells are determined as candidate drugs for the target tumor disorder. 33.根据权利要求32所述的计算机程序产品,其特征在于,基于各个孔的至少一个视野的时间序列图像,来分析各个孔的细胞内聚集物的量衰减状况具体包括,对于各个孔:33. The computer program product according to claim 32, wherein the step of analyzing the attenuation of the amount of intracellular aggregates in each well based on the time series images of at least one field of view of each well specifically comprises, for each well: 对于每个视野的所述时间序列图像的各个采样时间点的图像,执行细胞和聚集物的分割,并确定各个细胞的聚集物量;For each image of each sampling time point of the time series image of each field of view, performing segmentation of cells and aggregates, and determining the amount of aggregates of each cell; 对于每个视野的各个时间点的图像,确定聚集物量超过富集聚集物阈值的细胞所占比例,并基于视野求平均后作为该孔在各个采样时间点的富含聚集物细胞分数,从而得到该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线;For the images at each time point in each field of view, the proportion of cells whose aggregate amount exceeds the aggregate enrichment threshold is determined, and the fraction of aggregate-rich cells in the well at each sampling time point is averaged based on the field of view, thereby obtaining a distribution curve of the fraction of aggregate-rich cells in the well within the predetermined time period; 基于该孔在所述预定时间段内的富含聚集物细胞分数的分布曲线,拟合指数模型;fitting an exponential model based on the distribution curve of the fraction of aggregate-rich cells for the well over the predetermined time period; 基于拟合的指数模型,来计算拟合效果参数和富含聚集物细胞分数的衰减比率,Based on the fitted exponential model, the fitted effect parameter and the decay ratio of the aggregate-rich cell fraction were calculated. 其中,所述预定条件包括富含聚集物细胞分数的衰减比率达到第一预定条件且拟合效果参数优于第二阈值。The predetermined conditions include that the decay ratio of the aggregate-rich cell fraction reaches a first predetermined condition and the fitting effect parameter is better than a second threshold. 34.根据权利要求33所述的计算机程序产品,其特征在于,拟合效果参数为R平方,且所述第二阈值为0.2-1.0。34. The computer program product of claim 33, wherein the fitting effect parameter is R squared, and the second threshold is 0.2-1.0. 35.根据权利要求33所述的计算机程序产品,其特征在于,所述第一预定条件为衰减比率达到0.2且按照衰减比率大小顺序排列的前预定数量个,所述衰减比率定义为利用拟合的指数模型计算的衰减后的富含聚集物细胞分数与初始的富含聚集物细胞分数的比率。35. The computer program product according to claim 33 is characterized in that the first predetermined condition is the first predetermined number of cells whose attenuation ratio reaches 0.2 and are arranged in order of attenuation ratio, and the attenuation ratio is defined as the ratio of the aggregate-rich cell fraction after attenuation calculated using a fitted exponential model to the initial aggregate-rich cell fraction. 36.根据权利要求32-35中任何一项所述的计算机程序产品,其特征在于,所述筛选肿瘤病症的药物的方法还包括:自动地识别各个孔中的细胞的存活状况,确定存活状况劣于预定存活条件的孔;将向该孔分配到的小分子化合物从目标肿瘤病症的候选药物中排除。36. The computer program product according to any one of claims 32-35, characterized in that the method for screening drugs for tumor diseases also includes: automatically identifying the survival status of cells in each well, determining the wells whose survival status is worse than the predetermined survival condition; and excluding the small molecule compound assigned to the well from the candidate drugs for the target tumor disease. 37.根据权利要求36所述的计算机程序产品,其特征在于,自动地识别各个孔中的细胞的存活状况具体包括:37. The computer program product according to claim 36, wherein automatically identifying the viability of cells in each well specifically comprises: 利用图像处理算法,确定细胞的形态和细胞数量的减少率中的至少一种;Determine at least one of the morphology of the cells and the rate of reduction of the number of cells using an image processing algorithm; 在所识别的细胞的形态相较球形的偏离度高于偏离度阈值和/或细胞数量的减少率高于减少率阈值的情况下,将细胞识别为存活状况劣于预定存活条件。When the deviation of the morphology of the identified cells from a spherical shape is higher than a deviation threshold and/or the reduction rate of the number of cells is higher than a reduction rate threshold, the cells are identified as having a survival condition worse than a predetermined survival condition. 38.根据权利要求33所述的计算机程序产品,其特征在于,所述指数模型由如下的公式(1)来定义:38. The computer program product of claim 33, wherein the exponential model is defined by the following formula (1): 其中c0和c1是指数模型中需要回归的常数,t是小分子化合物的作用时间,Y(t)是每个孔中的富含聚集物细胞分数。where c0 and c1 are constants to be regressed in the exponential model, t is the exposure time of the small molecule compound, and Y(t) is the fraction of aggregate-rich cells in each well. 39.根据权利要求33所述的计算机程序产品,其特征在于,对各个图像执行细胞和聚集物的分割具体包括:39. The computer program product of claim 33, wherein performing segmentation of cells and aggregates on each image comprises: 利用第一算法来分割细胞;using a first algorithm to segment cells; 识别图像中的点状结构,并以聚集物的代表尺寸作为目标尺寸执行增强;Identify point structures in the image and perform enhancement using the representative size of the aggregate as the target size; 利用第二算法,分割出强度值比背景的代表强度值高出第三阈值的聚集物。Using the second algorithm, aggregates whose intensity values are higher than the representative intensity value of the background by a third threshold are segmented. 40.根据权利要求39所述的计算机程序产品,其特征在于,所述第一算法自动地确定用于分割细胞前景和细胞背景的强度阈值且据此进行细胞分割,所述第二算法自动地确定聚合物背景的鲁棒性的代表强度值,所述第三阈值为背景的代表强度值的标准偏差的3-5倍。40. The computer program product according to claim 39 is characterized in that the first algorithm automatically determines the intensity threshold for segmenting cell foreground and cell background and performs cell segmentation based on it, the second algorithm automatically determines the representative intensity value of the robustness of the polymer background, and the third threshold is 3-5 times the standard deviation of the representative intensity value of the background. 41.根据权利要求39所述的计算机程序产品,其特征在于,所述第一算法包括Otsu算法,所述第二算法包括鲁棒背景算法。41. The computer program product of claim 39, wherein the first algorithm comprises an Otsu algorithm and the second algorithm comprises a robust background algorithm. 42.一种高内涵成像分析系统,包括:42. A high-content imaging analysis system, comprising: 自动高速显微成像组件,其配置为在如下步骤执行后对包括孔阵列的孔板的各个孔以数个不同的视野在预定时间段内持续拍摄,而生成各个孔的数个视野的时间序列图像:将包含靶融合蛋白的细胞接种到包括孔阵列的孔板中,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白与目标肿瘤病症相关联,包含相分离倾向结构域(PS)和DNA结合结构域(DBD)且进一步包含标记分子,以便能够形成可供成像识别的聚集物;将多种小分子化合物分配到各个孔中;以及An automatic high-speed microscopic imaging component is configured to continuously photograph each well of a well plate including a well array with several different fields of view within a predetermined time period to generate time series images of several fields of view of each well after the following steps are performed: inoculating cells containing a target fusion protein into a well plate including a well array, wherein the cells contain a nucleic acid molecule encoding the target fusion protein, the target fusion protein is associated with a target tumor disorder, contains a phase separation prone domain (PS) and a DNA binding domain (DBD) and further contains a marker molecule so as to be able to form aggregates that can be identified by imaging; dispensing a plurality of small molecule compounds into each well; and 至少一个处理器,其配置为执行根据权利要求32-41中任何一项所述的筛选肿瘤病症的药物的方法。At least one processor configured to execute the method for screening drugs for tumor conditions according to any one of claims 32-41. 43.一种分析包含相分离倾向结构域(PS)和DNA结合结构域(DBD)的靶融合蛋白的相分离状况的方法,其特征在于,包括:43. A method for analyzing the phase separation state of a target fusion protein comprising a phase separation prone domain (PS) and a DNA binding domain (DBD), comprising: 采集包含所述靶融合蛋白的细胞的图像,其中,所述细胞包含编码靶融合蛋白的核酸分子,所述靶融合蛋白进一步包含标记分子以便能够形成可供成像识别的聚集物;以及Acquiring an image of a cell comprising the target fusion protein, wherein the cell comprises a nucleic acid molecule encoding the target fusion protein, and the target fusion protein further comprises a labeling molecule so as to form aggregates that can be identified by imaging; and 识别所述图像中的细胞内聚集物的量,并根据所识别的聚集物的量来确定所述PS-DBD融合蛋白在体内和/或体外的相分离状况,聚集物的量越大,则相分离状况越严重。The amount of intracellular aggregates in the image is identified, and the phase separation status of the PS-DBD fusion protein in vivo and/or in vitro is determined based on the amount of the identified aggregates. The larger the amount of aggregates, the more serious the phase separation status. 44.根据权利要求43所述的方法,其特征在于,所述聚集物呈现为液滴或者凝胶状。44. The method according to claim 43, characterized in that the aggregates are in the form of droplets or gel. 45.根据权利要求43所述的方法,其特征在于,所述靶融合蛋白的DNA结合结构域包含突变,使得所形成的聚集物的成像识别性相较DNA结合结构域不包含突变所形成的聚集物的成像识别性更显著。45. The method according to claim 43 is characterized in that the DNA binding domain of the target fusion protein contains a mutation, so that the imaging recognition of the formed aggregates is more significant than the imaging recognition of the aggregates formed when the DNA binding domain does not contain the mutation. 46.根据权利要求45所述的方法,其特征在于,所述突变为点突变,且引入待分析的靶融合蛋白的DBD中具有与DNA结合活性的位点。46. The method according to claim 45, characterized in that the mutation is a point mutation, and is introduced into a site having DNA binding activity in the DBD of the target fusion protein to be analyzed. 47.根据权利要求46所述的方法,其特征在于,待分析的靶融合蛋白的DBD中具有与DNA结合活性的位点根据Swiss-Prot数据库中有关DNA结合活性的信息来确定。47. The method according to claim 46 is characterized in that the site with DNA binding activity in the DBD of the target fusion protein to be analyzed is determined based on the information on DNA binding activity in the Swiss-Prot database.
CN202310118308.5A 2023-02-03 2023-02-03 Method for screening drugs for tumor diseases, program product, high-content imaging analysis system, method for analyzing phase separation Pending CN116590375A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202310118308.5A CN116590375A (en) 2023-02-03 2023-02-03 Method for screening drugs for tumor diseases, program product, high-content imaging analysis system, method for analyzing phase separation
PCT/CN2024/075422 WO2024160270A1 (en) 2023-02-03 2024-02-02 Method for screening drug for tumor disease, program product, high-content imaging analysis system, and method for analyzing phase separation conditions

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310118308.5A CN116590375A (en) 2023-02-03 2023-02-03 Method for screening drugs for tumor diseases, program product, high-content imaging analysis system, method for analyzing phase separation

Publications (1)

Publication Number Publication Date
CN116590375A true CN116590375A (en) 2023-08-15

Family

ID=87597844

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310118308.5A Pending CN116590375A (en) 2023-02-03 2023-02-03 Method for screening drugs for tumor diseases, program product, high-content imaging analysis system, method for analyzing phase separation

Country Status (2)

Country Link
CN (1) CN116590375A (en)
WO (1) WO2024160270A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024160270A1 (en) * 2023-02-03 2024-08-08 北京大学 Method for screening drug for tumor disease, program product, high-content imaging analysis system, and method for analyzing phase separation conditions

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10533167B2 (en) * 2016-06-09 2020-01-14 The Trustees Of Princeton University Optogenetic tool for rapid and reversible clustering of proteins
EP3853242B1 (en) * 2018-09-20 2024-05-15 The Trustees Of Princeton University High throughput method and system for mapping intracellular phase diagrams
CN111175514B (en) * 2018-11-13 2023-07-14 中国科学院上海有机化学研究所 Methods for Screening and Characterizing Proteins with Phase Separation Capability
CN113631707A (en) * 2019-02-19 2021-11-09 普林斯顿大学理事会 Systems and methods for light-regulated oligomerization and phase separation of folding domains and RNA particle-associated protein domains
CN116590375A (en) * 2023-02-03 2023-08-15 北京大学 Method for screening drugs for tumor diseases, program product, high-content imaging analysis system, method for analyzing phase separation

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024160270A1 (en) * 2023-02-03 2024-08-08 北京大学 Method for screening drug for tumor disease, program product, high-content imaging analysis system, and method for analyzing phase separation conditions

Also Published As

Publication number Publication date
WO2024160270A1 (en) 2024-08-08

Similar Documents

Publication Publication Date Title
Aharonov et al. ERBB2 drives YAP activation and EMT-like processes during cardiac regeneration
Shtutman et al. Cellular model of p21-induced senescence
Drost et al. BRD7 is a candidate tumour suppressor gene required for p53 function
Gal et al. Nuclear localization sequence of FUS and induction of stress granules by ALS mutants
Song et al. Specific interaction of KIF11 with ZBP1 regulates the transport of β-actin mRNA and cell motility
Chaturvedi et al. Molecular dissection of the mechanism by which EWS/FLI expression compromises actin cytoskeletal integrity and cell adhesion in Ewing sarcoma
Ducat et al. Regulation of microtubule assembly and organization in mitosis by the AAA+ ATPase Pontin
EP3093348B1 (en) Method for distinguishing desired cell type using expression of mirna as indicator
Monderer-Rothkoff et al. AUTS2 isoforms control neuronal differentiation
Chen et al. DDX3 modulates neurite development via translationally activating an RNA regulon involved in Rac1 activation
Moissoglu et al. Translational regulation of protrusion-localized RNAs involves silencing and clustering after transport
Sidor et al. Mask family proteins ANKHD1 and ANKRD17 regulate YAP nuclear import and stability
Shi et al. BRD4-targeting PROTAC as a unique tool to study biomolecular condensates
JP6873089B2 (en) Methods and systems for identifying patient-specific driver mutations
Jain et al. Rudhira/BCAS3 is a cytoskeletal protein that controls Cdc42 activation and directional cell migration during angiogenesis
Hsiao et al. Gli2 modulates cell cycle re-entry through autophagy-mediated regulation of the length of primary cilia
WO2024160270A1 (en) Method for screening drug for tumor disease, program product, high-content imaging analysis system, and method for analyzing phase separation conditions
Raynard et al. NF‐κB‐dependent secretome of senescent cells can trigger neuroendocrine transdifferentiation of breast cancer cells
Shea et al. Localization of YAP activity in developing skeletal rudiments is responsive to mechanical stimulation
Gelev et al. A new paradigm for transcription factor TFIIB functionality
Ariss et al. Amalgam regulates the receptor tyrosine kinase pathway through Sprouty in glial cell development in the Drosophila larval brain
Libé-Philippot et al. Human synaptic neoteny requires species-specific balancing of SRGAP2-SYNGAP1 cross-inhibition
Krauss et al. Downregulation of protein 4.1 R, a mature centriole protein, disrupts centrosomes, alters cell cycle progression, and perturbs mitotic spindles and anaphase
Libé-Philippot et al. Synaptic neoteny of human cortical neurons requires species-specific balancing of SRGAP2-SYNGAP1 cross-inhibition
Petsalaki et al. The abscission checkpoint senses chromatin bridges through Top2α recruitment to DNA knots

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination