IT201900000130A1 - METHOD FOR IDENTIFYING SUBJECTS RESISTANT TO TREATMENT WITH PLATINUM-BASED DRUGS AND RELATIVE KIT - Google Patents
METHOD FOR IDENTIFYING SUBJECTS RESISTANT TO TREATMENT WITH PLATINUM-BASED DRUGS AND RELATIVE KIT Download PDFInfo
- Publication number
- IT201900000130A1 IT201900000130A1 IT102019000000130A IT201900000130A IT201900000130A1 IT 201900000130 A1 IT201900000130 A1 IT 201900000130A1 IT 102019000000130 A IT102019000000130 A IT 102019000000130A IT 201900000130 A IT201900000130 A IT 201900000130A IT 201900000130 A1 IT201900000130 A1 IT 201900000130A1
- Authority
- IT
- Italy
- Prior art keywords
- genes
- seq
- treatment
- container
- expression
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 42
- 238000011282 treatment Methods 0.000 title claims description 40
- 239000003814 drug Substances 0.000 title claims description 19
- 229940079593 drug Drugs 0.000 title claims description 8
- 108090000623 proteins and genes Proteins 0.000 claims description 79
- 206010028980 Neoplasm Diseases 0.000 claims description 35
- 210000004027 cell Anatomy 0.000 claims description 31
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 21
- 238000004458 analytical method Methods 0.000 claims description 21
- 239000000523 sample Substances 0.000 claims description 20
- 230000014509 gene expression Effects 0.000 claims description 19
- 108020004999 messenger RNA Proteins 0.000 claims description 19
- 201000011510 cancer Diseases 0.000 claims description 17
- 230000004083 survival effect Effects 0.000 claims description 17
- 241000282414 Homo sapiens Species 0.000 claims description 16
- 150000003057 platinum Chemical class 0.000 claims description 14
- 229940124597 therapeutic agent Drugs 0.000 claims description 11
- 238000006243 chemical reaction Methods 0.000 claims description 7
- 239000012472 biological sample Substances 0.000 claims description 5
- 230000002018 overexpression Effects 0.000 claims description 5
- 239000000243 solution Substances 0.000 claims description 5
- 238000003556 assay Methods 0.000 claims description 4
- 239000013068 control sample Substances 0.000 claims description 4
- 238000003757 reverse transcription PCR Methods 0.000 claims description 4
- 239000000758 substrate Substances 0.000 claims description 4
- 210000004881 tumor cell Anatomy 0.000 claims description 4
- 102000006382 Ribonucleases Human genes 0.000 claims description 3
- 108010083644 Ribonucleases Proteins 0.000 claims description 3
- 238000002493 microarray Methods 0.000 claims description 3
- 238000012163 sequencing technique Methods 0.000 claims description 3
- 230000003827 upregulation Effects 0.000 claims description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 2
- 102100038449 Claudin-6 Human genes 0.000 claims description 2
- 102100031611 Collagen alpha-1(III) chain Human genes 0.000 claims description 2
- 102100033825 Collagen alpha-1(XI) chain Human genes 0.000 claims description 2
- 102100036213 Collagen alpha-2(I) chain Human genes 0.000 claims description 2
- 102100030074 Dickkopf-related protein 1 Human genes 0.000 claims description 2
- 102100030091 Dickkopf-related protein 2 Human genes 0.000 claims description 2
- 102100028463 Galactose-3-O-sulfotransferase 3 Human genes 0.000 claims description 2
- 101000882898 Homo sapiens Claudin-6 Proteins 0.000 claims description 2
- 101000993285 Homo sapiens Collagen alpha-1(III) chain Proteins 0.000 claims description 2
- 101000710623 Homo sapiens Collagen alpha-1(XI) chain Proteins 0.000 claims description 2
- 101000875067 Homo sapiens Collagen alpha-2(I) chain Proteins 0.000 claims description 2
- 101000864646 Homo sapiens Dickkopf-related protein 1 Proteins 0.000 claims description 2
- 101000864647 Homo sapiens Dickkopf-related protein 2 Proteins 0.000 claims description 2
- 101001061351 Homo sapiens Galactose-3-O-sulfotransferase 3 Proteins 0.000 claims description 2
- 101000840566 Homo sapiens Insulin-like growth factor-binding protein 5 Proteins 0.000 claims description 2
- 101001055087 Homo sapiens MAP3K7 C-terminal-like protein Proteins 0.000 claims description 2
- 101000703681 Homo sapiens Single-minded homolog 1 Proteins 0.000 claims description 2
- 102100029225 Insulin-like growth factor-binding protein 5 Human genes 0.000 claims description 2
- 102100026906 MAP3K7 C-terminal-like protein Human genes 0.000 claims description 2
- 238000000636 Northern blotting Methods 0.000 claims description 2
- 102100031980 Single-minded homolog 1 Human genes 0.000 claims description 2
- 239000003085 diluting agent Substances 0.000 claims description 2
- 238000003364 immunohistochemistry Methods 0.000 claims description 2
- 238000007901 in situ hybridization Methods 0.000 claims description 2
- 238000011534 incubation Methods 0.000 claims description 2
- 239000000126 substance Substances 0.000 claims description 2
- 108700026220 vif Genes Proteins 0.000 claims description 2
- 239000011534 wash buffer Substances 0.000 claims description 2
- 238000005406 washing Methods 0.000 claims description 2
- 238000005303 weighing Methods 0.000 claims description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 22
- 108020004414 DNA Proteins 0.000 description 18
- 229910052697 platinum Inorganic materials 0.000 description 11
- 210000001519 tissue Anatomy 0.000 description 11
- 238000001514 detection method Methods 0.000 description 10
- 238000011156 evaluation Methods 0.000 description 10
- 101000607909 Homo sapiens Ubiquitin carboxyl-terminal hydrolase 1 Proteins 0.000 description 9
- 102100039865 Ubiquitin carboxyl-terminal hydrolase 1 Human genes 0.000 description 9
- 238000004393 prognosis Methods 0.000 description 9
- 241000237858 Gastropoda Species 0.000 description 8
- 150000007523 nucleic acids Chemical class 0.000 description 8
- 230000000771 oncological effect Effects 0.000 description 8
- 238000002512 chemotherapy Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 102000039446 nucleic acids Human genes 0.000 description 7
- 108020004707 nucleic acids Proteins 0.000 description 7
- 108091034117 Oligonucleotide Proteins 0.000 description 6
- 102000004169 proteins and genes Human genes 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 102100034343 Integrase Human genes 0.000 description 4
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 4
- 238000011529 RT qPCR Methods 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- 230000007705 epithelial mesenchymal transition Effects 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 239000003112 inhibitor Substances 0.000 description 4
- 238000011835 investigation Methods 0.000 description 4
- 238000001356 surgical procedure Methods 0.000 description 4
- 108091093088 Amplicon Proteins 0.000 description 3
- 206010027476 Metastases Diseases 0.000 description 3
- 206010033128 Ovarian cancer Diseases 0.000 description 3
- 108010006785 Taq Polymerase Proteins 0.000 description 3
- 239000000090 biomarker Substances 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 230000000875 corresponding effect Effects 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 206010061289 metastatic neoplasm Diseases 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000002285 radioactive effect Effects 0.000 description 3
- GUAHPAJOXVYFON-ZETCQYMHSA-N (8S)-8-amino-7-oxononanoic acid zwitterion Chemical compound C[C@H](N)C(=O)CCCCCC(O)=O GUAHPAJOXVYFON-ZETCQYMHSA-N 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- 206010003445 Ascites Diseases 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 206010059866 Drug resistance Diseases 0.000 description 2
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 2
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 2
- 241000204031 Mycoplasma Species 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 208000007571 Ovarian Epithelial Carcinoma Diseases 0.000 description 2
- 206010061535 Ovarian neoplasm Diseases 0.000 description 2
- 208000003721 Triple Negative Breast Neoplasms Diseases 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 239000013060 biological fluid Substances 0.000 description 2
- 238000001574 biopsy Methods 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000003745 diagnosis Methods 0.000 description 2
- 210000000981 epithelium Anatomy 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 201000005202 lung cancer Diseases 0.000 description 2
- 208000020816 lung neoplasm Diseases 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 230000002611 ovarian Effects 0.000 description 2
- 239000012188 paraffin wax Substances 0.000 description 2
- 230000007170 pathology Effects 0.000 description 2
- 238000010837 poor prognosis Methods 0.000 description 2
- 239000000092 prognostic biomarker Substances 0.000 description 2
- 230000028617 response to DNA damage stimulus Effects 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 208000022679 triple-negative breast carcinoma Diseases 0.000 description 2
- IJJWOSAXNHWBPR-HUBLWGQQSA-N 5-[(3as,4s,6ar)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]-n-(6-hydrazinyl-6-oxohexyl)pentanamide Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)NCCCCCC(=O)NN)SC[C@@H]21 IJJWOSAXNHWBPR-HUBLWGQQSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 201000004384 Alopecia Diseases 0.000 description 1
- 241000713838 Avian myeloblastosis virus Species 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 206010048610 Cardiotoxicity Diseases 0.000 description 1
- 102000029816 Collagenase Human genes 0.000 description 1
- 108060005980 Collagenase Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 206010012735 Diarrhoea Diseases 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 208000010201 Exanthema Diseases 0.000 description 1
- 206010019233 Headaches Diseases 0.000 description 1
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- 208000030070 Malignant epithelial tumor of ovary Diseases 0.000 description 1
- 108091092878 Microsatellite Proteins 0.000 description 1
- 241000551546 Minerva Species 0.000 description 1
- 241000713869 Moloney murine leukemia virus Species 0.000 description 1
- 206010029350 Neurotoxicity Diseases 0.000 description 1
- 102100030569 Nuclear receptor corepressor 2 Human genes 0.000 description 1
- 101710153660 Nuclear receptor corepressor 2 Proteins 0.000 description 1
- 208000035327 Oestrogen receptor positive breast cancer Diseases 0.000 description 1
- 206010061328 Ovarian epithelial cancer Diseases 0.000 description 1
- 238000009004 PCR Kit Methods 0.000 description 1
- 208000002193 Pain Diseases 0.000 description 1
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 102000004245 Proteasome Endopeptidase Complex Human genes 0.000 description 1
- 108090000708 Proteasome Endopeptidase Complex Proteins 0.000 description 1
- 208000003251 Pruritus Diseases 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 206010044221 Toxic encephalopathy Diseases 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 108010020713 Tth polymerase Proteins 0.000 description 1
- 102000006275 Ubiquitin-Protein Ligases Human genes 0.000 description 1
- 108010083111 Ubiquitin-Protein Ligases Proteins 0.000 description 1
- 208000025865 Ulcer Diseases 0.000 description 1
- 206010047700 Vomiting Diseases 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000002246 antineoplastic agent Substances 0.000 description 1
- 229940041181 antineoplastic drug Drugs 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 210000003567 ascitic fluid Anatomy 0.000 description 1
- 206010003549 asthenia Diseases 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- KLNFSAOEKUDMFA-UHFFFAOYSA-N azanide;2-hydroxyacetic acid;platinum(2+) Chemical compound [NH2-].[NH2-].[Pt+2].OCC(O)=O KLNFSAOEKUDMFA-UHFFFAOYSA-N 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 229910002091 carbon monoxide Inorganic materials 0.000 description 1
- 229960004562 carboplatin Drugs 0.000 description 1
- 190000008236 carboplatin Chemical compound 0.000 description 1
- 231100000259 cardiotoxicity Toxicity 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000024245 cell differentiation Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 125000003636 chemical group Chemical group 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 229960004316 cisplatin Drugs 0.000 description 1
- DQLATGHUWYMOKM-UHFFFAOYSA-L cisplatin Chemical compound N[Pt](N)(Cl)Cl DQLATGHUWYMOKM-UHFFFAOYSA-L 0.000 description 1
- 238000011281 clinical therapy Methods 0.000 description 1
- 229960002424 collagenase Drugs 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 238000007398 colorimetric assay Methods 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 230000002498 deadly effect Effects 0.000 description 1
- 230000007850 degeneration Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 238000000432 density-gradient centrifugation Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 208000002173 dizziness Diseases 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 210000003743 erythrocyte Anatomy 0.000 description 1
- 102000015694 estrogen receptors Human genes 0.000 description 1
- 108010038795 estrogen receptors Proteins 0.000 description 1
- 201000007281 estrogen-receptor positive breast cancer Diseases 0.000 description 1
- 201000005884 exanthem Diseases 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 238000001917 fluorescence detection Methods 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 229960000789 guanidine hydrochloride Drugs 0.000 description 1
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 1
- ZJYYHGLJYGJLLN-UHFFFAOYSA-N guanidinium thiocyanate Chemical compound SC#N.NC(N)=N ZJYYHGLJYGJLLN-UHFFFAOYSA-N 0.000 description 1
- 208000024963 hair loss Diseases 0.000 description 1
- 230000003676 hair loss Effects 0.000 description 1
- 231100000869 headache Toxicity 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000012151 immunohistochemical method Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 230000007803 itching Effects 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 230000003907 kidney function Effects 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 230000001926 lymphatic effect Effects 0.000 description 1
- 238000010841 mRNA extraction Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 230000001394 metastastic effect Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 210000000214 mouth Anatomy 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 229950007221 nedaplatin Drugs 0.000 description 1
- 231100000228 neurotoxicity Toxicity 0.000 description 1
- 230000007135 neurotoxicity Effects 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 210000002747 omentum Anatomy 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 229960001756 oxaliplatin Drugs 0.000 description 1
- DWAFYCQODLXJNR-BNTLRKBRSA-L oxaliplatin Chemical compound O1C(=O)C(=O)O[Pt]11N[C@@H]2CCCC[C@H]2N1 DWAFYCQODLXJNR-BNTLRKBRSA-L 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 210000003200 peritoneal cavity Anatomy 0.000 description 1
- 210000004303 peritoneum Anatomy 0.000 description 1
- 238000006552 photochemical reaction Methods 0.000 description 1
- 229950005566 picoplatin Drugs 0.000 description 1
- IIMIOEBMYPRQGU-UHFFFAOYSA-L picoplatin Chemical compound N.[Cl-].[Cl-].[Pt+2].CC1=CC=CC=N1 IIMIOEBMYPRQGU-UHFFFAOYSA-L 0.000 description 1
- 210000002381 plasma Anatomy 0.000 description 1
- 239000002985 plastic film Substances 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 238000011158 quantitative evaluation Methods 0.000 description 1
- 238000001959 radiotherapy Methods 0.000 description 1
- 206010037844 rash Diseases 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000013074 reference sample Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 229960005399 satraplatin Drugs 0.000 description 1
- 190014017285 satraplatin Chemical compound 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 231100000046 skin rash Toxicity 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 238000007671 third-generation sequencing Methods 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- 230000004614 tumor growth Effects 0.000 description 1
- 230000034512 ubiquitination Effects 0.000 description 1
- 238000010798 ubiquitination Methods 0.000 description 1
- 231100000397 ulcer Toxicity 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 231100000925 very toxic Toxicity 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/106—Pharmacogenomics, i.e. genetic variability in individual responses to drugs and drug metabolism
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Engineering & Computer Science (AREA)
- Immunology (AREA)
- Pathology (AREA)
- Analytical Chemistry (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Hospice & Palliative Care (AREA)
- Biophysics (AREA)
- Oncology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Pharmaceuticals Containing Other Organic And Inorganic Compounds (AREA)
Description
Domanda di brevetto per invenzione industriale avente per titolo: Patent application for industrial invention entitled:
“METODO PER IDENTIFICARE SOGGETTI RESISTENTI AL TRATTAMENTO CON FARMACI A BASE DI PLATINO E RELATIVO KIT" "METHOD FOR IDENTIFYING SUBJECTS RESISTANT TO TREATMENT WITH PLATINUM-BASED DRUGS AND RELATIVE KIT"
SETTORE TECNICO DELL’INVENZIONE TECHNICAL FIELD OF THE INVENTION
[0001] La presente invenzione si riferisce ad un metodo per identificare soggetti oncologici resistenti al trattamento con farmaci della categoria dei platino complessi. In particolare, l’invenzione è relativa ad un metodo che consente di fornire al medico uno strumento utile a valutare se un determinato soggetto oncologico sarà in grado di rispondere positivamente al trattamento con farmaci antitumorali a base di platino complessi o se svilupperà una resistenza, nonché di predire l’efficacia di tale trattamento. Inoltre, l’invenzione riguarda un kit per l’identificazione di detti soggetti. [0001] The present invention relates to a method for identifying oncological subjects resistant to treatment with drugs of the category of platinum complexes. In particular, the invention relates to a method that allows the physician to be provided with a useful tool to assess whether a specific cancer subject will be able to respond positively to treatment with complex platinum-based anticancer drugs or whether he will develop resistance, as well as to predict the effectiveness of such treatment. Furthermore, the invention relates to a kit for the identification of said subjects.
STATO ANTERIORE DELLA TECNICA PRIOR STATE OF THE ART
[0002] II platino complessi sono una categoria di farmaci che comprendono il cisplatino, il carboplatino, l’oxaliplatino, satraplatino, picoplatino, nedaplatino, triplatino, lipoplatino e costituiscono terapie cliniche di prima linea nel trattamento di molti tipi di tumori solidi (Kang et al., 2015). Sebbene la maggior parte dei pazienti inizialmente risponde molto bene al trattamento con questi composti, lo sviluppo di cellule tumorali resistenti al trattamento con platino (PT) rappresenta un evento molto frequente (Kang et al., 2015; Siddik, 2003). [0002] Platinum complexes are a category of drugs that include cisplatin, carboplatin, oxaliplatin, satraplatin, picoplatin, nedaplatin, triplatin, lipoplatin and constitute first-line clinical therapies in the treatment of many types of solid tumors (Kang et al., 2015). Although most patients initially respond very well to treatment with these compounds, the development of platinum-resistant (PT) tumor cells is a very frequent occurrence (Kang et al., 2015; Siddik, 2003).
[0003] Inoltre, è noto che il trattamento con platino causa effetti collaterali piuttosto pesanti per il soggetto che vi è sottoposto. In altre parole, questi trattamenti causano una sostanziale tossicità con comparsa di più o meno severa eruzione cutanea accompagnata da prurito, rialzo termico, brividi, senso di vertigini, cefalea, astenia, nausea e vomito, alterazione della funzione renale, caduta dei capelli, in associazione a fenomeni meno frequenti quali dolore e ulcere del cavo orale, diarrea, cardio tossicità, neuro tossicità. Furthermore, it is known that platinum treatment causes rather severe side effects for the subject subjected to it. In other words, these treatments cause substantial toxicity with the appearance of more or less severe skin rash accompanied by itching, temperature rise, chills, dizziness, headache, asthenia, nausea and vomiting, alteration of renal function, hair loss, in association with less frequent phenomena such as pain and ulcers of the oral cavity, diarrhea, cardio toxicity, neuro toxicity.
[0004] Pertanto, è molto sentita l’esigenza da parte dei clinici di avere a disposizione uno strumento valido per valutare con grande cura l’opportunità di trattare soggetti oncologi con farmaci potenzialmente molto tossici. Allo stesso tempo, tale strumento dovrebbe fornire una diagnosi ed una prognosi sul trattamento con platino complessi in modo che il medico possa avere ulteriori importanti informazioni per tale valutazione e per una programmazione mirata dei dosaggi e tempi di somministrazione. [0004] Therefore, there is a strong need on the part of clinicians to have a valid tool available to carefully evaluate the opportunity to treat oncologists with potentially very toxic drugs. At the same time, this tool should provide a diagnosis and prognosis on the treatment with platinum complexes so that the physician can have further important information for this evaluation and for a targeted planning of dosages and administration times.
[0005] A tal proposito, sono stati proposti alcuni metodi di diagnosi, prognosi e trattamento di tumori primari e metastatici tramite l’impiego di biomarcatori. Tra questi biomarcatori sono state proposte specificatamente varie molecole di natura oligonucleotidica e centinaia di geni le cui attività influenzano il comportamento e lo stato di differenziamento cellulare delle cellule tumorali nonché la loro sensibilità o resistenza ad agenti terapeutici quali i suddetti platino complessi. In ogni caso, i metodi proposti si sono rivelati scarsamente utili nel fornire ai clinici strumenti utili per la valutazione dello stato di soggetti oncologici da trattare. [0005] In this regard, some methods of diagnosis, prognosis and treatment of primary and metastatic tumors have been proposed through the use of biomarkers. Among these biomarkers various molecules of an oligonucleotide nature and hundreds of genes have been specifically proposed, whose activities influence the behavior and the state of cellular differentiation of cancer cells as well as their sensitivity or resistance to therapeutic agents such as the aforementioned platinum complexes. In any case, the proposed methods have proved to be of little use in providing clinicians with useful tools for assessing the status of cancer subjects to be treated.
[0006] Infatti, è stato riscontrato che tali metodi si concentrano sull’impiego di singoli analiti senza tenere in considerazione almeno le note dozzine di interazioni tra differenti marcatori tumorali. Inoltre, i noti metodi diagnostici che si basano sull’immunoistochimica non sono standardizzati. [0006] In fact, it has been found that these methods focus on the use of single analytes without taking into account at least the known dozens of interactions between different tumor markers. Furthermore, the known diagnostic methods that are based on immunohistochemistry are not standardized.
RIASSUNTO DELL’INVENZIONE SUMMARY OF THE INVENTION
[0007] Il problema tecnico affrontato dalla presente invenzione è quindi quello di fornire un metodo per identificare soggetti oncologici sensibili al trattamento di farmaci a base di platino in modo efficace, semplice ed affidabile. [0007] The technical problem faced by the present invention is therefore that of providing a method for identifying oncological subjects sensitive to the treatment of platinum-based drugs in an effective, simple and reliable way.
[0008] Tale problema è risolto da un metodo comprendente il rilevamento di un gruppo di dieci geni specifici e selezionati quale firma molecolare identificativa della presenza di farmaco resistenza ai platino complessi da parte di soggetti oncologici. [0008] This problem is solved by a method comprising the detection of a group of ten specific and selected genes as molecular signature identifying the presence of drug resistance to platinum complexes by oncological subjects.
[0009] Pertanto, un primo oggetto dell’invenzione è un metodo per identificare soggetti oncologici resistenti al trattamento con un agente terapeutico a base di o che include platino, comprendente una fase di rilevamento contemporaneo dell’espressione di dieci geni specifici. [0009] Therefore, a first object of the invention is a method for identifying oncological subjects resistant to treatment with a therapeutic agent based on or including platinum, comprising a phase of simultaneous detection of the expression of ten specific genes.
[0010] Un secondo oggetto dell’invenzione è un metodo per la prognosi di soggetti oncologici da sottoporre al trattamento con un agente terapeutico a base di platino. [0010] A second object of the invention is a method for the prognosis of oncological subjects to be subjected to treatment with a platinum-based therapeutic agent.
[0011] Un terzo oggetto è un metodo come in precedenza in cui l’identificazione dei geni è realizzata per mezzo di sonde mirate contro detti geni. [0011] A third object is a method as previously in which the identification of genes is carried out by means of probes targeted against said genes.
[0012] Un ulteriore oggetto è un metodo come in precedenza in cui viene realizzata una fase di “pesatura” dei singoli geni. [0012] A further object is a method as above in which a "weighing" step of the single genes is carried out.
[0013] Un ulteriore oggetto è un kit per l’identificazione di soggetto oncologico resistente ad agenti terapeutici a base di platino comprendente sonde mirate contro l’mRNA di detti dieci geni. [0013] A further object is a kit for the identification of an oncological subject resistant to platinum-based therapeutic agents including probes targeted against the mRNA of said ten genes.
BREVE DESCRIZIONE DELLE FIGURE BRIEF DESCRIPTION OF THE FIGURES
[0014] Le ulteriori caratteristiche e i vantaggi del metodo e del kit secondo la presente invenzione diverranno più evidenti nella descrizione seguente relativa ad una forma preferita di realizzazione data a puro titolo esemplificativo, non limitativo, con riferimento alle seguenti figure in cui: [0014] The further characteristics and advantages of the method and of the kit according to the present invention will become more evident in the following description relating to a preferred embodiment given purely by way of non-limiting example, with reference to the following figures in which:
- la Fig. 1 mostra una tabella che riporta i dieci geni più sovra-espressi (upregulated) nelle cellule tumorali che rispondono al trattamento combinato con platino complessi ed inibitori di USP1 modulando l’espressione della proteina Snail; - Fig. 1 shows a table showing the ten most over-expressed (upregulated) genes in cancer cells that respond to the combined treatment with platinum complexes and USP1 inhibitors by modulating the expression of the Snail protein;
- la Fig.2 è un grafico che mostra una curva di Kaplan-Mayer di valutazione della sopravvivenza libera da progressione per le pazienti con EOC utilizzando i dieci geni sovra-espressi elencati nella figura 1 per discriminare le pazienti con prognosi migliore e peggiore; - Fig. 2 is a graph showing a Kaplan-Mayer curve for the evaluation of progression-free survival for patients with EOC using the ten over-expressed genes listed in figure 1 to discriminate the patients with better and worse prognosis;
- la Fig.3 è un grafico che mostra una curva di Kaplan-Mayer di valutazione della sopravvivenza libera da progressione per le pazienti con TNBC utilizzando i dieci geni sovra-espressi elencati nella figura 1 per discriminare le pazienti con prognosi migliore e peggiore; - Fig. 3 is a graph showing a Kaplan-Mayer curve for the evaluation of progression-free survival for patients with TNBC using the ten over-expressed genes listed in figure 1 to discriminate the patients with better and worse prognosis;
- la Fig.4 è un grafico che mostra una curva di Kaplan-Mayer di valutazione della sopravvivenza libera da progressione per i pazienti con LC utilizzando i dieci geni sovra-espressi elencati nella figura 1 per discriminare le pazienti con prognosi migliore e peggiore; - Fig. 4 is a graph showing a Kaplan-Mayer curve for the evaluation of progression-free survival for patients with LC using the ten over-expressed genes listed in figure 1 to discriminate the patients with the best and worst prognosis;
- la Fig.5 è un grafico che mostra una curva di Kaplan-Mayer di valutazione della sopravvivenza libera da progressione per le pazienti ER+BC utilizzando i dieci geni sovra-espressi elencati nella figura 1 per discriminare le pazienti con prognosi migliore e peggiore. - Fig. 5 is a graph showing a Kaplan-Mayer curve for the evaluation of progression-free survival for ER + BC patients using the ten over-expressed genes listed in figure 1 to discriminate the patients with better and worse prognosis.
DESCRIZIONE DETTAGLIATA DELL’INVENZIONE DETAILED DESCRIPTION OF THE INVENTION
[0015] Il cancro dell’epitelio ovarico (EOC, Epithelial Ovarian Cancer) è un prototipo di patologia resistente al trattamento con PT, associata ad una prognosi molto scarsa (Jayson et al., 2014). EOC ricorrente resistente cresce come patologia metastatica che invade il peritoneo, l’omento e gli organi localizzati nella cavità peritoneale (Thibault et al., 2014; Weidle et al., 2016). [0015] Ovarian epithelial cancer (EOC, Epithelial Ovarian Cancer) is a prototype of a pathology resistant to PT treatment, associated with a very poor prognosis (Jayson et al., 2014). Resistant recurrent EOC grows as a metastatic disease that invades the peritoneum, omentum and organs located in the peritoneal cavity (Thibault et al., 2014; Weidle et al., 2016).
[0016] EOC è purtroppo una patologia ad alta mortalità che affligge migliaia di donne ogni anno. Sebbene il tasso di sopravvivenza nei primi cinque anni sia molto alto per i tumori localizzati, tale tasso precipita drasticamente dopo metastasi distali. Inoltre, sfortunatamente solo una piccola percentuale (15%) di casi vengono diagnosticati quando il tumore è ancora nello stadio primario, mentre la grande maggioranza delle diagnosi di EOC viene fatto in seguito a metastasi distale già avvenuta. Pertanto, EOC è considerata una patologia mortale. [0016] EOC is unfortunately a high mortality disease that afflicts thousands of women every year. Although the survival rate in the first five years is very high for localized tumors, this rate plummets after distal metastases. Furthermore, unfortunately only a small percentage (15%) of cases are diagnosed when the cancer is still in the primary stage, while the vast majority of EOC diagnoses are made following distal metastases that have already occurred. Therefore, EOC is considered a deadly pathology.
[0017] La necessità, quindi, di una comprensione migliore dei procedimenti e percorsi che portano allo sviluppo dei tumori nonché alla resistenza ai trattamenti più comuni a disposizione dei clinici è di primaria importanza. The need, therefore, for a better understanding of the procedures and pathways leading to the development of tumors as well as resistance to the most common treatments available to clinicians is of primary importance.
[0018] Per raggiungere questo obiettivo, l’identificazione di biomarcatori è senza dubbio una strada importante della ricerca medica soprattutto per la valutazione del trattamento con determinati agenti terapeutici quali i platino complessi. [0018] To achieve this goal, the identification of biomarkers is undoubtedly an important avenue of medical research especially for the evaluation of treatment with certain therapeutic agents such as platinum complexes.
[0019] Inoltre, sia la resistenza ai farmaci che la diffusione dei tumori sono state correlate con la Transizione Epiteliale-Mesenchimale (EMT, Epithelial-to-Mesenchymal Transition), un programma molecolare e cellulare complesso tramite il quale le cellule tumorali acquisiscono caratteristiche aggressive, incluso la plasticità cellulare, l’invasività e la resistenza all’apoptosi (Polyak e Weinberg, 2009). Se e come questo fenotipo sia collegato con un’alterazione della Risposta al Danno del DNA (DDR, DNA Damage Response) attivato dalla chemioterapia è ancora oscuro. Furthermore, both drug resistance and tumor spread have been correlated with Epithelial-to-Mesenchymal Transition (EMT), a complex molecular and cellular program by which cancer cells acquire aggressive characteristics , including cellular plasticity, invasiveness and resistance to apoptosis (Polyak and Weinberg, 2009). Whether and how this phenotype is connected with an alteration of the DNA Damage Response (DDR, DNA Damage Response) activated by chemotherapy is still unclear.
[0020] L’idea alla base della presente invenzione è stata quindi quella di partire dall’indagine di proteine mediatrici e regolatrici collegate con EMT e DDR. In particolare, tra le proteine che inducono EMT, è noto che il fattore di trascrizione Snail può contribuire alla PT-resistenza in EOC tramite mediazione della sopravvivenza cellulare e l’acquisizione di caratteristiche simil-staminali (Haslehurst et al., 2012; Kurrey et al., 2009). Snail è una proteina a vita breve in quanto viene rapidamente poliubiquitinata e degradata dal complesso del proteosoma (Zhou et al., 2004). Diverse ligasi dell’ubiquitina sono state identificate per l’ubiquitinazione e la degradazione di Snail, ma nessun altro fattore è correlato con la stabilità di Snail in modo da consentirgli di entrare in gioco nella PT-resistenza. [0020] The idea behind the present invention was therefore to start from the investigation of mediating and regulatory proteins connected with EMT and DDR. In particular, among the EMT-inducing proteins, it is known that the transcription factor Snail can contribute to PT-resistance in EOC by mediating cell survival and the acquisition of stem-like characteristics (Haslehurst et al., 2012; Kurrey et al. al., 2009). Snail is a short-lived protein as it is rapidly polyubiquitinated and degraded by the proteasome complex (Zhou et al., 2004). Several ubiquitin ligases have been identified for the ubiquitination and degradation of Snail, but no other factor correlates with Snail's stability in order to allow it to come into play in PT-resistance.
[0021] In seguito a numerose indagini, è stato trovato che la proteina USP1 (Proteasi Ubiquitina-Specifica 1) è in grado di de-ubiquitinare e stabilizzare Snail, quindi provocando la PT-resistenza e la disseminazione metastatica nelle cellule EOC. Following numerous investigations, it was found that the USP1 protein (Ubiquitin-Specific Protease 1) is capable of de-ubiquitinating and stabilizing Snail, thereby causing PT-resistance and metastatic dissemination in EOC cells.
[0022] Di conseguenza, sono state messe a punto metodiche atte a verificare la risposta all’inibizione di USP1 in linee cellulari di EOC trattate con PT-complessi in presenza o meno di un inibitore di USP1. Successivamente, è stata verificata l’espressione della proteina Snail. Il risultato di dette indagini ha messo in evidenza che in otto su dieci linee cellulari testate, denominate cellule “responsive” rispetto a quelle “non responsive”, l’inibitore di USP1 impedisce l’incremento (upregulation) di Snail dopo PT-trattamento. [0022] Consequently, methods have been developed to verify the response to the inhibition of USP1 in EOC cell lines treated with PT-complexes in the presence or absence of a USP1 inhibitor. Subsequently, the expression of the Snail protein was verified. The result of these investigations showed that in eight out of ten cell lines tested, called "responsive" cells compared to "non-responsive" cells, the USP1 inhibitor prevents the increase (upregulation) of Snail after PT-treatment.
[0023] Una volta dimostrata l’attività di USP1, si è pensato di verificare anche se la firma molecolare dell’mRNA associata a detta attività potesse in qualche modo predire la prognosi dei pazienti EOC. [0023] Once the activity of USP1 was demonstrated, it was decided to also verify whether the molecular signature of the mRNA associated with said activity could somehow predict the prognosis of EOC patients.
[0024] A tal riguardo, è stato osservato che circa cinquecento geni sono upregolati nelle cellule “responsive”. Questi geni sono stati selezionati usando i profili di espressione genica disponibili in letteratura (Kreuzinger et al., 2015). In particolare, durante l’intervento di riduzione della massa tumorale, i tessuti tumorali sono stati prelevati e direttamente analizzati per conferma del tipo istologico. In seguito, da tale materiale sono state ottenute sette diverse linee cellulari da pazienti con HGSOC. In this regard, it has been observed that about five hundred genes are upregulated in "responsive" cells. These genes were selected using gene expression profiles available in the literature (Kreuzinger et al., 2015). In particular, during the tumor mass reduction surgery, the tumor tissues were taken and directly analyzed for confirmation of the histological type. Subsequently, seven different cell lines were obtained from this material from patients with HGSOC.
[0025] Di queste sette linee cellulari, nella presente invenzione sono state prese in considerazione le seguenti: 8587, 13699 e 13914 tutte responsive dopo sei mesi di trattamento, 14433 non responsive dopo sei mesi di trattamento. La linea 8587 deriva da tessuto tumorale, le altre invece derivano da asciti. Le asciti sono state centrifugate e in seguito gli eritrociti eliminati grazie ad una fase di centrifugazione con Histopaque 1077 (Sigma-Aldrich, St. Louis, USA). I tessuti tumorali sono stati tagliati in piccoli pezzi e digeriti con la collagenasi (1 mg/mL, 1453 CDU/mg, Sigma-Aldrich) a 37 °C per 1 h. Le cellule sono state coltivate in terreno DMEM, con 10% fetal bovine serum (FBS), 100 units/mL penicillina and 100 µg/mL streptomicina (PS; tutto della Gibco by Life Technologies, CA, USA) a 37 °C e 5% CO 2. Per controllare contaminazioni da mycoplasma è stato usato VenorGeM Classic Mycoplasma Detection Kit for conventional PCR (Minerva Biolabs, Berlin, Germany). Of these seven cell lines, the following have been considered in the present invention: 8587, 13699 and 13914 all responsive after six months of treatment, 14433 non-responsive after six months of treatment. Line 8587 derives from tumor tissue, while the others derive from ascites. The ascites were centrifuged and then the erythrocytes eliminated thanks to a centrifugation step with Histopaque 1077 (Sigma-Aldrich, St. Louis, USA). Tumor tissues were cut into small pieces and digested with collagenase (1 mg / mL, 1453 CDU / mg, Sigma-Aldrich) at 37 ° C for 1 h. Cells were cultured in DMEM medium, with 10% fetal bovine serum (FBS), 100 units / mL penicillin and 100 µg / mL streptomycin (PS; all from Gibco by Life Technologies, CA, USA) at 37 ° C and 5 % CO 2. VenorGeM Classic Mycoplasma Detection Kit for conventional PCR (Minerva Biolabs, Berlin, Germany) was used to control mycoplasma contamination.
[0026] È stata effettuata inoltre un’analisi Short Tandem Repeats di 7 marcatori (TPOX, vWA, CSF180, D16S539, D7S820, D13S317, D5S818, Applied Biosystems Life Technologies). L’analisi è stata effettuata utilizzando ABI Prism 310 Genetic Analyzer (Applied Biosystems, Life Technologies) in modo da autenticare le linee cellulari. [0026] A Short Tandem Repeats analysis of 7 markers (TPOX, vWA, CSF180, D16S539, D7S820, D13S317, D5S818, Applied Biosystems Life Technologies) was also carried out. The analysis was carried out using ABI Prism 310 Genetic Analyzer (Applied Biosystems, Life Technologies) in order to authenticate the cell lines.
[0027] In seguito, il DNA e l’RNA sono stati isolati da lisato cellulare di tessuto tumorale congelato fresco (Mikro-Dismembrator U; B.Braun Biotech International, Melsungen, Germany) grazie all’utilizzo di AllPrep DNA/RNA Mini Kit (Qiagen, Hilden, Germany). La concentrazione degli acidi nucleici è stata misurata con BioPhotometer (Eppendorf, Hamburg, Germany). [0027] Subsequently, the DNA and RNA were isolated from fresh frozen tumor tissue cell lysate (Mikro-Dismembrator U; B.Braun Biotech International, Melsungen, Germany) using the AllPrep DNA / RNA Mini Kit (Qiagen, Hilden, Germany). Nucleic acid concentration was measured with BioPhotometer (Eppendorf, Hamburg, Germany).
[0028] A questo punto, librerie genomiche complete Shot-gun sono state preparate utilizzando il kit di preparazione di librerie KAPA (KAPA Biosystems). Seguendo le istruzioni date dal produttore, sono state quantificate e sequenziate grazie ad un HiSeq2000 (Illumina) con un coverage basso generando delle letture (reads) da 50bp. Le sequenze grezze sono state poi mappate al genoma umano di riferimento (NCBI37/hg19) con Burrows–Wheeler Aligner (BWA v0.5.8a). At this point, complete Shot-gun genomic libraries were prepared using the KAPA Library Preparation Kit (KAPA Biosystems). Following the instructions given by the manufacturer, they were quantified and sequenced thanks to a HiSeq2000 (Illumina) with low coverage generating reads of 50bp. The raw sequences were then mapped to the reference human genome (NCBI37 / hg19) with Burrows – Wheeler Aligner (BWA v0.5.8a).
[0029] Le librerie a RNA, utili a quantificare le mappate dei geni, sono state create utilizzando Illumina TruSeq RNA sample preparation kit V2 seguendo le indicazioni del produttore e sono state sequenziate successivamente con HiSeq2000 (Illumina) usando un flowcell V3 che genera 1×50bp reads. [0029] The RNA libraries, useful for quantifying the gene mappings, were created using Illumina TruSeq RNA sample preparation kit V2 following the manufacturer's instructions and were subsequently sequenced with HiSeq2000 (Illumina) using a V3 flowcell generating 1 × 50bp reads.
[0030] Le sequenze grezze sono poi state mappate al trascrittoma e al genoma umano di riferimento (NCBI37/hg19) usando gli strumenti TopHat 2.0 e Bowtie 2.0.32 859 670 letture sono state poi associate ai rispettivi geni grazie all’ HTSeq software package e sono state normalizzate con EDASeq. [0030] The raw sequences were then mapped to the transcriptome and the human reference genome (NCBI37 / hg19) using the tools TopHat 2.0 and Bowtie 2.0.32 859 670 readings were then associated to the respective genes thanks to the HTSeq software package and have been normalized with EDASeq.
[0031] Dall’analisi appena descritta è stato quindi possibile costruire la firma molecolare dell’mRNA associato all’attività di USP1. Pertanto, per verificare la possibilità di predire la prognosi, si è proceduto a selezionare i geni sovraespressi in maniera arbitraria ponendo un cut off di fold change superiore a 100. Sono stati quindi scelti i primi 10 geni, qui di seguito riportati, che risultano almeno 100 volte più espressi nelle cellule “responsive”, cioè rispondenti al trattamento con platino complessi, rispetto alle cellule “non responsive”, cioè non rispondenti al trattamento con platino complessi: [0031] From the analysis just described it was therefore possible to construct the molecular signature of the mRNA associated with the activity of USP1. Therefore, to verify the possibility of predicting the prognosis, we proceeded to select the overexpressed genes in an arbitrary manner by placing a fold change cut off greater than 100. The first 10 genes, shown below, were then chosen, which are at least 100 times more expressed in "responsive" cells, that is, responding to treatment with platinum complexes, compared to "non-responsive" cells, that is, not responding to treatment with platinum complexes:
CLDN6 (SEQ ID NO:1), COL11A1 (SEQ ID NO:2), SIM1 (SEQ ID NO:3), DKK1 (SEQ ID NO:4), GAL3ST3 (SEQ IDE NO:5), IGFBP5 (SEQ ID NO:6), DKK2 (SEQ ID NO:7), COL1A2 (SEQ ID NO:8), COL3A1 (SEQ ID NO:9), MAP3K7CL (SEQ ID NO:10). CLDN6 (SEQ ID NO: 1), COL11A1 (SEQ ID NO: 2), SIM1 (SEQ ID NO: 3), DKK1 (SEQ ID NO: 4), GAL3ST3 (SEQ IDE NO: 5), IGFBP5 (SEQ ID NO : 6), DKK2 (SEQ ID NO: 7), COL1A2 (SEQ ID NO: 8), COL3A1 (SEQ ID NO: 9), MAP3K7CL (SEQ ID NO: 10).
[0032] In base alle suddette indagini, è stato pensato di impiegare questa firma molecolare dei suddetti dieci geni per la realizzazione di un metodo per identificare soggetti oncologici resistenti al trattamento con agenti terapeutici a base di o che includono platino complessi, comprendente le fasi di: [0032] On the basis of the aforementioned investigations, it was thought to employ this molecular signature of the aforementioned ten genes for the realization of a method for identifying oncological subjects resistant to treatment with therapeutic agents based on or including platinum complexes, comprising the steps of :
- mettere a disposizione un campione biologico isolato dal corpo umano di un paziente oncologico; - make available a biological sample isolated from the human body of a cancer patient;
- estrarre l’RNA da detto campione; - extract the RNA from said sample;
- identificare la sovra-espressione (up-regulation) contemporanea dei suddetti dieci geni. - identify the contemporary up-regulation of the above ten genes.
[0033] Con il termine “campione biologico isolato dal corpo umano” si intende un frammento di tessuto fresco o conservato ottenuto da un tumore, sia tramite biopsia che con ago aspirato, oppure qualsiasi fluido biologico contenente cellule tumorali prelevato dal corpo umano. Tra i fluidi biologici vi sono ad esempio sangue intero, plasma, siero, liquido peritoneale, liquido linfatico, urina, liquido seminale. The term "biological sample isolated from the human body" means a fresh or preserved tissue fragment obtained from a tumor, either by biopsy or by needle aspiration, or any biological fluid containing tumor cells taken from the human body. Biological fluids include, for example, whole blood, plasma, serum, peritoneal fluid, lymphatic fluid, urine, seminal fluid.
[0034] Le “cellule tumorali” o cellule cancerose possono essere di origine da tumore o cancro primario di varia natura o da metastasi, come ad esempio cancro dell’epitelio ovarico (EOC), cancro del pancreas, dello stomaco, del rene, colon-rettale, del fegato, dei polmoni, della prostata o del seno. In particolare, per il carcinoma ovarico, il tumore più frequente è quello sieroso, considerata la forma il più aggressiva e letale degli EOC. [0034] The "tumor cells" or cancerous cells can be of origin from tumor or primary cancer of various kinds or from metastases, such as cancer of the ovarian epithelium (EOC), cancer of the pancreas, stomach, kidney, colon -rectal, liver, lung, prostate or breast. In particular, for ovarian cancer, the most frequent tumor is the serous one, considered the most aggressive and lethal form of EOC.
[0035] Con il termine “acidi nucleici” si intendono nucleotidi sia naturali che artificiali includenti polinucleotidi od oligonucleotidi, generalmente ribonucleotidi (RNA) o deossiribonucleotidi (DNA) sia modificati che non modificati. Pertanto, il suddetto termine comprende RNA o DNA a singolo o doppio filamento, RNA o DNA con zone a singolo o doppio filamento, molecole ibride RNA-DNA, cDNA, mRNA, tRNA, mi-RNA, sh-RNA. Acidi nucleici purificati sono costituiti da molecole sostanzialmente libere da altro materiale cellulare o mezzi colturali quando prodotti sinteticamente; sostanzialmente liberi significa puri al 60%, 70%, 80%, 90% e anche oltre il 99%. Il grado di purezza è determinato attraverso convenzionali metodiche di purificazione come la cromatografia a colonna, elettroforesi in gel di poliacrilammide o HPLC. The term "nucleic acids" means both natural and artificial nucleotides including polynucleotides or oligonucleotides, generally ribonucleotides (RNA) or deoxyribonucleotides (DNA), both modified and unmodified. Therefore, the above term includes single or double stranded RNA or DNA, single or double stranded RNA or DNA, RNA-DNA hybrid molecules, cDNA, mRNA, tRNA, mi-RNA, sh-RNA. Purified nucleic acids consist of molecules substantially free from other cellular material or culture media when synthetically produced; essentially free means 60%, 70%, 80%, 90% and even over 99% pure. The degree of purity is determined through conventional purification methods such as column chromatography, polyacrylamide gel electrophoresis or HPLC.
[0036] Con il termine “sonde” si intendono acidi nucleici senso o anti-senso impiegate in convenzionali tecniche di ibridizzazione e provviste di, cioè etichettate, molecole rilevabili, ossia una molecola segnale cioè una molecola capace di essere rilevata tramite convenzionali metodiche analitiche, quale ad esempio un fluoroforo, un radioisotopo, un enzima, un metallo pesante. [0036] The term "probes" refers to sense or anti-sense nucleic acids used in conventional hybridization techniques and provided with, ie labeled, detectable molecules, ie a signal molecule, ie a molecule capable of being detected through conventional analytical methods, such as for example a fluorophore, a radioisotope, an enzyme, a heavy metal.
[0037] Con il termine “firma molecolare” si intende un qualsiasi gruppo di marcatori molecolari biologici, in particolare una sequenza di acidi nucleici, specificatamente una sequenza di mRNA. Le sequenze sono caratterizzate dall’ avere almeno il 60%, 70%, 80%, 90% di identità o complementarietà con le sequenze geniche SEQ ID NO: 1-10 in accordo con la presente invenzione, tenendo presente varianti, mutazioni non funzionali e la degenerazione del codice. By the term "molecular signature" is meant any group of biological molecular markers, in particular a nucleic acid sequence, specifically an mRNA sequence. The sequences are characterized by having at least 60%, 70%, 80%, 90% identity or complementarity with the SEQ ID NO: 1-10 gene sequences according to the present invention, taking into account variants, non-functional mutations and the degeneration of the code.
[0038] Il termine “profili di espressione genica” indicano una specifica condizione di una cellula. In pratica, sulla base della valutazione di più geni simultaneamente, può essere prodotto un profilo genico univoco dello stato di una cellula. In altre parole, un tessuto normale può essere distinto da un tessuto malato (canceroso), ed all’interno dello stesso tessuto canceroso possono essere distinte differenti prognosi (buone o scarse prospettive di sopravvivenza a lungo termine). Ad esempio, il confronto tra sequenze che sono espresse in maniera differente tra cellule normali o cancerose ed anche tra cellule cancerose e cellule cancerose può aiutare ad individuare differenti regimi di trattamento chemioterapico. The term "gene expression profiles" indicates a specific condition of a cell. In practice, on the basis of the evaluation of several genes simultaneously, a unique gene profile of the state of a cell can be produced. In other words, a normal tissue can be distinguished from a diseased (cancerous) tissue, and within the same cancerous tissue different prognoses (good or poor long-term survival prospects) can be distinguished. For example, the comparison between sequences that are expressed differently between normal or cancerous cells and also between cancer cells and cancer cells can help to identify different chemotherapy treatment regimens.
[0039] Una “prognosi scarsa” indica un decremento del tasso di sopravvivenza libero da progressione. La sopravvivenza libera da progressione è il periodo di tempo in cui la malattia non progredisce e che intercorre tra la data di randomizzazione nello studio clinico e la data di progressione valutata ad esempio mediante TAC o RMN. A "poor prognosis" indicates a decrease in the progression-free survival rate. Progression-free survival is the period of time in which the disease does not progress and that elapses between the date of randomization in the clinical study and the date of progression assessed for example by CT or MRI.
[0040] “Amplificazione genica” indica un procedimento tramite il quale copie multiple di un gene o frammenti di gene sono formati in una particolare cellula o linea cellulare o attraverso PCR. La regione duplicata è generalmente chiamata “amplicone”. "Gene amplification" means a process by which multiple copies of a gene or gene fragments are formed in a particular cell or cell line or by PCR. The duplicated region is generally called the "amplicon".
[0041] Il termine “responsivo” riferito ad una condizione tumorale rispetto al trattamento con chemioterapici significa che il grado di crescita del tumore è inibito rispetto al grado di crescita senza trattamento di chemioterapici. “Non responsivo”, al contrario, indica un grado di crescita del tumore non inibito o con un tasso di crescita scarsamente ridotto o con un grado di crescita statisticamente non significativo. The term "responsive" referring to a tumor condition with respect to treatment with chemotherapy means that the degree of growth of the tumor is inhibited compared to the degree of growth without treatment with chemotherapy. "Unresponsive", on the other hand, indicates a tumor growth rate that is not inhibited or with a poorly reduced growth rate or with a statistically insignificant rate of growth.
[0042] Nel metodo secondo la presente invenzione, il campione biologico viene prelevato secondo convenzionali sistemi durante ad esempio un intervento chirurgico di asportazione della massa tumorale, prelievo di una piccola quantità di tessuto, ago aspirato. Alternativamente, il prelievo avviene attraverso siringa, spatola o altro noto dispositivo in grado di ottenere una quantità di un fluido corporeo da trattare per estrarre gli acidi nucleici. [0042] In the method according to the present invention, the biological sample is taken according to conventional systems during, for example, a surgical procedure for the removal of the tumor mass, removal of a small amount of tissue, needle aspiration. Alternatively, the withdrawal takes place through a syringe, spatula or other known device capable of obtaining a quantity of a body fluid to be treated in order to extract the nucleic acids.
[0043] L’estrazione degli acidi nucleici da detto campione avviene tramite altrettante convenzionali metodiche. Ad esempio, l’mRNA viene estratto da un campione di tessuto che è stato fissato tramite inglobamento in paraffina e fissato tramite formalina. In ogni caso, metodi generali di estrazione di mRNA sono ben noti nel settore e descritti in libri e manuali di biologia molecolare quali Ausubel et al., Current Protocols of Molecular Biology, John Wiley and Sons (1997), Rupp and Locker, Lab Invest.56: A67 (1987), De Andrés et al., BioTechniques 18:42044 (1995), Molecular Cloning – a Laboratory Manual, Sambrook and Russel. Molti metodi di estrazione di RNA impiegano forti denaturanti quali idrocloruro di guanidina o tiocianato di guanidina per rompere le cellule, solubilizzare i suoi componenti e denaturare RNAsi endogene. In particolare, l’isolamento di RNA viene realizzato tramite kit di purificazione, buffer set e proteasi commercialmente disponibili. Ad esempio, l’RNA totale da cellule in coltura può essere estratto usando mini-colonne Qiagen RNeasy, kit di purificazione MasterPure<TM >Complete DNA ed RNA (EPICENTRE<®>, Madison, WI), kit di isolamento Paraffin Block RNA (Ambion, inc.), RNA Stat-60 (Tel-Test) usato per RNA totale da campioni tissutali, centrifugazione su gradiente di densità in cloruro di cesio. [0043] The extraction of nucleic acids from said sample takes place through as many conventional methods. For example, mRNA is extracted from a tissue sample that has been fixed by embedding in paraffin and fixed with formalin. However, general mRNA extraction methods are well known in the field and described in molecular biology books and manuals such as Ausubel et al., Current Protocols of Molecular Biology, John Wiley and Sons (1997), Rupp and Locker, Lab Invest .56: A67 (1987), De Andrés et al., BioTechniques 18: 42044 (1995), Molecular Cloning - a Laboratory Manual, Sambrook and Russel. Many RNA extraction methods employ strong denaturants such as guanidine hydrochloride or guanidine thiocyanate to break cells, solubilize its components and denature endogenous RNase. In particular, the isolation of RNA is carried out using commercially available purification kits, buffer sets and proteases. For example, total RNA from cultured cells can be extracted using Qiagen RNeasy mini-columns, MasterPure <TM> Complete DNA and RNA purification kit (EPICENTRE <®>, Madison, WI), Paraffin Block RNA isolation kit ( Ambion, inc.), Stat-60 RNA (Tel-Test) used for total RNA from tissue samples, density gradient centrifugation in cesium chloride.
Preferibilmente, vengono impiegati i reagenti TRI (Molecular Reserach Center), TriZol (Ambion), Isogen (Nippon Gene) per estrarre l’RNA totale. I protocolli sono ampiamente noti e prevedono in generale la preparazione di reagenti nei quali omogenizzare i campioni, separare le fasi proteiche e DNA da quella dell’RNA, precipitare l’RNA e lavarlo dai residui dei reagenti con opportune soluzioni standardizzate. Preferita è la separazione di mRNA per mezzo di cromatografia ad affinità su colonna di oligo(dT)-cellulosa o, nel caso di molti campioni da processare o piccole quantità, cromatografia in bach su identica colonna. Preferably, the TRI (Molecular Reserach Center), TriZol (Ambion), Isogen (Nippon Gene) reagents are used to extract the total RNA. The protocols are widely known and generally provide for the preparation of reagents in which to homogenize the samples, separate the protein and DNA phases from that of the RNA, precipitate the RNA and wash it from the reagent residues with appropriate standardized solutions. Preferred is the separation of mRNA by means of affinity chromatography on an oligo (dT) -cellulose column or, in the case of many samples to be processed or small quantities, bach chromatography on an identical column.
[0044] Per la determinazione della firma molecolare di mRNA di interesse dall’RNA estratto ed isolato come in precedenza, è possibile impiegare due generali metodiche note che si basano sull’analisi di ibridizzazione dei polinucleotidi e sul sequenziamento dei polinucleotidi. Tra queste, le più usate includono northern blotting; ibridizzazione in situ; saggi di protezione dall’RNAsi; reazione a catena della polimerasi a trascrizione inversa (RT-PCR). Alternativamente, possono essere impiegati anticorpi monoclonali o policlonali che riconoscono specifici duplex di RNA, DNA-RNA, RNA-proteina. In particolare, molto utili sono le metodiche di RT-PCR quantitativa real-time (qRT-PCR). [0044] For the determination of the molecular signature of mRNA of interest from the extracted and isolated RNA as previously, it is possible to use two general known methods which are based on the hybridization analysis of polynucleotides and on the sequencing of polynucleotides. Among these, the most used include northern blotting; in situ hybridization; RNase protection assays; reverse transcription polymerase chain reaction (RT-PCR). Alternatively, monoclonal or polyclonal antibodies that recognize specific duplexes of RNA, DNA-RNA, RNA-protein can be used. In particular, the methods of quantitative real-time RT-PCR (qRT-PCR) are very useful.
[0045] Tra le suddette differenti tecniche, la più sensibile e flessibile è la qRT-PCR che infatti può essere usata per confrontare i livelli di mRNA in differenti popolazioni di campioni, sia normali che tumorali, con o senza trattamento chemioterapico. In questo modo, è possibile discriminare tra loro mRNA strutturalmente molto vicini. [0045] Among the aforementioned different techniques, the most sensitive and flexible is qRT-PCR which in fact can be used to compare the mRNA levels in different populations of samples, both normal and tumor, with or without chemotherapy treatment. In this way, it is possible to discriminate between structurally very close mRNAs.
[0046] Nella determinazione di un profilo di espressione genica tramite RT-PCR, la prima fase consiste solitamente nel retro-trascrivere l’RNA in cDNA seguito dalla sua amplificazione nella reazione PCR. A tal scopo, vengono impiegate note trascrittasi inverse che catalizzano tale reazione: tra le più usate si possono citare la trascrittasi inversa del virus della mieloblastosi aviare e la trascrittasi inversa del virus della leucemia murino di Moloney. Le trascrittasi tipicamente necessitano di specifici primer, esameri o nonameri random oppure oligo-dT primers. Esempi di kit in tal senso sono il kit GeneAmp RNA PCR (Perkin Elmer, CA, USA) ed il kit di trascrittasi inversa AMW (Promega). [0046] In determining a gene expression profile by RT-PCR, the first step usually consists in retro-transcribing the RNA into cDNA followed by its amplification in the PCR reaction. For this purpose, known reverse transcriptases are used which catalyze this reaction: among the most used we can mention the reverse transcriptase of the avian myeloblastosis virus and the reverse transcriptase of the Moloney murine leukemia virus. Transcriptases typically require specific primers, random hexamers or nonamers, or oligo-dT primers. Examples of such kits are the GeneAmp RNA PCR kit (Perkin Elmer, CA, USA) and the AMW reverse transcriptase kit (Promega).
[0047] Numerose varietà di DNA polimerasi termostabili DNA-dipendenti possono essere impiegate per la successiva fase della PCR. Tipicamente, può essere impiegata la Taq DNA polimerasi; ad esempio la TaqMan® PCR sfrutta l’attività 5’-nucleasica della polimerasi Taq o Tth per idrolizzare un legame di ibridizzazione di una sonda per il suo amplicone target, ma qualsiasi enzima equivalente con attività 5’-nucleasica può essere usato. Due oligonucleotidi primer sono usati per generare l’amplicone tipico di una reazione PCR. Un terzo oligonucleotide, o sonda, viene progettato per rilevare sequenze nucleotidiche localizzate tra i due primer. La sonda viene marcata con un colorante reporter fluorescente ed un colorante quencher fluorescente. Durante la reazione di amplificazione, la Taq polimerasi scinde la sonda liberando il colorante reporter dall’effetto mascherante del colorante quencher. Pertanto, una molecola di colorante reporter viene liberata, e quindi identificabile, per ogni molecola sintetizzata, consentendo perciò la valutazione quantitativa del dato. Numerous varieties of DNA-dependent thermostable DNA polymerases can be used for the next step of the PCR. Typically, Taq DNA polymerase can be employed; for example, TaqMan® PCR exploits the 5-nuclease activity of the Taq or Tth polymerase to hydrolyze a hybridization bond of a probe to its target amplicon, but any equivalent enzyme with 5'-nuclease activity can be used. Two primer oligonucleotides are used to generate the typical amplicon of a PCR reaction. A third oligonucleotide, or probe, is designed to detect nucleotide sequences located between the two primers. The probe is labeled with a fluorescent reporter dye and a fluorescent quencher dye. During the amplification reaction, the Taq polymerase cleaves the probe, releasing the reporter dye from the masking effect of the quencher dye. Therefore, a reporter dye molecule is released, and therefore identifiable, for each synthesized molecule, thus allowing the quantitative evaluation of the data.
[0048] Apparecchiature solitamente impiegate per questo tipo di PCR sono ad esempio ABI PRISM 7700TM Sequence Detection System TM, GeneAmp 5700 Sequence Detection System (Perkin-Elmer Biosystems), LightCycler (Roche), MyiQ2 Two Color Real-time PCR Detection System (Biorad). [0048] Devices usually used for this type of PCR are for example ABI PRISM 7700TM Sequence Detection System TM, GeneAmp 5700 Sequence Detection System (Perkin-Elmer Biosystems), LightCycler (Roche), MyiQ2 Two Color Real-time PCR Detection System (Biorad ).
[0049] Un differente approccio per la valutazione di un profilo di espressione genica è rappresentato dalla tecnica del microarray, ossia chip a DNA. Una serie di sequenze di interesse (oligonucleotidiche) vengono attaccate sulla superficie di un substrato quale un vetrino, una lamina di plastica, chip di silicio, in modo da dare origine ad un array o matrice di sequenze. Queste sequenze sono poi ibridizzate con sonde specifiche di DNA. In particolare, gli inserti di PCR amplificati di cloni di cDNA vengono fissati su un substrato. Sonde specifiche di cDNA con etichette fluorescenti ibridizzano in punti ben precisi sulla matrice con detti inserti. Dopo rimozione delle sonde che non hanno ibridizzato, il supporto viene analizzato con un’opportuna apparecchiatura di rilevamento della fluorescenza. [0049] A different approach for the evaluation of a gene expression profile is represented by the microarray technique, ie DNA chip. A series of sequences of interest (oligonucleotides) are attached to the surface of a substrate such as a slide, a plastic sheet, a silicon chip, in order to give rise to an array or matrix of sequences. These sequences are then hybridized with specific DNA probes. In particular, the amplified PCR inserts of cDNA clones are fixed on a substrate. Specific cDNA probes with fluorescent labels hybridize at precise points on the matrix with said inserts. After removing the probes that have not hybridized, the support is analyzed with an appropriate fluorescence detection equipment.
[0050] Ulteriori sistemi di rilevamento dell’mRNA comprendono le tecnologie di sequenziamento di seconda e terza generazione come il sequenziamento secondo Illumina, Ion Torrent, SMRT, nanopore, SeqLL. [0050] Further mRNA detection systems include second and third generation sequencing technologies such as sequencing according to Illumina, Ion Torrent, SMRT, nanopore, SeqLL.
[0051] L’impiego di metodi immunoistochimici sono anche adatti a rilevare i livelli di espressione dei marcatori. Come è noto, con questi metodi, vengono prodotti anticorpi monoclonali o policlonali mirati contro i marcatori di interesse (Antibodies, “A laboratory Manual”, Cold Spring Harbor Laboratory Press, Second Edition, 2014; “Critical Steps in the Production of Polyclonal and Monoclonal Antibodies: Evaluation and Recommendations”, Marlies Leenaars, Coenraad F. M. Hendriksen (ILAR Journal, Volume 46, Issue 3, 1 January 2005, pagine 269–279; M. Schreier et al. “Hybridoma Techniques”, 1980; Hammerling et al., “Monoclonal Antibodies and T-cell Hybridomas”,1981; Kennet et al., Monoclonal Antibodies, 1980; Harlow et al.). Gli anticorpi possono essere rilevati tramite etichette ad essi legate quali elementi radioattivi, composti fluorescenti, apteni come la biotina o enzimi come la perossidasi del rafano o la fosfatasi alcalina. Alternativamente, può essere usato un anticorpo primario non marcato ed un anticorpo secondario marcato. Tramite opportuni rilevatori quali contatori di radioattività, scintillatori, spettrofotometri, colorimetri, fluorimetri, luminometri e densitometri. [0051] The use of immunohistochemical methods are also suitable for detecting the expression levels of markers. As is known, with these methods, monoclonal or polyclonal antibodies are produced targeted against the markers of interest (Antibodies, "A laboratory Manual", Cold Spring Harbor Laboratory Press, Second Edition, 2014; "Critical Steps in the Production of Polyclonal and Monoclonal Antibodies: Evaluation and Recommendations ", Marlies Leenaars, Coenraad F. M. Hendriksen (ILAR Journal, Volume 46, Issue 3, 1 January 2005, pages 269-279; M. Schreier et al." Hybridoma Techniques ", 1980; Hammerling et al., "Monoclonal Antibodies and T-cell Hybridomas", 1981; Kennet et al., Monoclonal Antibodies, 1980; Harlow et al.). Antibodies can be detected by labels attached to them such as radioactive elements, fluorescent compounds, haptens such as biotin or enzymes such as horseradish peroxidase or alkaline phosphatase. Alternatively, an unlabeled primary antibody and a labeled secondary antibody can be used. By means of suitable detectors such as radioactivity counters, scin tillers, spectrophotometers, colorimeters, fluorimeters, luminometers and densitometers.
[0052] Le sonde usate per rilevare nei campioni di interesse la quantità di mRNA della firma molecolare secondo l’invenzione sono prodotte in accordo con metodi convenzionali tra cui il più diffuso è la PCR, ma possono essere usati anche priming con oligonucleotidi random, nick-translation. In ogni caso, le sonde sono piccoli frammenti di oligonicleotidi sintetizzati in modo da risultare complementare, e quindi ibridizzare, con sequenze specifiche corte di un bersaglio quale una porzione di un gene o di un mRNA. Le sonde sono costruite in modo da portare elementi radioattivi come il <32>P in modo da essere rilevabili tramite autoradiografia, oppure elementi o gruppi chimici coniugati con gli acidi nucleici tramite via enzimatica, fotochimica o reazione chimica. Un esempio di tale metodica grandemente impiegato è costituito da biotina che consente il rilevamento tramite interazione con streptavidina coniugata ad un enzima reporter. Un altro sistema di rilevamento non radioattivo è rappresentato da DIG. [0052] The probes used to detect in the samples of interest the quantity of mRNA of the molecular signature according to the invention are produced in accordance with conventional methods among which the most widespread is PCR, but priming with random, nick oligonucleotides can also be used -translation. In any case, the probes are small fragments of oligonicleotides synthesized in such a way as to complement, and thus hybridize, with specific short sequences of a target such as a portion of a gene or an mRNA. The probes are built to carry radioactive elements such as <32> P so as to be detectable by autoradiography, or elements or chemical groups conjugated with nucleic acids by enzymatic, photochemical or chemical reaction. An example of this widely used method is constituted by biotin which allows detection by interaction with streptavidin conjugated to a reporter enzyme. Another non-radioactive detection system is represented by DIG.
[0053] I sistemi di rilevamento delle sonde sono generalmente i saggi colorimetrici, la chemioluminescenza e i saggi fluorescenti, ampiamente noti al tecnico del settore. [0053] The detection systems of the probes are generally the colorimetric assays, the chemoluminescence and the fluorescent assays, widely known to the skilled in the art.
[0054] In particolare è stato visto che l’indice di sovra espressione dei dieci geni dell’invenzione è determinato quando in base alle suddette analisi condotte su almeno dieci campioni si trova una quantità di mRNA corrispondente superiore al valore mediano dei dieci campioni considerati. Questo valore soglia per stabilire la presenza di sovra espressione è stato trovato per esempio impiegando l’analisi di qRT-PCR. Tuttavia, è da tenere presente che variando il metodo analitico potrebbero variare anche le quantità trovate. In ogni caso, impiegando la procedura descritta qui di seguito è possibile comunque correlare quantità di mRNA con una situazione di sovra espressione. [0054] In particular, it has been seen that the over-expression index of the ten genes of the invention is determined when, based on the aforementioned analyzes conducted on at least ten samples, there is a corresponding amount of mRNA higher than the median value of the ten samples considered. This threshold value for establishing the presence of overexpression was found for example using qRT-PCR analysis. However, it should be kept in mind that by varying the analytical method, the quantities found could also vary. In any case, using the procedure described below it is still possible to correlate quantities of mRNA with a situation of over-expression.
[0055] Nel metodo secondo la presente invenzione, la fase di determinazione della sovra-espressione dei suddetti dieci geni selezionati comprende una lista dei geni differenzialmente espressi nelle linee responsive al trattamento combinato platino+inibitore di USP1 e sono stati selezionati secondo la metodologia descritta in Kreuzinger et al., Cancer letters July 1, 2015: 362 (2), 218–228. Per ogni gene il valore corrisponde alla media di tre replicati tecnici per ogni linea cellulare e la fold change (FC) viene calcolata secondo la formula FC= mean(C1+0,5, C2+0,5, C3+0,5)/mean(C4+0,5, C5+0,5), dove C1-C3 sono le linee cellulari n.8587, 13914_1 e 13699, mentre C4-5: 14333 e 12370 (0,5 viene aggiunto a tutti i valori per evitare il caso N/0). I valori C1-C5 corrispondono al numero di letture (“reads”) per ogni gene per ogni linea cellulare ottenuto dal saggio di RNA seq. I geni con FC<4 (in entrambe le direzioni) sono stati scartati. [0055] In the method according to the present invention, the step of determining the overexpression of the aforementioned ten selected genes comprises a list of genes differentially expressed in the lines responsive to the combined platinum + USP1 inhibitor treatment and were selected according to the methodology described in Kreuzinger et al., Cancer letters July 1, 2015: 362 (2), 218–228. For each gene the value corresponds to the average of three technical replicates for each cell line and the fold change (FC) is calculated according to the formula FC = mean (C1 + 0.5, C2 + 0.5, C3 + 0.5) / mean (C4 + 0.5, C5 + 0.5), where C1-C3 are cell lines # 8587, 13914_1 and 13699, while C4-5: 14333 and 12370 (0.5 is added to all values to avoid case N / 0). The C1-C5 values correspond to the number of reads for each gene for each cell line obtained from the RNA seq assay. Genes with FC <4 (in both directions) were discarded.
[0056] In accordo con una forma di realizzazione preferita dell’invenzione, i dieci geni vengono pesati. Con il termine “peso” si intende la grandezza assoluta di un elemento in un calcolo matematico. Pertanto, il peso di ognuno dei dieci geni dell’invenzione nella firma molecolare può essere determinato su un data set di campioni di pazienti utilizzando metodi di apprendimento noti nel settore. [0056] In accordance with a preferred embodiment of the invention, the ten genes are weighed. The term "weight" refers to the absolute size of an element in a mathematical calculation. Therefore, the weight of each of the ten genes of the invention in the molecular signature can be determined on a data set of patient samples using learning methods known in the field.
[0057] In particolare, per ogni tipo di tumore i pesi per ogni gene sono stati scelti in maniera “arbitraria’” dopo aver condotto un’analisi sulla curva di sopravvivenza considerando ogni gene singolarmente. I geni che separavano meglio le due curve (geni la cui bassa espressione correla con una maggiore sopravvivenza) hanno avuto un valore più alto (max 3,5), I geni che invece non separavano molto le due curve hanno avuto il peso più basso (min 0). I geni che andavano al contrario (la loro alta espressione era indice di una maggiore sopravvivenza) hanno avuto un peso “inverted”. [0057] In particular, for each type of tumor the weights for each gene were chosen in an "arbitrary" manner after having conducted an analysis on the survival curve considering each gene individually. The genes that separated the two curves better (genes whose low expression correlates with greater survival) had a higher value (max 3.5), the genes that did not separate the two curves very much had the lowest weight ( min 0). The genes that went the other way around (their high expression was indicative of a higher survival) had an "inverted" weight.
[0058] Nella Tabella 1 seguente, vengono riportati i valori dell’HR (Hazard Ratio), come prodotto dal programma KM Plotter disponibile online, dei dieci geni secondo l’invenzione. [0058] Table 1 below shows the HR (Hazard Ratio) values, as produced by the KM Plotter program available online, of the ten genes according to the invention.
Tabella 1 Table 1
[0059] Una volta ottenuti i suddetti valori, sono stati definiti i suddetti pesi “arbitrari” secondo quanto riportato nella seguente Tabella 2. [0059] Once the above-mentioned values have been obtained, the above-mentioned "arbitrary" weights have been defined as reported in the following Table 2.
Tabella 2 Table 2
[0060] Pertanto, usando i valori dei suddetti pesi della tabella 2 e correlandoli con ciascuno dei dieci geni, si posso ottenere i pesi necessari ad ottenere le curve in accordo con il programma KM Plotter, come spiegato più in dettaglio con riferimento agli esempi specifici di realizzazione. Questi pesi sono riportati nella seguente Tabella 3. [0060] Therefore, using the values of the aforementioned weights of table 2 and correlating them with each of the ten genes, it is possible to obtain the weights necessary to obtain the curves in accordance with the KM Plotter program, as explained in more detail with reference to the specific examples of realization. These weights are shown in the following Table 3.
Tabella 3 Table 3
[0061] E’ stato quindi stabilito che per valori del peso compresi tra 0.1 e 0.5 si ha un peso del singolo gene basso; valori tra 0,6 e 1,5 si ha un peso medio; valori tra 1,6 e 2 si ha un peso alto e tra 2,1 e 3,5 si ha un peso molto alto. Ne deriva che la corrispondente espressione genica diventa variamente importante, a seconda del tipo di cancro, nella valutazione della somministrazione della terapia. [0061] It has therefore been established that for weight values between 0.1 and 0.5 there is a low single gene weight; values between 0.6 and 1.5 have an average weight; values between 1.6 and 2 have a high weight and between 2.1 and 3.5 there is a very high weight. It follows that the corresponding gene expression becomes variously important, depending on the type of cancer, in the evaluation of the administration of the therapy.
[0062] In accordo con una ulteriore forma di realizzazione, il metodo comprende una fase di valutazione della previsione di un trattamento con agenti terapeutici a base di o che includono platino complessi, in cui detta fase prevede la valutazione dell’espressione di ogni gene utilizzando il “Best cut off” come soglia per distinguere un gene ad alta espressione o a bassa espressione. [0062] According to a further embodiment, the method comprises a step of evaluating the prediction of a treatment with therapeutic agents based on or including platinum complexes, in which said step involves evaluating the expression of each gene using the "Best cut off" as a threshold to distinguish a gene with high or low expression.
[0063] Un ulteriore oggetto dell’invenzione è un kit per l’identificazione di soggetti oncologici resistenti al trattamento di agenti terapeutici a base di platino complessi, comprendente preferibilmente una micropiastra i cui pozzetti sono rivestiti da un anticorpo o da sonde costruite come in precedenza, un contenitore per un campione di controllo comprendente le sequenze dell’mRNA dei suddetti dieci geni, un [0063] A further object of the invention is a kit for the identification of oncological subjects resistant to the treatment of complex platinum-based therapeutic agents, preferably comprising a microplate whose wells are coated with an antibody or by probes constructed as previously , a container for a control sample comprising the mRNA sequences of the aforementioned ten genes, a
contenitore per una sostanza diluente per il campione di controllo ed uno per il campione da testare, un contenitore per un tampone, un contenitore per una soluzione substrato, un contenitore per una soluzione di terminazione della reazione, contenitore per un tampone di lavaggio. In aggiunta, il kit comprende un foglietto illustrativo con le spiegazioni di come preparare i campioni, tempi di incubazione e lavaggi, condizioni di lettura del segnale rilevato con fotometro o apparecchiatura alternativa a seconda del segnale da rilevare. container for a diluent substance for the control sample and one for the sample to be tested, a container for a buffer, a container for a substrate solution, a container for a reaction termination solution, container for a wash buffer. In addition, the kit includes a package insert with explanations of how to prepare the samples, incubation and washing times, reading conditions of the signal detected with a photometer or alternative equipment depending on the signal to be detected.
[0064] In accordo con una forma preferita dell’invenzione, il kit può comprendere un campione di riferimento o una tabella di dati prestabiliti di confronto tra valori e/o tipologie di molecole rilevate e corrispondenti quantità riferite ai valori soglia indicativi della presenza di resistenza al trattamento con platino complessi. [0064] In accordance with a preferred form of the invention, the kit can comprise a reference sample or a table of predetermined data for comparing the values and / or types of molecules detected and corresponding quantities referred to the threshold values indicative of the presence of resistance to complex platinum treatment.
[0065] Verranno ora descritte alcune realizzazioni dell’invenzione date a puro titolo di esempio non limitativo. [0065] Some embodiments of the invention will now be described, given purely by way of non-limiting example.
[0066] L’mRNA è stato estratto da una biopsia in accordo con la procedura descritta in Kreuzinger et al.2015, riportato in precedenza. [0066] The mRNA was extracted from a biopsy in accordance with the procedure described in Kreuzinger et al. 2015, previously reported.
[0067] Per confermare il ruolo dei biomarker prognostici e predittivi, è usato un database online che contiene i dati di espressione da microarray per il carcinoma dell’ovaio. Tale database contiene i dati clinici di questi pazienti ed è anche possibile estrarre i dati sull’Overall Survival e disease free survival dei pazienti. Tale tool, il KmPlotter è capace di disegnare curve di sopravvivenza di una coorte di pazienti per geni individuali. Attraverso il tool online KM Plotter (http://kmplot.com) i nomi dei vari geni della firma possono essere inseriti nella casella del Gene symbol (scegliendo tra sonde possibili quella con l’indicazione verde in quanto considerate dal programma la più affidabile) e selezionando l’opzione “Use multigene classifier”. Gli stessi passaggi sono stati effettuati per il carcinoma mammario triplo negativo, carcinoma mammario positivo al recettore dell’estrogeno e per il tumore al polmone. [0067] To confirm the role of prognostic and predictive biomarkers, an online database containing microarray expression data for ovarian cancer is used. This database contains the clinical data of these patients and it is also possible to extract data on the patients' Overall Survival and disease free survival. This tool, the KmPlotter is capable of drawing survival curves of a cohort of patients for individual genes. Through the KM Plotter online tool (http://kmplot.com) the names of the various signature genes can be entered in the Gene symbol box (choosing the one with the green indication among possible probes as they are considered the most reliable by the program) and selecting the option “Use multigene classifier”. The same steps were carried out for triple negative breast cancer, estrogen receptor positive breast cancer and for lung cancer.
MODELLO DEL CANCRO DELL’EPITELIO OVARICO (EOC) MODEL OF OVARIAN EPITHELIUM CANCER (EOC)
[0068] Nella schermata principale del programma KM Plotter, si inseriscono i nomi dei geni relativi alla firma molecolare secondo l’invenzione nello spazio “Affy id/Gene symbol” usando l’opzione “Use multi gene classifier”, si seleziona "Auto select best cutoff” nella casella “Split patients by”, nelle caselle “Restrict analysis to subtypes” dove vengono scelte le opzioni relative allo stadio, grado, istotipo e stato di p53 dei pazienti da prendere in considerazione viene scelta per tutte le voci l’opzione “all”. Infine nelle caselle “Restrict analysis to treatment groups” dove vengono scelte le opzioni relative al tipo di chirurgia e chemioterapia, anche in questo caso viene scelta per tutte le voci l’opzione “all”. A questo punto si lancia l’analisi cliccando su “Draw Kaplan-Meier plot”. Il risultato è il diagramma di figura 2 in cui il valore di HR (Hazard Ratio) calcolato dal programma è la media dei singoli HR di ognuno dei dieci geni secondo la presente invenzione. [0068] In the main screen of the KM Plotter program, enter the names of the genes relating to the molecular signature according to the invention in the "Affy id / Gene symbol" space using the "Use multi gene classifier" option, select "Auto select best cutoff "in the" Split patients by "box, in the" Restrict analysis to subtypes "boxes where the options relating to the stage, grade, histotype and p53 status of the patients to be considered are chosen, the option is chosen for all items Finally, in the boxes "Restrict analysis to treatment groups" where the options relating to the type of surgery and chemotherapy are chosen, also in this case the option "all" is chosen for all items. analysis by clicking on “Draw Kaplan-Meier plot.” The result is the diagram of figure 2 in which the HR (Hazard Ratio) value calculated by the program is the average of the individual HRs of each of the ten genes according to the present invention.
CARCINOMA MAMMARIO TRIPLO NEGATIVO TRIPLE NEGATIVE BREAST CANCER
[0069] Nella schermata principale del programma KM Plotter, si inseriscono i nomi dei geni relativi alla firma molecolare nello spazio “Affy id/Gene symbol” usando l’opzione “Use multi gene classifier”, si seleziona "Auto select best cutoff” nella casella “Split patients by”, nelle caselle “Restrict analysis to subtypes” vengono selzionate solo le pazienti con ER status, PR status e HER2 status negativi, mentre per le altre voci viene scelta per tutte l’opzione “all”. Infine nelle caselle “Restrict analysis to selected cohorts” dove non viene selezionata alcuna opzione. A questo punto si lancia l’analisi cliccando su “Draw Kaplan-Meier plot”. Il risultato è il diagramma di figura 3 in cui il valore di HR (Hazard Ratio) calcolato dal programma è la media dei singoli HR di ognuno dei dieci geni secondo la presente invenzione. [0069] In the main screen of the KM Plotter program, enter the names of the genes relating to the molecular signature in the "Affy id / Gene symbol" space using the "Use multi gene classifier" option, select "Auto select best cutoff" in the "Split patients by" box, in the "Restrict analysis to subtypes" boxes only patients with negative ER status, PR status and HER2 status are selected, while for the other items the "all" option is selected for all. "Restrict analysis to selected cohorts" where no option is selected. At this point the analysis is launched by clicking on "Draw Kaplan-Meier plot". The result is the diagram of figure 3 in which the HR (Hazard Ratio) value calculated by the program is the average of the individual HRs of each of the ten genes according to the present invention.
CANCRO POLMONARE LUNG CANCER
[0070] Nella schermata principale del programma KM Plotter, si inseriscono i nomi dei geni relativi alla firma molecolare nello spazio “Affy id/Gene symbol” usando l’opzione “Use multi gene classifier”, si seleziona "Auto select best cutoff” nella casella “Split patients by”, nelle caselle “Restrict analysis to subtypes” dove vengono scelte le opzioni relative allo stadio, grado, istotipo del tumore, e il sesso dei pazienti da prendere in considerazione viene scelta per tutte le voci l’opzione “all”. Infine nelle caselle “Restrict analysis to treatment groups” dove vengono scelte le opzioni relative al tipo di chirurgia, chemioterapia e radioterapia, anche in questo caso viene scelta per tutte le voci l’opzione “all”. A questo punto si lancia l’analisi cliccando su “Draw Kaplan-Meier plot”. Il risultato è il diagramma di figura 4 in cui il valore di HR (Hazard Ratio) calcolato dal programma è la media dei singoli HR di ognuno dei dieci geni secondo la presente invenzione. [0070] In the main screen of the KM Plotter program, enter the names of the genes relating to the molecular signature in the "Affy id / Gene symbol" space using the "Use multi gene classifier" option, select "Auto select best cutoff" in the "Split patients by" box, in the "Restrict analysis to subtypes" boxes where the options relating to the stage, grade, histotype of the tumor, and the sex of the patients to be taken into consideration are chosen, the option "all Finally, in the boxes "Restrict analysis to treatment groups" where the options relating to the type of surgery, chemotherapy and radiotherapy are chosen, also in this case the option "all" is chosen for all items. analysis by clicking on “Draw Kaplan-Meier plot.” The result is the diagram of figure 4 in which the HR (Hazard Ratio) value calculated by the program is the average of the individual HRs of each of the ten genes according to the present invention.
CARCINOMA MAMMARIO POSITIVO AL RECETTORE DELL’ESTROGENO BREAST CANCER POSITIVE TO ESTROGEN RECEPTOR
[0071] Nella schermata principale del programma KM Plotter, si inseriscono i nomi dei geni relativi alla firma molecolare nello spazio “Affy id/Gene symbol” usando l’opzione “Use multi gene classifier”, si seleziona "Auto select best cutoff” nella casella “Split patients by”, nelle caselle “Restrict analysis to subtypes” vengono selzionate solo le pazienti con ER status positivo, mentre per le altre voci viene scelta per tutte l’opzione “all”. Infine nelle caselle “Restrict analysis to selected cohorts” dove non viene selezionata alcuna opzione. A questo punto si lancia l’analisi cliccando su “Draw Kaplan-Meier plot”. Il risultato è il diagramma di figura 2 in cui il valore di HR (Hazard Ratio) calcolato dal programma è la media dei singoli HR di ognuno dei dieci geni secondo la presente invenzione. [0071] In the main screen of the KM Plotter program, enter the names of the genes relating to the molecular signature in the "Affy id / Gene symbol" space using the "Use multi gene classifier" option, select "Auto select best cutoff" in the "Split patients by" box, in the "Restrict analysis to subtypes" boxes only patients with positive ER status are selected, while for the other items the "all" option is selected for all. Finally, in the "Restrict analysis to selected cohorts" boxes "Where no option is selected. At this point the analysis is launched by clicking on" Draw Kaplan-Meier plot ". The result is the diagram in figure 2 in which the HR (Hazard Ratio) value calculated by the program is the average of the individual HRs of each of the ten genes according to the present invention.
LISTATO DELLE SEQUENZE LIST OF SEQUENCES
<110> Centro di Riferimento Oncologico <110> Oncology Reference Center
<120> Metodo per identificare soggetti resistenti al trattamento con farmaci a base di platino e relativo kit <120> Method to identify subjects resistant to treatment with platinum-based drugs and related kit
<130> B18-143 <130> B18-143
<170> PatentIn version 3.5 <170> PatentIn version 3.5
<210> 1 <210> 1
<211> 1389 <211> 1389
<212> DNA <212> DNA
<213> Homo sapiens <213> Homo sapiens
<400> 1 <400> 1
cgacactcgg cctaggaatt tcccttatct ccttcgcagt gcagctcctt caacctcgcc 60 atggcctctg ccggaatgca gatcctggga gtcgtcctga cactgctggg ctgggtgaat 120 ggcctggtct cctgtgccct gcccatgtgg aaggtgaccg ctttcatcgg caacagcatc 180 gtggtggccc aggtggtgtg ggagggcctg tggatgtcct gcgtggtgca gagcaccggc 240 cagatgcagt gcaaggtgta cgactcactg ctggcgctgc cacaggacct gcaggctgca 300 cgtgccctct gtgtcatcgc cctccttgtg gccctgttcg gcttgctggt ctaccttgct 360 ggggccaagt gtaccacctg tgtggaggag aaggattcca aggcccgcct ggtgctcacc 420 tctgggattg tctttgtcat ctcaggggtc ctgacgctaa tccccgtgtg ctggacggcg 480 catgccatca tccgggactt ctataacccc ctggtggctg aggcccaaaa gcgggagctg 540 ggggcctccc tctacttggg ctgggcggcc tcaggccttt tgttgctggg tggggggttg 600 ctgtgctgca cttgcccctc gggggggtcc cagggcccca gccattacat ggcccgctac 660 tcaacatctg cccctgccat ctctcggggg ccctctgagt accctaccaa gaattacgtc 720 tgacgtggag gggaatgggg gctccgctgg cgctagagcc atccagaagt ggcagtgccc 780 aacagctttg ggatgggttc gtaccttttg tttctgcctc ctgctatttt tcttttgact 840 gaggatattt aaaattcatt tgaaaactga gccaaggtgt tgactcagac tctcacttag 900 gctctgctgt ttctcaccct tggatgatgg agccaaagag gggatgcttt gagattctgg 960 atcttgacat gcccatctta gaagccagtc aagctatgga actaatgcgg aggctgcttg 1020 ctgtgctggc tttgcaacaa gacagactgt ccccaagagt tcctgctgct gctgggggct 1080 gggcttccct agatgtcact ggacagctgc cccccatcct actcaggtct ctggagctcc 1140 tctcttcacc cctggaaaaa caaatgatct gttaacaaag gactgcccac ctccggaact 1200 tctgacctct gtttcctccg tcctgataag acgtccaccc cccagggcca ggtcccagct 1260 atgtagaccc ccgcccccac ctccaacact gcacccttct gccctgcccc cctcgtctca 1320 ccccctttac actcacattt ttatcaaata aagcatgttt tgttagtgca aaaaaaaaaa 1380 aaaaaaaaa 1389 cgacactcgg cctaggaatt tcccttatct ccttcgcagt gcagctcctt caacctcgcc 60 atggcctctg ccggaatgca gatcctggga gtcgtcctga cactgctggg ctgggtgaat 120 ggcctggtct cctgtgccct gcccatgtgg aaggtgaccg ctttcatcgg caacagcatc 180 gtggtggccc aggtggtgtg ggagggcctg tggatgtcct gcgtggtgca gagcaccggc 240 cagatgcagt gcaaggtgta cgactcactg ctggcgctgc cacaggacct gcaggctgca 300 cgtgccctct gtgtcatcgc cctccttgtg gccctgttcg gcttgctggt ctaccttgct 360 ggggccaagt gtaccacctg tgtggaggag aaggattcca aggcccgcct ggtgctcacc 420 tctgggattg tctttgtcat ctcaggggtc ctgacgctaa tccccgtgtg ctggacggcg 480 catgccatca tccgggactt ctataacccc ctggtggctg aggcccaaaa gcgggagctg 540 ggggcctccc tctacttggg ctgggcggcc tcaggccttt tgttgctggg tggggggttg 600 ctgtgctgca cttgcccctc gggggggtcc cagggcccca gccattacat ggcccgctac 660 tcaacatctg cccctgccat ctctcggggg ccctctgagt accctaccaa gaattacgtc 720 tgacgtggag gggaatgggg gctccgctgg cgctagagcc atccagaagt ggcagtgccc 780 aacagctttg ggatgggttc gtaccttttg tttctgcctc ctgctatttt tcttttgact 840 gaggatattt aaaattcatt tgaaaactga gccaaggtgt tgactcagac tctcacttag 900 gctctgctgt ttctcaccct tggatgatgg agccaaagag gggatgcttt gagattctgg 960 atcttgacat gcccatctta gaagccagtc aagctatgga actaatgcgg aggctgcttg 1020 ctgtgctggc tttgcaacaa gacagactgt ccccaagagt tcctgctgct gctgggggct 1080 gggcttccct agatgtcact ggacagctgc cccccatcct actcaggtct ctggagctcc 1140 tctcttcacc cctggaaaaa caaatgatct gttaacaaag gactgcccac ctccggaact 1200 tctgacctct gtttcctccg tcctgataag acgtccaccc cccagggcca ggtcccagct 1260 atgtagaccc ccgcccccac ctccaacact gcacccttct gccctgcccc cctcgtctca 1320 ccccctttac actcacattt ttatcaaata aagcatgttt tgttagtgca aaaaaaaaaa 1380 aaaaaaaaa 1389
<210> 2 <210> 2
<211> 7291 <211> 7291
<212> DNA <212> DNA
<213> Homo sapiens <213> Homo sapiens
<400> 2 <400> 2
acacagtact ctcagcttgt tggtggaagc ccctcatctg ccttcattct gaaggcaggg 60 cccggcagag gaaggatcag agggtcgcgg ccggagggtc ccggccggtg gggccaactc 120 agagggagag gaaagggcta gagacacgaa gaacgcaaac catcaaattt agaagaaaaa 180 gccctttgac tttttccccc tctccctccc caatggctgt gtagcaaaca tccctggcga 240 taccttggaa aggacgaagt tggtctgcag tcgcaatttc gtgggttgag ttcacagttg 300 tgagtgcggg gctcggagat ggagccgtgg tcctctaggt ggaaaacgaa acggtggctc 360 tgggatttca ccgtaacaac cctcgcattg accttcctct tccaagctag agaggtcaga 420 ggagctgctc cagttgatgt actaaaagca ctagattttc acaattctcc agagggaata 480 tcaaaaacaa cgggattttg cacaaacaga aagaattcta aaggctcaga tactgcttac 540 agagtttcaa agcaagcaca actcagtgcc ccaacaaaac agttatttcc aggtggaact 600 ttcccagaag acttttcaat actatttaca gtaaaaccaa aaaaaggaat tcagtctttc 660 cttttatcta tatataatga gcatggtatt cagcaaattg gtgttgaggt tgggagatca 720 cctgtttttc tgtttgaaga ccacactgga aaacctgccc cagaagacta tcccctcttc 780 agaactgtta acatcgctga cgggaagtgg catcgggtag caatcagcgt ggagaagaaa 840 actgtgacaa tgattgttga ttgtaagaag aaaaccacga aaccacttga tagaagtgag 900 agagcaattg ttgataccaa tggaatcacg gtttttggaa caaggatttt ggatgaagaa 960 gtttttgagg gggacattca gcagtttttg atcacaggtg atcccaaggc agcatatgac 1020 tactgtgagc attatagtcc agactgtgac tcttcagcac ccaaggctgc tcaagctcag 1080 gaacctcaga tagatgagta tgcaccagag gatataatcg aatatgacta tgagtatggg 1140 gaagcagagt ataaagaggc tgaaagtgta acagagggac ccactgtaac tgaggagaca 1200 atagcacaga cggaggcaaa catcgttgat gattttcaag aatacaacta tggaacaatg 1260 gaaagttacc agacagaagc tcctaggcat gtttctggga caaatgagcc aaatccagtt 1320 gaagaaatat ttactgaaga atatctaacg ggagaggatt atgattccca gaggaaaaat 1380 tctgaggata cactatatga aaacaaagaa atagacggca gggattctga tcttctggta 1440 gatggagatt taggcgaata tgatttttat gaatataaag aatatgaaga taaaccaaca 1500 agccccccta atgaagaatt tggtccaggt gtaccagcag aaactgatat tacagaaaca 1560 agcataaatg gccatggtgc atatggagag aaaggacaga aaggagaacc agcagtggtt 1620 gagcctggta tgcttgtcga aggaccacca ggaccagcag gacctgcagg tattatgggt 1680 cctccaggtc tacaaggccc cactggaccc cctggtgacc ctggcgatag gggcccccca 1740 ggacgtcctg gcttaccagg ggctgatggt ctacctggtc ctcctggtac tatgttgatg 1800 ttaccgttcc gttatggtgg tgatggttcc aaaggaccaa ccatctctgc tcaggaagct 1860 caggctcaag ctattcttca gcaggctcgg attgctctga gaggcccacc tggcccaatg 1920 ggtctaactg gaagaccagg tcctgtgggg gggcctggtt catctggggc caaaggtgag 1980 agtggtgatc caggtcctca gggccctcga ggcgtccagg gtccccctgg tccaacggga 2040 aaacctggaa aaaggggtcg tccaggtgca gatggaggaa gaggaatgcc aggagaacct 2100 ggggcaaagg gagatcgagg gtttgatgga cttccgggtc tgccaggtga caaaggtcac 2160 aggggtgaac gaggtcctca aggtcctcca ggtcctcctg gtgatgatgg aatgagggga 2220 gaagatggag aaattggacc aagaggtctt ccaggtgaag ctggcccacg aggtttgctg 2280 ggtccaaggg gaactccagg agctccaggg cagcctggta tggcaggtgt agatggcccc 2340 ccaggaccaa aagggaacat gggtccccaa ggggagcctg ggcctccagg tcaacaaggg 2400 aatccaggac ctcagggtct tcctggtcca caaggtccaa ttggtcctcc tggtgaaaaa 2460 ggaccacaag gaaaaccagg acttgctgga cttcctggtg ctgatgggcc tcctggtcat 2520 cctgggaaag aaggccagtc tggagaaaag ggggctctgg gtccccctgg tccacaaggt 2580 cctattggat acccgggccc ccggggagta aagggagcag atggtgtcag aggtctcaag 2640 ggatctaaag gtgaaaaggg tgaagatggt tttccaggat tcaaaggtga catgggtcta 2700 aaaggtgaca gaggagaagt tggtcaaatt ggcccaagag gggaagatgg ccctgaagga 2760 cccaaaggtc gagcaggccc aactggagac ccaggtcctt caggtcaagc aggagaaaag 2820 ggaaaacttg gagttccagg attaccagga tatccaggaa gacaaggtcc aaagggttcc 2880 actggattcc ctgggtttcc aggtgccaat ggagagaaag gtgcacgggg agtagctggc 2940 aaaccaggcc ctcggggtca gcgtggtcca acgggtcctc gaggttcaag aggtgcaaga 3000 ggtcccactg ggaaacctgg gccaaagggc acttcaggtg gcgatggccc tcctggccct 3060 ccaggtgaaa gaggtcctca aggacctcag ggtccagttg gattccctgg accaaaaggc 3120 cctcctggac cacctgggaa ggatgggctg ccaggacacc ctgggcaacg tggggagact 3180 ggatttcaag gcaagaccgg ccctcctggg ccagggggag tggttggacc acagggacca 3240 accggtgaga ctggtccaat aggggaacgt gggcatcctg gccctcctgg ccctcctggt 3300 gagcaaggtc ttcctggtgc tgcaggaaaa gaaggtgcaa agggtgatcc aggtcctcaa 3360 ggtatctcag ggaaagatgg accagcagga ttacgtggtt tcccagggga aagaggtctt 3420 cctggagctc agggtgcacc tggactgaaa ggaggggaag gtccccaggg cccaccaggt 3480 ccagttggct caccaggaga acgtgggtca gcaggtacag ctggcccaat tggtttacca 3540 gggcgcccgg gacctcaggg tcctcctggt ccagctggag agaaaggtgc tcctggagaa 3600 aaaggtcccc aagggcctgc agggagagat ggagttcaag gtcctgttgg tctcccaggg 3660 ccagctggtc ctgccggctc ccctggggaa gacggagaca agggtgaaat tggtgagccg 3720 ggacaaaaag gcagcaaggg tgacaaggga gaaaatggcc ctcccggtcc cccaggtctt 3780 caaggaccag ttggtgcccc tggaattgct ggaggtgatg gtgaaccagg tcctagagga 3840 cagcagggga tgtttgggca aaaaggtgat gagggtgcca gaggcttccc tggacctcct 3900 ggtccaatag gtcttcaggg tctgccaggc ccacctggtg aaaaaggtga aaatggggat 3960 gttggtccca tggggccacc tggtcctcca ggcccaagag gccctcaagg tcccaatgga 4020 gctgatggac cacaaggacc cccagggtct gttggttcag ttggtggtgt tggagaaaag 4080 ggtgaacctg gagaagcagg gaacccaggg cctcctgggg aagcaggtgt aggcggtccc 4140 aaaggagaaa gaggagagaa aggggaagct ggtccacctg gagctgctgg acctccaggt 4200 gccaaggggc caccaggtga tgatggccct aagggtaacc cgggtcctgt tggttttcct 4260 ggagatcctg gtcctcctgg ggaacctggc cctgcaggtc aagatggtgt tggtggtgac 4320 aagggtgaag atggagatcc tggtcaaccg ggtcctcctg gcccatctgg tgaggctggc 4380 ccaccaggtc ctcctggaaa acgaggtcct cctggagctg caggtgcaga gggaagacaa 4440 ggtgaaaaag gtgctaaggg ggaagcaggt gcagaaggtc ctcctggaaa aaccggccca 4500 gtcggtcctc agggacctgc aggaaagcct ggtccagaag gtcttcgggg catccctggt 4560 cctgtgggag aacaaggtct ccctggagct gcaggccaag atggaccacc tggtcctatg 4620 ggacctcctg gcttacctgg tctcaaaggt gaccctggct ccaagggtga aaagggacat 4680 cctggtttaa ttggcctgat tggtcctcca ggagaacaag gggaaaaagg tgaccgaggg 4740 ctccctggaa ctcaaggatc tccaggagca aaaggggatg ggggaattcc tggtcctgct 4800 ggtcccttag gtccacctgg tcctccaggt ttaccaggtc ctcaaggccc aaagggtaac 4860 aaaggctcta ctggacccgc tggccagaaa ggtgacagtg gtcttccagg gcctcctggg 4920 tctccaggtc cacctggtga agtcattcag cctttaccaa tcttgtcctc caaaaaaacg 4980 agaagacata ctgaaggcat gcaagcagat gcagatgata atattcttga ttactcggat 5040 ggaatggaag aaatatttgg ttccctcaat tccctgaaac aagacattga gcatatgaaa 5100 tttccaatgg gtactcagac caatccagcc cgaacttgta aagacctgca actcagccat 5160 cctgacttcc cagatggtga atattggatt gatcctaacc aaggttgctc aggagattcc 5220 ttcaaagttt actgtaattt cacatctggt ggtgagactt gcatttatcc agacaaaaaa 5280 tctgagggag taagaatttc atcatggcca aaggagaaac caggaagttg gtttagtgaa 5340 tttaagaggg gaaaactgct ttcatactta gatgttgaag gaaattccat caatatggtg 5400 caaatgacat tcctgaaact tctgactgcc tctgctcggc aaaatttcac ctaccactgt 5460 catcagtcag cagcctggta tgatgtgtca tcaggaagtt atgacaaagc acttcgcttc 5520 ctgggatcaa atgatgagga gatgtcctat gacaataatc cttttatcaa aacactgtat 5580 gatggttgtg cgtccagaaa aggctatgaa aagactgtca ttgaaatcaa tacaccaaaa 5640 attgatcaag tacctattgt tgatgtcatg atcaatgact ttggtgatca gaatcagaag 5700 ttcggatttg aagttggtcc tgtttgtttt cttggctaag attaagacaa agaacatatc 5760 aaatcaacag aaaatatacc ttggtgccac caacccattt tgtgccacat gcaagttttg 5820 aataaggatg gtatagaaaa caacgctgca tatacaggta ccatttagga aataccgatg 5880 cctttgtggg ggcagaatca catggcaaaa gctttgaaaa tcataaagat ataagttggt 5940 gtggctaaga tggaaacagg gctgattctt gattcccaat tctcaactct ccttttccta 6000 tttgaatttc tttggtgctg tagaaaacaa aaaaagaaaa atatatattc ataaaaaata 6060 tggtgctcat tctcatccat ccaggatgta ctaaaacagt gtgtttaata aattgtaatt 6120 attttgtgta cagttctata ctgttatctg tgtccatttc caaaacttgc acgtgtccct 6180 gaattccatc tgactctaat tttatgagaa ttgcagaact ctgatggcaa taaatatatg 6240 tattatgaaa aaataaagtt gtaatttctg atgactctaa gtccctttct ttggttaata 6300 ataaaatgcc tttgtatata ttgatgttga agagttcaat tatttgatgt cgccaacaaa 6360 attctcagag ggcaaaaatc tggaagactt ttggaagcac actctgatca actcttctct 6420 gccgacagtc attttgctga atttcagcca aaaatattat gcattttgat gctttattca 6480 aggctatacc tcaaactttt tcttctcaga atccaggatt tcacaggata cttgtatata 6540 tggaaaacaa gcaagtttat atttttggac agggaaatgt gtgtaagaaa gtatattaac 6600 aaatcaatgc ctccgtcaag caaacaatca tatgtatact ttttttctac gttatctcat 6660 ctccttgttt tcagtgtgct tcaataatgc aggttaatat taaagatgga aattaagcaa 6720 ttatttatga atttgtgcaa tgttagattt tcttatcaat caagttcttg aatttgattc 6780 taagttgcat attataacag tctcgaaaat tattttactt gcccaacaaa tattactttt 6840 ttcctttcaa gataatttta taaatcattt gacctaccta attgctaaat gaataacata 6900 tggtggactg ttattaagag tatttgtttt aagtcattca ggaaaatcta aacttttttt 6960 tccactaagg tatttacttt aaggtagctt gaaatagcaa tacaatttaa aaattaaaaa 7020 ctgaattttg tatctatttt aagtaatata tgtaagactt gaaaataaat gttttatttc 7080 ttatataaag tgttaaatta attgatacca gatttcactg gaacagtttc aactgataat 7140 ttatgacaaa agaacatacc tgtaatattg aaattaaaaa gtgaaatttg tcataaagaa 7200 tttcttttat ttttgaaatc gagtttgtaa atgtcctttt aagaagggag atatgaatcc 7260 aataaataaa ctcaagtctt ggctacctgg a 7291 acacagtact ctcagcttgt tggtggaagc ccctcatctg ccttcattct gaaggcaggg 60 cccggcagag gaaggatcag agggtcgcgg ccggagggtc ccggccggtg gggccaactc 120 agagggagag gaaagggcta gagacacgaa gaacgcaaac catcaaattt agaagaaaaa 180 gccctttgac tttttccccc tctccctccc caatggctgt gtagcaaaca tccctggcga 240 taccttggaa aggacgaagt tggtctgcag tcgcaatttc gtgggttgag ttcacagttg 300 tgagtgcggg gctcggagat ggagccgtgg tcctctaggt ggaaaacgaa acggtggctc 360 tgggatttca ccgtaacaac cctcgcattg accttcctct tccaagctag agaggtcaga 420 ggagctgctc cagttgatgt actaaaagca ctagattttc acaattctcc agagggaata 480 tcaaaaacaa cgggattttg cacaaacaga aagaattcta aaggctcaga tactgcttac 540 agagtttcaa agcaagcaca actcagtgcc ccaacaaaac agttatttcc aggtggaact 600 ttcccagaag acttttcaat actatttaca gtaaaaccaa aaaaaggaat tcagtctttc 660 cttttatcta tatataatga gcatggtatt cagcaaattg gtgttgaggt tgggagatca 720 cctgtttttc tgtttgaaga ccacactgga aaacctgccc cagaagacta tcccctcttc 780 agaactgtta acatcgctga cgggaagtgg catcgggtag caatcagcgt ggagaagaaa 840 actgtgacaa tgattgttga ttgtaagaag aaaaccacga aaccacttga tagaagtgag 900 agagcaattg ttgataccaa tggaatcacg gtttttggaa caaggatttt ggatgaagaa 960 gtttttgagg gggacattca gcagtttttg atcacaggtg atcccaaggc agcatatgac 1020 tactgtgagc attatagtcc agactgtgac tcttcagcac ccaaggctgc tcaagctcag 1080 gaacctcaga tagatgagta tgcaccagag gatataatcg aatatgacta tgagtatggg 1140 gaagcagagt ataaagaggc tgaaagtgta acagagggac ccactgtaac tgaggagaca 1200 atagcacaga cggaggcaaa catcgttgat gattttcaag aatacaacta tggaacaatg 1260 gaaagttacc agacagaagc tcctaggcat gtttctggga caaatgagcc aaatccagtt 1320 gaagaaatat ttactgaaga atatctaacg ggagaggatt atgattccca gaggaaaaat 1380 tctgaggata cactatatga aaacaaagaa atagacggca gggattctga tcttctggta 1440 gatggagatt taggcgaata tgatttttat gaatataaag aatatgaaga taaaccaaca 1500 agccccccta atgaagaatt tggtccaggt gtaccagcag aaactgatat tacagaaaca 1560 agcataaatg gccatggtgc atatggagag aaaggacaga aaggagaacc agcagtggtt 1620 gagcctggta tgcttgtcga aggaccacca ggaccagcag gacctgcagg tattatgggt 1680 cctccaggtc tacaaggccc cactgga ccc cctggtgacc ctggcgatag gggcccccca 1740 ggacgtcctg gcttaccagg ggctgatggt ctacctggtc ctcctggtac tatgttgatg 1800 ttaccgttcc gttatggtgg tgatggttcc aaaggaccaa ccatctctgc tcaggaagct 1860 caggctcaag ctattcttca gcaggctcgg attgctctga gaggcccacc tggcccaatg 1920 ggtctaactg gaagaccagg tcctgtgggg gggcctggtt catctggggc caaaggtgag 1980 agtggtgatc caggtcctca gggccctcga ggcgtccagg gtccccctgg tccaacggga 2040 aaacctggaa aaaggggtcg tccaggtgca gatggaggaa gaggaatgcc aggagaacct 2100 ggggcaaagg gagatcgagg gtttgatgga cttccgggtc tgccaggtga caaaggtcac 2160 aggggtgaac gaggtcctca aggtcctcca ggtcctcctg gtgatgatgg aatgagggga 2220 gaagatggag aaattggacc aagaggtctt ccaggtgaag ctggcccacg aggtttgctg 2280 ggtccaaggg gaactccagg agctccaggg cagcctggta tggcaggtgt agatggcccc 2340 ccaggaccaa aagggaacat gggtccccaa ggggagcctg ggcctccagg tcaacaaggg 2400 aatccaggac ctcagggtct tcctggtcca caaggtccaa ttggtcctcc tggtgaaaaa 2460 ggaccacaag gaaaaccagg acttgctgga cttcctggtg ctgatgggcc tcctggtcat 2520 cctgggaaag aaggccagtc tggagaaaag gg ggctctgg gtccccctgg tccacaaggt 2580 cctattggat acccgggccc ccggggagta aagggagcag atggtgtcag aggtctcaag 2640 ggatctaaag gtgaaaaggg tgaagatggt tttccaggat tcaaaggtga catgggtcta 2700 aaaggtgaca gaggagaagt tggtcaaatt ggcccaagag gggaagatgg ccctgaagga 2760 cccaaaggtc gagcaggccc aactggagac ccaggtcctt caggtcaagc aggagaaaag 2820 ggaaaacttg gagttccagg attaccagga tatccaggaa gacaaggtcc aaagggttcc 2880 actggattcc ctgggtttcc aggtgccaat ggagagaaag gtgcacgggg agtagctggc 2940 aaaccaggcc ctcggggtca gcgtggtcca acgggtcctc gaggttcaag aggtgcaaga 3000 ggtcccactg ggaaacctgg gccaaagggc acttcaggtg gcgatggccc tcctggccct 3060 ccaggtgaaa gaggtcctca aggacctcag ggtccagttg gattccctgg accaaaaggc 3120 cctcctggac cacctgggaa ggatgggctg ccaggacacc ctgggcaacg tggggagact 3180 ggatttcaag gcaagaccgg ccctcctggg ccagggggag tggttggacc acagggacca 3240 accggtgaga ctggtccaat aggggaacgt gggcatcctg gccctcctgg ccctcctggt 3300 gagcaaggtc ttcctggtgc tgcaggaaaa gaaggtgcaa agggtgatcc aggtcctcaa 3360 ggtatctcag ggaaagatgg accagcagga ttacgtgg tt tcccagggga aagaggtctt 3420 cctggagctc agggtgcacc tggactgaaa ggaggggaag gtccccaggg cccaccaggt 3480 ccagttggct caccaggaga acgtgggtca gcaggtacag ctggcccaat tggtttacca 3540 gggcgcccgg gacctcaggg tcctcctggt ccagctggag agaaaggtgc tcctggagaa 3600 aaaggtcccc aagggcctgc agggagagat ggagttcaag gtcctgttgg tctcccaggg 3660 ccagctggtc ctgccggctc ccctggggaa gacggagaca agggtgaaat tggtgagccg 3720 ggacaaaaag gcagcaaggg tgacaaggga gaaaatggcc ctcccggtcc cccaggtctt 3780 caaggaccag ttggtgcccc tggaattgct ggaggtgatg gtgaaccagg tcctagagga 3840 cagcagggga tgtttgggca aaaaggtgat gagggtgcca gaggcttccc tggacctcct 3900 ggtccaatag gtcttcaggg tctgccaggc ccacctggtg aaaaaggtga aaatggggat 3960 gttggtccca tggggccacc tggtcctcca ggcccaagag gccctcaagg tcccaatgga 4020 gctgatggac cacaaggacc cccagggtct gttggttcag ttggtggtgt tggagaaaag 4080 ggtgaacctg gagaagcagg gaacccaggg cctcctgggg aagcaggtgt aggcggtccc 4140 aaaggagaaa gaggagagaa aggggaagct ggtccacctg gagctgctgg acctccaggt 4200 gccaaggggc caccaggtga tgatggccct aagggtaacc cgg gtcctgt tggttttcct 4260 ggagatcctg gtcctcctgg ggaacctggc cctgcaggtc aagatggtgt tggtggtgac 4320 aagggtgaag atggagatcc tggtcaaccg ggtcctcctg gcccatctgg tgaggctggc 4380 ccaccaggtc ctcctggaaa acgaggtcct cctggagctg caggtgcaga gggaagacaa 4440 ggtgaaaaag gtgctaaggg ggaagcaggt gcagaaggtc ctcctggaaa aaccggccca 4500 gtcggtcctc agggacctgc aggaaagcct ggtccagaag gtcttcgggg catccctggt 4560 cctgtgggag aacaaggtct ccctggagct gcaggccaag atggaccacc tggtcctatg 4620 ggacctcctg gcttacctgg tctcaaaggt gaccctggct ccaagggtga aaagggacat 4680 cctggtttaa ttggcctgat tggtcctcca ggagaacaag gggaaaaagg tgaccgaggg 4740 ctccctggaa ctcaaggatc tccaggagca aaaggggatg ggggaattcc tggtcctgct 4800 ggtcccttag gtccacctgg tcctccaggt ttaccaggtc ctcaaggccc aaagggtaac 4860 aaaggctcta ctggacccgc tggccagaaa ggtgacagtg gtcttccagg gcctcctggg 4920 tctccaggtc cacctggtga agtcattcag cctttaccaa tcttgtcctc caaaaaaacg 4980 agaagacata ctgaaggcat gcaagcagat gcagatgata atattcttga ttactcggat 5040 ggaatggaag aaatatttgg ttccctcaat tccctgaaac aagacattg a gcatatgaaa 5100 tttccaatgg gtactcagac caatccagcc cgaacttgta aagacctgca actcagccat 5160 cctgacttcc cagatggtga atattggatt gatcctaacc aaggttgctc aggagattcc 5220 ttcaaagttt actgtaattt cacatctggt ggtgagactt gcatttatcc agacaaaaaa 5280 tctgagggag taagaatttc atcatggcca aaggagaaac caggaagttg gtttagtgaa 5340 tttaagaggg gaaaactgct ttcatactta gatgttgaag gaaattccat caatatggtg 5400 caaatgacat tcctgaaact tctgactgcc tctgctcggc aaaatttcac ctaccactgt 5460 catcagtcag cagcctggta tgatgtgtca tcaggaagtt atgacaaagc acttcgcttc 5520 ctgggatcaa atgatgagga gatgtcctat gacaataatc cttttatcaa aacactgtat 5580 gatggttgtg cgtccagaaa aggctatgaa aagactgtca ttgaaatcaa tacaccaaaa 5640 attgatcaag tacctattgt tgatgtcatg atcaatgact ttggtgatca gaatcagaag 5700 ttcggatttg aagttggtcc tgtttgtttt cttggctaag attaagacaa agaacatatc 5760 aaatcaacag aaaatatacc ttggtgccac caacccattt tgtgccacat gcaagttttg 5820 aataaggatg gtatagaaaa caacgctgca tatacaggta ccatttagga aataccgatg 5880 cctttgtggg ggcagaatca catggcaaaa gctttgaaaa tcataaagat ataa gttggt 5940 gtggctaaga tggaaacagg gctgattctt gattcccaat tctcaactct ccttttccta 6000 tttgaatttc tttggtgctg tagaaaacaa aaaaagaaaa atatatattc ataaaaaata 6060 tggtgctcat tctcatccat ccaggatgta ctaaaacagt gtgtttaata aattgtaatt 6120 attttgtgta cagttctata ctgttatctg tgtccatttc caaaacttgc acgtgtccct 6180 gaattccatc tgactctaat tttatgagaa ttgcagaact ctgatggcaa taaatatatg 6240 tattatgaaa aaataaagtt gtaatttctg atgactctaa gtccctttct ttggttaata 6300 ataaaatgcc tttgtatata ttgatgttga agagttcaat tatttgatgt cgccaacaaa 6360 attctcagag ggcaaaaatc tggaagactt ttggaagcac actctgatca actcttctct 6420 gccgacagtc attttgctga atttcagcca aaaatattat gcattttgat gctttattca 6480 aggctatacc tcaaactttt tcttctcaga atccaggatt tcacaggata cttgtatata 6540 tggaaaacaa gcaagtttat atttttggac agggaaatgt gtgtaagaaa gtatattaac 6600 aaatcaatgc ctccgtcaag caaacaatca tatgtatact ttttttctac gttatctcat 6660 ctccttgttt tcagtgtgct tcaataatgc aggttaatat taaagatgga aattaagcaa 6720 ttatttatga atttgtgcaa tgttagattt tcttatcaat caagttcttg aatttgattc 6780 taagttgcat attataacag tctcgaaaat tattttactt gcccaacaaa tattactttt 6840 ttcctttcaa gataatttta taaatcattt gacctaccta attgctaaat gaataacata 6900 tggtggactg ttattaagag tatttgtttt aagtcattca ggaaaatcta aacttttttt 6960 tccactaagg tatttacttt aaggtagctt gaaatagcaa tacaatttaa aaattaaaaa 7020 ctgaattttg tatctatttt aagtaatata tgtaagactt gaaaataaat gttttatttc 7080 ttatataaag tgttaaatta attgatacca gatttcactg gaacagtttc aactgataat 7140 ttatgacaaa agaacatacc tgtaatattg aaattaaaaa gtgaaatttg tcataaagaa 7200 tttcttttat ttttgaaatc gagtttgtaa atgtcctttt aagaagggag atatgaatcc 7260 aataaataaa ctcaagtctt ggctacctgg a 7291
<210> 3 <210> 3
<211> 3995 <211> 3995
<212> DNA <212> DNA
<213> Homo sapiens <213> Homo sapiens
<400> 3 <400> 3
cgaattttca aagaacatat tttccgttca cccccgctgg tcttttactg ccatcaatac 60 actgttcttg gtgcaaatac ttcagcctct ttattcaaag tatgttttat gtttttgcca 120 aatatgatct ctaattgaaa gtttattttt ggttttggat gaatctgcgg agcttaagtt 180 gtgagaagaa agggggaaca agacacaatg aaagaaaagt ccaaaaatgc tgcgcggact 240 aggagggaga aggaaaacag tgaattttat gaactggcta aattactgcc tttgccctcg 300 gctatcacct cgcagctgga caaagcatcc ataatcagac tcacgaccag ctatctcaaa 360 atgagagtgg tgttcccaga agggctcggc gaggcgtggg gccactcaag tcggaccagc 420 cccctggaca acgttggccg agaactgggc tcccatctgc tccagaccct ggatggcttc 480 atcttcgtgg tagccccaga tgggaagatc atgtacatct cagagacagc ctcagtccac 540 ttgggtcttt ctcaggtaga gctgaccgga aacagcattt atgaatacat tcacccggca 600 gaccacgacg agatgacggc ggtgctcacc gcccatcaac cctaccactc tcacttcgtg 660 caggagtatg agatcgagcg ctccttcttc ctgaggatga agtgcgtctt ggccaagcgt 720 aacgccggcc tcacctgtgg cggctacaag gtcatccact gcagcggcta cttgaagatc 780 cgccagtaca gcctggacat gtcccccttc gacggctgct accaaaacgt gggcctggtg 840 gccgtgggcc actcgctgcc tcccagcgcc gtcacggaga tcaagctaca cagcaatatg 900 tttatgttcc gcgccagcct ggacatgaag ctcatctttc tggactccag ggtggcggag 960 ctgacggggt acgaacctca ggacctgatt gagaagactc tgtaccacca tgtgcacggc 1020 tgcgacacct tccacctgcg ctgcgcgcac catttgctgc tggtgaaggg acaggtgacc 1080 accaagtact acaggttcct ggcgaaacac ggcggctggg tatgggtgca gagctacgcg 1140 accatcgtgc acaacagtcg ctcctccagg ccacactgta tcgtcagcgt caactatgtc 1200 ctcacagaca cagaatacaa agggctgcag ctctccctgg atcagatctc agcctccaaa 1260 ccagccttct cctataccag cagctccacc cccaccatga ctgacaacag aaagggggcc 1320 aaatcccggc tctccagctc aaagtcaaaa tccaggactt ccccataccc tcagtattcg 1380 ggatttcaca cagaaagatc ggaatctgat catgacagcc agtggggcgg aagtcccttg 1440 accgacacgg cctctccgca gcttctggac cccgccgata ggcctggctc ccagcacgac 1500 gcatcgtgcg cctacagaca gttttcggac cgcagctctc tctgctatgg ctttgcgctt 1560 gaccactcga ggctggtgga agagaggcat ttccataccc aggcctgtga aggaggccga 1620 tgtgaggcag gcaggtactt cctgggaacg ccgcaggccg ggagggagcc ctggtggggc 1680 tctcgcgcag ccttgcccct gacaaaggcc tccccagaaa gcagagaagc ctatgaaaac 1740 agcatgcctc acatcgcttc agtccacagg atccatgggc gaggtcattg ggatgaagat 1800 agtgtggtca gttctccaga ccctgggtcg gccagtgaat caggtgaccg atatcgtact 1860 gagcagtatc aaagtagccc acatgaaccc agcaaaattg aaactcttat aagagccact 1920 cagcaaatga ttaaagaaga agagaacaga ttacagctaa ggaaagcccc ctcagaccaa 1980 ctggcttcca ttaatggggc tgggaaaaaa cactccctgt gttttgcaaa ctaccaacag 2040 cccccaccaa caggtgaagt ctgccatggc tctgctcttg ccaacacttc accatgtgac 2100 catatccagc agagagaggg aaaaatgttg agcccccatg aaaatgacta tgacaacagt 2160 cccaccgcac tatctcggat aagtagtccc aattcggatc gcatttcaaa atccagtttg 2220 atcctagcta aagactatct gcattcggat atatctcctc atcagacagc aggagaccac 2280 cctactgtct ctccaaactg ctttggctct caccggcagt attttgacaa gcatgcttac 2340 acattaactg gatatgccct ggagcactta tatgacagcg aaaccattag aaactattcc 2400 ttgggctgta atggctcaca ctttgatgta acttcccatc tgaggatgca accagaccca 2460 gcacaaggac acaagggaac atctgttata ataaccaacg gaagctgatg ttttgctgaa 2520 atattttgtt ctttaaggat ctctgaaaca tatttatagt ttaatacccc attaccagca 2580 tttactatgc cacagattgt tagagagtat aacttaagtt actgggtatt tgatacgtgt 2640 tcctataaaa tcaaagaaaa catagcacta gcattcaggg ttatacacag aaaagggagc 2700 taaattgaat acacaaattt cccctctaat tatatgggaa ccagaataga taaattttga 2760 cttgaaaaat attcatgtag atcaagtgtg catatatact acatgagagg actgatgaat 2820 gacaacattg cattgtgact atccagtgat cctcaaacac acaaactatt acttacaaac 2880 tgcggtatac attttacata tggaaatata ggctatgtaa tgtaaataca tcaaaaatgg 2940 gtaattttct ttgactctgt cacactaaac ttcttaacga aatttccatt cccaaaataa 3000 ctgagaaaga gagagataca tcttataaac tgacttcttt gtggtttcaa atcagccagc 3060 tcatttggtt caggcataaa ttagagaaat ggttctggat atggtgcaaa aatgagtttt 3120 cacctggtat ccattataaa caatcaggaa gaggtaattt ttcaccttgc ttttcagtta 3180 gacaaggacc aggattgcac tgacatggcg ctgagggttt ttctaagtaa gaacactgag 3240 atattgggac acacatcaaa aacctggagt gctcaattgg aagtagttct atgaatatgg 3300 aaaggccaga ggcagagtga aataaaatgc tatctcaaag tttaacacaa tttaagggct 3360 cagcataagt aaacaacata tttggggttt gcttgtaaaa ccaactaaat aaaaaattca 3420 aaccaattca cccagaaaaa agaccaatag gtgcaaaaat aaaaggaaaa ccagtgaagt 3480 gccacatgac agcagtgtta agtgtttgaa aacgtttcaa agcacatatg tgccaatgtg 3540 acaacatgtg gaaagcctca ggagagagtc taagataaaa gcttaggctg atagacaagt 3600 agttaagagc taagagcagt actctgaagg aataggcaaa atgtttattt tccttattgt 3660 ttgtaaacaa caaacttggt cttacatctg tgtggtatag tagaaaggcc agctgactag 3720 atctctggat tctaattttg gccctacctg taacttaatt ttgtgaccac agttgtacca 3780 ttcaccgtgc ctgggctcta gtttcctggt ttgtaaggca gccccagcgt tcatgttctg 3840 tgatagagca gaactgaact tattacctaa ttaactctct gctatgagtt gtcaagactg 3900 atcattctgt tttttctgta cacagaagtt tagatgcttt gtgacttaag caggtgtgtg 3960 ggctccttta ggcaggttac agttaacttt ctaga 3995 cgaattttca aagaacatat tttccgttca cccccgctgg tcttttactg ccatcaatac 60 actgttcttg gtgcaaatac ttcagcctct ttattcaaag tatgttttat gtttttgcca 120 aatatgatct ctaattgaaa gtttattttt ggttttggat gaatctgcgg agcttaagtt 180 gtgagaagaa agggggaaca agacacaatg aaagaaaagt ccaaaaatgc tgcgcggact 240 aggagggaga aggaaaacag tgaattttat gaactggcta aattactgcc tttgccctcg 300 gctatcacct cgcagctgga caaagcatcc ataatcagac tcacgaccag ctatctcaaa 360 atgagagtgg tgttcccaga agggctcggc gaggcgtggg gccactcaag tcggaccagc 420 cccctggaca acgttggccg agaactgggc tcccatctgc tccagaccct ggatggcttc 480 atcttcgtgg tagccccaga tgggaagatc atgtacatct cagagacagc ctcagtccac 540 ttgggtcttt ctcaggtaga gctgaccgga aacagcattt atgaatacat tcacccggca 600 gaccacgacg agatgacggc ggtgctcacc gcccatcaac cctaccactc tcacttcgtg 660 caggagtatg agatcgagcg ctccttcttc ctgaggatga agtgcgtctt ggccaagcgt 720 aacgccggcc tcacctgtgg cggctacaag gtcatccact gcagcggcta cttgaagatc 780 cgccagtaca gcctggacat gtcccccttc gacggctgct accaaaacgt gggcctggtg 840 gccgtgggcc actcgctgcc tcccagcgcc gtcacggaga tcaagctaca cagcaatatg 900 tttatgttcc gcgccagcct ggacatgaag ctcatctttc tggactccag ggtggcggag 960 ctgacggggt acgaacctca ggacctgatt gagaagactc tgtaccacca tgtgcacggc 1020 tgcgacacct tccacctgcg ctgcgcgcac catttgctgc tggtgaaggg acaggtgacc 1080 accaagtact acaggttcct ggcgaaacac ggcggctggg tatgggtgca gagctacgcg 1140 accatcgtgc acaacagtcg ctcctccagg ccacactgta tcgtcagcgt caactatgtc 1200 ctcacagaca cagaatacaa agggctgcag ctctccctgg atcagatctc agcctccaaa 1260 ccagccttct cctataccag cagctccacc cccaccatga ctgacaacag aaagggggcc 1320 aaatcccggc tctccagctc aaagtcaaaa tccaggactt ccccataccc tcagtattcg 1380 ggatttcaca cagaaagatc ggaatctgat catgacagcc agtggggcgg aagtcccttg 1440 accgacacgg cctctccgca gcttctggac cccgccgata ggcctggctc ccagcacgac 1500 gcatcgtgcg cctacagaca gttttcggac cgcagctctc tctgctatgg ctttgcgctt 1560 gaccactcga ggctggtgga agagaggcat ttccataccc aggcctgtga aggaggccga 1620 tgtgaggcag gcaggtactt cctgggaacg ccgcaggccg ggagggagcc ctggtggggc 1680 tctcgcgcag ccttgcccct gacaaag gcc tccccagaaa gcagagaagc ctatgaaaac 1740 agcatgcctc acatcgcttc agtccacagg atccatgggc gaggtcattg ggatgaagat 1800 agtgtggtca gttctccaga ccctgggtcg gccagtgaat caggtgaccg atatcgtact 1860 gagcagtatc aaagtagccc acatgaaccc agcaaaattg aaactcttat aagagccact 1920 cagcaaatga ttaaagaaga agagaacaga ttacagctaa ggaaagcccc ctcagaccaa 1980 ctggcttcca ttaatggggc tgggaaaaaa cactccctgt gttttgcaaa ctaccaacag 2040 cccccaccaa caggtgaagt ctgccatggc tctgctcttg ccaacacttc accatgtgac 2100 catatccagc agagagaggg aaaaatgttg agcccccatg aaaatgacta tgacaacagt 2160 cccaccgcac tatctcggat aagtagtccc aattcggatc gcatttcaaa atccagtttg 2220 atcctagcta aagactatct gcattcggat atatctcctc atcagacagc aggagaccac 2280 cctactgtct ctccaaactg ctttggctct caccggcagt attttgacaa gcatgcttac 2340 acattaactg gatatgccct ggagcactta tatgacagcg aaaccattag aaactattcc 2400 ttgggctgta atggctcaca ctttgatgta acttcccatc tgaggatgca accagaccca 2460 gcacaaggac acaagggaac atctgttata ataaccaacg gaagctgatg ttttgctgaa 2520 atattttgtt ctttaaggat ctctgaaaca ta tttatagt ttaatacccc attaccagca 2580 tttactatgc cacagattgt tagagagtat aacttaagtt actgggtatt tgatacgtgt 2640 tcctataaaa tcaaagaaaa catagcacta gcattcaggg ttatacacag aaaagggagc 2700 taaattgaat acacaaattt cccctctaat tatatgggaa ccagaataga taaattttga 2760 cttgaaaaat attcatgtag atcaagtgtg catatatact acatgagagg actgatgaat 2820 gacaacattg cattgtgact atccagtgat cctcaaacac acaaactatt acttacaaac 2880 tgcggtatac attttacata tggaaatata ggctatgtaa tgtaaataca tcaaaaatgg 2940 gtaattttct ttgactctgt cacactaaac ttcttaacga aatttccatt cccaaaataa 3000 ctgagaaaga gagagataca tcttataaac tgacttcttt gtggtttcaa atcagccagc 3060 tcatttggtt caggcataaa ttagagaaat ggttctggat atggtgcaaa aatgagtttt 3120 cacctggtat ccattataaa caatcaggaa gaggtaattt ttcaccttgc ttttcagtta 3180 gacaaggacc aggattgcac tgacatggcg ctgagggttt ttctaagtaa gaacactgag 3240 atattgggac acacatcaaa aacctggagt gctcaattgg aagtagttct atgaatatgg 3300 aaaggccaga ggcagagtga aataaaatgc tatctcaaag tttaacacaa tttaagggct 3360 cagcataagt aaacaacata tttggggttt gcttgtaa aa ccaactaaat aaaaaattca 3420 aaccaattca cccagaaaaa agaccaatag gtgcaaaaat aaaaggaaaa ccagtgaagt 3480 gccacatgac agcagtgtta agtgtttgaa aacgtttcaa agcacatatg tgccaatgtg 3540 acaacatgtg gaaagcctca ggagagagtc taagataaaa gcttaggctg atagacaagt 3600 agttaagagc taagagcagt actctgaagg aataggcaaa atgtttattt tccttattgt 3660 ttgtaaacaa caaacttggt cttacatctg tgtggtatag tagaaaggcc agctgactag 3720 atctctggat tctaattttg gccctacctg taacttaatt ttgtgaccac agttgtacca 3780 ttcaccgtgc ctgggctcta gtttcctggt ttgtaaggca gccccagcgt tcatgttctg 3840 tgatagagca gaactgaact tattacctaa ttaactctct gctatgagtt gtcaagactg 3900 atcattctgt tttttctgta cacagaagtt tagatgcttt gtgacttaag caggtgtgtgttg 3960 ggctccttacta ggtagataggc 3960 ggctccttacta ggtaggc
<210> 4 <210> 4
<212> DNA <212> DNA
<213> Homo sapiens <213> Homo sapiens
<400> 4 <400> 4
gcgctttgaa atcccatccc ggctttgttg tctccctccc aaggggccgg aatgctccgg 60 gcccgcggta taaaggcagc cgcggtggcg gtggcggcgc agagctctgt gctccctgca 120 gtcaggactc tgggaccgca gggggctccc ggaccctgac tctgcagccg aaccggcacg 180 gtttcgtggg gacccaggct tgcaaagtga cggtcatttt ctctttcttt ctccctcttg 240 agtccttctg agatgatggc tctgggcgca gcgggagcta cccgggtctt tgtcgcgatg 300 gtagcggcgg ctctcggcgg ccaccctctg ctgggagtga gcgccacctt gaactcggtt 360 ctcaattcca acgctatcaa gaacctgccc ccaccgctgg gcggcgctgc ggggcaccca 420 ggctctgcag tcagcgccgc gccgggaatc ctgtacccgg gcgggaataa gtaccagacc 480 attgacaact accagccgta cccgtgcgca gaggacgagg agtgcggcac tgatgagtac 540 tgcgctagtc ccacccgcgg aggggacgca ggcgtgcaaa tctgtctcgc ctgcaggaag 600 cgccgaaaac gctgcatgcg tcacgctatg tgctgccccg ggaattactg caaaaatgga 660 atatgtgtgt cttctgatca aaatcatttc cgaggagaaa ttgaggaaac catcactgaa 720 agctttggta atgatcatag caccttggat gggtattcca gaagaaccac cttgtcttca 780 aaaatgtatc acaccaaagg acaagaaggt tctgtttgtc tccggtcatc agactgtgcc 840 tcaggattgt gttgtgctag acacttctgg tccaagatct gtaaacctgt cctgaaagaa 900 ggtcaagtgt gtaccaagca taggagaaaa ggctctcatg gactagaaat attccagcgt 960 tgttactgtg gagaaggtct gtcttgccgg atacagaaag atcaccatca agccagtaat 1020 tcttctaggc ttcacacttg tcagagacac taaaccagct atccaaatgc agtgaactcc 1080 ttttatataa tagatgctat gaaaaccttt tatgaccttc atcaactcaa tcctaaggat 1140 atacaagttc tgtggtttca gttaagcatt ccaataacac cttccaaaaa cctggagtgt 1200 aagagctttg tttctttatg gaactcccct gtgattgcag taaattactg tattgtaaat 1260 tctcagtgtg gcacttacct gtaaatgcaa tgaaactttt aattattttt ctaaaggtgc 1320 tgcactgcct atttttcctc ttgttatgta aatttttgta cacattgatt gttatcttga 1380 ctgacaaata ttctatattg aactgaagta aatcatttca gcttatagtt cttaaaagca 1440 taacccttta ccccatttaa ttctagagtc tagaacgcaa ggatctcttg gaatgacaaa 1500 tgataggtac ctaaaatgta acatgaaaat actagcttat tttctgaaat gtactatctt 1560 aatgcttaaa ttatatttcc ctttaggctg tgatagtttt tgaaataaaa tttaacattt 1620 aatatcatga aatgttataa gtagacatac attttgggat tgtgatctta gaggtttgtg 1680 tgtgtgtacg tatgtgtgtg ttctacaaga acggaagtgt gatatgttta aagatgatca 1740 gagaaaagac agtgtctaaa tataagacaa tattgatcag ctctagaata actttaaaga 1800 aagacgtgtt ctgcattgat aaactcaaat gatcatggca gaatgagagt gaatcttaca 1860 ttactacttt caaaaatagt ttccaataaa ttaataatac ctaaaaaaaa aaa 1913 gcgctttgaa atcccatccc ggctttgttg tctccctccc aaggggccgg aatgctccgg 60 gcccgcggta taaaggcagc cgcggtggcg gtggcggcgc agagctctgt gctccctgca 120 gtcaggactc tgggaccgca gggggctccc ggaccctgac tctgcagccg aaccggcacg 180 gtttcgtggg gacccaggct tgcaaagtga cggtcatttt ctctttcttt ctccctcttg 240 agtccttctg agatgatggc tctgggcgca gcgggagcta cccgggtctt tgtcgcgatg 300 gtagcggcgg ctctcggcgg ccaccctctg ctgggagtga gcgccacctt gaactcggtt 360 ctcaattcca acgctatcaa gaacctgccc ccaccgctgg gcggcgctgc ggggcaccca 420 ggctctgcag tcagcgccgc gccgggaatc ctgtacccgg gcgggaataa gtaccagacc 480 attgacaact accagccgta cccgtgcgca gaggacgagg agtgcggcac tgatgagtac 540 tgcgctagtc ccacccgcgg aggggacgca ggcgtgcaaa tctgtctcgc ctgcaggaag 600 cgccgaaaac gctgcatgcg tcacgctatg tgctgccccg ggaattactg caaaaatgga 660 atatgtgtgt cttctgatca aaatcatttc cgaggagaaa ttgaggaaac catcactgaa 720 agctttggta atgatcatag caccttggat gggtattcca gaagaaccac cttgtcttca 780 aaaatgtatc acaccaaagg acaagaaggt tctgtttgtc tccggtcatc agactgtgcc 840 tcaggattgt gttgtgctag acacttctgg tccaagatct gtaaacctgt cctgaaagaa 900 ggtcaagtgt gtaccaagca taggagaaaa ggctctcatg gactagaaat attccagcgt 960 tgttactgtg gagaaggtct gtcttgccgg atacagaaag atcaccatca agccagtaat 1020 tcttctaggc ttcacacttg tcagagacac taaaccagct atccaaatgc agtgaactcc 1080 ttttatataa tagatgctat gaaaaccttt tatgaccttc atcaactcaa tcctaaggat 1140 atacaagttc tgtggtttca gttaagcatt ccaataacac cttccaaaaa cctggagtgt 1200 aagagctttg tttctttatg gaactcccct gtgattgcag taaattactg tattgtaaat 1260 tctcagtgtg gcacttacct gtaaatgcaa tgaaactttt aattattttt ctaaaggtgc 1320 tgcactgcct atttttcctc ttgttatgta aatttttgta cacattgatt gttatcttga 1380 ctgacaaata ttctatattg aactgaagta aatcatttca gcttatagtt cttaaaagca 1440 taacccttta ccccatttaa ttctagagtc tagaacgcaa ggatctcttg gaatgacaaa 1500 tgataggtac ctaaaatgta acatgaaaat actagcttat tttctgaaat gtactatctt 1560 aatgcttaaa ttatatttcc ctttaggctg tgatagtttt tgaaataaaa tttaacattt 1620 aatatcatga aatgttataa gtagacatac attttgggat tgtgatctta gaggtttgtg 1680 tgtgtgtacg tatgtgtgtg ttctaca aga acggaagtgt gatatgttta aagatgatca 1740 gagaaaagac agtgtctaaa tataagacaa tattgatcag ctctagaata actttaaaga 1800 aagacgtgtt ctgcattgat aaactcaaat gatcatggca gaatgagttagt aaatcttataa ctctacataa tataaacta 1860
<210> 5 <210> 5
<211> 3335 <211> 3335
<212> DNA <212> DNA
<213> Homo sapiens <213> Homo sapiens
<400> 5 <400> 5
ctagcggacg ctgccacctg atcccatcca ggcgctgaga ggcaagccac cggcactcgg 60 cgcgcagggc cccgggcacc ctcggacccc gtgcggattt cgagacaagg gtccagagat 120 agcggcggcc gcccccaacc cgggccagag gcagagccca ggcgtgcggt gcttctcggg 180 aacactggcc ctgctaccag agccgcagga tcccccatga gtgcctgtga agggagtcct 240 ggatagtgag gccctggctg gtccaggggt gggtactccg ccatgccacc catcctccag 300 cgcctgcagc aggccaccaa gatgatgagc cgccggaaaa tcctgctgct ggtgctaggg 360 tgcagcaccg taagccttct catccaccag ggggcgcagc tcagctggta ccccaagctg 420 ttccccttga gctgccctcc tctgcggaac tcgccgccgc gccccaagca catgactgtg 480 gccttcctga agactcacaa gacggcaggc acgacggtgc agaacatcct gtttcgcttt 540 gccgagcgcc acaacctgac ggtggccctg ccgcacccga gctgcgagca ccagttctgc 600 tacccccgca acttctcggc gcacttcgtg cacccggcca cgcggccgcc gcacgtgctg 660 gccagccacc tgcgcttcga ccgtgcggag ctggagcgcc tcatgccgcc cagcaccgtc 720 tatgtcacca tcctgcgcga gccggccgcc atgttcgagt cgctcttcag ctactacaac 780 cagtactgcc cggccttccg gcgcgtaccc aacgcctcgc tcgaggcctt cctgcgcgcg 840 cccgaggcat actaccgcgc tggcgagcac ttcgccatgt tcgcacacaa cacgctggcc 900 tacgacctgg gcggcgacaa tgagcgcagc ccgcgcgacg acgccgccta cctggcgggc 960 ctcatccgcc aggtggagga ggttttctcg ctcgtcatga tcgccgagta cttcgacgag 1020 tcgctagtgc tgctgcggcg cctactggcc tgggacctgg acgacgtgct ctacgccaag 1080 ctcaacgcgc gcgccgccag ctcgcgcctg gccgccatcc ccgcggcgct ggcgcgggcg 1140 gcgcgcacct ggaacgccct ggacgccggc ctctacgacc acttcaacgc caccttctgg 1200 cgccacgtgg cgcgcgcggg ccgcgcgtgc gtggagcgcg aggcgcgcga gctgcgcgag 1260 gcccgccagc gcctactgcg gcgctgcttc ggggacgagc cactgctgcg gcctgccgcg 1320 cagatccgca ccaagcagct gcagccgtgg cagcccagcc gcaaggtgga cattatgggc 1380 tatgacctgc ccggcggcgg cgccggcccg gccaccgagg cctgcctcaa gctggccatg 1440 cccgaggtcc agtactcgaa ctacctgttg cgcaagcaga agcgccgggg cggtgcgcgg 1500 gctcggcccg agcccgtcct ggacaatccc ccgcctcggc ccatccgagt gctgcctcgc 1560 ggccctcaag gtccctgagc tccgcaggcc tccaggagta tgggtcctgc cctgccctcc 1620 caggagtcca gccaggggct gcctccatgt cctgccccca tgagagccct ttcttttcaa 1680 ggggcttgag tccctcgctg aactttgggg tacgcctccc tatctgagcc cccgctcccg 1740 atcgtgggct gaaccttgcc ttcgggcacg gcgctgatag gctggatcca gcgctgggcc 1800 accctctcct tcccagctgg ttggtgattt tgacaagagg cctgagggct gtccagaact 1860 gctcaccctg cctttttctg ggggctgagc ccccacaacc tgtcttttcc aagtggataa 1920 gcccaggccc catggagaga ggctcagccc tagggaaagg ctgctcttgc tctccctccc 1980 agggcttgag gctccctata cttttctgtc ttccaggttc taccctcccc acagacctaa 2040 gctgctccca gtctcccaga accaagagtt cttgggttgg gatttttgtt tggttggttt 2100 ttcttttttt taataaaaca tctttaaaat gtgcgagttg tatgtcagtc atagaagccg 2160 gattccccac atttgaggat gaaacggatg cccaggagga gctgtggctg gacccatgag 2220 gatgggggcc tggactcaag gtcccccgcc acctggcata ggcctacaac cctcctcaac 2280 cgcttgaccg actgggcccc caacacagac tggccacctt agccccactc actttccaca 2340 tcgttctttt tcacaaaatt ctgcgacatt tactgagcat catctctgta acttggggat 2400 acaggtgaac cagatagata gggccttatt ttcatggagc ttatggggaa acattggaac 2460 aggagcaaga caggtaagca aagaggaagg actggagggg ttagagcaca acacctcttt 2520 gggtggaagg aggatcattt tcagataagg agctgcctaa ctcagaattt gacctcctcc 2580 tgtgaatcac tgacttcact aatggcctgc tttcctcaca agaagacaac ttggactcag 2640 atgaagacca gtccctttct gcagtttcct ttaacacagc aagatagcca gcacttccag 2700 gtgaaatctc ccttaagact agcagagaat tggaactttt tctgttactt tcttcagagc 2760 tatatcgacg tatctggaca agtgataaaa aacaatagct attatggtct acggtgtgcc 2820 agccactgtt taagggcttt atgtatgtta actttggaga tgggtgccgt tttccaggtg 2880 agggaactga aacttggggg aaattaagtc agggtcacac agccagaaag tggccactag 2940 gatctcaacc aaggcagtgt ggcttcagag tttgtgctca caagtgctgt ctgtagtgcc 3000 attcaaataa aaaaggtcac atgccacaag ctaggctgat tcccagcacc ttcccccaaa 3060 agaatgtaaa gggaagaacc aggacactgc tatcagcaac tctttatcca gtttttcatg 3120 gagacaacgg ttgacctgtt tggacaatgg acattacagt gattgattgg caggatgcat 3180 gacacgctgg gcttgaaaag ggactgtttg tatggcaccc cactttttac cagcagagct 3240 ctgaaaaagg caataattag aatcatgcaa aagcataatt aagtttctga cattaagcag 3300 cagtaaagaa ttgtaccaga aaaaaaaaaa aaaaa 3335 ctagcggacg ctgccacctg atcccatcca ggcgctgaga ggcaagccac cggcactcgg 60 cgcgcagggc cccgggcacc ctcggacccc gtgcggattt cgagacaagg gtccagagat 120 agcggcggcc gcccccaacc cgggccagag gcagagccca ggcgtgcggt gcttctcggg 180 aacactggcc ctgctaccag agccgcagga tcccccatga gtgcctgtga agggagtcct 240 ggatagtgag gccctggctg gtccaggggt gggtactccg ccatgccacc catcctccag 300 cgcctgcagc aggccaccaa gatgatgagc cgccggaaaa tcctgctgct ggtgctaggg 360 tgcagcaccg taagccttct catccaccag ggggcgcagc tcagctggta ccccaagctg 420 ttccccttga gctgccctcc tctgcggaac tcgccgccgc gccccaagca catgactgtg 480 gccttcctga agactcacaa gacggcaggc acgacggtgc agaacatcct gtttcgcttt 540 gccgagcgcc acaacctgac ggtggccctg ccgcacccga gctgcgagca ccagttctgc 600 tacccccgca acttctcggc gcacttcgtg cacccggcca cgcggccgcc gcacgtgctg 660 gccagccacc tgcgcttcga ccgtgcggag ctggagcgcc tcatgccgcc cagcaccgtc 720 tatgtcacca tcctgcgcga gccggccgcc atgttcgagt cgctcttcag ctactacaac 780 cagtactgcc cggccttccg gcgcgtaccc aacgcctcgc tcgaggcctt cctgcgcgcg 840 cccgaggcat actaccgcgc tggcgagcac ttcgccatgt tcgcacacaa cacgctggcc 900 tacgacctgg gcggcgacaa tgagcgcagc ccgcgcgacg acgccgccta cctggcgggc 960 ctcatccgcc aggtggagga ggttttctcg ctcgtcatga tcgccgagta cttcgacgag 1020 tcgctagtgc tgctgcggcg cctactggcc tgggacctgg acgacgtgct ctacgccaag 1080 ctcaacgcgc gcgccgccag ctcgcgcctg gccgccatcc ccgcggcgct ggcgcgggcg 1140 gcgcgcacct ggaacgccct ggacgccggc ctctacgacc acttcaacgc caccttctgg 1200 cgccacgtgg cgcgcgcggg ccgcgcgtgc gtggagcgcg aggcgcgcga gctgcgcgag 1260 gcccgccagc gcctactgcg gcgctgcttc ggggacgagc cactgctgcg gcctgccgcg 1320 cagatccgca ccaagcagct gcagccgtgg cagcccagcc gcaaggtgga cattatgggc 1380 tatgacctgc ccggcggcgg cgccggcccg gccaccgagg cctgcctcaa gctggccatg 1440 cccgaggtcc agtactcgaa ctacctgttg cgcaagcaga agcgccgggg cggtgcgcgg 1500 gctcggcccg agcccgtcct ggacaatccc ccgcctcggc ccatccgagt gctgcctcgc 1560 ggccctcaag gtccctgagc tccgcaggcc tccaggagta tgggtcctgc cctgccctcc 1620 caggagtcca gccaggggct gcctccatgt cctgccccca tgagagccct ttcttttcaa 1680 ggggcttgag tccctcgctg aactttg ggg tacgcctccc tatctgagcc cccgctcccg 1740 atcgtgggct gaaccttgcc ttcgggcacg gcgctgatag gctggatcca gcgctgggcc 1800 accctctcct tcccagctgg ttggtgattt tgacaagagg cctgagggct gtccagaact 1860 gctcaccctg cctttttctg ggggctgagc ccccacaacc tgtcttttcc aagtggataa 1920 gcccaggccc catggagaga ggctcagccc tagggaaagg ctgctcttgc tctccctccc 1980 agggcttgag gctccctata cttttctgtc ttccaggttc taccctcccc acagacctaa 2040 gctgctccca gtctcccaga accaagagtt cttgggttgg gatttttgtt tggttggttt 2100 ttcttttttt taataaaaca tctttaaaat gtgcgagttg tatgtcagtc atagaagccg 2160 gattccccac atttgaggat gaaacggatg cccaggagga gctgtggctg gacccatgag 2220 gatgggggcc tggactcaag gtcccccgcc acctggcata ggcctacaac cctcctcaac 2280 cgcttgaccg actgggcccc caacacagac tggccacctt agccccactc actttccaca 2340 tcgttctttt tcacaaaatt ctgcgacatt tactgagcat catctctgta acttggggat 2400 acaggtgaac cagatagata gggccttatt ttcatggagc ttatggggaa acattggaac 2460 aggagcaaga caggtaagca aagaggaagg actggagggg ttagagcaca acacctcttt 2520 gggtggaagg aggatcattt tcagataagg ag ctgcctaa ctcagaattt gacctcctcc 2580 tgtgaatcac tgacttcact aatggcctgc tttcctcaca agaagacaac ttggactcag 2640 atgaagacca gtccctttct gcagtttcct ttaacacagc aagatagcca gcacttccag 2700 gtgaaatctc ccttaagact agcagagaat tggaactttt tctgttactt tcttcagagc 2760 tatatcgacg tatctggaca agtgataaaa aacaatagct attatggtct acggtgtgcc 2820 agccactgtt taagggcttt atgtatgtta actttggaga tgggtgccgt tttccaggtg 2880 agggaactga aacttggggg aaattaagtc agggtcacac agccagaaag tggccactag 2940 gatctcaacc aaggcagtgt ggcttcagag tttgtgctca caagtgctgt ctgtagtgcc 3000 attcaaataa aaaaggtcac atgccacaag ctaggctgat tcccagcacc ttcccccaaa 3060 agaatgtaaa gggaagaacc aggacactgc tatcagcaac tctttatcca gtttttcatg 3120 gagacaacgg ttgacctgtt tggacaatgg acattacagt gattgattgg caggatgcat 3180 gacacgctgg gcttgaaaag ggactgtttg tatggcaccc cactttttac cagcagagct 3240 ctgaaaaagg caataattag aatcatgcaa aagcataatt aagtttctga cattaagcag 3300 cagtaaagaa ttgtaccaga aaaaaaaaaa aaaaa 3335
<210> 6 <210> 6
<211> 6333 <211> 6333
<212> DNA <212> DNA
<213> Homo sapiens <213> Homo sapiens
<400> 6 <400> 6
gtagtctctt tggaaacttc tgcaggggaa aagagctagg aaagagctgc aaagcagtgt 60 gggctttttc ccttttttgc tccttttcat tacccctcct ccgttttcac ccttctccgg 120 acttcgcgta gaacctgcga atttcgaaga ggaggtggca aagtgggaga aaagaggtgt 180 tagggtttgg ggtttttttg tttttgtttt tgttttttaa tttcttgatt tcaacatttt 240 gtagtctctt tggaaacttc tgcaggggaa aagagctagg aaagagctgc aaagcagtgt 60 gggctttttc ccttttttgc tccttttcat tacccctcct ccgttttcac ccttctccgg 120 acttcgcgta gaacctgttttttga atttcga taga tcaggtaggt 180
ctcccaccct ctcggctgca gccaacgcct cttacctgtt ctgcggcgcc gcgcaccgct 300 ggcagctgag ggttagaaag cggggtgtat tttagatttt aagcaaaaat tttaaagata 360 aatccatttt tctctcccac ccccaacgcc atctccactg catccgatct cattatttcg 420 gtggttgctt gggggtgaac aattttgtgg ctttttttcc cctataattc tgacccgctc 480 aggcttgagg gtttctccgg cctccgctca ctgcgtgcac ctggcgctgc cctgcttccc 540 ccaacctgtt gcaaggcttt aattcttgca actgggacct gctcgcaggc accccagccc 600 tccacctctc tctacatttt tgcaagtgtc tgggggaggg cacctgctct acctgccaga 660 aattttaaaa caaaaacaaa aacaaaaaaa tctccggggg ccctcttggc ccctttatcc 720 ctgcactctc gctctcctgc cccaccccga ggtaaagggg gcgactaaga gaagatggtg 780 ttgctcaccg cggtcctcct gctgctggcc gcctatgcgg ggccggccca gagcctgggc 840 tccttcgtgc actgcgagcc ctgcgacgag aaagccctct ccatgtgccc ccccagcccc 900 ctgggctgcg agctggtcaa ggagccgggc tgcggctgct gcatgacctg cgccctggcc 960 gaggggcagt cgtgcggcgt ctacaccgag cgctgcgccc aggggctgcg ctgcctcccc 1020 cggcaggacg aggagaagcc gctgcacgcc ctgctgcacg gccgcggggt ttgcctcaac 1080 gaaaagagct accgcgagca agtcaagatc gagagagact cccgtgagca cgaggagccc 1140 accacctctg agatggccga ggagacctac tcccccaaga tcttccggcc caaacacacc 1200 cgcatctccg agctgaaggc tgaagcagtg aagaaggacc gcagaaagaa gctgacccag 1260 tccaagtttg tcgggggagc cgagaacact gcccaccccc ggatcatctc tgcacctgag 1320 atgagacagg agtctgagca gggcccctgc cgcagacaca tggaggcttc cctgcaggag 1380 ctcaaagcca gcccacgcat ggtgccccgt gctgtgtacc tgcccaattg tgaccgcaaa 1440 ggattctaca agagaaagca gtgcaaacct tcccgtggcc gcaagcgtgg catctgctgg 1500 tgcgtggaca agtacgggat gaagctgcca ggcatggagt acgttgacgg ggactttcag 1560 tgccacacct tcgacagcag caacgttgag tgatgcgtcc ccccccaacc tttccctcac 1620 cccctcccac ccccagcccc gactccagcc agcgcctccc tccaccccag gacgccactc 1680 atttcatctc atttaaggga aaaatatata tctatctatt tgaggaaact gaggacctcg 1740 gaatctctag caagggctca acttcgaaaa tggcaacaac agagatgcaa aaagctaaaa 1800 agacaccccc cccctttaaa tggttttctt tttgaggcaa gttggatgaa cagagaaggg 1860 aagagaggaa gaacgagagg aagagaaggg aaggaagtgt ttgtgtagaa gagagagaaa 1920 gacgaataga gttaggaaaa ggaagacaag caggtgggca ggaaggacat gcaccgagac 1980 caggcagggg cccaactttc acgtccagcc ctggcctggg gtcgggagag gtgggcgcta 2040 gaagatgcag cccaggatgt ggcaatcaat gacactattg gggtttccca ggatggattg 2100 gtcaggggga gaaaggaaaa ggcaaaacac tccaggacct ctcccggatc tgtctcctcc 2160 tctagccagc agtatggaca gctggacccc tgaacttcct ctcctcttac ctgggcagag 2220 tgttgtctct ccccaaattt ataaaaacta aaatgcattc cattcctctg aaagcaaaac 2280 aaattcataa ttgagtgata ttaaatagag aggttttcgg aagcagatct gtgaatatga 2340 aatacatgtg catatttcat tccccaggca gacatttttt agaaatcaat acatgcccca 2400 atattggaaa gacttgttct tccacggtga ctacagtaca tgctgaagcg tgccgtttca 2460 gccctcattt aattcaattt gtaagtagcg cagcagcctc tgtgggggag gataggctga 2520 aaaaaaaaag tgggctcgta tttatctaca ggactccata tagtcatata taggcatata 2580 aatctattct ttttctttgt ttttttcttt cttcctttct ttcaaaggtt tgcattaact 2640 ctcccaccct ctcggctgca gccaacgcct cttacctgtt ctgcggcgcc gcgcaccgct 300 ggcagctgag ggttagaaag cggggtgtat tttagatttt aagcaaaaat tttaaagata 360 aatccatttt tctctcccac ccccaacgcc atctccactg catccgatct cattatttcg 420 gtggttgctt gggggtgaac aattttgtgg ctttttttcc cctataattc tgacccgctc 480 aggcttgagg gtttctccgg cctccgctca ctgcgtgcac ctggcgctgc cctgcttccc 540 ccaacctgtt gcaaggcttt aattcttgca actgggacct gctcgcaggc accccagccc 600 tccacctctc tctacatttt tgcaagtgtc tgggggaggg cacctgctct acctgccaga 660 aattttaaaa caaaaacaaa aacaaaaaaa tctccggggg ccctcttggc ccctttatcc 720 ctgcactctc gctctcctgc cccaccccga ggtaaagggg gcgactaaga gaagatggtg 780 ttgctcaccg cggtcctcct gctgctggcc gcctatgcgg ggccggccca gagcctgggc 840 tccttcgtgc actgcgagcc ctgcgacgag aaagccctct ccatgtgccc ccccagcccc 900 ctgggctgcg agctggtcaa ggagccgggc tgcggctgct gcatgacctg cgccctggcc 960 gaggggcagt cgtgcggcgt ctacaccgag cgctgcgccc aggggctgcg ctgcctcccc 1020 cggcaggacg aggagaagcc gctgcacgcc ctgctgcacg gccgcggggt ttgcctcaac 1080 gaaaagagct accgcga gca agtcaagatc gagagagact cccgtgagca cgaggagccc 1140 accacctctg agatggccga ggagacctac tcccccaaga tcttccggcc caaacacacc 1200 cgcatctccg agctgaaggc tgaagcagtg aagaaggacc gcagaaagaa gctgacccag 1260 tccaagtttg tcgggggagc cgagaacact gcccaccccc ggatcatctc tgcacctgag 1320 atgagacagg agtctgagca gggcccctgc cgcagacaca tggaggcttc cctgcaggag 1380 ctcaaagcca gcccacgcat ggtgccccgt gctgtgtacc tgcccaattg tgaccgcaaa 1440 ggattctaca agagaaagca gtgcaaacct tcccgtggcc gcaagcgtgg catctgctgg 1500 tgcgtggaca agtacgggat gaagctgcca ggcatggagt acgttgacgg ggactttcag 1560 tgccacacct tcgacagcag caacgttgag tgatgcgtcc ccccccaacc tttccctcac 1620 cccctcccac ccccagcccc gactccagcc agcgcctccc tccaccccag gacgccactc 1680 atttcatctc atttaaggga aaaatatata tctatctatt tgaggaaact gaggacctcg 1740 gaatctctag caagggctca acttcgaaaa tggcaacaac agagatgcaa aaagctaaaa 1800 agacaccccc cccctttaaa tggttttctt tttgaggcaa gttggatgaa cagagaaggg 1860 aagagaggaa gaacgagagg aagagaaggg aaggaagtgt ttgtgtagaa gagagagaaa 1920 gacgaataga gttaggaaaa gg aagacaag caggtgggca ggaaggacat gcaccgagac 1980 caggcagggg cccaactttc acgtccagcc ctggcctggg gtcgggagag gtgggcgcta 2040 gaagatgcag cccaggatgt ggcaatcaat gacactattg gggtttccca ggatggattg 2100 gtcaggggga gaaaggaaaa ggcaaaacac tccaggacct ctcccggatc tgtctcctcc 2160 tctagccagc agtatggaca gctggacccc tgaacttcct ctcctcttac ctgggcagag 2220 tgttgtctct ccccaaattt ataaaaacta aaatgcattc cattcctctg aaagcaaaac 2280 aaattcataa ttgagtgata ttaaatagag aggttttcgg aagcagatct gtgaatatga 2340 aatacatgtg catatttcat tccccaggca gacatttttt agaaatcaat acatgcccca 2400 atattggaaa gacttgttct tccacggtga ctacagtaca tgctgaagcg tgccgtttca 2460 gccctcattt aattcaattt gtaagtagcg cagcagcctc tgtgggggag gataggctga 2520 aaaaaaaaag tgggctcgta tttatctaca ggactccata tagtcatata taggcatata 2580 aatctattct ttttctttgt ttttttcttt cttcctttct ttcaaaggtt tgcattaact 2640
tttcaaagta gttcctatag gggcattgag gagcttcctc attctgggaa aactgagaaa 2700 acccatattc tcctaataca acccgtaata gcatttttgc ctgcctcgag gcagagtttc 2760 ccgtgagcaa taaactcagc ttttttgtgg ggcacagtac tggatttgac agtgattccc 2820 cacgtgtgtt catctgcacc caccgagcca ggcagaggcc agccctccgt ggtgcacaca 2880 gcacgcgcct cagtccatcc cattttagtc tttaaaccct caggaagtca cagtctccgg 2940 acaccacacc acatgagccc aacaggtcca cgatggatcc accagtccca ccccagcctt 3000 ttcctttcat ctgaacagaa tgtgcatttt tggaagcctc cctcactctc catgctggca 3060 gagcaggagg gagactgaag taagagatgg cagagggaga tggtggcaaa aaggtttaga 3120 tgcaggagaa cagtaagatg gatggttccg gccagagtcg atgtggggag gaacagaggg 3180 ctgaagggag agggggctga ctgttccatt ctagctttgg cacaaagcag cagaaagggg 3240 gaaaagccaa tagaaatttc cttagcttcc ccaccatatg tattttctag gatttgagag 3300 gaaagagagg aaaatggggg aatgggttgc aaaatagaaa tgagcttaat ccaggccgca 3360 gagccaggga aggtgagtaa ctttaggagg gtgctagact ttagaagcca gataggaaga 3420 atcagtctaa actggccatg ctttggaagg gacaagacta tgtgctccgc tgcccacctt 3480 cagcctgcaa tgagggactg aggcccacga gtctttccag ctcttcctcc attctggcca 3540 gtccctgcat cctccctggg gtggaggatg gaaggaaagc tgggacaagc agggaacgca 3600 tgattcaggg atgctgtcac tcggcagcca gattccgaaa ctcccattct ccaatgactt 3660 cctcaaccaa tgggtggcct tgtgactgtt ctttaaggct gaagatatcc aggaaagggg 3720 gcttggacac tggccaagga gaccccttcg tgctgtggac acagctctct tcactctttg 3780 ctcatggcat gacacagcgg agaccgcctc caacaacgaa tttggggcta cgaagaggaa 3840 tagcgaaaaa gcaaatctgt ttcaactgat gggaacccta tagctataga acttgggggc 3900 tatctcctat gcccctggac aggacagttg gctggggaca ggagaagtgc tcaatcttca 3960 tgagacaaag gggcccgata gggccagcag ccacaaggcc ttgacctgcc gagtcagcat 4020 gccccatctc tctgcacagc tgtcccctaa acccaactca cgtttctgta tgtcttaggc 4080 cagtatccca aacctcttcc acgtcactgt tctttccacc cattctccct ttgcatcttg 4140 agcagttatc caactaggat ctgccaagtg gatactgggg tgccactccc ctgagaaaag 4200 actgagccag gaactacaag ctccccccac attcctccca gcctggacct aattcttgag 4260 aggggctctc tcttcacgga ctgtgtctgg actttgagca ggcttctgcc ccttgcgttg 4320 gctctttgct gccagccatc aggtggggga ttagagcctg gtgtaagtgc gccagactct 4380 tccggtttcc aaagttcgtg cctgcgaacc caaacctgtg agtctcttct gcatgcagga 4440 gtttctcctg ggcagctggt cactccccag agaagctggg ccttcatgga cacatggaac 4500 taagcctccc aaatgggagt tctggctgag cccagggtgg ggagatcctg ggaagggagg 4560 cactggagga agacggcacc tcttccccca tggcagggtg tgagggaggc aggtttggaa 4620 tggtgcgagt atggcaatct aagcaggggt ctggtctctt tgactccagg ctggcctttg 4680 gccgactgtc tgctcaccca gagaccttgg actccggact atccatggct ccgaatctaa 4740 gtgctgccca ctcccatgct cacacccaca gaaggtcttc ccatcccctt tagattcgtg 4800 cctcactcca ccagtgagga agatgcctct gtctttccca cgactgccag gagataggga 4860 agcccagcca ggactgaccc tccttcctcc agcctgccct gacccacctg gcaaagcagg 4920 gcacatgggg aggaagagac tggaaccttt ctttgacagc caggcctaga cagacaggcc 4980 tggggacact ggccccatga ggggaggaag gcaggcgcac gaggtccagg gaggcccttt 5040 tctgatcatg ccccttctct cccaccccat ctccccacca ccacctctgt ggcctccatg 5100 gtacccccac agggctggcc tcccctagag ggtgggcctc aaccacctgc tcccgccacg 5160 caccggttag tgagacaggg ctgccacggc aaccgccaag cccccctcaa ggtgggacag 5220 taccccggac ccatccactc actcctgaga gggctccggc ccagaatggg aacctcagag 5280 aagagctcta aggagaagaa accccatagc gtcagagagg atatgtctgg cttccaagag 5340 aaaggaggct ccgttttgca aagtggagga gggacgaggg acaggggttt caccagccag 5400 caacctgggc cttgtactgt ctgtgttttt aaaaccacta aagtgcaaga attacattgc 5460 actgtttctc cactttttat tttctcttag gcttttgttt ctatttcaaa catactttct 5520 tttcaaagta gttcctatag gggcattgag gagcttcctc attctgggaa aactgagaaa 2700 acccatattc tcctaataca acccgtaata gcatttttgc ctgcctcgag gcagagtttc 2760 ccgtgagcaa taaactcagc ttttttgtgg ggcacagtac tggatttgac agtgattccc 2820 cacgtgtgtt catctgcacc caccgagcca ggcagaggcc agccctccgt ggtgcacaca 2880 gcacgcgcct cagtccatcc cattttagtc tttaaaccct caggaagtca cagtctccgg 2940 acaccacacc acatgagccc aacaggtcca cgatggatcc accagtccca ccccagcctt 3000 ttcctttcat ctgaacagaa tgtgcatttt tggaagcctc cctcactctc catgctggca 3060 gagcaggagg gagactgaag taagagatgg cagagggaga tggtggcaaa aaggtttaga 3120 tgcaggagaa cagtaagatg gatggttccg gccagagtcg atgtggggag gaacagaggg 3180 ctgaagggag agggggctga ctgttccatt ctagctttgg cacaaagcag cagaaagggg 3240 gaaaagccaa tagaaatttc cttagcttcc ccaccatatg tattttctag gatttgagag 3300 gaaagagagg aaaatggggg aatgggttgc aaaatagaaa tgagcttaat ccaggccgca 3360 gagccaggga aggtgagtaa ctttaggagg gtgctagact ttagaagcca gataggaaga 3420 atcagtctaa actggccatg ctttggaagg gacaagacta tgtgctccgc tgcccacctt 3480 cagcct gcaa tgagggactg aggcccacga gtctttccag ctcttcctcc attctggcca 3540 gtccctgcat cctccctggg gtggaggatg gaaggaaagc tgggacaagc agggaacgca 3600 tgattcaggg atgctgtcac tcggcagcca gattccgaaa ctcccattct ccaatgactt 3660 cctcaaccaa tgggtggcct tgtgactgtt ctttaaggct gaagatatcc aggaaagggg 3720 gcttggacac tggccaagga gaccccttcg tgctgtggac acagctctct tcactctttg 3780 ctcatggcat gacacagcgg agaccgcctc caacaacgaa tttggggcta cgaagaggaa 3840 tagcgaaaaa gcaaatctgt ttcaactgat gggaacccta tagctataga acttgggggc 3900 tatctcctat gcccctggac aggacagttg gctggggaca ggagaagtgc tcaatcttca 3960 tgagacaaag gggcccgata gggccagcag ccacaaggcc ttgacctgcc gagtcagcat 4020 gccccatctc tctgcacagc tgtcccctaa acccaactca cgtttctgta tgtcttaggc 4080 cagtatccca aacctcttcc acgtcactgt tctttccacc cattctccct ttgcatcttg 4140 agcagttatc caactaggat ctgccaagtg gatactgggg tgccactccc ctgagaaaag 4200 actgagccag gaactacaag ctccccccac attcctccca gcctggacct aattcttgag 4260 aggggctctc tcttcacgga ctgtgtctgg actttgagca ggcttctgcc ccttgcgttg 4320 gctctttgct g ccagccatc aggtggggga ttagagcctg gtgtaagtgc gccagactct 4380 tccggtttcc aaagttcgtg cctgcgaacc caaacctgtg agtctcttct gcatgcagga 4440 gtttctcctg ggcagctggt cactccccag agaagctggg ccttcatgga cacatggaac 4500 taagcctccc aaatgggagt tctggctgag cccagggtgg ggagatcctg ggaagggagg 4560 cactggagga agacggcacc tcttccccca tggcagggtg tgagggaggc aggtttggaa 4620 tggtgcgagt atggcaatct aagcaggggt ctggtctctt tgactccagg ctggcctttg 4680 gccgactgtc tgctcaccca gagaccttgg actccggact atccatggct ccgaatctaa 4740 gtgctgccca ctcccatgct cacacccaca gaaggtcttc ccatcccctt tagattcgtg 4800 cctcactcca ccagtgagga agatgcctct gtctttccca cgactgccag gagataggga 4860 agcccagcca ggactgaccc tccttcctcc agcctgccct gacccacctg gcaaagcagg 4920 gcacatgggg aggaagagac tggaaccttt ctttgacagc caggcctaga cagacaggcc 4980 tggggacact ggccccatga ggggaggaag gcaggcgcac gaggtccagg gaggcccttt 5040 tctgatcatg ccccttctct cccaccccat ctccccacca ccacctctgt ggcctccatg 5100 gtacccccac agggctggcc tcccctagag ggtgggcctc aaccacctgc tcccgccacg 5160 caccggttag tgagaca ggg ctgccacggc aaccgccaag cccccctcaa ggtgggacag 5220 taccccggac ccatccactc actcctgaga gggctccggc ccagaatggg aacctcagag 5280 aagagctcta aggagaagaa accccatagc gtcagagagg atatgtctgg cttccaagag 5340 aaaggaggct ccgttttgca aagtggagga gggacgaggg acaggggttt caccagccag 5400 caacctgggc cttgtactgt ctgtgttttt aaaaccacta aagtgcaaga attacattgc 5460 actgtttctc cactttttat tttctcttag gcttttgttt ctatttcaaa catactttct 5520
tggttttcta atggagtata tagtttagtc atttcacaga ctctggcctc ctctcctgaa 5580 atccttttgg atggggaaag ggaaggtggg gagggtccga ggggaagggg accccagctt 5640 ccctgtgccc gctcacccca ctccaccagt ccccggtcgc cagccggagt ctcctctcta 5700 ccgccactgt cacaccgtag cccacatgga tagcacagtt gtcagacaag attccttcag 5760 attccgagtt gcctaccggt tgttttcgtt gttgttgttg ttgtttttct ttttcttttt 5820 tggttttcta atggagtata tagtttagtc atttcacaga ctctggcctc ctctcctgaa 5580 atccttttgg atggggaaag ggaaggtggg gagggtccga ggggaagggg accccagctt 5640 ccctgtgccc gctcacccca ctccaccagt ccccggtcgc cagccggagt ctcctctcta 5700 ccgccactgt cacaccgtag cccacatgga tagcacagtt gtcagacaag attccttcag 5760 attccgagtt gcctaccggt tgttttcgtt gttgttgttg ttgtttttct ttttcttttt 5820
ttttttgaag acagcaataa ccacagtaca tattactgta gttctctata gttttacata 5880 cattcatacc ataactctgt tctctcctct tttttgtttt caactttaaa aacaaaaata 5940 aacgatgata atctttactg gtgaaaagga tggaaaaata aatcaacaaa tgcaaccagt 6000 ttgtgagaaa aaaaaaaaaa agccgaaaaa aaaaaaaaaa acacctgaat gcggaagagc 6060 tcggctcccg tttagcattt tgtacttaag gaaataaaaa accaacaaag gatctcacat 6120 tttcttaaaa agtgaagatt gctgtatact atttattcaa cttataattt atgttactcc 6180 ttttttgaag acagcaataa ccacagtaca tattactgta gttctctata gttttacata 5880 cattcatacc ataactctgt tctctcctct tttttgtttt caactttaaa aacaaaaata 5940 aacgatgata atctttactg gtgaaaagga tggaaaaata aatcaacaaa tgcaaccagt 6000 ttgtgagaaa aaaaaaaaaa agccgaaaaa aaaaaaaaaa acacctgaat gcggaagagc 6060 tcggctcccg tttagcattt tgtacttaag gaaataaaaa accaacaaag gatctcacat 6120 tttcttaaaa agtgaagatt gctgtatact atttattcaa cttataattt atgttactcc 6180
ttgatctttg tcttttgtca tgacaaagca tttatttaat aaagttatgc attcagttaa 6240 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 6300 ttgatctttg tcttttgtca tgacaaagca tttatttaat aaagttatgc attcagttaa 6240 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa aaaaaaaaaa aaaaaaaaaa 6300
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 6333 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 6333
<210> 7 <210> 7
<211> 3659 <211> 3659
<212> DNA <212> DNA
<213> Homo sapiens <213> Homo sapiens
<400> 7 <400> 7
cgggagcccg cggcgagcgt agcgcaagtc cgctccctag gcatcgctgc gctggcagcg 60 attcgctgtc tcttgtgagt caggggacaa cgcttcgggg caactgtgag tgcgcgtgtg 120 ggggacctcg attctcttca gatctcgagg attcggtccg gggacgtctc ctgatcccct 180 actaaagcgc ctgctaactt tgaaaaggag cactgtgtcc tgcaaagttt gacacataaa 240 ggataggaaa agagaggaga gaaaagcaac tgagttgaag gagaaggagc tgatgcgggc 300 ctcctgatca attaagagga gagttaaacc gccgagatcc cggcgggacc aaggaggtgc 360 ggggcaagaa ggaacggaag cggtgcgatc cacagggctg ggttttcttg caccttgggt 420 cacgcctcct tggcgagaaa gcgcctcgca tttgattgct tccagttatt gcagaacttc 480 ctgtcctggt ggagaagcgg gtctcgcttg ggttccgcta atttctgtcc tgaggcgtga 540 gactgagttc atagggtcct gggtccccga accaggaagg gttgagggaa cacaatctgc 600 aagcccccgc gacccaagtg aggggccccg tgttggggtc ctccctccct ttgcattccc 660 acccctccgg gctttgcgtc ttcctgggga ccccctcgcc gggagatggc cgcgttgatg 720 cggagcaagg attcgtcctg ctgcctgctc ctactggccg cggtgctgat ggtggagagc 780 tcacagatcg gcagttcgcg ggccaaactc aactccatca agtcctctct gggcggggag 840 acgcctggtc aggccgccaa tcgatctgcg ggcatgtacc aaggactggc attcggcggc 900 agtaagaagg gcaaaaacct ggggcaggcc tacccttgta gcagtgataa ggagtgtgaa 960 gttgggaggt attgccacag tccccaccaa ggatcatcgg cctgcatggt gtgtcggaga 1020 aaaaagaagc gctgccaccg agatggcatg tgctgcccca gtacccgctg caataatggc 1080 atctgtatcc cagttactga aagcatctta acccctcaca tcccggctct ggatggtact 1140 cggcacagag atcgaaacca cggtcattac tcaaaccatg acttgggatg gcagaatcta 1200 ggaagaccac acactaagat gtcacatata aaagggcatg aaggagaccc ctgcctacga 1260 tcatcagact gcattgaagg gttttgctgt gctcgtcatt tctggaccaa aatctgcaaa 1320 ccagtgctcc atcaggggga agtctgtacc aaacaacgca agaagggttc tcatgggctg 1380 gaaattttcc agcgttgcga ctgtgcgaag ggcctgtctt gcaaagtatg gaaagatgcc 1440 acctactcct ccaaagccag actccatgtg tgtcagaaaa tttgatcacc attgaggaac 1500 atcatcaatt gcagactgtg aagttgtgta tttaatgcat tatagcatgg tggaaaataa 1560 ggttcagatg cagaagaatg gctaaaataa gaaacgtgat aagaatatag atgatcacaa 1620 aaagggagaa agaaaacatg aactgaatag attagaatgg gtgacaaatg cagtgcagcc 1680 agtgtttcca ttatgcaact tgtctatgta aataatgtac acatttgtgg aaaatgctat 1740 tattaagaga acaagcacac agtggaaatt actgatgagt agcatgtgac tttccaagag 1800 tttaggttgt gctggaggag aggtttcctt cagattgctg attgcttata caaataacct 1860 acatgccaga tttctattca acgttagagt ttaacaaaat actcctagaa taacttgtta 1920 tacaataggt tctaaaaata aaattgctaa acaagaaatg aaaacatgga gcattgttaa 1980 tttacaacag aaaattacct tttgatttgt aacactactt ctgctgttca atcaagagtc 2040 ttggtagata agaaaaaaat cagtcaatat ttccaaataa ttgcaaaata atggccagtt 2100 gtttaggaag gcctttagga agacaaataa ataacaaaca aacagccaca aatacttttt 2160 tttcaaaatt ttagttttac ctgtaattaa taagaactga tacaagacaa aaacagttcc 2220 ttcagattct acggaatgac agtatatctc tctttatcct atgtgattcc tgctctgaat 2280 gcattatatt ttccaaacta tacccataaa ttgtgactag taaaatactt acacagagca 2340 gaattttcac agatggcaaa aaaatttaaa gatgtccaat atatgtggga aaagagctaa 2400 cagagagatc attatttctt aaagattggc cataacctgt attttgatag aattagattg 2460 gtaaatacat gtattcatac atactctgtg gtaatagaga cttgagctgg atctgtactg 2520 cactggagta agcaagaaaa ttgggaaaac tttttcgttt gttcaggttt tggcaacaca 2580 tagatcatat gtctgaggca caagttggct gttcatcttt gaaaccaggg gatgcacagt 2640 ctaaatgaat atctgcatgg gatttgctat cataatattt actatgcaga tgaattcagt 2700 gtgaggtcct gtgtccgtac tatcctcaaa ttatttattt tatagtgctg agatcctcaa 2760 ataatctcaa tttcaggagg tttcacaaaa tggactcctg aagtagacag agtagtgagg 2820 tttcattgcc ctctataagc ttctgactag ccaatggcat catccaattt tcttcccaaa 2880 cctctgcagc atctgcttta ttgccaaagg gctagtttcg gttttctgca gccattgcgg 2940 ttaaaaaata taagtaggat aacttgtaaa acctgcatat tgctaatcta tagacaccac 3000 agtttctaaa ttctttgaaa ccactttact acttttttta aacttaactc agttctaaat 3060 actttgtctg gagcacaaaa caataaaagg ttatcttata gtcgtgactt taaacttttg 3120 tagaccacaa ttcacttttt agttttcttt tacttaaatc ccatctgcag tctcaaattt 3180 aagttctccc agtagagatt gagtttgagc ctgtatatct attaaaaatt tcaacttccc 3240 acatatattt actaagatga ttaagactta cattttctgc acaggtctgc aaaaacaaaa 3300 attataaact agtccatcca agaaccaaag tttgtataaa caggttgcta taagcttggt 3360 gaaatgaaaa tggaacattt caatcaaaca tttcctatat aacaattatt atatttacaa 3420 tttggtttct gcaatatttt tcttatgtcc acccttttaa aaattattat ttgaagtaat 3480 ttatttacag gaaatgttaa tgagatgtat tttcttatag agatatttct tacagaaagc 3540 tttgtagcag aatatatttg cagctattga ctttgtaatt taggaaaaat gtataataag 3600 ataaaatcta ttaaattttt ctcctctaaa aactgaaaaa aaaaaaaaaa aaaaaaaaa 3659 cgggagcccg cggcgagcgt agcgcaagtc cgctccctag gcatcgctgc gctggcagcg 60 attcgctgtc tcttgtgagt caggggacaa cgcttcgggg caactgtgag tgcgcgtgtg 120 ggggacctcg attctcttca gatctcgagg attcggtccg gggacgtctc ctgatcccct 180 actaaagcgc ctgctaactt tgaaaaggag cactgtgtcc tgcaaagttt gacacataaa 240 ggataggaaa agagaggaga gaaaagcaac tgagttgaag gagaaggagc tgatgcgggc 300 ctcctgatca attaagagga gagttaaacc gccgagatcc cggcgggacc aaggaggtgc 360 ggggcaagaa ggaacggaag cggtgcgatc cacagggctg ggttttcttg caccttgggt 420 cacgcctcct tggcgagaaa gcgcctcgca tttgattgct tccagttatt gcagaacttc 480 ctgtcctggt ggagaagcgg gtctcgcttg ggttccgcta atttctgtcc tgaggcgtga 540 gactgagttc atagggtcct gggtccccga accaggaagg gttgagggaa cacaatctgc 600 aagcccccgc gacccaagtg aggggccccg tgttggggtc ctccctccct ttgcattccc 660 acccctccgg gctttgcgtc ttcctgggga ccccctcgcc gggagatggc cgcgttgatg 720 cggagcaagg attcgtcctg ctgcctgctc ctactggccg cggtgctgat ggtggagagc 780 tcacagatcg gcagttcgcg ggccaaactc aactccatca agtcctctct gggcggggag 840 acgcctggtc aggccgccaa tcgatctgcg ggcatgtacc aaggactggc attcggcggc 900 agtaagaagg gcaaaaacct ggggcaggcc tacccttgta gcagtgataa ggagtgtgaa 960 gttgggaggt attgccacag tccccaccaa ggatcatcgg cctgcatggt gtgtcggaga 1020 aaaaagaagc gctgccaccg agatggcatg tgctgcccca gtacccgctg caataatggc 1080 atctgtatcc cagttactga aagcatctta acccctcaca tcccggctct ggatggtact 1140 cggcacagag atcgaaacca cggtcattac tcaaaccatg acttgggatg gcagaatcta 1200 ggaagaccac acactaagat gtcacatata aaagggcatg aaggagaccc ctgcctacga 1260 tcatcagact gcattgaagg gttttgctgt gctcgtcatt tctggaccaa aatctgcaaa 1320 ccagtgctcc atcaggggga agtctgtacc aaacaacgca agaagggttc tcatgggctg 1380 gaaattttcc agcgttgcga ctgtgcgaag ggcctgtctt gcaaagtatg gaaagatgcc 1440 acctactcct ccaaagccag actccatgtg tgtcagaaaa tttgatcacc attgaggaac 1500 atcatcaatt gcagactgtg aagttgtgta tttaatgcat tatagcatgg tggaaaataa 1560 ggttcagatg cagaagaatg gctaaaataa gaaacgtgat aagaatatag atgatcacaa 1620 aaagggagaa agaaaacatg aactgaatag attagaatgg gtgacaaatg cagtgcagcc 1680 agtgtttcca ttatgcaact tgtctat gta aataatgtac acatttgtgg aaaatgctat 1740 tattaagaga acaagcacac agtggaaatt actgatgagt agcatgtgac tttccaagag 1800 tttaggttgt gctggaggag aggtttcctt cagattgctg attgcttata caaataacct 1860 acatgccaga tttctattca acgttagagt ttaacaaaat actcctagaa taacttgtta 1920 tacaataggt tctaaaaata aaattgctaa acaagaaatg aaaacatgga gcattgttaa 1980 tttacaacag aaaattacct tttgatttgt aacactactt ctgctgttca atcaagagtc 2040 ttggtagata agaaaaaaat cagtcaatat ttccaaataa ttgcaaaata atggccagtt 2100 gtttaggaag gcctttagga agacaaataa ataacaaaca aacagccaca aatacttttt 2160 tttcaaaatt ttagttttac ctgtaattaa taagaactga tacaagacaa aaacagttcc 2220 ttcagattct acggaatgac agtatatctc tctttatcct atgtgattcc tgctctgaat 2280 gcattatatt ttccaaacta tacccataaa ttgtgactag taaaatactt acacagagca 2340 gaattttcac agatggcaaa aaaatttaaa gatgtccaat atatgtggga aaagagctaa 2400 cagagagatc attatttctt aaagattggc cataacctgt attttgatag aattagattg 2460 gtaaatacat gtattcatac atactctgtg gtaatagaga cttgagctgg atctgtactg 2520 cactggagta agcaagaaaa ttgggaaaac tt tttcgttt gttcaggttt tggcaacaca 2580 tagatcatat gtctgaggca caagttggct gttcatcttt gaaaccaggg gatgcacagt 2640 ctaaatgaat atctgcatgg gatttgctat cataatattt actatgcaga tgaattcagt 2700 gtgaggtcct gtgtccgtac tatcctcaaa ttatttattt tatagtgctg agatcctcaa 2760 ataatctcaa tttcaggagg tttcacaaaa tggactcctg aagtagacag agtagtgagg 2820 tttcattgcc ctctataagc ttctgactag ccaatggcat catccaattt tcttcccaaa 2880 cctctgcagc atctgcttta ttgccaaagg gctagtttcg gttttctgca gccattgcgg 2940 ttaaaaaata taagtaggat aacttgtaaa acctgcatat tgctaatcta tagacaccac 3000 agtttctaaa ttctttgaaa ccactttact acttttttta aacttaactc agttctaaat 3060 actttgtctg gagcacaaaa caataaaagg ttatcttata gtcgtgactt taaacttttg 3120 tagaccacaa ttcacttttt agttttcttt tacttaaatc ccatctgcag tctcaaattt 3180 aagttctccc agtagagatt gagtttgagc ctgtatatct attaaaaatt tcaacttccc 3240 acatatattt actaagatga ttaagactta cattttctgc acaggtctgc aaaaacaaaa 3300 attataaact agtccatcca agaaccaaag tttgtataaa caggttgcta taagcttggt 3360 gaaatgaaaa tggaacattt caatcaaaca tttcctat at aacaattatt atatttacaa 3420 tttggtttct gcaatatttt tcttatgtcc acccttttaa aaattattat ttgaagtaat 3480 ttatttacag gaaatgttaa tgagatgtat tttcttatag agatatttct tacagaaagc 3540 tttgtagcag aatatatttg cagctattga ctttgtaatt taggaaaaat gtataataag 3600 ataaaatcta ttaaattttt ctcctctaaa aactgaaaaa aaaaaaaaaa aaaaaaaaa 3659
<210> 8 <210> 8
<211> 5411 <211> 5411
<212> DNA <212> DNA
<213> Homo sapiens <213> Homo sapiens
<400> 8 <400> 8
gtgtcccata gtgtttccaa acttggaaag ggcgggggag ggcgggagga tgcggagggc 60 ggaggtatgc agacaacgag tcagagtttc cccttgaaag cctcaaaagt gtccacgtcc 120 tcaaaaagaa tggaaccaat ttaagaagcc agccccgtgg ccacgtccct tcccccattc 180 gctccctcct ctgcgccccc gcaggctcct cccagctgtg gctgcccggg cccccagccc 240 cagccctccc attggtggag gcccttttgg aggcacccta gggccaggga aacttttgcc 300 gtataaatag ggcagatccg ggctttatta ttttagcacc acggcagcag gaggtttcgg 360 ctaagttgga ggtactggcc acgactgcat gcccgcgccc gccaggtgat acctccgccg 420 gtgacccagg ggctctgcga cacaaggagt ctgcatgtct aagtgctaga catgctcagc 480 tttgtggata cgcggacttt gttgctgctt gcagtaacct tatgcctagc aacatgccaa 540 tctttacaag aggaaactgt aagaaagggc ccagccggag atagaggacc acgtggagaa 600 aggggtccac caggcccccc aggcagagat ggtgaagatg gtcccacagg ccctcctggt 660 ccacctggtc ctcctggccc ccctggtctc ggtgggaact ttgctgctca gtatgatgga 720 aaaggagttg gacttggccc tggaccaatg ggcttaatgg gacctagagg cccacctggt 780 gcagctggag ccccaggccc tcaaggtttc caaggacctg ctggtgagcc tggtgaacct 840 ggtcaaactg gtcctgcagg tgctcgtggt ccagctggcc ctcctggcaa ggctggtgaa 900 gatggtcacc ctggaaaacc cggacgacct ggtgagagag gagttgttgg accacagggt 960 gctcgtggtt tccctggaac tcctggactt cctggcttca aaggcattag gggacacaat 1020 ggtctggatg gattgaaggg acagcccggt gctcctggtg tgaagggtga acctggtgcc 1080 cctggtgaaa atggaactcc aggtcaaaca ggagcccgtg ggcttcctgg tgagagagga 1140 cgtgttggtg cccctggccc agctggtgcc cgtggcagtg atggaagtgt gggtcccgtg 1200 ggtcctgctg gtcccattgg gtctgctggc cctccaggct tcccaggtgc ccctggcccc 1260 aagggtgaaa ttggagctgt tggtaacgct ggtcctgctg gtcccgccgg tccccgtggt 1320 gaagtgggtc ttccaggcct ctccggcccc gttggacctc ctggtaatcc tggagcaaac 1380 ggccttactg gtgccaaggg tgctgctggc cttcccggcg ttgctggggc tcccggcctc 1440 cctggacccc gcggtattcc tggccctgtt ggtgctgccg gtgctactgg tgccagagga 1500 cttgttggtg agcctggtcc agctggctcc aaaggagaga gcggtaacaa gggtgagccc 1560 ggctctgctg ggccccaagg tcctcctggt cccagtggtg aagaaggaaa gagaggccct 1620 aatggggaag ctggatctgc cggccctcca ggacctcctg ggctgagagg tagtcctggt 1680 tctcgtggtc ttcctggagc tgatggcaga gctggcgtca tgggccctcc tggtagtcgt 1740 ggtgcaagtg gccctgctgg agtccgagga cctaatggag atgctggtcg ccctggggag 1800 cctggtctca tgggacccag aggtcttcct ggttcccctg gaaatatcgg ccccgctgga 1860 aaagaaggtc ctgtcggcct ccctggcatc gacggcaggc ctggcccaat tggcccagct 1920 ggagcaagag gagagcctgg caacattgga ttccctggac ccaaaggccc cactggtgat 1980 cctggcaaaa acggtgataa aggtcatgct ggtcttgctg gtgctcgggg tgctccaggt 2040 cctgatggaa acaatggtgc tcagggacct cctggaccac agggtgttca aggtggaaaa 2100 ggtgaacagg gtccccctgg tcctccaggc ttccagggtc tgcctggccc ctcaggtccc 2160 gctggtgaag ttggcaaacc aggagaaagg ggtctccatg gtgagtttgg tctccctggt 2220 cctgctggtc caagagggga acgcggtccc ccaggtgaga gtggtgctgc cggtcctact 2280 ggtcctattg gaagccgagg tccttctgga cccccagggc ctgatggaaa caagggtgaa 2340 cctggtgtgg ttggtgctgt gggcactgct ggtccatctg gtcctagtgg actcccagga 2400 gagaggggtg ctgctggcat acctggaggc aagggagaaa agggtgaacc tggtctcaga 2460 ggtgaaattg gtaaccctgg cagagatggt gctcgtggtg ctcctggtgc tgtaggtgcc 2520 cctggtcctg ctggagccac aggtgaccgg ggcgaagctg gggctgctgg tcctgctggt 2580 cctgctggtc ctcggggaag ccctggtgaa cgtggtgagg tcggtcctgc tggccccaat 2640 ggatttgctg gtcctgctgg tgctgctggt caacctggtg ctaaaggaga aagaggagcc 2700 aaagggccta agggtgaaaa cggtgttgtt ggtcccacag gccccgttgg agctgctggc 2760 ccagctggtc caaatggtcc ccccggtcct gctggaagtc gtggtgatgg aggcccccct 2820 ggtatgactg gtttccctgg tgctgctgga cggactggtc ccccaggacc ctctggtatt 2880 tctggccctc ctggtccccc tggtcctgct gggaaagaag ggcttcgtgg tcctcgtggt 2940 gaccaaggtc cagttggccg aactggagaa gtaggtgcag ttggtccccc tggcttcgct 3000 ggtgagaagg gtccctctgg agaggctggt actgctggac ctcctggcac tccaggtcct 3060 cagggtcttc ttggtgctcc tggtattctg ggtctccctg gctcgagagg tgaacgtggt 3120 ctaccaggtg ttgctggtgc tgtgggtgaa cctggtcctc ttggcattgc cggccctcct 3180 ggggcccgtg gtcctcctgg tgctgtgggt agtcctggag tcaacggtgc tcctggtgaa 3240 gctggtcgtg atggcaaccc tgggaacgat ggtcccccag gtcgcgatgg tcaacccgga 3300 cacaagggag agcgcggtta ccctggcaat attggtcccg ttggtgctgc aggtgcacct 3360 ggtcctcatg gccccgtggg tcctgctggc aaacatggaa accgtggtga aactggtcct 3420 tctggtcctg ttggtcctgc tggtgctgtt ggcccaagag gtcctagtgg cccacaaggc 3480 attcgtggcg ataagggaga gcccggtgaa aaggggccca gaggtcttcc tggcttaaag 3540 ggacacaatg gattgcaagg tctgcctggt atcgctggtc accatggtga tcaaggtgct 3600 cctggctccg tgggtcctgc tggtcctagg ggccctgctg gtccttctgg ccctgctgga 3660 aaagatggtc gcactggaca tcctggtaca gttggacctg ctggcattcg aggccctcag 3720 ggtcaccaag gccctgctgg cccccctggt ccccctggcc ctcctggacc tccaggtgta 3780 agcggtggtg gttatgactt tggttacgat ggagacttct acagggctga ccagcctcgc 3840 tcagcacctt ctctcagacc caaggactat gaagttgatg ctactctgaa gtctctcaac 3900 aaccagattg agacccttct tactcctgaa ggctctagaa agaacccagc tcgcacatgc 3960 cgtgacttga gactcagcca cccagagtgg agcagtggtt actactggat tgaccctaac 4020 caaggatgca ctatggatgc tatcaaagta tactgtgatt tctctactgg cgaaacctgt 4080 atccgggccc aacctgaaaa catcccagcc aagaactggt ataggagctc caaggacaag 4140 aaacacgtct ggctaggaga aactatcaat gctggcagcc agtttgaata taatgtagaa 4200 ggagtgactt ccaaggaaat ggctacccaa cttgccttca tgcgcctgct ggccaactat 4260 gcctctcaga acatcaccta ccactgcaag aacagcattg catacatgga tgaggagact 4320 ggcaacctga aaaaggctgt cattctacag ggctctaatg atgttgaact tgttgctgag 4380 ggcaacagca ggttcactta cactgttctt gtagatggct gctctaaaaa gacaaatgaa 4440 tggggaaaga caatcattga atacaaaaca aataagccat cacgcctgcc cttccttgat 4500 attgcacctt tggacatcgg tggtgctgac caggaattct ttgtggacat tggcccagtc 4560 tgtttcaaat aaatgaactc aatctaaatt aaaaaagaaa gaaatttgaa aaaactttct 4620 ctttgccatt tcttcttctt cttttttaac tgaaagctga atccttccat ttcttctgca 4680 catctacttg cttaaattgt gggcaaaaga gaaaaagaag gattgatcag agcattgtgc 4740 aatacagttt cattaactcc ttcccccgct cccccaaaaa tttgaatttt tttttcaaca 4800 ctcttacacc tgttatggaa aatgtcaacc tttgtaagaa aaccaaaata aaaattgaaa 4860 aataaaaacc ataaacattt gcaccacttg tggcttttga atatcttcca cagagggaag 4920 tttaaaaccc aaacttccaa aggtttaaac tacctcaaaa cactttccca tgagtgtgat 4980 ccacattgtt aggtgctgac ctagacagag atgaactgag gtccttgttt tgttttgttc 5040 ataatacaaa ggtgctaatt aatagtattt cagatacttg aagaatgttg atggtgctag 5100 aagaatttga gaagaaatac tcctgtattg agttgtatcg tgtggtgtat tttttaaaaa 5160 atttgattta gcattcatat tttccatctt attcccaatt aaaagtatgc agattatttg 5220 cccaaatctt cttcagattc agcatttgtt ctttgccagt ctcattttca tcttcttcca 5280 tggttccaca gaagctttgt ttcttgggca agcagaaaaa ttaaattgta cctattttgt 5340 atatgtgaga tgtttaaata aattgtgaaa aaaatgaaat aaagcatgtt tggttttcca 5400 aaagaacata t 5411 gtgtcccata gtgtttccaa acttggaaag ggcgggggag ggcgggagga tgcggagggc 60 ggaggtatgc agacaacgag tcagagtttc cccttgaaag cctcaaaagt gtccacgtcc 120 tcaaaaagaa tggaaccaat ttaagaagcc agccccgtgg ccacgtccct tcccccattc 180 gctccctcct ctgcgccccc gcaggctcct cccagctgtg gctgcccggg cccccagccc 240 cagccctccc attggtggag gcccttttgg aggcacccta gggccaggga aacttttgcc 300 gtataaatag ggcagatccg ggctttatta ttttagcacc acggcagcag gaggtttcgg 360 ctaagttgga ggtactggcc acgactgcat gcccgcgccc gccaggtgat acctccgccg 420 gtgacccagg ggctctgcga cacaaggagt ctgcatgtct aagtgctaga catgctcagc 480 tttgtggata cgcggacttt gttgctgctt gcagtaacct tatgcctagc aacatgccaa 540 tctttacaag aggaaactgt aagaaagggc ccagccggag atagaggacc acgtggagaa 600 aggggtccac caggcccccc aggcagagat ggtgaagatg gtcccacagg ccctcctggt 660 ccacctggtc ctcctggccc ccctggtctc ggtgggaact ttgctgctca gtatgatgga 720 aaaggagttg gacttggccc tggaccaatg ggcttaatgg gacctagagg cccacctggt 780 gcagctggag ccccaggccc tcaaggtttc caaggacctg ctggtgagcc tggtgaacct 840 ggtcaaactg gtcctgcagg tgctcgtggt ccagctggcc ctcctggcaa ggctggtgaa 900 gatggtcacc ctggaaaacc cggacgacct ggtgagagag gagttgttgg accacagggt 960 gctcgtggtt tccctggaac tcctggactt cctggcttca aaggcattag gggacacaat 1020 ggtctggatg gattgaaggg acagcccggt gctcctggtg tgaagggtga acctggtgcc 1080 cctggtgaaa atggaactcc aggtcaaaca ggagcccgtg ggcttcctgg tgagagagga 1140 cgtgttggtg cccctggccc agctggtgcc cgtggcagtg atggaagtgt gggtcccgtg 1200 ggtcctgctg gtcccattgg gtctgctggc cctccaggct tcccaggtgc ccctggcccc 1260 aagggtgaaa ttggagctgt tggtaacgct ggtcctgctg gtcccgccgg tccccgtggt 1320 gaagtgggtc ttccaggcct ctccggcccc gttggacctc ctggtaatcc tggagcaaac 1380 ggccttactg gtgccaaggg tgctgctggc cttcccggcg ttgctggggc tcccggcctc 1440 cctggacccc gcggtattcc tggccctgtt ggtgctgccg gtgctactgg tgccagagga 1500 cttgttggtg agcctggtcc agctggctcc aaaggagaga gcggtaacaa gggtgagccc 1560 ggctctgctg ggccccaagg tcctcctggt cccagtggtg aagaaggaaa gagaggccct 1620 aatggggaag ctggatctgc cggccctcca ggacctcctg ggctgagagg tagtcctggt 1680 tctcgtggtc ttcctggagc tgatggc aga gctggcgtca tgggccctcc tggtagtcgt 1740 ggtgcaagtg gccctgctgg agtccgagga cctaatggag atgctggtcg ccctggggag 1800 cctggtctca tgggacccag aggtcttcct ggttcccctg gaaatatcgg ccccgctgga 1860 aaagaaggtc ctgtcggcct ccctggcatc gacggcaggc ctggcccaat tggcccagct 1920 ggagcaagag gagagcctgg caacattgga ttccctggac ccaaaggccc cactggtgat 1980 cctggcaaaa acggtgataa aggtcatgct ggtcttgctg gtgctcgggg tgctccaggt 2040 cctgatggaa acaatggtgc tcagggacct cctggaccac agggtgttca aggtggaaaa 2100 ggtgaacagg gtccccctgg tcctccaggc ttccagggtc tgcctggccc ctcaggtccc 2160 gctggtgaag ttggcaaacc aggagaaagg ggtctccatg gtgagtttgg tctccctggt 2220 cctgctggtc caagagggga acgcggtccc ccaggtgaga gtggtgctgc cggtcctact 2280 ggtcctattg gaagccgagg tccttctgga cccccagggc ctgatggaaa caagggtgaa 2340 cctggtgtgg ttggtgctgt gggcactgct ggtccatctg gtcctagtgg actcccagga 2400 gagaggggtg ctgctggcat acctggaggc aagggagaaa agggtgaacc tggtctcaga 2460 ggtgaaattg gtaaccctgg cagagatggt gctcgtggtg ctcctggtgc tgtaggtgcc 2520 cctggtcctg ctggagccac aggtgaccgg gg cgaagctg gggctgctgg tcctgctggt 2580 cctgctggtc ctcggggaag ccctggtgaa cgtggtgagg tcggtcctgc tggccccaat 2640 ggatttgctg gtcctgctgg tgctgctggt caacctggtg ctaaaggaga aagaggagcc 2700 aaagggccta agggtgaaaa cggtgttgtt ggtcccacag gccccgttgg agctgctggc 2760 ccagctggtc caaatggtcc ccccggtcct gctggaagtc gtggtgatgg aggcccccct 2820 ggtatgactg gtttccctgg tgctgctgga cggactggtc ccccaggacc ctctggtatt 2880 tctggccctc ctggtccccc tggtcctgct gggaaagaag ggcttcgtgg tcctcgtggt 2940 gaccaaggtc cagttggccg aactggagaa gtaggtgcag ttggtccccc tggcttcgct 3000 ggtgagaagg gtccctctgg agaggctggt actgctggac ctcctggcac tccaggtcct 3060 cagggtcttc ttggtgctcc tggtattctg ggtctccctg gctcgagagg tgaacgtggt 3120 ctaccaggtg ttgctggtgc tgtgggtgaa cctggtcctc ttggcattgc cggccctcct 3180 ggggcccgtg gtcctcctgg tgctgtgggt agtcctggag tcaacggtgc tcctggtgaa 3240 gctggtcgtg atggcaaccc tgggaacgat ggtcccccag gtcgcgatgg tcaacccgga 3300 cacaagggag agcgcggtta ccctggcaat attggtcccg ttggtgctgc aggtgcacct 3360 ggtcctcatg gccccgtggg tcctgctggc aaacatgg aa accgtggtga aactggtcct 3420 tctggtcctg ttggtcctgc tggtgctgtt ggcccaagag gtcctagtgg cccacaaggc 3480 attcgtggcg ataagggaga gcccggtgaa aaggggccca gaggtcttcc tggcttaaag 3540 ggacacaatg gattgcaagg tctgcctggt atcgctggtc accatggtga tcaaggtgct 3600 cctggctccg tgggtcctgc tggtcctagg ggccctgctg gtccttctgg ccctgctgga 3660 aaagatggtc gcactggaca tcctggtaca gttggacctg ctggcattcg aggccctcag 3720 ggtcaccaag gccctgctgg cccccctggt ccccctggcc ctcctggacc tccaggtgta 3780 agcggtggtg gttatgactt tggttacgat ggagacttct acagggctga ccagcctcgc 3840 tcagcacctt ctctcagacc caaggactat gaagttgatg ctactctgaa gtctctcaac 3900 aaccagattg agacccttct tactcctgaa ggctctagaa agaacccagc tcgcacatgc 3960 cgtgacttga gactcagcca cccagagtgg agcagtggtt actactggat tgaccctaac 4020 caaggatgca ctatggatgc tatcaaagta tactgtgatt tctctactgg cgaaacctgt 4080 atccgggccc aacctgaaaa catcccagcc aagaactggt ataggagctc caaggacaag 4140 aaacacgtct ggctaggaga aactatcaat gctggcagcc agtttgaata taatgtagaa 4200 ggagtgactt ccaaggaaat ggctacccaa cttgccttca tgc gcctgct ggccaactat 4260 gcctctcaga acatcaccta ccactgcaag aacagcattg catacatgga tgaggagact 4320 ggcaacctga aaaaggctgt cattctacag ggctctaatg atgttgaact tgttgctgag 4380 ggcaacagca ggttcactta cactgttctt gtagatggct gctctaaaaa gacaaatgaa 4440 tggggaaaga caatcattga atacaaaaca aataagccat cacgcctgcc cttccttgat 4500 attgcacctt tggacatcgg tggtgctgac caggaattct ttgtggacat tggcccagtc 4560 tgtttcaaat aaatgaactc aatctaaatt aaaaaagaaa gaaatttgaa aaaactttct 4620 ctttgccatt tcttcttctt cttttttaac tgaaagctga atccttccat ttcttctgca 4680 catctacttg cttaaattgt gggcaaaaga gaaaaagaag gattgatcag agcattgtgc 4740 aatacagttt cattaactcc ttcccccgct cccccaaaaa tttgaatttt tttttcaaca 4800 ctcttacacc tgttatggaa aatgtcaacc tttgtaagaa aaccaaaata aaaattgaaa 4860 aataaaaacc ataaacattt gcaccacttg tggcttttga atatcttcca cagagggaag 4920 tttaaaaccc aaacttccaa aggtttaaac tacctcaaaa cactttccca tgagtgtgat 4980 ccacattgtt aggtgctgac ctagacagag atgaactgag gtccttgttt tgttttgttc 5040 ataatacaaa ggtgctaatt aatagtattt cagatacttg aagaatgtt g atggtgctag 5100 aagaatttga gaagaaatac tcctgtattg agttgtatcg tgtggtgtat tttttaaaaa 5160 atttgattta gcattcatat tttccatctt attcccaatt aaaagtatgc agattatttg 5220 cccaaatctt cttcagattc agcatttgtt ctttgccagt ctcattttca tcttcttcca 5280 tggttccaca gaagctttgt ttcttgggca agcagaaaaa ttaaattgta cctattttgt 5340 atatgtgaga tgtttaaata aattgtgaaa aaaatgaaat aaagcatgtt tggttttcca 5400 aaagaacata t 5411
<210> 9 <210> 9
<211> 5490 <211> 5490
<212> DNA <212> DNA
<213> Homo sapiens <213> Homo sapiens
<400> 9 <400> 9
ggctgagttt tatgacgggc ccggtgctga agggcaggga acaacttgat ggtgctactt 60 tgaactgctt ttcttttctc ctttttgcac aaagagtctc atgtctgata tttagacatg 120 atgagctttg tgcaaaaggg gagctggcta cttctcgctc tgcttcatcc cactattatt 180 ttggcacaac aggaagctgt tgaaggagga tgttcccatc ttggtcagtc ctatgcggat 240 agagatgtct ggaagccaga accatgccaa atatgtgtct gtgactcagg atccgttctc 300 tgcgatgaca taatatgtga cgatcaagaa ttagactgcc ccaacccaga aattccattt 360 ggagaatgtt gtgcagtttg cccacagcct ccaactgctc ctactcgccc tcctaatggt 420 caaggacctc aaggccccaa gggagatcca ggccctcctg gtattcctgg gagaaatggt 480 gaccctggta ttccaggaca accagggtcc cctggttctc ctggcccccc tggaatctgt 540 gaatcatgcc ctactggtcc tcagaactat tctccccagt atgattcata tgatgtcaag 600 tctggagtag cagtaggagg actcgcaggc tatcctggac cagctggccc cccaggccct 660 cccggtcccc ctggtacatc tggtcatcct ggttcccctg gatctccagg ataccaagga 720 ccccctggtg aacctgggca agctggtcct tcaggccctc caggacctcc tggtgctata 780 ggtccatctg gtcctgctgg aaaagatgga gaatcaggta gacccggacg acctggagag 840 cgaggattgc ctggacctcc aggtatcaaa ggtccagctg ggatacctgg attccctggt 900 atgaaaggac acagaggctt cgatggacga aatggagaaa agggtgaaac aggtgctcct 960 ggattaaagg gtgaaaatgg tcttccaggc gaaaatggag ctcctggacc catgggtcca 1020 agaggggctc ctggtgagcg aggacggcca ggacttcctg gggctgcagg tgctcggggt 1080 aatgacggtg ctcgaggcag tgatggtcaa ccaggccctc ctggtcctcc tggaactgcc 1140 ggattccctg gatcccctgg tgctaagggt gaagttggac ctgcagggtc tcctggttca 1200 aatggtgccc ctggacaaag aggagaacct ggacctcagg gacacgctgg tgctcaaggt 1260 cctcctggcc ctcctgggat taatggtagt cctggtggta aaggcgaaat gggtcccgct 1320 ggcattcctg gagctcctgg actgatggga gcccggggtc ctccaggacc agccggtgct 1380 aatggtgctc ctggactgcg aggtggtgca ggtgagcctg gtaagaatgg tgccaaagga 1440 gagcccggac cacgtggtga acgcggtgag gctggtattc caggtgttcc aggagctaaa 1500 ggcgaagatg gcaaggatgg atcacctgga gaacctggtg caaatgggct tccaggagct 1560 gcaggagaaa ggggtgcccc tgggttccga ggacctgctg gaccaaatgg catcccagga 1620 gaaaagggtc ctgctggaga gcgtggtgct ccaggccctg cagggcccag aggagctgct 1680 ggagaacctg gcagagatgg cgtccctgga ggtccaggaa tgaggggcat gcccggaagt 1740 ccaggaggac caggaagtga tgggaaacca gggcctcccg gaagtcaagg agaaagtggt 1800 cgaccaggtc ctcctgggcc atctggtccc cgaggtcagc ctggtgtcat gggcttcccc 1860 ggtcctaaag gaaatgatgg tgctcctggt aagaatggag aacgaggtgg ccctggagga 1920 cctggccctc agggtcctcc tggaaagaat ggtgaaactg gacctcaggg acccccaggg 1980 cctactgggc ctggtggtga caaaggagac acaggacccc ctggtccaca aggattacaa 2040 ggcttgcctg gtacaggtgg tcctccagga gaaaatggaa aacctgggga accaggtcca 2100 aagggtgatg ccggtgcacc tggagctcca ggaggcaagg gtgatgctgg tgcccctggt 2160 gaacgtggac ctcctggatt ggcaggggcc ccaggactta gaggtggagc tggtccccct 2220 ggtcccgaag gaggaaaggg tgctgctggt cctcctgggc cacctggtgc tgctggtact 2280 cctggtctgc aaggaatgcc tggagaaaga ggaggtcttg gaagtcctgg tccaaagggt 2340 gacaagggtg aaccaggcgg tccaggtgct gatggtgtcc cagggaaaga tggcccaagg 2400 ggtcctactg gtcctattgg tcctcctggc ccagctggcc agcctggaga taagggtgaa 2460 ggtggtgccc ccggacttcc aggtatagct ggacctcgtg gtagccctgg tgagagaggt 2520 gaaactggcc ctccaggacc tgctggtttc cctggtgctc ctggacagaa tggtgaacct 2580 ggtggtaaag gagaaagagg ggctccgggt gagaaaggtg aaggaggccc tcctggagtt 2640 gcaggacccc ctggaggttc tggacctgct ggtcctcctg gtccccaagg tgtcaaaggt 2700 gaacgtggca gtcctggtgg acctggtgct gctggcttcc ctggtgctcg tggtcttcct 2760 ggtcctcctg gtagtaatgg taacccagga cccccaggtc ccagcggttc tccaggcaag 2820 gatgggcccc caggtcctgc gggtaacact ggtgctcctg gcagccctgg agtgtctgga 2880 ccaaaaggtg atgctggcca accaggagag aagggatcgc ctggtgccca gggcccacca 2940 ggagctccag gcccacttgg gattgctggg atcactggag cacggggtct tgcaggacca 3000 ccaggcatgc caggtcctag gggaagccct ggccctcagg gtgtcaaggg tgaaagtggg 3060 aaaccaggag ctaacggtct cagtggagaa cgtggtcccc ctggacccca gggtcttcct 3120 ggtctggctg gtacagctgg tgaacctgga agagatggaa accctggatc agatggtctt 3180 ccaggccgag atggatctcc tggtggcaag ggtgatcgtg gtgaaaatgg ctctcctggt 3240 gcccctggcg ctcctggtca tccaggccca cctggtcctg tcggtccagc tggaaagagt 3300 ggtgacagag gagaaagtgg ccctgctggc cctgctggtg ctcccggtcc tgctggttcc 3360 cgaggtgctc ctggtcctca aggcccacgt ggtgacaaag gtgaaacagg tgaacgtgga 3420 gctgctggca tcaaaggaca tcgaggattc cctggtaatc caggtgcccc aggttctcca 3480 ggccctgctg gtcagcaggg tgcaatcggc agtccaggac ctgcaggccc cagaggacct 3540 gttggaccca gtggacctcc tggcaaagat ggaaccagtg gacatccagg tcccattgga 3600 ccaccagggc ctcgaggtaa cagaggtgaa agaggatctg agggctcccc aggccaccca 3660 gggcaaccag gccctcctgg acctcctggt gcccctggtc cttgctgtgg tggtgttgga 3720 gccgctgcca ttgctgggat tggaggtgaa aaagctggcg gttttgcccc gtattatgga 3780 gatgaaccaa tggatttcaa aatcaacacc gatgagatta tgacttcact caagtctgtt 3840 aatggacaaa tagaaagcct cattagtcct gatggttctc gtaaaaaccc cgctagaaac 3900 tgcagagacc tgaaattctg ccatcctgaa ctcaagagtg gagaatactg ggttgaccct 3960 aaccaaggat gcaaattgga tgctatcaag gtattctgta atatggaaac tggggaaaca 4020 tgcataagtg ccaatccttt gaatgttcca cggaaacact ggtggacaga ttctagtgct 4080 gagaagaaac acgtttggtt tggagagtcc atggatggtg gttttcagtt tagctacggc 4140 aatcctgaac ttcctgaaga tgtccttgat gtgcagctgg cattccttcg acttctctcc 4200 agccgagctt cccagaacat cacatatcac tgcaaaaata gcattgcata catggatcag 4260 gccagtggaa atgtaaagaa ggccctgaag ctgatggggt caaatgaagg tgaattcaag 4320 gctgaaggaa atagcaaatt cacctacaca gttctggagg atggttgcac gaaacacact 4380 ggggaatgga gcaaaacagt ctttgaatat cgaacacgca aggctgtgag actacctatt 4440 gtagatattg caccctatga cattggtggt cctgatcaag aatttggtgt ggacgttggc 4500 cctgtttgct ttttataaac caaactctat ctgaaatccc aacaaaaaaa atttaactcc 4560 atatgtgttc ctcttgttct aatcttgtca accagtgcaa gtgaccgaca aaattccagt 4620 tatttatttc caaaatgttt ggaaacagta taatttgaca aagaaaaatg atacttctct 4680 ttttttgctg ttccaccaaa tacaattcaa atgctttttg ttttattttt ttaccaattc 4740 caatttcaaa atgtctcaat ggtgctataa taaataaact tcaacactct ttatgataac 4800 aacactgtgt tatattcttt gaatcctagc ccatctgcag agcaatgact gtgctcacca 4860 gtaaaagata acctttcttt ctgaaatagt caaatacgaa attagaaaag ccctccctat 4920 tttaactacc tcaactggtc agaaacacag attgtattct atgagtccca gaagatgaaa 4980 aaaattttat acgttgataa aacttataaa tttcattgat taatctcctg gaagattggt 5040 ttaaaaagaa aagtgtaatg caagaattta aagaaatatt tttaaagcca caattatttt 5100 aatattggat atcaactgct tgtaaaggtg ctcctctttt ttcttgtcat tgctggtcaa 5160 gattactaat atttgggaag gctttaaaga cgcatgttat ggtgctaatg tactttcact 5220 tttaaactct agatcagaat tgttgacttg cattcagaac ataaatgcac aaaatctgta 5280 catgtctccc atcagaaaga ttcattggca tgccacaggg gattctcctc cttcatcctg 5340 taaaggtcaa caataaaaac caaattatgg ggctgctttt gtcacactag catagagaat 5400 gtgttgaaat ttaactttgt aagcttgtat gtggttgttg atcttttttt tccttacaga 5460 cacccataat aaaatatcat attaaaattc 5490 ggctgagttt tatgacgggc ccggtgctga agggcaggga acaacttgat ggtgctactt 60 tgaactgctt ttcttttctc ctttttgcac aaagagtctc atgtctgata tttagacatg 120 atgagctttg tgcaaaaggg gagctggcta cttctcgctc tgcttcatcc cactattatt 180 ttggcacaac aggaagctgt tgaaggagga tgttcccatc ttggtcagtc ctatgcggat 240 agagatgtct ggaagccaga accatgccaa atatgtgtct gtgactcagg atccgttctc 300 tgcgatgaca taatatgtga cgatcaagaa ttagactgcc ccaacccaga aattccattt 360 ggagaatgtt gtgcagtttg cccacagcct ccaactgctc ctactcgccc tcctaatggt 420 caaggacctc aaggccccaa gggagatcca ggccctcctg gtattcctgg gagaaatggt 480 gaccctggta ttccaggaca accagggtcc cctggttctc ctggcccccc tggaatctgt 540 gaatcatgcc ctactggtcc tcagaactat tctccccagt atgattcata tgatgtcaag 600 tctggagtag cagtaggagg actcgcaggc tatcctggac cagctggccc cccaggccct 660 cccggtcccc ctggtacatc tggtcatcct ggttcccctg gatctccagg ataccaagga 720 ccccctggtg aacctgggca agctggtcct tcaggccctc caggacctcc tggtgctata 780 ggtccatctg gtcctgctgg aaaagatgga gaatcaggta gacccggacg acctggagag 840 cgaggattgc ctggacctcc aggtatcaaa ggtccagctg ggatacctgg attccctggt 900 atgaaaggac acagaggctt cgatggacga aatggagaaa agggtgaaac aggtgctcct 960 ggattaaagg gtgaaaatgg tcttccaggc gaaaatggag ctcctggacc catgggtcca 1020 agaggggctc ctggtgagcg aggacggcca ggacttcctg gggctgcagg tgctcggggt 1080 aatgacggtg ctcgaggcag tgatggtcaa ccaggccctc ctggtcctcc tggaactgcc 1140 ggattccctg gatcccctgg tgctaagggt gaagttggac ctgcagggtc tcctggttca 1200 aatggtgccc ctggacaaag aggagaacct ggacctcagg gacacgctgg tgctcaaggt 1260 cctcctggcc ctcctgggat taatggtagt cctggtggta aaggcgaaat gggtcccgct 1320 ggcattcctg gagctcctgg actgatggga gcccggggtc ctccaggacc agccggtgct 1380 aatggtgctc ctggactgcg aggtggtgca ggtgagcctg gtaagaatgg tgccaaagga 1440 gagcccggac cacgtggtga acgcggtgag gctggtattc caggtgttcc aggagctaaa 1500 ggcgaagatg gcaaggatgg atcacctgga gaacctggtg caaatgggct tccaggagct 1560 gcaggagaaa ggggtgcccc tgggttccga ggacctgctg gaccaaatgg catcccagga 1620 gaaaagggtc ctgctggaga gcgtggtgct ccaggccctg cagggcccag aggagctgct 1680 ggagaacctg gcagagatgg cgtccct gga ggtccaggaa tgaggggcat gcccggaagt 1740 ccaggaggac caggaagtga tgggaaacca gggcctcccg gaagtcaagg agaaagtggt 1800 cgaccaggtc ctcctgggcc atctggtccc cgaggtcagc ctggtgtcat gggcttcccc 1860 ggtcctaaag gaaatgatgg tgctcctggt aagaatggag aacgaggtgg ccctggagga 1920 cctggccctc agggtcctcc tggaaagaat ggtgaaactg gacctcaggg acccccaggg 1980 cctactgggc ctggtggtga caaaggagac acaggacccc ctggtccaca aggattacaa 2040 ggcttgcctg gtacaggtgg tcctccagga gaaaatggaa aacctgggga accaggtcca 2100 aagggtgatg ccggtgcacc tggagctcca ggaggcaagg gtgatgctgg tgcccctggt 2160 gaacgtggac ctcctggatt ggcaggggcc ccaggactta gaggtggagc tggtccccct 2220 ggtcccgaag gaggaaaggg tgctgctggt cctcctgggc cacctggtgc tgctggtact 2280 cctggtctgc aaggaatgcc tggagaaaga ggaggtcttg gaagtcctgg tccaaagggt 2340 gacaagggtg aaccaggcgg tccaggtgct gatggtgtcc cagggaaaga tggcccaagg 2400 ggtcctactg gtcctattgg tcctcctggc ccagctggcc agcctggaga taagggtgaa 2460 ggtggtgccc ccggacttcc aggtatagct ggacctcgtg gtagccctgg tgagagaggt 2520 gaaactggcc ctccaggacc tgctggtttc cc tggtgctc ctggacagaa tggtgaacct 2580 ggtggtaaag gagaaagagg ggctccgggt gagaaaggtg aaggaggccc tcctggagtt 2640 gcaggacccc ctggaggttc tggacctgct ggtcctcctg gtccccaagg tgtcaaaggt 2700 gaacgtggca gtcctggtgg acctggtgct gctggcttcc ctggtgctcg tggtcttcct 2760 ggtcctcctg gtagtaatgg taacccagga cccccaggtc ccagcggttc tccaggcaag 2820 gatgggcccc caggtcctgc gggtaacact ggtgctcctg gcagccctgg agtgtctgga 2880 ccaaaaggtg atgctggcca accaggagag aagggatcgc ctggtgccca gggcccacca 2940 ggagctccag gcccacttgg gattgctggg atcactggag cacggggtct tgcaggacca 3000 ccaggcatgc caggtcctag gggaagccct ggccctcagg gtgtcaaggg tgaaagtggg 3060 aaaccaggag ctaacggtct cagtggagaa cgtggtcccc ctggacccca gggtcttcct 3120 ggtctggctg gtacagctgg tgaacctgga agagatggaa accctggatc agatggtctt 3180 ccaggccgag atggatctcc tggtggcaag ggtgatcgtg gtgaaaatgg ctctcctggt 3240 gcccctggcg ctcctggtca tccaggccca cctggtcctg tcggtccagc tggaaagagt 3300 ggtgacagag gagaaagtgg ccctgctggc cctgctggtg ctcccggtcc tgctggttcc 3360 cgaggtgctc ctggtcctca aggcccacgt ggtgacaa ag gtgaaacagg tgaacgtgga 3420 gctgctggca tcaaaggaca tcgaggattc cctggtaatc caggtgcccc aggttctcca 3480 ggccctgctg gtcagcaggg tgcaatcggc agtccaggac ctgcaggccc cagaggacct 3540 gttggaccca gtggacctcc tggcaaagat ggaaccagtg gacatccagg tcccattgga 3600 ccaccagggc ctcgaggtaa cagaggtgaa agaggatctg agggctcccc aggccaccca 3660 gggcaaccag gccctcctgg acctcctggt gcccctggtc cttgctgtgg tggtgttgga 3720 gccgctgcca ttgctgggat tggaggtgaa aaagctggcg gttttgcccc gtattatgga 3780 gatgaaccaa tggatttcaa aatcaacacc gatgagatta tgacttcact caagtctgtt 3840 aatggacaaa tagaaagcct cattagtcct gatggttctc gtaaaaaccc cgctagaaac 3900 tgcagagacc tgaaattctg ccatcctgaa ctcaagagtg gagaatactg ggttgaccct 3960 aaccaaggat gcaaattgga tgctatcaag gtattctgta atatggaaac tggggaaaca 4020 tgcataagtg ccaatccttt gaatgttcca cggaaacact ggtggacaga ttctagtgct 4080 gagaagaaac acgtttggtt tggagagtcc atggatggtg gttttcagtt tagctacggc 4140 aatcctgaac ttcctgaaga tgtccttgat gtgcagctgg cattccttcg acttctctcc 4200 agccgagctt cccagaacat cacatatcac tgcaaaaata gca ttgcata catggatcag 4260 gccagtggaa atgtaaagaa ggccctgaag ctgatggggt caaatgaagg tgaattcaag 4320 gctgaaggaa atagcaaatt cacctacaca gttctggagg atggttgcac gaaacacact 4380 ggggaatgga gcaaaacagt ctttgaatat cgaacacgca aggctgtgag actacctatt 4440 gtagatattg caccctatga cattggtggt cctgatcaag aatttggtgt ggacgttggc 4500 cctgtttgct ttttataaac caaactctat ctgaaatccc aacaaaaaaa atttaactcc 4560 atatgtgttc ctcttgttct aatcttgtca accagtgcaa gtgaccgaca aaattccagt 4620 tatttatttc caaaatgttt ggaaacagta taatttgaca aagaaaaatg atacttctct 4680 ttttttgctg ttccaccaaa tacaattcaa atgctttttg ttttattttt ttaccaattc 4740 caatttcaaa atgtctcaat ggtgctataa taaataaact tcaacactct ttatgataac 4800 aacactgtgt tatattcttt gaatcctagc ccatctgcag agcaatgact gtgctcacca 4860 gtaaaagata acctttcttt ctgaaatagt caaatacgaa attagaaaag ccctccctat 4920 tttaactacc tcaactggtc agaaacacag attgtattct atgagtccca gaagatgaaa 4980 aaaattttat acgttgataa aacttataaa tttcattgat taatctcctg gaagattggt 5040 ttaaaaagaa aagtgtaatg caagaattta aagaaatatt tttaaagcc a caattatttt 5100 aatattggat atcaactgct tgtaaaggtg ctcctctttt ttcttgtcat tgctggtcaa 5160 gattactaat atttgggaag gctttaaaga cgcatgttat ggtgctaatg tactttcact 5220 tttaaactct agatcagaat tgttgacttg cattcagaac ataaatgcac aaaatctgta 5280 catgtctccc atcagaaaga ttcattggca tgccacaggg gattctcctc cttcatcctg 5340 taaaggtcaa caataaaaac caaattatgg ggctgctttt gtcacactag catagagaat 5400 gtgttgaaat ttaactttgt aagcttgtat gtggttgttg atcttttttt tccttacaga 5460 cacccataat aaaatatcat attaaaattc 5490
<210> 10 <210> 10
<211> 2269 <211> 2269
<212> DNA <212> DNA
<213> Homo sapiens <213> Homo sapiens
<400> 10 <400> 10
cgcagccccg gttcctgccc gcacctctcc ctccacacct ccccgcaagc tgagggagcc 60 ggctccggcc tcggccagcc caggaaggcg ctcccacagc gcagtggtgg gctgaagggc 120 tcctcaagtg ccgccaaagt gggagcccag gcagaggagg cgccgagagc gagggagggc 180 tgtgaggact gccagcacgc tgtcacctct caatagcagc ccaaacagat taagacatgg 240 gaggtgaaag acaacttgag tggttaaatt actgtcatgc aaagcgacta gatggttcag 300 ctgattgcac ctttagaagt tatgtggaac gaggcagcag atcttaagcc ccttgctctg 360 tcacgcaggc tggaatgcag tggtggaatc atggctcact acagccctga cctcctgggc 420 ccagagatgg agtctcgcta ttttgcccag gttggtcttg aacacctggc ttcaagcagt 480 cctcctgctt ttggcttctt gaagtgcttg gattacagta tttcagtttt atgctctgca 540 acaagtttgg ccatgttgga ggacaatcca aaggtcagca agttggctac tggcgattgg 600 atgctcactc tgaagccaaa gtctattact gtgcccgtgg aaatccccag ctcccctctg 660 gattgtcagt ggctgctatg cagcaggtgc agcctggtct ctcactgagt ctctactcca 720 caaaggcaac gactggccaa ggcagtggct ggctctgggt tacacaagtg cagacactca 780 actaagtgag ctggaagacc caggagaagg cggaggctca ggtgcccaca tgatcagcac 840 agccagggta cctgctgaca agcctgtacg catcgccttt agcctcaatg acgcctcaga 900 tgatacaccc cctgaagact ccattccttt ggtctttcca gaattagacc agcagctaca 960 gcccctgccg ccttgtcatg actccgagga atccatggag gtgttcaaac agcactgcca 1020 aatagcagaa gaataccatg aggtcaaaaa ggaaatcacc ctgcttgagc aaaggaagaa 1080 ggagctcatt gccaagttag atcaggcaga aaaggagaag gtggatgctg ctgagctggt 1140 tcgggaattc gaggctctga cggaggagaa tcggacgttg aggttggccc agtctcaatg 1200 tgtggaacaa ctggagaaac ttcgaataca gtatcagaag aggcagggct cgtcctaact 1260 ttaaattttt cagtgtgagc atacgaggct gatgactgcc ctgtgctggc caaaagattt 1320 ttattttaaa tgaatagtga gtcagatcta ttgcttctct gtattaccca catgacaact 1380 gtctataatg agtttactgc ttgccagctt ctagcttgag agaagggata ttttaaatga 1440 gatcattaac gtgaaactat tactagtata tgtttttgga gatcagaatt cttttccaaa 1500 gatatatgtt tttttctttt ttaggaagat atgatcatgc tgtacaacag ggtagaaaat 1560 gataaaaata gactattgac tgacccagct aagaatcgtg ggctgagcag agttaaacca 1620 tgggacaaac ccataacatg ttcaccatag tttcacgtat gtgtattttt aaatttcatg 1680 cctttaatat ttcaaatatg ctcaaattta aactgtcaga aacttctgtg catgtattta 1740 tatttgccag agtataaact tttatactct gatttttatc cttcaatgat tgattatact 1800 aagaataaat ggtcacatat cctaaaagct tcttcatgaa attattagca gaaaccatgt 1860 ttgtaaccaa agcacatttg ccaatgctaa ctggctgttg taataataaa cagataaggc 1920 tgcatttgct tcatgccatg tgacctcaca gtaaacatct ctgcctttgc ctgtgtgtgt 1980 tctgggggag gggggacatg gaaaaatatt gtttggacat tacttgggtg agtgcccatg 2040 aaaacatcag tgaacttgta actattgttt tgttttggat ttaaggagat gttttagatc 2100 agtaacagct aataggaata tgcgagtaaa ttcagaattg aaacaatttc tccttgttct 2160 acctatcacc acattttctc aaattgaact ctttgttata tgtccatttc tattcatgta 2220 acttcttttt cattaaacat ggatcaaaac tgacaaaaaa aaaaaaaaa 2269 cgcagccccg gttcctgccc gcacctctcc ctccacacct ccccgcaagc tgagggagcc 60 ggctccggcc tcggccagcc caggaaggcg ctcccacagc gcagtggtgg gctgaagggc 120 tcctcaagtg ccgccaaagt gggagcccag gcagaggagg cgccgagagc gagggagggc 180 tgtgaggact gccagcacgc tgtcacctct caatagcagc ccaaacagat taagacatgg 240 gaggtgaaag acaacttgag tggttaaatt actgtcatgc aaagcgacta gatggttcag 300 ctgattgcac ctttagaagt tatgtggaac gaggcagcag atcttaagcc ccttgctctg 360 tcacgcaggc tggaatgcag tggtggaatc atggctcact acagccctga cctcctgggc 420 ccagagatgg agtctcgcta ttttgcccag gttggtcttg aacacctggc ttcaagcagt 480 cctcctgctt ttggcttctt gaagtgcttg gattacagta tttcagtttt atgctctgca 540 acaagtttgg ccatgttgga ggacaatcca aaggtcagca agttggctac tggcgattgg 600 atgctcactc tgaagccaaa gtctattact gtgcccgtgg aaatccccag ctcccctctg 660 gattgtcagt ggctgctatg cagcaggtgc agcctggtct ctcactgagt ctctactcca 720 caaaggcaac gactggccaa ggcagtggct ggctctgggt tacacaagtg cagacactca 780 actaagtgag ctggaagacc caggagaagg cggaggctca ggtgcccaca tgatcagcac 840 agccagggta cctgctgaca agcctgtacg catcgccttt agcctcaatg acgcctcaga 900 tgatacaccc cctgaagact ccattccttt ggtctttcca gaattagacc agcagctaca 960 gcccctgccg ccttgtcatg actccgagga atccatggag gtgttcaaac agcactgcca 1020 aatagcagaa gaataccatg aggtcaaaaa ggaaatcacc ctgcttgagc aaaggaagaa 1080 ggagctcatt gccaagttag atcaggcaga aaaggagaag gtggatgctg ctgagctggt 1140 tcgggaattc gaggctctga cggaggagaa tcggacgttg aggttggccc agtctcaatg 1200 tgtggaacaa ctggagaaac ttcgaataca gtatcagaag aggcagggct cgtcctaact 1260 ttaaattttt cagtgtgagc atacgaggct gatgactgcc ctgtgctggc caaaagattt 1320 ttattttaaa tgaatagtga gtcagatcta ttgcttctct gtattaccca catgacaact 1380 gtctataatg agtttactgc ttgccagctt ctagcttgag agaagggata ttttaaatga 1440 gatcattaac gtgaaactat tactagtata tgtttttgga gatcagaatt cttttccaaa 1500 gatatatgtt tttttctttt ttaggaagat atgatcatgc tgtacaacag ggtagaaaat 1560 gataaaaata gactattgac tgacccagct aagaatcgtg ggctgagcag agttaaacca 1620 tgggacaaac ccataacatg ttcaccatag tttcacgtat gtgtattttt aaatttcatg 1680 cctttaatat ttcaaatatg ctcaaat tta aactgtcaga aacttctgtg catgtattta 1740 tatttgccag agtataaact tttatactct gatttttatc cttcaatgat tgattatact 1800 aagaataaat ggtcacatat cctaaaagct tcttcatgaa attattagca gaaaccatgt 1860 ttgtaaccaa agcacatttg ccaatgctaa ctggctgttg taataataaa cagataaggc 1920 tgcatttgct tcatgccatg tgacctcaca gtaaacatct ctgcctttgc ctgtgtgtgt 1980 tctgggggag gggggacatg gaaaaatatt gtttggacat tacttgggtg agtgcccatg 2040 aaaacatcag tgaacttgta actattgttt tgttttggat ttaaggagat gttttagatc 2100 agtaacagct aataggaata tgcgagtaaa ttcagaattg aaacaatttc tccttgttct 2160 acctatcacc acattttctc aaattgaact ctttgttata tgtccatttc tattcatgta 2220 acttcttttt cattaaacat ggatcaaaac tgacaaaaaa aaaaaaaaa 2269
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
IT102019000000130A IT201900000130A1 (en) | 2019-01-07 | 2019-01-07 | METHOD FOR IDENTIFYING SUBJECTS RESISTANT TO TREATMENT WITH PLATINUM-BASED DRUGS AND RELATIVE KIT |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
IT102019000000130A IT201900000130A1 (en) | 2019-01-07 | 2019-01-07 | METHOD FOR IDENTIFYING SUBJECTS RESISTANT TO TREATMENT WITH PLATINUM-BASED DRUGS AND RELATIVE KIT |
Publications (1)
Publication Number | Publication Date |
---|---|
IT201900000130A1 true IT201900000130A1 (en) | 2020-07-07 |
Family
ID=66286574
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
IT102019000000130A IT201900000130A1 (en) | 2019-01-07 | 2019-01-07 | METHOD FOR IDENTIFYING SUBJECTS RESISTANT TO TREATMENT WITH PLATINUM-BASED DRUGS AND RELATIVE KIT |
Country Status (1)
Country | Link |
---|---|
IT (1) | IT201900000130A1 (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007038792A2 (en) * | 2005-09-28 | 2007-04-05 | H. Lee Moffitt Cancer Center | Individualized cancer treatments |
WO2017158358A1 (en) * | 2016-03-15 | 2017-09-21 | Almac Diagnostics Limited | Gene signatures for cancer detection and treatment |
-
2019
- 2019-01-07 IT IT102019000000130A patent/IT201900000130A1/en unknown
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007038792A2 (en) * | 2005-09-28 | 2007-04-05 | H. Lee Moffitt Cancer Center | Individualized cancer treatments |
WO2017158358A1 (en) * | 2016-03-15 | 2017-09-21 | Almac Diagnostics Limited | Gene signatures for cancer detection and treatment |
Non-Patent Citations (11)
Title |
---|
"Antibodies, A Laboratory Manual", 2014, COLD SPRING HARBOR LABORATORY PRESS |
AUSUBEL ET AL.: "Current Protocols of Molecular Biology", 1997, JOHN WILEY AND SONS |
DE ANDRES ET AL., BIOTECHNIQUES, vol. 6, 1995, pages 42044 |
HAMMERLING ET AL., MONOCLONAL ANTIBODIES AND T-CELL HYBRIDOMAS, 1981 |
IRAIA GARCÍA-SANTISTEBAN ET AL: "USP1 deubiquitinase: cellular functions, regulatory mechanisms and emerging potential as target in cancer therapy", MOLECULAR CANCER, vol. 12, 10 August 2013 (2013-08-10), pages 91 - 102, XP055590499 * |
KENNET ET AL., MONOCLONAL ANTIBODIES, 1980 |
KREUZINGER ET AL., CANCER LETTERS, vol. 362, no. 2, 1 July 2015 (2015-07-01), pages 218 - 228 |
M. SCHREIER ET AL., HYBRIDOMA TECHNIQUES, 1980 |
MARLIES LEENAARSCOENRAAD F. M. HENDRIKSEN: "remote thermostats steps in the Production of Polyclonal and Monoclonal Antibodies: Evaluation and Recommendations", ILAR JOURNAL, vol. 46, no. 3, 1 January 2005 (2005-01-01), pages 269 - 279 |
RUPPLOCKER, LAB INVEST., vol. 56, 1987, pages A67 |
SAMBROOKRUSSEL, MOLECULAR CLONING - A LABORATORY MANUAL |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102754524B1 (en) | Diagnosis and Treatment Methods for Breast Cancer Treatment | |
DK2456889T3 (en) | Markers of endometrial cancer | |
AU2012345789B2 (en) | Methods of treating breast cancer with taxane therapy | |
CN113383085A (en) | Method and system for analyzing nucleic acid molecules | |
BRPI0616090A2 (en) | methods and materials for identifying the origin of a carcinoma of unknown primary origin | |
IL205635A (en) | Method for measuring resistance or sensitivity to docetaxel | |
US11118232B2 (en) | Methods of detecting DDR2 mutations | |
EP3546591A1 (en) | Method for predicting sensitivity of cancer to treatment with pd-1 immune checkpoint inhibitor | |
US10519507B2 (en) | Method for detecting T-cell lymphoma | |
US20080182250A1 (en) | Method for detecting multiple myeloma and method for inhibiting the same | |
KR20240005018A (en) | Methods and systems for analyzing nucleic acid molecules | |
KR20060048684A (en) | How to diagnose and treat cancer | |
AU2018304242B2 (en) | Methods for detection of plasma cell dyscrasia | |
IT201900000130A1 (en) | METHOD FOR IDENTIFYING SUBJECTS RESISTANT TO TREATMENT WITH PLATINUM-BASED DRUGS AND RELATIVE KIT | |
CN101784674A (en) | Predictive markers for EGFR inhibitor therapy | |
KR102202120B1 (en) | Use of Ube2h for Diagnosis or Treatment of Alzheimer's Disease | |
HK40060652A (en) | Methods and systems for analyzing nucleic acid molecules | |
KR20230045895A (en) | Novel biomarker for predicting prognosis of peripheral T-cell lymphoma and uses thereof | |
CN117580963A (en) | Methods and systems for analyzing nucleic acid molecules |