CN114729320B - 用于将细胞重新程序化为能够呈递抗原的树突细胞2型的组合物、方法及其用途 - Google Patents
用于将细胞重新程序化为能够呈递抗原的树突细胞2型的组合物、方法及其用途 Download PDFInfo
- Publication number
- CN114729320B CN114729320B CN202080078541.XA CN202080078541A CN114729320B CN 114729320 B CN114729320 B CN 114729320B CN 202080078541 A CN202080078541 A CN 202080078541A CN 114729320 B CN114729320 B CN 114729320B
- Authority
- CN
- China
- Prior art keywords
- seq
- cells
- pro
- leu
- irf4
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 210000004027 cell Anatomy 0.000 title claims abstract description 319
- 238000000034 method Methods 0.000 title claims abstract description 152
- 239000000203 mixture Substances 0.000 title claims abstract description 142
- 230000008672 reprogramming Effects 0.000 title claims abstract description 60
- 210000004443 dendritic cell Anatomy 0.000 title claims abstract description 45
- 239000000427 antigen Substances 0.000 title abstract description 38
- 102000036639 antigens Human genes 0.000 title abstract description 35
- 108091007433 antigens Proteins 0.000 title abstract description 35
- 108091023040 Transcription factor Proteins 0.000 claims abstract description 122
- 102000040945 Transcription factor Human genes 0.000 claims abstract description 121
- 210000001778 pluripotent stem cell Anatomy 0.000 claims abstract description 32
- 210000002894 multi-fate stem cell Anatomy 0.000 claims abstract 4
- 102100027654 Transcription factor PU.1 Human genes 0.000 claims description 183
- 108010008929 proto-oncogene protein Spi-1 Proteins 0.000 claims description 182
- 102100030126 Interferon regulatory factor 4 Human genes 0.000 claims description 178
- 239000013598 vector Substances 0.000 claims description 116
- 101000596925 Homo sapiens Homeobox protein TGIF1 Proteins 0.000 claims description 72
- 241000282414 Homo sapiens Species 0.000 claims description 69
- 101001000773 Homo sapiens POU domain, class 2, transcription factor 2 Proteins 0.000 claims description 69
- 102100035081 Homeobox protein TGIF1 Human genes 0.000 claims description 68
- 102100035591 POU domain, class 2, transcription factor 2 Human genes 0.000 claims description 65
- 150000007523 nucleic acids Chemical group 0.000 claims description 60
- 108020004999 messenger RNA Proteins 0.000 claims description 53
- 206010028980 Neoplasm Diseases 0.000 claims description 49
- 230000001939 inductive effect Effects 0.000 claims description 48
- 101000584743 Homo sapiens Recombining binding protein suppressor of hairless Proteins 0.000 claims description 41
- 102100030000 Recombining binding protein suppressor of hairless Human genes 0.000 claims description 38
- 201000011510 cancer Diseases 0.000 claims description 33
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 27
- 210000000130 stem cell Anatomy 0.000 claims description 26
- 210000002950 fibroblast Anatomy 0.000 claims description 22
- 238000011282 treatment Methods 0.000 claims description 21
- 102000004127 Cytokines Human genes 0.000 claims description 19
- 108090000695 Cytokines Proteins 0.000 claims description 19
- 101001011441 Homo sapiens Interferon regulatory factor 4 Proteins 0.000 claims description 17
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 17
- 208000015181 infectious disease Diseases 0.000 claims description 17
- 239000013603 viral vector Substances 0.000 claims description 16
- 210000003958 hematopoietic stem cell Anatomy 0.000 claims description 15
- 210000000612 antigen-presenting cell Anatomy 0.000 claims description 14
- 208000023275 Autoimmune disease Diseases 0.000 claims description 12
- 238000012258 culturing Methods 0.000 claims description 11
- 102000040430 polynucleotide Human genes 0.000 claims description 11
- 108091033319 polynucleotide Proteins 0.000 claims description 11
- 239000002157 polynucleotide Substances 0.000 claims description 11
- 230000001177 retroviral effect Effects 0.000 claims description 11
- 239000003795 chemical substances by application Substances 0.000 claims description 9
- 239000003814 drug Substances 0.000 claims description 9
- 210000004881 tumor cell Anatomy 0.000 claims description 9
- 210000005260 human cell Anatomy 0.000 claims description 8
- 238000000338 in vitro Methods 0.000 claims description 8
- 208000035473 Communicable disease Diseases 0.000 claims description 7
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 claims description 7
- 230000003115 biocidal effect Effects 0.000 claims description 7
- 239000006143 cell culture medium Substances 0.000 claims description 7
- 238000002347 injection Methods 0.000 claims description 7
- 239000007924 injection Substances 0.000 claims description 7
- 230000002463 transducing effect Effects 0.000 claims description 7
- 241000710929 Alphavirus Species 0.000 claims description 6
- 241000710831 Flavivirus Species 0.000 claims description 5
- 108090000978 Interleukin-4 Proteins 0.000 claims description 5
- 239000003242 anti bacterial agent Substances 0.000 claims description 5
- 230000012010 growth Effects 0.000 claims description 5
- 239000007972 injectable composition Substances 0.000 claims description 5
- 238000004519 manufacturing process Methods 0.000 claims description 5
- 102100037850 Interferon gamma Human genes 0.000 claims description 4
- 108010074328 Interferon-gamma Proteins 0.000 claims description 4
- 102000013462 Interleukin-12 Human genes 0.000 claims description 4
- 108010065805 Interleukin-12 Proteins 0.000 claims description 4
- 230000003308 immunostimulating effect Effects 0.000 claims description 4
- 238000011065 in-situ storage Methods 0.000 claims description 4
- 229960005486 vaccine Drugs 0.000 claims description 4
- 210000004700 fetal blood Anatomy 0.000 claims description 3
- 210000004962 mammalian cell Anatomy 0.000 claims description 3
- 230000008685 targeting Effects 0.000 claims description 3
- 239000000730 antalgic agent Substances 0.000 claims description 2
- 239000002260 anti-inflammatory agent Substances 0.000 claims description 2
- 229940121363 anti-inflammatory agent Drugs 0.000 claims description 2
- 239000002246 antineoplastic agent Substances 0.000 claims description 2
- 239000003443 antiviral agent Substances 0.000 claims description 2
- 229940127089 cytotoxic agent Drugs 0.000 claims description 2
- 239000007884 disintegrant Substances 0.000 claims description 2
- 239000002934 diuretic Substances 0.000 claims description 2
- 239000000945 filler Substances 0.000 claims description 2
- 210000004966 intestinal stem cell Anatomy 0.000 claims description 2
- 239000000314 lubricant Substances 0.000 claims description 2
- 210000002901 mesenchymal stem cell Anatomy 0.000 claims description 2
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 2
- 230000003439 radiotherapeutic effect Effects 0.000 claims description 2
- 108010009975 Positive Regulatory Domain I-Binding Factor 1 Proteins 0.000 claims 8
- 102000009844 Positive Regulatory Domain I-Binding Factor 1 Human genes 0.000 claims 8
- 241000175212 Herpesvirales Species 0.000 claims 2
- 102000004457 Granulocyte-Macrophage Colony-Stimulating Factor Human genes 0.000 claims 1
- 239000000853 adhesive Substances 0.000 claims 1
- 230000001070 adhesive effect Effects 0.000 claims 1
- 230000001882 diuretic effect Effects 0.000 claims 1
- 238000011161 development Methods 0.000 abstract description 10
- 230000018109 developmental process Effects 0.000 abstract description 10
- 210000003700 cd11b-positive dendritic cell Anatomy 0.000 abstract description 8
- 210000003068 cdc Anatomy 0.000 abstract description 2
- 108010051920 interferon regulatory factor-4 Proteins 0.000 description 167
- 101000687344 Homo sapiens PR domain zinc finger protein 1 Proteins 0.000 description 100
- 102100024894 PR domain zinc finger protein 1 Human genes 0.000 description 97
- 230000014509 gene expression Effects 0.000 description 82
- 102000004344 Interferon regulatory factor 2 Human genes 0.000 description 78
- 108090000908 Interferon regulatory factor 2 Proteins 0.000 description 78
- 241000699660 Mus musculus Species 0.000 description 47
- 108020004414 DNA Proteins 0.000 description 37
- 102000039446 nucleic acids Human genes 0.000 description 37
- 108020004707 nucleic acids Proteins 0.000 description 37
- 210000001744 T-lymphocyte Anatomy 0.000 description 34
- 108090000623 proteins and genes Proteins 0.000 description 27
- 101001046686 Homo sapiens Integrin alpha-M Proteins 0.000 description 26
- 102100022338 Integrin alpha-M Human genes 0.000 description 26
- 101150035137 Clec9a gene Proteins 0.000 description 24
- -1 Sirpα Proteins 0.000 description 24
- 230000004913 activation Effects 0.000 description 24
- 241000699666 Mus <mouse, genus> Species 0.000 description 23
- 230000003394 haemopoietic effect Effects 0.000 description 23
- 101000716102 Homo sapiens T-cell surface glycoprotein CD4 Proteins 0.000 description 20
- 125000003729 nucleotide group Chemical group 0.000 description 20
- 108010054624 red fluorescent protein Proteins 0.000 description 20
- 230000006698 induction Effects 0.000 description 19
- 101150009018 SPI-1 gene Proteins 0.000 description 18
- 241000700605 Viruses Species 0.000 description 18
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 18
- 239000003550 marker Substances 0.000 description 18
- 210000001082 somatic cell Anatomy 0.000 description 18
- 238000010361 transduction Methods 0.000 description 17
- 230000026683 transduction Effects 0.000 description 17
- 108020004705 Codon Proteins 0.000 description 16
- 239000002773 nucleotide Substances 0.000 description 16
- 206010020751 Hypersensitivity Diseases 0.000 description 15
- 230000028993 immune response Effects 0.000 description 15
- 238000009169 immunotherapy Methods 0.000 description 15
- 108010057821 leucylproline Proteins 0.000 description 15
- 238000011002 quantification Methods 0.000 description 15
- 241000894007 species Species 0.000 description 15
- 102100023013 Basic leucine zipper transcriptional factor ATF-like 3 Human genes 0.000 description 14
- 101000903609 Homo sapiens Basic leucine zipper transcriptional factor ATF-like 3 Proteins 0.000 description 14
- 101001032345 Homo sapiens Interferon regulatory factor 8 Proteins 0.000 description 14
- 102100038069 Interferon regulatory factor 8 Human genes 0.000 description 14
- 108010087924 alanylproline Proteins 0.000 description 14
- 210000001185 bone marrow Anatomy 0.000 description 14
- 244000052769 pathogen Species 0.000 description 14
- 108010026333 seryl-proline Proteins 0.000 description 14
- 238000002560 therapeutic procedure Methods 0.000 description 14
- 108010061238 threonyl-glycine Proteins 0.000 description 14
- 108010009298 lysylglutamic acid Proteins 0.000 description 13
- 108090000765 processed proteins & peptides Proteins 0.000 description 13
- 102000004169 proteins and genes Human genes 0.000 description 13
- 230000004044 response Effects 0.000 description 13
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 12
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 12
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 12
- 108010093581 aspartyl-proline Proteins 0.000 description 12
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 12
- 239000012091 fetal bovine serum Substances 0.000 description 12
- 108010050848 glycylleucine Proteins 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 11
- 239000013604 expression vector Substances 0.000 description 11
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 11
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 11
- 108010025306 histidylleucine Proteins 0.000 description 11
- 108010034529 leucyl-lysine Proteins 0.000 description 11
- 102000004196 processed proteins & peptides Human genes 0.000 description 11
- 210000001519 tissue Anatomy 0.000 description 11
- 241000701161 unidentified adenovirus Species 0.000 description 11
- 241000699670 Mus sp. Species 0.000 description 10
- 108010005233 alanylglutamic acid Proteins 0.000 description 10
- 150000001413 amino acids Chemical group 0.000 description 10
- 108010092854 aspartyllysine Proteins 0.000 description 10
- 108010015792 glycyllysine Proteins 0.000 description 10
- 108010018006 histidylserine Proteins 0.000 description 10
- 229920001184 polypeptide Polymers 0.000 description 10
- 101000738771 Homo sapiens Receptor-type tyrosine-protein phosphatase C Proteins 0.000 description 9
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 9
- 102100037422 Receptor-type tyrosine-protein phosphatase C Human genes 0.000 description 9
- 230000030741 antigen processing and presentation Effects 0.000 description 9
- 108010060035 arginylproline Proteins 0.000 description 9
- 108010077245 asparaginyl-proline Proteins 0.000 description 9
- 238000000684 flow cytometry Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 239000001963 growth medium Substances 0.000 description 9
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 9
- 230000000770 proinflammatory effect Effects 0.000 description 9
- 108010077112 prolyl-proline Proteins 0.000 description 9
- 230000003612 virological effect Effects 0.000 description 9
- 108010065920 Insulin Lispro Proteins 0.000 description 8
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 8
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 8
- 102000001708 Protein Isoforms Human genes 0.000 description 8
- 108010029485 Protein Isoforms Proteins 0.000 description 8
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 8
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 8
- 108010008355 arginyl-glutamine Proteins 0.000 description 8
- 108010085325 histidylproline Proteins 0.000 description 8
- 108010053725 prolylvaline Proteins 0.000 description 8
- 238000012216 screening Methods 0.000 description 8
- 230000028327 secretion Effects 0.000 description 8
- 101100028791 Caenorhabditis elegans pbs-5 gene Proteins 0.000 description 7
- 102000014914 Carrier Proteins Human genes 0.000 description 7
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 7
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 7
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 7
- 108010044940 alanylglutamine Proteins 0.000 description 7
- 108010047857 aspartylglycine Proteins 0.000 description 7
- 210000003719 b-lymphocyte Anatomy 0.000 description 7
- 108091008324 binding proteins Proteins 0.000 description 7
- 230000001413 cellular effect Effects 0.000 description 7
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 7
- 201000010099 disease Diseases 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 239000002158 endotoxin Substances 0.000 description 7
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 7
- 230000002458 infectious effect Effects 0.000 description 7
- 229920006008 lipopolysaccharide Polymers 0.000 description 7
- 210000004698 lymphocyte Anatomy 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 7
- 108010029020 prolylglycine Proteins 0.000 description 7
- 230000000638 stimulation Effects 0.000 description 7
- 102100033215 DNA nucleotidylexotransferase Human genes 0.000 description 6
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 6
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 6
- 108090000143 Mouse Proteins Proteins 0.000 description 6
- 101100273832 Mus musculus Cds1 gene Proteins 0.000 description 6
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 6
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 6
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 6
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 6
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 6
- 208000026935 allergic disease Diseases 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 238000003745 diagnosis Methods 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 108010092114 histidylphenylalanine Proteins 0.000 description 6
- 238000001727 in vivo Methods 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 210000002540 macrophage Anatomy 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 210000001616 monocyte Anatomy 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 230000035755 proliferation Effects 0.000 description 6
- 108010090894 prolylleucine Proteins 0.000 description 6
- 229950010550 resiquimod Drugs 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 230000020382 suppression by virus of host antigen processing and presentation of peptide antigen via MHC class I Effects 0.000 description 6
- 241001430294 unidentified retrovirus Species 0.000 description 6
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 5
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 5
- MZOFCQQQCNRIBI-VMXHOPILSA-N (3s)-4-[[(2s)-1-[[(2s)-1-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-methyl-1-oxopentan-2-yl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-3-[[2-[[(2s)-2,6-diaminohexanoyl]amino]acetyl]amino]-4-oxobutanoic acid Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN MZOFCQQQCNRIBI-VMXHOPILSA-N 0.000 description 5
- 229940046168 CpG oligodeoxynucleotide Drugs 0.000 description 5
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 5
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 5
- 102000018713 Histocompatibility Antigens Class II Human genes 0.000 description 5
- 108010027412 Histocompatibility Antigens Class II Proteins 0.000 description 5
- 108060003951 Immunoglobulin Proteins 0.000 description 5
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 5
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 5
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 5
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 5
- 108091036414 Polyinosinic:polycytidylic acid Proteins 0.000 description 5
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 5
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 5
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 5
- 230000007815 allergy Effects 0.000 description 5
- 108010068380 arginylarginine Proteins 0.000 description 5
- 238000003556 assay Methods 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 108010060199 cysteinylproline Proteins 0.000 description 5
- 210000004544 dc2 Anatomy 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 5
- 230000004069 differentiation Effects 0.000 description 5
- 210000003743 erythrocyte Anatomy 0.000 description 5
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 5
- 230000009610 hypersensitivity Effects 0.000 description 5
- 102000018358 immunoglobulin Human genes 0.000 description 5
- 108010054155 lysyllysine Proteins 0.000 description 5
- 239000002777 nucleoside Substances 0.000 description 5
- 150000003833 nucleoside derivatives Chemical class 0.000 description 5
- 230000002018 overexpression Effects 0.000 description 5
- 244000045947 parasite Species 0.000 description 5
- 229940115272 polyinosinic:polycytidylic acid Drugs 0.000 description 5
- 239000002243 precursor Substances 0.000 description 5
- 108010031719 prolyl-serine Proteins 0.000 description 5
- 108010079317 prolyl-tyrosine Proteins 0.000 description 5
- 230000010076 replication Effects 0.000 description 5
- BXNMTOQRYBFHNZ-UHFFFAOYSA-N resiquimod Chemical compound C1=CC=CC2=C(N(C(COCC)=N3)CC(C)(C)O)C3=C(N)N=C21 BXNMTOQRYBFHNZ-UHFFFAOYSA-N 0.000 description 5
- 238000001890 transfection Methods 0.000 description 5
- 108010020532 tyrosyl-proline Proteins 0.000 description 5
- 102100027824 3'(2'),5'-bisphosphate nucleotidase 1 Human genes 0.000 description 4
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 4
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 4
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 4
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 4
- 241000233866 Fungi Species 0.000 description 4
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 4
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 4
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 4
- CQIIXEHDSZUSAG-QWRGUYRKSA-N Gly-His-His Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CQIIXEHDSZUSAG-QWRGUYRKSA-N 0.000 description 4
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 4
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 4
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 4
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 4
- 108010061414 Hepatocyte Nuclear Factor 1-beta Proteins 0.000 description 4
- 102100022123 Hepatocyte nuclear factor 1-beta Human genes 0.000 description 4
- 241000282412 Homo Species 0.000 description 4
- AKQFLPNANHNTLP-VKOGCVSHSA-N Ile-Pro-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N AKQFLPNANHNTLP-VKOGCVSHSA-N 0.000 description 4
- 206010061598 Immunodeficiency Diseases 0.000 description 4
- 208000029462 Immunodeficiency disease Diseases 0.000 description 4
- 102100022297 Integrin alpha-X Human genes 0.000 description 4
- 108010014726 Interferon Type I Proteins 0.000 description 4
- 102000002227 Interferon Type I Human genes 0.000 description 4
- 108090000174 Interleukin-10 Proteins 0.000 description 4
- 102000004388 Interleukin-4 Human genes 0.000 description 4
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 4
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 4
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 4
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 4
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 4
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 108700020796 Oncogene Proteins 0.000 description 4
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 4
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 4
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 4
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 4
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 4
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 4
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 4
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 4
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 4
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 4
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 4
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 4
- 241000713880 Spleen focus-forming virus Species 0.000 description 4
- 230000005867 T cell response Effects 0.000 description 4
- LZRWTJSPTJSWDN-FKBYEOEOSA-N Val-Trp-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LZRWTJSPTJSWDN-FKBYEOEOSA-N 0.000 description 4
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 4
- 208000036142 Viral infection Diseases 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 230000003110 anti-inflammatory effect Effects 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 230000027455 binding Effects 0.000 description 4
- 210000004369 blood Anatomy 0.000 description 4
- 239000008280 blood Substances 0.000 description 4
- 230000001684 chronic effect Effects 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 230000002950 deficient Effects 0.000 description 4
- 229960003722 doxycycline Drugs 0.000 description 4
- XQTWDDCIUJNLTR-CVHRZJFOSA-N doxycycline monohydrate Chemical compound O.O=C1C2=C(O)C=CC=C2[C@H](C)[C@@H]2C1=C(O)[C@]1(O)C(=O)C(C(N)=O)=C(O)[C@@H](N(C)C)[C@@H]1[C@H]2O XQTWDDCIUJNLTR-CVHRZJFOSA-N 0.000 description 4
- 238000001415 gene therapy Methods 0.000 description 4
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 4
- 108010010147 glycylglutamine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 108010036413 histidylglycine Proteins 0.000 description 4
- 230000036039 immunity Effects 0.000 description 4
- 230000007813 immunodeficiency Effects 0.000 description 4
- 210000004263 induced pluripotent stem cell Anatomy 0.000 description 4
- 239000000411 inducer Substances 0.000 description 4
- 230000002757 inflammatory effect Effects 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- 108010027338 isoleucylcysteine Proteins 0.000 description 4
- 210000004185 liver Anatomy 0.000 description 4
- 210000005210 lymphoid organ Anatomy 0.000 description 4
- 210000003563 lymphoid tissue Anatomy 0.000 description 4
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 230000001404 mediated effect Effects 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 125000003835 nucleoside group Chemical group 0.000 description 4
- 238000004806 packaging method and process Methods 0.000 description 4
- 230000001717 pathogenic effect Effects 0.000 description 4
- 210000005259 peripheral blood Anatomy 0.000 description 4
- 239000011886 peripheral blood Substances 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 108010071207 serylmethionine Proteins 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- 210000000952 spleen Anatomy 0.000 description 4
- 239000000725 suspension Substances 0.000 description 4
- 230000003614 tolerogenic effect Effects 0.000 description 4
- 241001529453 unidentified herpesvirus Species 0.000 description 4
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- 230000009385 viral infection Effects 0.000 description 4
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 3
- ATAKEVCGTRZKLI-UWJYBYFXSA-N Ala-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 ATAKEVCGTRZKLI-UWJYBYFXSA-N 0.000 description 3
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 3
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 3
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 3
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 3
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 3
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 3
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 3
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 3
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 3
- 101150014003 Batf3 gene Proteins 0.000 description 3
- 102100039521 C-type lectin domain family 9 member A Human genes 0.000 description 3
- 241000991587 Enterovirus C Species 0.000 description 3
- 241000710198 Foot-and-mouth disease virus Species 0.000 description 3
- 108010010803 Gelatin Proteins 0.000 description 3
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 3
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 3
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 3
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 3
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 3
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 3
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 3
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 3
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 3
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 3
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 3
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 3
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 3
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 3
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 3
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 3
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 3
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 3
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 3
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 3
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 3
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 3
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 3
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 3
- 241000711549 Hepacivirus C Species 0.000 description 3
- 241000700721 Hepatitis B virus Species 0.000 description 3
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 3
- 101000888548 Homo sapiens C-type lectin domain family 9 member A Proteins 0.000 description 3
- 101001011393 Homo sapiens Interferon regulatory factor 2 Proteins 0.000 description 3
- 101000651211 Homo sapiens Transcription factor PU.1 Proteins 0.000 description 3
- 108090000144 Human Proteins Proteins 0.000 description 3
- 102000003839 Human Proteins Human genes 0.000 description 3
- 241000725303 Human immunodeficiency virus Species 0.000 description 3
- 102000043138 IRF family Human genes 0.000 description 3
- 108091054729 IRF family Proteins 0.000 description 3
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 3
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 3
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 3
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 3
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 3
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 3
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 3
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 3
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 3
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 3
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 3
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 3
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 3
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 3
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 3
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 3
- 101100097134 Mus musculus Rbpj gene Proteins 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- 108010058846 Ovalbumin Proteins 0.000 description 3
- 208000030852 Parasitic disease Diseases 0.000 description 3
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 3
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 3
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 3
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 3
- 241000709664 Picornaviridae Species 0.000 description 3
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 3
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 3
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 3
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 3
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 3
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 3
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 3
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 3
- 108010003201 RGH 0205 Proteins 0.000 description 3
- 241000315672 SARS coronavirus Species 0.000 description 3
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 3
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 3
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 3
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 3
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 3
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 3
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 3
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 3
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 3
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 3
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 3
- 108091008874 T cell receptors Proteins 0.000 description 3
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 3
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 3
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 3
- 102000002689 Toll-like receptor Human genes 0.000 description 3
- 108020000411 Toll-like receptor Proteins 0.000 description 3
- DZKFGCNKEVMXFA-JUKXBJQTSA-N Tyr-Ile-His Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O DZKFGCNKEVMXFA-JUKXBJQTSA-N 0.000 description 3
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 3
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 3
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 3
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 3
- UJVUMTUBMCYKBK-BNOPZSDTSA-N [(2r)-2-hexadecanoyloxy-3-[hydroxy-[(2r,3r,5s,6r)-2,3,5,6-tetrahydroxy-4-phosphonooxycyclohexyl]oxyphosphoryl]oxypropyl] hexadecanoate Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@@H](OC(=O)CCCCCCCCCCCCCCC)COP(O)(=O)OC1[C@H](O)[C@H](O)C(OP(O)(O)=O)[C@H](O)[C@H]1O UJVUMTUBMCYKBK-BNOPZSDTSA-N 0.000 description 3
- 210000005006 adaptive immune system Anatomy 0.000 description 3
- 230000004721 adaptive immunity Effects 0.000 description 3
- 230000032683 aging Effects 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 108010011559 alanylphenylalanine Proteins 0.000 description 3
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 210000000988 bone and bone Anatomy 0.000 description 3
- 230000008668 cellular reprogramming Effects 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 230000000139 costimulatory effect Effects 0.000 description 3
- 229940104302 cytosine Drugs 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 239000003085 diluting agent Substances 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 3
- 208000035475 disorder Diseases 0.000 description 3
- 239000003937 drug carrier Substances 0.000 description 3
- 210000003162 effector t lymphocyte Anatomy 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 108091006047 fluorescent proteins Proteins 0.000 description 3
- 102000034287 fluorescent proteins Human genes 0.000 description 3
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 3
- 229920000159 gelatin Polymers 0.000 description 3
- 239000008273 gelatin Substances 0.000 description 3
- 235000019322 gelatine Nutrition 0.000 description 3
- 235000011852 gelatine desserts Nutrition 0.000 description 3
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 3
- 108010079547 glutamylmethionine Proteins 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 210000003714 granulocyte Anatomy 0.000 description 3
- 208000014829 head and neck neoplasm Diseases 0.000 description 3
- 201000005787 hematologic cancer Diseases 0.000 description 3
- 208000024200 hematopoietic and lymphoid system neoplasm Diseases 0.000 description 3
- 208000006454 hepatitis Diseases 0.000 description 3
- 231100000283 hepatitis Toxicity 0.000 description 3
- 230000002519 immonomodulatory effect Effects 0.000 description 3
- 210000002865 immune cell Anatomy 0.000 description 3
- 230000006058 immune tolerance Effects 0.000 description 3
- 238000002513 implantation Methods 0.000 description 3
- 239000012678 infectious agent Substances 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 210000001165 lymph node Anatomy 0.000 description 3
- 210000003738 lymphoid progenitor cell Anatomy 0.000 description 3
- 230000003211 malignant effect Effects 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 230000004770 neurodegeneration Effects 0.000 description 3
- 208000015122 neurodegenerative disease Diseases 0.000 description 3
- 230000000626 neurodegenerative effect Effects 0.000 description 3
- 238000001543 one-way ANOVA Methods 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 108010024607 phenylalanylalanine Proteins 0.000 description 3
- 108010012581 phenylalanylglutamate Proteins 0.000 description 3
- 239000002953 phosphate buffered saline Substances 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 230000037452 priming Effects 0.000 description 3
- 102000005962 receptors Human genes 0.000 description 3
- 108020003175 receptors Proteins 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 210000003289 regulatory T cell Anatomy 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 238000002864 sequence alignment Methods 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 150000003384 small molecules Chemical class 0.000 description 3
- 230000000392 somatic effect Effects 0.000 description 3
- 238000007619 statistical method Methods 0.000 description 3
- 201000000596 systemic lupus erythematosus Diseases 0.000 description 3
- 210000001541 thymus gland Anatomy 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 241000712461 unidentified influenza virus Species 0.000 description 3
- SXUXMRMBWZCMEN-UHFFFAOYSA-N 2'-O-methyl uridine Natural products COC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 SXUXMRMBWZCMEN-UHFFFAOYSA-N 0.000 description 2
- GJTBSTBJLVYKAU-XVFCMESISA-N 2-thiouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=S)NC(=O)C=C1 GJTBSTBJLVYKAU-XVFCMESISA-N 0.000 description 2
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 2
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 2
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 2
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 2
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 2
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 2
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 2
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 2
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 2
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 2
- 108010076441 Ala-His-His Proteins 0.000 description 2
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 2
- QPBSRMDNJOTFAL-AICCOOGYSA-N Ala-Leu-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QPBSRMDNJOTFAL-AICCOOGYSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 2
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- UCDOXFBTMLKASE-HERUPUMHSA-N Ala-Ser-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N UCDOXFBTMLKASE-HERUPUMHSA-N 0.000 description 2
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 2
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 2
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- DPNHSNLIULPOBH-GUBZILKMSA-N Arg-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DPNHSNLIULPOBH-GUBZILKMSA-N 0.000 description 2
- RCAUJZASOAFTAJ-FXQIFTODSA-N Arg-Asp-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N RCAUJZASOAFTAJ-FXQIFTODSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 2
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 2
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 2
- GFMWTFHOZGLTLC-AVGNSLFASA-N Arg-His-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O GFMWTFHOZGLTLC-AVGNSLFASA-N 0.000 description 2
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 2
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 2
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 2
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 2
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 2
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 2
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 2
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 2
- STHNZYKCJHWULY-AVGNSLFASA-N Arg-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O STHNZYKCJHWULY-AVGNSLFASA-N 0.000 description 2
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 2
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 2
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 2
- ZCSHHTFOZULVLN-SZMVWBNQSA-N Arg-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 ZCSHHTFOZULVLN-SZMVWBNQSA-N 0.000 description 2
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 2
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 2
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 2
- JZRLLSOWDYUKOK-SRVKXCTJSA-N Asn-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N JZRLLSOWDYUKOK-SRVKXCTJSA-N 0.000 description 2
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 2
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 2
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 2
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 2
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 2
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 2
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 2
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 2
- HGGIYWURFPGLIU-FXQIFTODSA-N Asn-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(N)=O HGGIYWURFPGLIU-FXQIFTODSA-N 0.000 description 2
- RLHANKIRBONJBK-IHRRRGAJSA-N Asn-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N RLHANKIRBONJBK-IHRRRGAJSA-N 0.000 description 2
- OSZBYGVKAFZWKC-FXQIFTODSA-N Asn-Pro-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O OSZBYGVKAFZWKC-FXQIFTODSA-N 0.000 description 2
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 2
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 2
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 2
- FMWHSNJMHUNLAG-FXQIFTODSA-N Asp-Cys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FMWHSNJMHUNLAG-FXQIFTODSA-N 0.000 description 2
- DZQKLNLLWFQONU-LKXGYXEUSA-N Asp-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O DZQKLNLLWFQONU-LKXGYXEUSA-N 0.000 description 2
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 2
- BIVYLQMZPHDUIH-WHFBIAKZSA-N Asp-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)O BIVYLQMZPHDUIH-WHFBIAKZSA-N 0.000 description 2
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 2
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 2
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 2
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 2
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 2
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 2
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 2
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 2
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 2
- 206010005003 Bladder cancer Diseases 0.000 description 2
- 208000003174 Brain Neoplasms Diseases 0.000 description 2
- 206010006187 Breast cancer Diseases 0.000 description 2
- 208000026310 Breast neoplasm Diseases 0.000 description 2
- 238000011740 C57BL/6 mouse Methods 0.000 description 2
- 206010008342 Cervix carcinoma Diseases 0.000 description 2
- 241000498849 Chlamydiales Species 0.000 description 2
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 2
- 241000710777 Classical swine fever virus Species 0.000 description 2
- 101150000943 Clec10a gene Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- 241000711573 Coronaviridae Species 0.000 description 2
- 241000710127 Cricket paralysis virus Species 0.000 description 2
- QDFBJJABJKOLTD-FXQIFTODSA-N Cys-Asn-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QDFBJJABJKOLTD-FXQIFTODSA-N 0.000 description 2
- HHABWQIFXZPZCK-ACZMJKKPSA-N Cys-Gln-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HHABWQIFXZPZCK-ACZMJKKPSA-N 0.000 description 2
- GCDLPNRHPWBKJJ-WDSKDSINSA-N Cys-Gly-Glu Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GCDLPNRHPWBKJJ-WDSKDSINSA-N 0.000 description 2
- HAYVLBZZBDCKRA-SRVKXCTJSA-N Cys-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N HAYVLBZZBDCKRA-SRVKXCTJSA-N 0.000 description 2
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 2
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 2
- 241000701022 Cytomegalovirus Species 0.000 description 2
- 241000702421 Dependoparvovirus Species 0.000 description 2
- 201000004624 Dermatitis Diseases 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 241000710188 Encephalomyocarditis virus Species 0.000 description 2
- 102100038591 Endothelial cell-selective adhesion molecule Human genes 0.000 description 2
- 241000194032 Enterococcus faecalis Species 0.000 description 2
- 241000709661 Enterovirus Species 0.000 description 2
- 241000192125 Firmicutes Species 0.000 description 2
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 2
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 2
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 2
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 2
- QFTRCUPCARNIPZ-XHNCKOQMSA-N Gln-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)C(=O)O QFTRCUPCARNIPZ-XHNCKOQMSA-N 0.000 description 2
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 2
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 2
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 2
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 2
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 2
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 2
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 2
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 2
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 2
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 2
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 2
- XUZQMPGBGFQJMY-SRVKXCTJSA-N Gln-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XUZQMPGBGFQJMY-SRVKXCTJSA-N 0.000 description 2
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 2
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 2
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 2
- NPMFDZGLKBNFOO-SRVKXCTJSA-N Gln-Pro-His Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NPMFDZGLKBNFOO-SRVKXCTJSA-N 0.000 description 2
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 2
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 2
- SYTFJIQPBRJSOK-NKIYYHGXSA-N Gln-Thr-His Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 SYTFJIQPBRJSOK-NKIYYHGXSA-N 0.000 description 2
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 2
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 2
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 2
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 2
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 2
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 2
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 2
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 2
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 2
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 2
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 2
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 2
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 2
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 2
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 2
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 2
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 2
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 2
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 2
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 208000030836 Hashimoto thyroiditis Diseases 0.000 description 2
- 241000590002 Helicobacter pylori Species 0.000 description 2
- 206010019799 Hepatitis viral Diseases 0.000 description 2
- 208000009889 Herpes Simplex Diseases 0.000 description 2
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 2
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 2
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 2
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 2
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 2
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 2
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 2
- AJTBOTWDSRSUDV-ULQDDVLXSA-N His-Phe-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O AJTBOTWDSRSUDV-ULQDDVLXSA-N 0.000 description 2
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 2
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 2
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 2
- 108700005087 Homeobox Genes Proteins 0.000 description 2
- 101710097363 Homeobox protein TGIF1 Proteins 0.000 description 2
- 101000882622 Homo sapiens Endothelial cell-selective adhesion molecule Proteins 0.000 description 2
- 101000708741 Homo sapiens Transcription factor RelB Proteins 0.000 description 2
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 2
- 206010020850 Hyperthyroidism Diseases 0.000 description 2
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 2
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 2
- FHCNLXMTQJNJNH-KBIXCLLPSA-N Ile-Cys-Gln Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)O FHCNLXMTQJNJNH-KBIXCLLPSA-N 0.000 description 2
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 2
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 2
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 2
- 102100026720 Interferon beta Human genes 0.000 description 2
- 108090000467 Interferon-beta Proteins 0.000 description 2
- 108010002386 Interleukin-3 Proteins 0.000 description 2
- 102000000646 Interleukin-3 Human genes 0.000 description 2
- 108090001005 Interleukin-6 Proteins 0.000 description 2
- 102000004889 Interleukin-6 Human genes 0.000 description 2
- 108010002335 Interleukin-9 Proteins 0.000 description 2
- 241000588915 Klebsiella aerogenes Species 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- 241000713666 Lentivirus Species 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 2
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 2
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 2
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 2
- MPSBSKHOWJQHBS-IHRRRGAJSA-N Leu-His-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N MPSBSKHOWJQHBS-IHRRRGAJSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 2
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- UIIMIKFNIYPDJF-WDSOQIARSA-N Leu-Trp-Met Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCSC)C(O)=O)NC(=O)[C@@H](N)CC(C)C)=CNC2=C1 UIIMIKFNIYPDJF-WDSOQIARSA-N 0.000 description 2
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 2
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 2
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 2
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 2
- NTXYXFDMIHXTHE-WDSOQIARSA-N Leu-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 NTXYXFDMIHXTHE-WDSOQIARSA-N 0.000 description 2
- 206010025323 Lymphomas Diseases 0.000 description 2
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 2
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 2
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 2
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 2
- ZAWOJFFMBANLGE-CIUDSAMLSA-N Lys-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N ZAWOJFFMBANLGE-CIUDSAMLSA-N 0.000 description 2
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 2
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 2
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 2
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- SKUOQDYMJFUMOE-ULQDDVLXSA-N Lys-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N SKUOQDYMJFUMOE-ULQDDVLXSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- ZVXSESPJMKNIQA-YXMSTPNBSA-N Lys-Thr-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZVXSESPJMKNIQA-YXMSTPNBSA-N 0.000 description 2
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 2
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 2
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 2
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 2
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 2
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 2
- 108010046938 Macrophage Colony-Stimulating Factor Proteins 0.000 description 2
- 102000007651 Macrophage Colony-Stimulating Factor Human genes 0.000 description 2
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 2
- JYCQGAGDJQYEDB-GUBZILKMSA-N Met-Gln-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JYCQGAGDJQYEDB-GUBZILKMSA-N 0.000 description 2
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 2
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 2
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 2
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 2
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 2
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 2
- CUICVBQQHMKBRJ-LSJOCFKGSA-N Met-His-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O CUICVBQQHMKBRJ-LSJOCFKGSA-N 0.000 description 2
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 2
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 2
- OBPCXINRFKHSRY-SDDRHHMPSA-N Met-Met-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N OBPCXINRFKHSRY-SDDRHHMPSA-N 0.000 description 2
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 2
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 2
- VOAKKHOIAFKOQZ-JYJNAYRXSA-N Met-Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 VOAKKHOIAFKOQZ-JYJNAYRXSA-N 0.000 description 2
- VEKRTVRZDMUOQN-AVGNSLFASA-N Met-Val-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 VEKRTVRZDMUOQN-AVGNSLFASA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 208000003250 Mixed connective tissue disease Diseases 0.000 description 2
- 241000714177 Murine leukemia virus Species 0.000 description 2
- 101100496087 Mus musculus Clec12a gene Proteins 0.000 description 2
- 101100335079 Mus musculus Flt3lg gene Proteins 0.000 description 2
- 101001011440 Mus musculus Interferon regulatory factor 4 Proteins 0.000 description 2
- 241000186367 Mycobacterium avium Species 0.000 description 2
- 241000186364 Mycobacterium intracellulare Species 0.000 description 2
- 241000186363 Mycobacterium kansasii Species 0.000 description 2
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 2
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010062618 Oncogene Proteins v-rel Proteins 0.000 description 2
- 206010033128 Ovarian cancer Diseases 0.000 description 2
- 206010061535 Ovarian neoplasm Diseases 0.000 description 2
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 2
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 2
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 2
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 2
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 2
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 2
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 2
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 2
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 2
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 2
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 2
- ODGNUUUDJONJSC-UFYCRDLUSA-N Phe-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O ODGNUUUDJONJSC-UFYCRDLUSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 2
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 2
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 2
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 2
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 2
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 2
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 206010035226 Plasma cell myeloma Diseases 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 2
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 2
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 2
- XZGWNSIRZIUHHP-SRVKXCTJSA-N Pro-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 XZGWNSIRZIUHHP-SRVKXCTJSA-N 0.000 description 2
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 2
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 2
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 2
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 2
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 2
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 2
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 2
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 2
- GSPPWVHVBBSPSY-FHWLQOOXSA-N Pro-His-Trp Chemical compound OC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](Cc1cnc[nH]1)NC(=O)[C@@H]1CCCN1 GSPPWVHVBBSPSY-FHWLQOOXSA-N 0.000 description 2
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 2
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 2
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 2
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 2
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 2
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 2
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 2
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 2
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 2
- 206010060862 Prostate cancer Diseases 0.000 description 2
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 2
- 238000003559 RNA-seq method Methods 0.000 description 2
- 206010039710 Scleroderma Diseases 0.000 description 2
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 2
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 2
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- FKZSXTKZLPPHQU-GQGQLFGLSA-N Ser-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N FKZSXTKZLPPHQU-GQGQLFGLSA-N 0.000 description 2
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 2
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 2
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 2
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 2
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 2
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 2
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 2
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- 241000700584 Simplexvirus Species 0.000 description 2
- 108010039445 Stem Cell Factor Proteins 0.000 description 2
- 102000015215 Stem Cell Factor Human genes 0.000 description 2
- 241000194017 Streptococcus Species 0.000 description 2
- 241000194049 Streptococcus equinus Species 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 2
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 2
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 2
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 2
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 2
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 2
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 2
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 2
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 2
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 2
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 2
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 2
- 102000004380 Transcription factor RelB Human genes 0.000 description 2
- 108090000952 Transcription factor RelB Proteins 0.000 description 2
- 108090001012 Transforming Growth Factor beta Proteins 0.000 description 2
- 102000004887 Transforming Growth Factor beta Human genes 0.000 description 2
- 206010052779 Transplant rejections Diseases 0.000 description 2
- 241000243777 Trichinella spiralis Species 0.000 description 2
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 2
- WPSYJHFHZYJXMW-JSGCOSHPSA-N Trp-Gln-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O WPSYJHFHZYJXMW-JSGCOSHPSA-N 0.000 description 2
- OENGVSDBQHHGBU-QEJZJMRPSA-N Trp-Glu-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OENGVSDBQHHGBU-QEJZJMRPSA-N 0.000 description 2
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 2
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 2
- KRCPXGSWDOGHAM-XIRDDKMYSA-N Trp-Lys-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O KRCPXGSWDOGHAM-XIRDDKMYSA-N 0.000 description 2
- UUIYFDAWNBSWPG-IHPCNDPISA-N Trp-Lys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N UUIYFDAWNBSWPG-IHPCNDPISA-N 0.000 description 2
- WKQNLTQSCYXKQK-VFAJRCTISA-N Trp-Lys-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WKQNLTQSCYXKQK-VFAJRCTISA-N 0.000 description 2
- LWHUUHCERHMOAB-WDSOQIARSA-N Trp-Pro-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O LWHUUHCERHMOAB-WDSOQIARSA-N 0.000 description 2
- QUIXRGCMQOXUSV-SZMVWBNQSA-N Trp-Pro-Pro Chemical compound O=C([C@@H]1CCCN1C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(O)=O QUIXRGCMQOXUSV-SZMVWBNQSA-N 0.000 description 2
- 241000223109 Trypanosoma cruzi Species 0.000 description 2
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 2
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 2
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 2
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 2
- KLGFILUOTCBNLJ-IHRRRGAJSA-N Tyr-Cys-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O KLGFILUOTCBNLJ-IHRRRGAJSA-N 0.000 description 2
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 2
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 2
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 2
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 2
- DAOREBHZAKCOEN-ULQDDVLXSA-N Tyr-Leu-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O DAOREBHZAKCOEN-ULQDDVLXSA-N 0.000 description 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 2
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 2
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 2
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 description 2
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 2
- 108010064997 VPY tripeptide Proteins 0.000 description 2
- 241000700618 Vaccinia virus Species 0.000 description 2
- SMKXLHVZIFKQRB-GUBZILKMSA-N Val-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N SMKXLHVZIFKQRB-GUBZILKMSA-N 0.000 description 2
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 2
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 2
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 2
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 2
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 2
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 2
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 2
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 2
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- VBTFUDNTMCHPII-UHFFFAOYSA-N Val-Trp-Tyr Natural products C=1NC2=CC=CC=C2C=1CC(NC(=O)C(N)C(C)C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 VBTFUDNTMCHPII-UHFFFAOYSA-N 0.000 description 2
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 2
- 108010066829 alanyl-glutamyl-aspartylprolyine Proteins 0.000 description 2
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 230000014102 antigen processing and presentation of exogenous peptide antigen via MHC class I Effects 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010027234 aspartyl-glycyl-glutamyl-alanine Proteins 0.000 description 2
- 208000010668 atopic eczema Diseases 0.000 description 2
- 230000001363 autoimmune Effects 0.000 description 2
- 230000005784 autoimmunity Effects 0.000 description 2
- 238000009566 cancer vaccine Methods 0.000 description 2
- 229940022399 cancer vaccine Drugs 0.000 description 2
- 230000010261 cell growth Effects 0.000 description 2
- 239000002458 cell surface marker Substances 0.000 description 2
- 238000002659 cell therapy Methods 0.000 description 2
- 201000010881 cervical cancer Diseases 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- OSASVXMJTNOKOY-UHFFFAOYSA-N chlorobutanol Chemical compound CC(C)(O)C(Cl)(Cl)Cl OSASVXMJTNOKOY-UHFFFAOYSA-N 0.000 description 2
- 238000003501 co-culture Methods 0.000 description 2
- 208000029742 colonic neoplasm Diseases 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- FPUGCISOLXNPPC-IOSLPCCCSA-N cordysinin B Chemical compound CO[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(N)=C2N=C1 FPUGCISOLXNPPC-IOSLPCCCSA-N 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 206010012601 diabetes mellitus Diseases 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 231100000676 disease causative agent Toxicity 0.000 description 2
- 238000007877 drug screening Methods 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 210000002257 embryonic structure Anatomy 0.000 description 2
- 229940092559 enterobacter aerogenes Drugs 0.000 description 2
- 230000001973 epigenetic effect Effects 0.000 description 2
- 210000002919 epithelial cell Anatomy 0.000 description 2
- 108700014844 flt3 ligand Proteins 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 238000000799 fluorescence microscopy Methods 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 238000001476 gene delivery Methods 0.000 description 2
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 2
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 238000003306 harvesting Methods 0.000 description 2
- 210000000777 hematopoietic system Anatomy 0.000 description 2
- 230000011132 hemopoiesis Effects 0.000 description 2
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 208000027866 inflammatory disease Diseases 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 210000005007 innate immune system Anatomy 0.000 description 2
- 239000000543 intermediate Substances 0.000 description 2
- 210000000936 intestine Anatomy 0.000 description 2
- 244000000056 intracellular parasite Species 0.000 description 2
- 238000010253 intravenous injection Methods 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 2
- 210000001161 mammalian embryo Anatomy 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 239000012577 media supplement Substances 0.000 description 2
- 210000003593 megakaryocyte Anatomy 0.000 description 2
- 201000001441 melanoma Diseases 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 201000006417 multiple sclerosis Diseases 0.000 description 2
- 206010028417 myasthenia gravis Diseases 0.000 description 2
- 210000003643 myeloid progenitor cell Anatomy 0.000 description 2
- 210000000822 natural killer cell Anatomy 0.000 description 2
- 230000009826 neoplastic cell growth Effects 0.000 description 2
- 210000002569 neuron Anatomy 0.000 description 2
- 230000005868 ontogenesis Effects 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 229940092253 ovalbumin Drugs 0.000 description 2
- 201000002528 pancreatic cancer Diseases 0.000 description 2
- 208000008443 pancreatic carcinoma Diseases 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 210000001948 pro-b lymphocyte Anatomy 0.000 description 2
- 230000001566 pro-viral effect Effects 0.000 description 2
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 230000001172 regenerating effect Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 206010039073 rheumatoid arthritis Diseases 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 238000012174 single-cell RNA sequencing Methods 0.000 description 2
- 210000003491 skin Anatomy 0.000 description 2
- 230000003393 splenic effect Effects 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 238000004114 suspension culture Methods 0.000 description 2
- 208000011580 syndromic disease Diseases 0.000 description 2
- 230000009885 systemic effect Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- ZRKFYGHZFMAOKI-QMGMOQQFSA-N tgfbeta Chemical compound C([C@H](NC(=O)[C@H](C(C)C)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(C)C)[C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 ZRKFYGHZFMAOKI-QMGMOQQFSA-N 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 108091008023 transcriptional regulators Proteins 0.000 description 2
- 230000014616 translation Effects 0.000 description 2
- 229940096911 trichinella spiralis Drugs 0.000 description 2
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 2
- 208000035408 type 1 diabetes mellitus 1 Diseases 0.000 description 2
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 2
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- 201000005112 urinary bladder cancer Diseases 0.000 description 2
- 108010072695 valyl-valyl-tyrosyl-proline Proteins 0.000 description 2
- 230000035899 viability Effects 0.000 description 2
- 201000001862 viral hepatitis Diseases 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- GRYSXUXXBDSYRT-WOUKDFQISA-N (2r,3r,4r,5r)-2-(hydroxymethyl)-4-methoxy-5-[6-(methylamino)purin-9-yl]oxolan-3-ol Chemical compound C1=NC=2C(NC)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1OC GRYSXUXXBDSYRT-WOUKDFQISA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- KPDTZVSUQCBOAE-HTFCKZLJSA-N (2s)-2-[[(2s)-1-[(2s)-2-[[(2s)-2-[[(2s)-2-aminopropanoyl]amino]propanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]propanoic acid Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O KPDTZVSUQCBOAE-HTFCKZLJSA-N 0.000 description 1
- PCDUALPXEOKZPE-DXCABUDRSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxypropanoic acid Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O PCDUALPXEOKZPE-DXCABUDRSA-N 0.000 description 1
- OTFGHFBGGZEXEU-PEBGCTIMSA-N 1-[(2r,3r,4r,5r)-4-hydroxy-5-(hydroxymethyl)-3-methoxyoxolan-2-yl]-3-methylpyrimidine-2,4-dione Chemical compound CO[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)N(C)C(=O)C=C1 OTFGHFBGGZEXEU-PEBGCTIMSA-N 0.000 description 1
- QPHRQMAYYMYWFW-FJGDRVTGSA-N 1-[(2r,3s,4r,5r)-3-fluoro-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione Chemical compound O[C@]1(F)[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 QPHRQMAYYMYWFW-FJGDRVTGSA-N 0.000 description 1
- FPUGCISOLXNPPC-UHFFFAOYSA-N 2'-O-Methyladenosine Natural products COC1C(O)C(CO)OC1N1C2=NC=NC(N)=C2N=C1 FPUGCISOLXNPPC-UHFFFAOYSA-N 0.000 description 1
- RFCQJGFZUQFYRF-UHFFFAOYSA-N 2'-O-Methylcytidine Natural products COC1C(O)C(CO)OC1N1C(=O)N=C(N)C=C1 RFCQJGFZUQFYRF-UHFFFAOYSA-N 0.000 description 1
- OVYNGSFVYRPRCG-UHFFFAOYSA-N 2'-O-Methylguanosine Natural products COC1C(O)C(CO)OC1N1C(NC(N)=NC2=O)=C2N=C1 OVYNGSFVYRPRCG-UHFFFAOYSA-N 0.000 description 1
- RFCQJGFZUQFYRF-ZOQUXTDFSA-N 2'-O-methylcytidine Chemical compound CO[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)N=C(N)C=C1 RFCQJGFZUQFYRF-ZOQUXTDFSA-N 0.000 description 1
- OVYNGSFVYRPRCG-KQYNXXCUSA-N 2'-O-methylguanosine Chemical compound CO[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(N=C(N)NC2=O)=C2N=C1 OVYNGSFVYRPRCG-KQYNXXCUSA-N 0.000 description 1
- 101800001779 2'-O-methyltransferase Proteins 0.000 description 1
- SXUXMRMBWZCMEN-ZOQUXTDFSA-N 2'-O-methyluridine Chemical compound CO[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 SXUXMRMBWZCMEN-ZOQUXTDFSA-N 0.000 description 1
- MXHRCPNRJAMMIM-SHYZEUOFSA-N 2'-deoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-SHYZEUOFSA-N 0.000 description 1
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 1
- XJFPXLWGZWAWRQ-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O XJFPXLWGZWAWRQ-UHFFFAOYSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- 238000010600 3H thymidine incorporation assay Methods 0.000 description 1
- ZLOIGESWDJYCTF-UHFFFAOYSA-N 4-Thiouridine Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=S)C=C1 ZLOIGESWDJYCTF-UHFFFAOYSA-N 0.000 description 1
- ZLOIGESWDJYCTF-XVFCMESISA-N 4-thiouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=S)C=C1 ZLOIGESWDJYCTF-XVFCMESISA-N 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- IPRQAJTUSRLECG-UHFFFAOYSA-N 5-[6-(dimethylamino)purin-9-yl]-2-(hydroxymethyl)-4-methoxyoxolan-3-ol Chemical compound COC1C(O)C(CO)OC1N1C2=NC=NC(N(C)C)=C2N=C1 IPRQAJTUSRLECG-UHFFFAOYSA-N 0.000 description 1
- SQDAZGGFXASXDW-UHFFFAOYSA-N 5-bromo-2-(trifluoromethoxy)pyridine Chemical compound FC(F)(F)OC1=CC=C(Br)C=N1 SQDAZGGFXASXDW-UHFFFAOYSA-N 0.000 description 1
- OGHAROSJZRTIOK-KQYNXXCUSA-O 7-methylguanosine Chemical compound C1=2N=C(N)NC(=O)C=2[N+](C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OGHAROSJZRTIOK-KQYNXXCUSA-O 0.000 description 1
- 241000186041 Actinomyces israelii Species 0.000 description 1
- 208000024893 Acute lymphoblastic leukemia Diseases 0.000 description 1
- 208000014697 Acute lymphocytic leukaemia Diseases 0.000 description 1
- 208000026872 Addison Disease Diseases 0.000 description 1
- 241000701242 Adenoviridae Species 0.000 description 1
- 208000006468 Adrenal Cortex Neoplasms Diseases 0.000 description 1
- 208000009746 Adult T-Cell Leukemia-Lymphoma Diseases 0.000 description 1
- 208000016683 Adult T-cell leukemia/lymphoma Diseases 0.000 description 1
- 241000701386 African swine fever virus Species 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- ZFXQNADNEBRERM-BJDJZHNGSA-N Ala-Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZFXQNADNEBRERM-BJDJZHNGSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 1
- FRFDXQWNDZMREB-ACZMJKKPSA-N Ala-Cys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRFDXQWNDZMREB-ACZMJKKPSA-N 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- OILNWMNBLIHXQK-ZLUOBGJFSA-N Ala-Cys-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O OILNWMNBLIHXQK-ZLUOBGJFSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 1
- LBFXVAXPDOBRKU-LKTVYLICSA-N Ala-His-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LBFXVAXPDOBRKU-LKTVYLICSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- VGMNWQOPSFBBBG-XUXIUFHCSA-N Ala-Leu-Leu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VGMNWQOPSFBBBG-XUXIUFHCSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 208000035285 Allergic Seasonal Rhinitis Diseases 0.000 description 1
- 206010001742 Allergy to animal Diseases 0.000 description 1
- 208000003343 Antiphospholipid Syndrome Diseases 0.000 description 1
- 241000712892 Arenaviridae Species 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 1
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 1
- HJWQFFYRVFEWRM-SRVKXCTJSA-N Arg-Arg-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O HJWQFFYRVFEWRM-SRVKXCTJSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 1
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 1
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- YLVGUOGAFAJMKP-JYJNAYRXSA-N Arg-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YLVGUOGAFAJMKP-JYJNAYRXSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- UGJLILSJKSBVIR-ZFWWWQNUSA-N Arg-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)NCC(O)=O)=CNC2=C1 UGJLILSJKSBVIR-ZFWWWQNUSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- 241000244185 Ascaris lumbricoides Species 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- HUAOKVVEVHACHR-CIUDSAMLSA-N Asn-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N HUAOKVVEVHACHR-CIUDSAMLSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- MOHUTCNYQLMARY-GUBZILKMSA-N Asn-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MOHUTCNYQLMARY-GUBZILKMSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- YUUIAUXBNOHFRJ-IHRRRGAJSA-N Asn-Phe-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O YUUIAUXBNOHFRJ-IHRRRGAJSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- LUJQEUOZJUWRRX-BPUTZDHNSA-N Asn-Trp-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(O)=O LUJQEUOZJUWRRX-BPUTZDHNSA-N 0.000 description 1
- CPYHLXSGDBDULY-IHPCNDPISA-N Asn-Trp-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CPYHLXSGDBDULY-IHPCNDPISA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- OGTCOKZFOJIZFG-CIUDSAMLSA-N Asp-His-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OGTCOKZFOJIZFG-CIUDSAMLSA-N 0.000 description 1
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- VMVUDJUXJKDGNR-FXQIFTODSA-N Asp-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N VMVUDJUXJKDGNR-FXQIFTODSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- FIAKNCXQFFKSSI-ZLUOBGJFSA-N Asp-Ser-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O FIAKNCXQFFKSSI-ZLUOBGJFSA-N 0.000 description 1
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- IQCJOIHDVFJQFV-LKXGYXEUSA-N Asp-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O IQCJOIHDVFJQFV-LKXGYXEUSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 1
- ZQFZEBRNAMXXJV-KKUMJFAQSA-N Asp-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O ZQFZEBRNAMXXJV-KKUMJFAQSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- 208000004300 Atrophic Gastritis Diseases 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 208000010839 B-cell chronic lymphocytic leukemia Diseases 0.000 description 1
- 241001455947 Babesia divergens Species 0.000 description 1
- 241000223848 Babesia microti Species 0.000 description 1
- 241000193738 Bacillus anthracis Species 0.000 description 1
- 241001148536 Bacteroides sp. Species 0.000 description 1
- 208000023328 Basedow disease Diseases 0.000 description 1
- 208000027496 Behcet disease Diseases 0.000 description 1
- 208000009137 Behcet syndrome Diseases 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 208000008439 Biliary Liver Cirrhosis Diseases 0.000 description 1
- 208000033222 Biliary cirrhosis primary Diseases 0.000 description 1
- 241000228405 Blastomyces dermatitidis Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 241000244038 Brugia malayi Species 0.000 description 1
- 241000143302 Brugia timori Species 0.000 description 1
- 208000011691 Burkitt lymphomas Diseases 0.000 description 1
- 101150077194 CAP1 gene Proteins 0.000 description 1
- 101150013553 CD40 gene Proteins 0.000 description 1
- 241000714198 Caliciviridae Species 0.000 description 1
- 241000589876 Campylobacter Species 0.000 description 1
- 241000222122 Candida albicans Species 0.000 description 1
- 241000606153 Chlamydia trachomatis Species 0.000 description 1
- 229920001287 Chondroitin sulfate Polymers 0.000 description 1
- 208000006332 Choriocarcinoma Diseases 0.000 description 1
- 206010008909 Chronic Hepatitis Diseases 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 241000193449 Clostridium tetani Species 0.000 description 1
- 241000223205 Coccidioides immitis Species 0.000 description 1
- 208000015943 Coeliac disease Diseases 0.000 description 1
- 206010009900 Colitis ulcerative Diseases 0.000 description 1
- 102000012422 Collagen Type I Human genes 0.000 description 1
- 108010022452 Collagen Type I Proteins 0.000 description 1
- 102000000503 Collagen Type II Human genes 0.000 description 1
- 108010041390 Collagen Type II Proteins 0.000 description 1
- 241000186227 Corynebacterium diphtheriae Species 0.000 description 1
- 241000186249 Corynebacterium sp. Species 0.000 description 1
- 241000709687 Coxsackievirus Species 0.000 description 1
- 208000011231 Crohn disease Diseases 0.000 description 1
- 201000007336 Cryptococcosis Diseases 0.000 description 1
- 241000221204 Cryptococcus neoformans Species 0.000 description 1
- DCJNIJAWIRPPBB-CIUDSAMLSA-N Cys-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N DCJNIJAWIRPPBB-CIUDSAMLSA-N 0.000 description 1
- ZOLXQKZHYOHHMD-DLOVCJGASA-N Cys-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N ZOLXQKZHYOHHMD-DLOVCJGASA-N 0.000 description 1
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 1
- ZWNFOZNJYNDNGM-UBHSHLNASA-N Cys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N ZWNFOZNJYNDNGM-UBHSHLNASA-N 0.000 description 1
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 1
- UCMIKRLLIOVDRJ-XKBZYTNZSA-N Cys-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)O UCMIKRLLIOVDRJ-XKBZYTNZSA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- UXUSHQYYQCZWET-WDSKDSINSA-N Cys-Glu-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O UXUSHQYYQCZWET-WDSKDSINSA-N 0.000 description 1
- UQHYQYXOLIYNSR-CUJWVEQBSA-N Cys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N)O UQHYQYXOLIYNSR-CUJWVEQBSA-N 0.000 description 1
- HKALUUKHYNEDRS-GUBZILKMSA-N Cys-Leu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HKALUUKHYNEDRS-GUBZILKMSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- NIXHTNJAGGFBAW-CIUDSAMLSA-N Cys-Lys-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N NIXHTNJAGGFBAW-CIUDSAMLSA-N 0.000 description 1
- BBQIWFFTTQTNOC-AVGNSLFASA-N Cys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N BBQIWFFTTQTNOC-AVGNSLFASA-N 0.000 description 1
- GFMJUESGWILPEN-MELADBBJSA-N Cys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CS)N)C(=O)O GFMJUESGWILPEN-MELADBBJSA-N 0.000 description 1
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 1
- ZOKPRHVIFAUJPV-GUBZILKMSA-N Cys-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O ZOKPRHVIFAUJPV-GUBZILKMSA-N 0.000 description 1
- DQUWSUWXPWGTQT-DCAQKATOSA-N Cys-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS DQUWSUWXPWGTQT-DCAQKATOSA-N 0.000 description 1
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 1
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 208000016192 Demyelinating disease Diseases 0.000 description 1
- 241000725619 Dengue virus Species 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 206010012468 Dermatitis herpetiformis Diseases 0.000 description 1
- 208000032131 Diabetic Neuropathies Diseases 0.000 description 1
- 241001115402 Ebolavirus Species 0.000 description 1
- 241001466953 Echovirus Species 0.000 description 1
- 206010014733 Endometrial cancer Diseases 0.000 description 1
- 206010014759 Endometrial neoplasm Diseases 0.000 description 1
- 101000889900 Enterobacteria phage T4 Intron-associated endonuclease 1 Proteins 0.000 description 1
- 241001495410 Enterococcus sp. Species 0.000 description 1
- 206010014950 Eosinophilia Diseases 0.000 description 1
- 208000000832 Equine Encephalomyelitis Diseases 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 241000186810 Erysipelothrix rhusiopathiae Species 0.000 description 1
- 101150049580 Esam gene Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 208000000461 Esophageal Neoplasms Diseases 0.000 description 1
- 208000032027 Essential Thrombocythemia Diseases 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 108010067306 Fibronectins Proteins 0.000 description 1
- 102000016359 Fibronectins Human genes 0.000 description 1
- 241000711950 Filoviridae Species 0.000 description 1
- 241000710781 Flaviviridae Species 0.000 description 1
- 208000004262 Food Hypersensitivity Diseases 0.000 description 1
- 206010016946 Food allergy Diseases 0.000 description 1
- 241000605986 Fusobacterium nucleatum Species 0.000 description 1
- 101150005295 GATA2 gene Proteins 0.000 description 1
- 208000036495 Gastritis atrophic Diseases 0.000 description 1
- 208000005577 Gastroenteritis Diseases 0.000 description 1
- 208000032612 Glial tumor Diseases 0.000 description 1
- 206010018338 Glioma Diseases 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- ZRXBYKAOFHLTDN-GUBZILKMSA-N Gln-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N ZRXBYKAOFHLTDN-GUBZILKMSA-N 0.000 description 1
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 1
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 1
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 1
- OWOFCNWTMWOOJJ-WDSKDSINSA-N Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OWOFCNWTMWOOJJ-WDSKDSINSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 1
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 1
- QMVCEWKHIUHTSD-GUBZILKMSA-N Gln-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QMVCEWKHIUHTSD-GUBZILKMSA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 1
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 1
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 1
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 1
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- DJTXYXZNNDDEOU-WHFBIAKZSA-N Gly-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)C(=O)N DJTXYXZNNDDEOU-WHFBIAKZSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- YZPVGIVFMZLQMM-YUMQZZPRSA-N Gly-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN YZPVGIVFMZLQMM-YUMQZZPRSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 208000024869 Goodpasture syndrome Diseases 0.000 description 1
- 208000009329 Graft vs Host Disease Diseases 0.000 description 1
- 208000015023 Graves' disease Diseases 0.000 description 1
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 1
- 241000606790 Haemophilus Species 0.000 description 1
- 206010061192 Haemorrhagic fever Diseases 0.000 description 1
- 208000001204 Hashimoto Disease Diseases 0.000 description 1
- 208000035186 Hemolytic Autoimmune Anemia Diseases 0.000 description 1
- 241000700739 Hepadnaviridae Species 0.000 description 1
- 208000005176 Hepatitis C Diseases 0.000 description 1
- 208000005331 Hepatitis D Diseases 0.000 description 1
- 206010019755 Hepatitis chronic active Diseases 0.000 description 1
- 241000709721 Hepatovirus A Species 0.000 description 1
- 208000007514 Herpes zoster Diseases 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 1
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 1
- LBHOVGUGOBINDL-KKUMJFAQSA-N His-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O LBHOVGUGOBINDL-KKUMJFAQSA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- YTKOTXRIWQHSAZ-GUBZILKMSA-N His-Glu-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N YTKOTXRIWQHSAZ-GUBZILKMSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- OZBDSFBWIDPVDA-BZSNNMDCSA-N His-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N OZBDSFBWIDPVDA-BZSNNMDCSA-N 0.000 description 1
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 1
- LJUIEESLIAZSFR-SRVKXCTJSA-N His-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LJUIEESLIAZSFR-SRVKXCTJSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- GJMHMDKCJPQJOI-IHRRRGAJSA-N His-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 GJMHMDKCJPQJOI-IHRRRGAJSA-N 0.000 description 1
- LYDKQVYYCMYNMC-SRVKXCTJSA-N His-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYDKQVYYCMYNMC-SRVKXCTJSA-N 0.000 description 1
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 1
- XIGFLVCAVQQGNS-IHRRRGAJSA-N His-Pro-His Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XIGFLVCAVQQGNS-IHRRRGAJSA-N 0.000 description 1
- AVQNTYBAFBKMDL-WDSOQIARSA-N His-Pro-Trp Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O AVQNTYBAFBKMDL-WDSOQIARSA-N 0.000 description 1
- WKEABZIITNXXQZ-CIUDSAMLSA-N His-Ser-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N WKEABZIITNXXQZ-CIUDSAMLSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- FCPSGEVYIVXPPO-QTKMDUPCSA-N His-Thr-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FCPSGEVYIVXPPO-QTKMDUPCSA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- ISQOVWDWRUONJH-YESZJQIVSA-N His-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ISQOVWDWRUONJH-YESZJQIVSA-N 0.000 description 1
- MUENHEQLLUDKSC-PMVMPFDFSA-N His-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CNC=N1 MUENHEQLLUDKSC-PMVMPFDFSA-N 0.000 description 1
- CKONPJHGMIDMJP-IHRRRGAJSA-N His-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CKONPJHGMIDMJP-IHRRRGAJSA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 1
- 241000228404 Histoplasma capsulatum Species 0.000 description 1
- 208000017604 Hodgkin disease Diseases 0.000 description 1
- 208000010747 Hodgkins lymphoma Diseases 0.000 description 1
- 101000914484 Homo sapiens T-lymphocyte activation antigen CD80 Proteins 0.000 description 1
- 101000831496 Homo sapiens Toll-like receptor 3 Proteins 0.000 description 1
- 101000669447 Homo sapiens Toll-like receptor 4 Proteins 0.000 description 1
- 101000669402 Homo sapiens Toll-like receptor 7 Proteins 0.000 description 1
- 101000800483 Homo sapiens Toll-like receptor 8 Proteins 0.000 description 1
- 241000701085 Human alphaherpesvirus 3 Species 0.000 description 1
- 241000722343 Human papillomavirus types Species 0.000 description 1
- 208000037147 Hypercalcaemia Diseases 0.000 description 1
- XQFRJNBWHJMXHO-RRKCRQDMSA-N IDUR Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(I)=C1 XQFRJNBWHJMXHO-RRKCRQDMSA-N 0.000 description 1
- 206010021245 Idiopathic thrombocytopenic purpura Diseases 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- SJIGTGZVQGLMGG-NAKRPEOUSA-N Ile-Cys-Arg Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O SJIGTGZVQGLMGG-NAKRPEOUSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- OONBGFHNQVSUBF-KBIXCLLPSA-N Ile-Gln-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(O)=O OONBGFHNQVSUBF-KBIXCLLPSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 1
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 102300041584 Interferon regulatory factor 4 isoform 1 Human genes 0.000 description 1
- 108010047761 Interferon-alpha Proteins 0.000 description 1
- 102000006992 Interferon-alpha Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108090000176 Interleukin-13 Proteins 0.000 description 1
- 108090000171 Interleukin-18 Proteins 0.000 description 1
- 108050009288 Interleukin-19 Proteins 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- 108010002616 Interleukin-5 Proteins 0.000 description 1
- 108090001007 Interleukin-8 Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 241000701377 Iridoviridae Species 0.000 description 1
- 239000007760 Iscove's Modified Dulbecco's Medium Substances 0.000 description 1
- 208000007766 Kaposi sarcoma Diseases 0.000 description 1
- 241000588747 Klebsiella pneumoniae Species 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- 229930182816 L-glutamine Natural products 0.000 description 1
- 208000007811 Latex Hypersensitivity Diseases 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 241000589248 Legionella Species 0.000 description 1
- 208000007764 Legionnaires' Disease Diseases 0.000 description 1
- 241000222740 Leishmania braziliensis Species 0.000 description 1
- 241000222727 Leishmania donovani Species 0.000 description 1
- 241000222732 Leishmania major Species 0.000 description 1
- 241000222736 Leishmania tropica Species 0.000 description 1
- 241000589902 Leptospira Species 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 1
- QKIBIXAQKAFZGL-GUBZILKMSA-N Leu-Cys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QKIBIXAQKAFZGL-GUBZILKMSA-N 0.000 description 1
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 241000186779 Listeria monocytogenes Species 0.000 description 1
- 208000019693 Lung disease Diseases 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 208000031422 Lymphocytic Chronic B-Cell Leukemia Diseases 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- MLLKLNYPZRDIQG-GUBZILKMSA-N Lys-Cys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N MLLKLNYPZRDIQG-GUBZILKMSA-N 0.000 description 1
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 1
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 1
- OIYWBDBHEGAVST-BZSNNMDCSA-N Lys-His-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OIYWBDBHEGAVST-BZSNNMDCSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- GZGWILAQHOVXTD-DCAQKATOSA-N Lys-Met-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O GZGWILAQHOVXTD-DCAQKATOSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- 102000043129 MHC class I family Human genes 0.000 description 1
- 108091054437 MHC class I family Proteins 0.000 description 1
- 102000043131 MHC class II family Human genes 0.000 description 1
- 108091054438 MHC class II family Proteins 0.000 description 1
- PWHULOQIROXLJO-UHFFFAOYSA-N Manganese Chemical compound [Mn] PWHULOQIROXLJO-UHFFFAOYSA-N 0.000 description 1
- 241000712079 Measles morbillivirus Species 0.000 description 1
- 210000002361 Megakaryocyte Progenitor Cell Anatomy 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- BXNZDLVLGYYFIB-FXQIFTODSA-N Met-Asn-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BXNZDLVLGYYFIB-FXQIFTODSA-N 0.000 description 1
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 1
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 1
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 1
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 1
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 1
- RAAVFTFEAUAVIY-DCAQKATOSA-N Met-Glu-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N RAAVFTFEAUAVIY-DCAQKATOSA-N 0.000 description 1
- WWWGMQHQSAUXBU-BQBZGAKWSA-N Met-Gly-Asn Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O WWWGMQHQSAUXBU-BQBZGAKWSA-N 0.000 description 1
- XKJUFUPCHARJKX-UWVGGRQHSA-N Met-Gly-His Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 XKJUFUPCHARJKX-UWVGGRQHSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- VAGCEUUEMMXFEX-GUBZILKMSA-N Met-Met-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O VAGCEUUEMMXFEX-GUBZILKMSA-N 0.000 description 1
- PHKBGZKVOJCIMZ-SRVKXCTJSA-N Met-Pro-Arg Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PHKBGZKVOJCIMZ-SRVKXCTJSA-N 0.000 description 1
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 1
- LHXFNWBNRBWMNV-DCAQKATOSA-N Met-Ser-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LHXFNWBNRBWMNV-DCAQKATOSA-N 0.000 description 1
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 1
- 208000034578 Multiple myelomas Diseases 0.000 description 1
- 241000711386 Mumps virus Species 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 101001011398 Mus musculus Interferon regulatory factor 2 Proteins 0.000 description 1
- 101100245221 Mus musculus Prss8 gene Proteins 0.000 description 1
- 101100101250 Mus musculus Th gene Proteins 0.000 description 1
- 241000186359 Mycobacterium Species 0.000 description 1
- 241000187484 Mycobacterium gordonae Species 0.000 description 1
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 1
- 201000002481 Myositis Diseases 0.000 description 1
- 206010028665 Myxoedema Diseases 0.000 description 1
- IYYIBFCJILKPCO-WOUKDFQISA-O N(2),N(2),N(7)-trimethylguanosine Chemical compound C1=2NC(N(C)C)=NC(=O)C=2N(C)C=[N+]1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O IYYIBFCJILKPCO-WOUKDFQISA-O 0.000 description 1
- ZBYRSRLCXTUFLJ-IOSLPCCCSA-O N(2),N(7)-dimethylguanosine Chemical compound CNC=1NC(C=2[N+](=CN([C@H]3[C@H](O)[C@H](O)[C@@H](CO)O3)C=2N=1)C)=O ZBYRSRLCXTUFLJ-IOSLPCCCSA-O 0.000 description 1
- VQAYFKKCNSOZKM-IOSLPCCCSA-N N(6)-methyladenosine Chemical compound C1=NC=2C(NC)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O VQAYFKKCNSOZKM-IOSLPCCCSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 102000003945 NF-kappa B Human genes 0.000 description 1
- 108010057466 NF-kappa B Proteins 0.000 description 1
- 208000002454 Nasopharyngeal Carcinoma Diseases 0.000 description 1
- 206010061306 Nasopharyngeal cancer Diseases 0.000 description 1
- 241000588652 Neisseria gonorrhoeae Species 0.000 description 1
- 241000588650 Neisseria meningitidis Species 0.000 description 1
- 206010061309 Neoplasm progression Diseases 0.000 description 1
- 206010029260 Neuroblastoma Diseases 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 208000015914 Non-Hodgkin lymphomas Diseases 0.000 description 1
- 108010019759 OVA 323-339 Proteins 0.000 description 1
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 1
- 241000243985 Onchocerca volvulus Species 0.000 description 1
- 241000150452 Orthohantavirus Species 0.000 description 1
- 241000712464 Orthomyxoviridae Species 0.000 description 1
- 241000150218 Orthonairovirus Species 0.000 description 1
- 101710117757 PR domain zinc finger protein 1 Proteins 0.000 description 1
- 102300039519 PR domain zinc finger protein 1 isoform 1 Human genes 0.000 description 1
- 208000002606 Paramyxoviridae Infections Diseases 0.000 description 1
- 241000701945 Parvoviridae Species 0.000 description 1
- 241000606856 Pasteurella multocida Species 0.000 description 1
- 241000606580 Pasteurella sp. Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 241000150350 Peribunyaviridae Species 0.000 description 1
- 208000031845 Pernicious anaemia Diseases 0.000 description 1
- 206010057249 Phagocytosis Diseases 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- VJEZWOSKRCLHRP-MELADBBJSA-N Phe-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O VJEZWOSKRCLHRP-MELADBBJSA-N 0.000 description 1
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- HTXVATDVCRFORF-MGHWNKPDSA-N Phe-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N HTXVATDVCRFORF-MGHWNKPDSA-N 0.000 description 1
- ONORAGIFHNAADN-LLLHUVSDSA-N Phe-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N ONORAGIFHNAADN-LLLHUVSDSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 1
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- FXEKNHAJIMHRFJ-ULQDDVLXSA-N Phe-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N FXEKNHAJIMHRFJ-ULQDDVLXSA-N 0.000 description 1
- 241000223960 Plasmodium falciparum Species 0.000 description 1
- 241000223801 Plasmodium knowlesi Species 0.000 description 1
- 241000223821 Plasmodium malariae Species 0.000 description 1
- 241001505293 Plasmodium ovale Species 0.000 description 1
- 241000223810 Plasmodium vivax Species 0.000 description 1
- 108091036407 Polyadenylation Proteins 0.000 description 1
- 208000008601 Polycythemia Diseases 0.000 description 1
- 241001505332 Polyomavirus sp. Species 0.000 description 1
- 241000700625 Poxviridae Species 0.000 description 1
- 208000006664 Precursor Cell Lymphoblastic Leukemia-Lymphoma Diseases 0.000 description 1
- 208000012654 Primary biliary cholangitis Diseases 0.000 description 1
- 208000024777 Prion disease Diseases 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- PTLOFJZJADCNCD-DCAQKATOSA-N Pro-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 PTLOFJZJADCNCD-DCAQKATOSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- GBRUQFBAJOKCTF-DCAQKATOSA-N Pro-His-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O GBRUQFBAJOKCTF-DCAQKATOSA-N 0.000 description 1
- DTQIXTOJHKVEOH-DCAQKATOSA-N Pro-His-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O DTQIXTOJHKVEOH-DCAQKATOSA-N 0.000 description 1
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- JRQCDSNPRNGWRG-AVGNSLFASA-N Pro-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2 JRQCDSNPRNGWRG-AVGNSLFASA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 1
- YAZNFQUKPUASKB-DCAQKATOSA-N Pro-Lys-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O YAZNFQUKPUASKB-DCAQKATOSA-N 0.000 description 1
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 1
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- SWRNSCMUXRLHCR-ULQDDVLXSA-N Pro-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 SWRNSCMUXRLHCR-ULQDDVLXSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- DMNANGOFEUVBRV-GJZGRUSLSA-N Pro-Trp-Gly Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)O)C(=O)[C@@H]1CCCN1 DMNANGOFEUVBRV-GJZGRUSLSA-N 0.000 description 1
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- UCTIUWKCVNGEFH-OBJOEFQTSA-N Pro-Val-Gly-Pro Chemical compound N([C@@H](C(C)C)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 UCTIUWKCVNGEFH-OBJOEFQTSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 108700020978 Proto-Oncogene Proteins 0.000 description 1
- 102000052575 Proto-Oncogene Human genes 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 241000589774 Pseudomonas sp. Species 0.000 description 1
- 229930185560 Pseudouridine Natural products 0.000 description 1
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 description 1
- 201000004681 Psoriasis Diseases 0.000 description 1
- 239000012979 RPMI medium Substances 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 241000711798 Rabies lyssavirus Species 0.000 description 1
- 101100237460 Rattus norvegicus Mgll gene Proteins 0.000 description 1
- 208000006265 Renal cell carcinoma Diseases 0.000 description 1
- 241000702263 Reovirus sp. Species 0.000 description 1
- 241000725643 Respiratory syncytial virus Species 0.000 description 1
- 241000712909 Reticuloendotheliosis virus Species 0.000 description 1
- 201000000582 Retinoblastoma Diseases 0.000 description 1
- 241000712907 Retroviridae Species 0.000 description 1
- 241000711931 Rhabdoviridae Species 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 241000606701 Rickettsia Species 0.000 description 1
- 239000006146 Roswell Park Memorial Institute medium Substances 0.000 description 1
- 241000702670 Rotavirus Species 0.000 description 1
- 206010039251 Rubber sensitivity Diseases 0.000 description 1
- 241000710799 Rubella virus Species 0.000 description 1
- 101150036449 SIRPA gene Proteins 0.000 description 1
- 241000607149 Salmonella sp. Species 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 241000242680 Schistosoma mansoni Species 0.000 description 1
- 206010040070 Septic Shock Diseases 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- YXGCIEUDOHILKR-IHRRRGAJSA-N Ser-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CO)N YXGCIEUDOHILKR-IHRRRGAJSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 241000713311 Simian immunodeficiency virus Species 0.000 description 1
- 208000000453 Skin Neoplasms Diseases 0.000 description 1
- 206010041067 Small cell lung cancer Diseases 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- 208000021712 Soft tissue sarcoma Diseases 0.000 description 1
- 241000295644 Staphylococcaceae Species 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 206010072148 Stiff-Person syndrome Diseases 0.000 description 1
- 208000005718 Stomach Neoplasms Diseases 0.000 description 1
- 241001478880 Streptobacillus moniliformis Species 0.000 description 1
- 241000193985 Streptococcus agalactiae Species 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- 241000194022 Streptococcus sp. Species 0.000 description 1
- 241000193990 Streptococcus sp. 'group B' Species 0.000 description 1
- 206010042742 Sympathetic ophthalmia Diseases 0.000 description 1
- 230000006044 T cell activation Effects 0.000 description 1
- 230000024932 T cell mediated immunity Effects 0.000 description 1
- 230000006052 T cell proliferation Effects 0.000 description 1
- 102100027222 T-lymphocyte activation antigen CD80 Human genes 0.000 description 1
- 208000024313 Testicular Neoplasms Diseases 0.000 description 1
- 206010057644 Testis cancer Diseases 0.000 description 1
- 210000000068 Th17 cell Anatomy 0.000 description 1
- 210000004241 Th2 cell Anatomy 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 1
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- ISLDRLHVPXABBC-IEGACIPQSA-N Thr-Leu-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISLDRLHVPXABBC-IEGACIPQSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- KDGBLMDAPJTQIW-RHYQMDGZSA-N Thr-Met-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N)O KDGBLMDAPJTQIW-RHYQMDGZSA-N 0.000 description 1
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 208000031981 Thrombocytopenic Idiopathic Purpura Diseases 0.000 description 1
- 102000002938 Thrombospondin Human genes 0.000 description 1
- 108060008245 Thrombospondin Proteins 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- 241000710924 Togaviridae Species 0.000 description 1
- 102000008235 Toll-Like Receptor 9 Human genes 0.000 description 1
- 108010060818 Toll-Like Receptor 9 Proteins 0.000 description 1
- 102100024324 Toll-like receptor 3 Human genes 0.000 description 1
- 102100039360 Toll-like receptor 4 Human genes 0.000 description 1
- 102100039390 Toll-like receptor 7 Human genes 0.000 description 1
- 102100033110 Toll-like receptor 8 Human genes 0.000 description 1
- 241000223997 Toxoplasma gondii Species 0.000 description 1
- 102300034786 Transcription factor PU.1 isoform 1 Human genes 0.000 description 1
- 102300034812 Transcription factor PU.1 isoform 2 Human genes 0.000 description 1
- 102100032727 Transcription factor RelB Human genes 0.000 description 1
- 241000589904 Treponema pallidum subsp. pertenue Species 0.000 description 1
- 239000007984 Tris EDTA buffer Substances 0.000 description 1
- NXAPHBHZCMQORW-FDARSICLSA-N Trp-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NXAPHBHZCMQORW-FDARSICLSA-N 0.000 description 1
- ZCPCXVJOMUPIDD-IHPCNDPISA-N Trp-Asp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 ZCPCXVJOMUPIDD-IHPCNDPISA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 1
- DNUJCLUFRGGSDJ-YLVFBTJISA-N Trp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DNUJCLUFRGGSDJ-YLVFBTJISA-N 0.000 description 1
- SNWIAPVRCNYFNI-SZMVWBNQSA-N Trp-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SNWIAPVRCNYFNI-SZMVWBNQSA-N 0.000 description 1
- IQIRAJGHFRVFEL-UBHSHLNASA-N Trp-Ser-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N IQIRAJGHFRVFEL-UBHSHLNASA-N 0.000 description 1
- DTPWXZXGFAHEKL-NWLDYVSISA-N Trp-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DTPWXZXGFAHEKL-NWLDYVSISA-N 0.000 description 1
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 1
- GLNADSQYFUSGOU-GPTZEZBUSA-J Trypan blue Chemical compound [Na+].[Na+].[Na+].[Na+].C1=C(S([O-])(=O)=O)C=C2C=C(S([O-])(=O)=O)C(/N=N/C3=CC=C(C=C3C)C=3C=C(C(=CC=3)\N=N\C=3C(=CC4=CC(=CC(N)=C4C=3O)S([O-])(=O)=O)S([O-])(=O)=O)C)=C(O)C2=C1N GLNADSQYFUSGOU-GPTZEZBUSA-J 0.000 description 1
- 241000223104 Trypanosoma Species 0.000 description 1
- 241001442399 Trypanosoma brucei gambiense Species 0.000 description 1
- 241001442397 Trypanosoma brucei rhodesiense Species 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102100040245 Tumor necrosis factor receptor superfamily member 5 Human genes 0.000 description 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 1
- CWQZAUYFWRLITN-AVGNSLFASA-N Tyr-Gln-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O CWQZAUYFWRLITN-AVGNSLFASA-N 0.000 description 1
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 1
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 1
- LMLBOGIOLHZXOT-JYJNAYRXSA-N Tyr-Glu-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O LMLBOGIOLHZXOT-JYJNAYRXSA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- ADECJAKCRKPSOR-ULQDDVLXSA-N Tyr-His-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ADECJAKCRKPSOR-ULQDDVLXSA-N 0.000 description 1
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 1
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- 201000006704 Ulcerative Colitis Diseases 0.000 description 1
- 208000008385 Urogenital Neoplasms Diseases 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- YTUABZMPYKCWCQ-XQQFMLRXSA-N Val-His-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N YTUABZMPYKCWCQ-XQQFMLRXSA-N 0.000 description 1
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 1
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 241000700647 Variola virus Species 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 241000711975 Vesicular stomatitis virus Species 0.000 description 1
- 108010087302 Viral Structural Proteins Proteins 0.000 description 1
- 108010031318 Vitronectin Proteins 0.000 description 1
- 102100035140 Vitronectin Human genes 0.000 description 1
- 208000033559 Waldenström macroglobulinemia Diseases 0.000 description 1
- 241000244005 Wuchereria bancrofti Species 0.000 description 1
- 241000710772 Yellow fever virus Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 108700010877 adenoviridae proteins Proteins 0.000 description 1
- 210000001789 adipocyte Anatomy 0.000 description 1
- 201000005188 adrenal gland cancer Diseases 0.000 description 1
- 208000024447 adrenal gland neoplasm Diseases 0.000 description 1
- 201000006966 adult T-cell leukemia Diseases 0.000 description 1
- 239000013567 aeroallergen Substances 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 239000013566 allergen Substances 0.000 description 1
- 230000000172 allergic effect Effects 0.000 description 1
- 208000028004 allergic respiratory disease Diseases 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 230000002141 anti-parasite Effects 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 230000005809 anti-tumor immunity Effects 0.000 description 1
- 230000006023 anti-tumor response Effects 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 239000003096 antiparasitic agent Substances 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 208000006673 asthma Diseases 0.000 description 1
- 244000309743 astrovirus Species 0.000 description 1
- 201000000448 autoimmune hemolytic anemia Diseases 0.000 description 1
- 201000003710 autoimmune thrombocytopenic purpura Diseases 0.000 description 1
- 229940065181 bacillus anthracis Drugs 0.000 description 1
- 239000007640 basal medium Substances 0.000 description 1
- 210000003651 basophil Anatomy 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 230000007321 biological mechanism Effects 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 210000002798 bone marrow cell Anatomy 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 239000008366 buffered solution Substances 0.000 description 1
- 229940095731 candida albicans Drugs 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 208000002458 carcinoid tumor Diseases 0.000 description 1
- 210000004413 cardiac myocyte Anatomy 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000010307 cell transformation Effects 0.000 description 1
- 210000003169 central nervous system Anatomy 0.000 description 1
- 208000015114 central nervous system disease Diseases 0.000 description 1
- 210000002230 centromere Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 229940038705 chlamydia trachomatis Drugs 0.000 description 1
- 229960004926 chlorobutanol Drugs 0.000 description 1
- 229940059329 chondroitin sulfate Drugs 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 208000016644 chronic atrophic gastritis Diseases 0.000 description 1
- 208000032852 chronic lymphocytic leukemia Diseases 0.000 description 1
- 208000025302 chronic primary adrenal insufficiency Diseases 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000003021 clonogenic effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000009073 conformational modification Effects 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000005260 corrosion Methods 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 208000035250 cutaneous malignant susceptibility to 1 melanoma Diseases 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 238000007822 cytometric assay Methods 0.000 description 1
- 238000004163 cytometry Methods 0.000 description 1
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000000586 desensitisation Methods 0.000 description 1
- MXHRCPNRJAMMIM-UHFFFAOYSA-N desoxyuridine Natural products C1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-UHFFFAOYSA-N 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 239000002612 dispersion medium Substances 0.000 description 1
- 210000003515 double negative t cell Anatomy 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 210000003981 ectoderm Anatomy 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 206010014599 encephalitis Diseases 0.000 description 1
- 230000002124 endocrine Effects 0.000 description 1
- 210000001900 endoderm Anatomy 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 210000003979 eosinophil Anatomy 0.000 description 1
- 201000004101 esophageal cancer Diseases 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 210000001508 eye Anatomy 0.000 description 1
- 208000030533 eye disease Diseases 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 210000003754 fetus Anatomy 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 235000020932 food allergy Nutrition 0.000 description 1
- 210000003918 fraction a Anatomy 0.000 description 1
- 239000012737 fresh medium Substances 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 238000002825 functional assay Methods 0.000 description 1
- 206010017758 gastric cancer Diseases 0.000 description 1
- 230000002496 gastric effect Effects 0.000 description 1
- 201000011587 gastric lymphoma Diseases 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 208000005017 glioblastoma Diseases 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 208000024908 graft versus host disease Diseases 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 201000009277 hairy cell leukemia Diseases 0.000 description 1
- 201000010536 head and neck cancer Diseases 0.000 description 1
- 210000002216 heart Anatomy 0.000 description 1
- 210000002064 heart cell Anatomy 0.000 description 1
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 210000003630 histaminocyte Anatomy 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 230000000148 hypercalcaemia Effects 0.000 description 1
- 208000030915 hypercalcemia disease Diseases 0.000 description 1
- 206010020718 hyperplasia Diseases 0.000 description 1
- 230000002989 hypothyroidism Effects 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000036737 immune function Effects 0.000 description 1
- 230000037451 immune surveillance Effects 0.000 description 1
- 208000026278 immune system disease Diseases 0.000 description 1
- 238000003365 immunocytochemistry Methods 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 230000008975 immunomodulatory function Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 230000015788 innate immune response Effects 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000002743 insertional mutagenesis Methods 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 108090000681 interleukin 20 Proteins 0.000 description 1
- 229940047122 interleukins Drugs 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 208000002551 irritable bowel syndrome Diseases 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 210000001821 langerhans cell Anatomy 0.000 description 1
- 201000005391 latex allergy Diseases 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 201000002364 leukopenia Diseases 0.000 description 1
- 231100001022 leukopenia Toxicity 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 201000007270 liver cancer Diseases 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 208000019423 liver disease Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 210000005265 lung cell Anatomy 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 239000008176 lyophilized powder Substances 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 201000000564 macroglobulinemia Diseases 0.000 description 1
- 238000007885 magnetic separation Methods 0.000 description 1
- 238000002826 magnetic-activated cell sorting Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 208000026037 malignant tumor of neck Diseases 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 210000004379 membrane Anatomy 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 210000003716 mesoderm Anatomy 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 239000004570 mortar (masonry) Substances 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 201000005962 mycosis fungoides Diseases 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 210000000107 myocyte Anatomy 0.000 description 1
- 208000003786 myxedema Diseases 0.000 description 1
- OHDXDNUPVVYWOV-UHFFFAOYSA-N n-methyl-1-(2-naphthalen-1-ylsulfanylphenyl)methanamine Chemical compound CNCC1=CC=CC=C1SC1=CC=CC2=CC=CC=C12 OHDXDNUPVVYWOV-UHFFFAOYSA-N 0.000 description 1
- 201000011216 nasopharynx carcinoma Diseases 0.000 description 1
- 210000000581 natural killer T-cell Anatomy 0.000 description 1
- 230000001338 necrotic effect Effects 0.000 description 1
- 210000000441 neoplastic stem cell Anatomy 0.000 description 1
- 210000000440 neutrophil Anatomy 0.000 description 1
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 231100000590 oncogenic Toxicity 0.000 description 1
- 230000002246 oncogenic effect Effects 0.000 description 1
- 201000008968 osteosarcoma Diseases 0.000 description 1
- GSSMIHQEWAQUPM-AOLPDKKJSA-N ovalbumin peptide Chemical compound C([C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)[C@@H](C)CC)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CN=CN1 GSSMIHQEWAQUPM-AOLPDKKJSA-N 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 208000021255 pancreatic insulinoma Diseases 0.000 description 1
- 208000003154 papilloma Diseases 0.000 description 1
- 230000003071 parasitic effect Effects 0.000 description 1
- 229940051027 pasteurella multocida Drugs 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 210000001428 peripheral nervous system Anatomy 0.000 description 1
- 208000027232 peripheral nervous system disease Diseases 0.000 description 1
- 230000002085 persistent effect Effects 0.000 description 1
- 230000008782 phagocytosis Effects 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 229960003742 phenol Drugs 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 229940118768 plasmodium malariae Drugs 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 229920000729 poly(L-lysine) polymer Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 208000017805 post-transplant lymphoproliferative disease Diseases 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 210000004765 promyelocyte Anatomy 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 description 1
- 230000002685 pulmonary effect Effects 0.000 description 1
- 239000002213 purine nucleotide Substances 0.000 description 1
- 150000003212 purines Chemical class 0.000 description 1
- 150000003230 pyrimidines Chemical class 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 201000004335 respiratory allergy Diseases 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 210000001995 reticulocyte Anatomy 0.000 description 1
- 230000003307 reticuloendothelial effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 201000009410 rhabdomyosarcoma Diseases 0.000 description 1
- 239000002342 ribonucleoside Substances 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- DWRXFEITVBNRMK-JXOAFFINSA-N ribothymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 DWRXFEITVBNRMK-JXOAFFINSA-N 0.000 description 1
- 239000004576 sand Substances 0.000 description 1
- 230000001932 seasonal effect Effects 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000036303 septic shock Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000004017 serum-free culture medium Substances 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 1
- 201000000849 skin cancer Diseases 0.000 description 1
- 208000000587 small cell lung carcinoma Diseases 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 210000004988 splenocyte Anatomy 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 201000011549 stomach cancer Diseases 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 201000003120 testicular cancer Diseases 0.000 description 1
- RTKIYNMVFMVABJ-UHFFFAOYSA-L thimerosal Chemical compound [Na+].CC[Hg]SC1=CC=CC=C1C([O-])=O RTKIYNMVFMVABJ-UHFFFAOYSA-L 0.000 description 1
- 229940033663 thimerosal Drugs 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- 206010043778 thyroiditis Diseases 0.000 description 1
- 208000005057 thyrotoxicosis Diseases 0.000 description 1
- 210000002303 tibia Anatomy 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 201000008827 tuberculosis Diseases 0.000 description 1
- 230000005751 tumor progression Effects 0.000 description 1
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 description 1
- 210000004981 tumor-associated macrophage Anatomy 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 241000724775 unclassified viruses Species 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 210000002444 unipotent stem cell Anatomy 0.000 description 1
- 238000011870 unpaired t-test Methods 0.000 description 1
- 210000000689 upper leg Anatomy 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 108010027510 vaccinia virus capping enzyme Proteins 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 244000052613 viral pathogen Species 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 210000001835 viscera Anatomy 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 229940051021 yellow-fever virus Drugs 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/06—Animal cells or tissues; Human cells or tissues
- C12N5/0602—Vertebrate cells
- C12N5/0634—Cells from the blood or the immune system
- C12N5/0639—Dendritic cells, e.g. Langherhans cells in the epidermis
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K40/00—Cellular immunotherapy
- A61K40/40—Cellular immunotherapy characterised by antigens that are targeted or presented by cells of the immune system
- A61K40/41—Vertebrate antigens
- A61K40/42—Cancer antigens
- A61K40/4242—Transcription factors, e.g. SOX or c-MYC
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K35/00—Medicinal preparations containing materials or reaction products thereof with undetermined constitution
- A61K35/12—Materials from mammals; Compositions comprising non-specified tissues or cells; Compositions comprising non-embryonic stem cells; Genetically modified cells
- A61K35/28—Bone marrow; Haematopoietic stem cells; Mesenchymal stem cells of any origin, e.g. adipose-derived stem cells
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K35/00—Medicinal preparations containing materials or reaction products thereof with undetermined constitution
- A61K35/12—Materials from mammals; Compositions comprising non-specified tissues or cells; Compositions comprising non-embryonic stem cells; Genetically modified cells
- A61K35/37—Digestive system
- A61K35/38—Stomach; Intestine; Goblet cells; Oral mucosa; Saliva
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K35/00—Medicinal preparations containing materials or reaction products thereof with undetermined constitution
- A61K35/12—Materials from mammals; Compositions comprising non-specified tissues or cells; Compositions comprising non-embryonic stem cells; Genetically modified cells
- A61K35/48—Reproductive organs
- A61K35/54—Ovaries; Ova; Ovules; Embryos; Foetal cells; Germ cells
- A61K35/545—Embryonic stem cells; Pluripotent stem cells; Induced pluripotent stem cells; Uncharacterised stem cells
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K40/00—Cellular immunotherapy
- A61K40/10—Cellular immunotherapy characterised by the cell type used
- A61K40/19—Dendritic cells
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K40/00—Cellular immunotherapy
- A61K40/20—Cellular immunotherapy characterised by the effect or the function of the cells
- A61K40/24—Antigen-presenting cells [APC]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K40/00—Cellular immunotherapy
- A61K40/40—Cellular immunotherapy characterised by antigens that are targeted or presented by cells of the immune system
- A61K40/41—Vertebrate antigens
- A61K40/42—Cancer antigens
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4702—Regulators; Modulating activity
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K2121/00—Preparations for use in therapy
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K2300/00—Mixtures or combinations of active ingredients, wherein at least one active ingredient is fully defined in groups A61K31/00 - A61K41/00
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K9/00—Medicinal preparations characterised by special physical form
- A61K9/0012—Galenical forms characterised by the site of application
- A61K9/0019—Injectable compositions; Intramuscular, intravenous, arterial, subcutaneous administration; Compositions to be administered through the skin in an invasive manner
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2501/00—Active agents used in cell culture processes, e.g. differentation
- C12N2501/60—Transcription factors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2506/00—Differentiation of animal cells from one lineage to another; Differentiation of pluripotent cells
- C12N2506/13—Differentiation of animal cells from one lineage to another; Differentiation of pluripotent cells from connective tissue cells, from mesenchymal cells
- C12N2506/1307—Differentiation of animal cells from one lineage to another; Differentiation of pluripotent cells from connective tissue cells, from mesenchymal cells from adult fibroblasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2506/00—Differentiation of animal cells from one lineage to another; Differentiation of pluripotent cells
- C12N2506/30—Differentiation of animal cells from one lineage to another; Differentiation of pluripotent cells from cancer cells, e.g. reversion of tumour cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2506/00—Differentiation of animal cells from one lineage to another; Differentiation of pluripotent cells
- C12N2506/45—Differentiation of animal cells from one lineage to another; Differentiation of pluripotent cells from artificially induced pluripotent stem cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2780/00—Naked RNA viruses
- C12N2780/00011—Details
- C12N2780/00041—Use of virus, viral particle or viral elements as a vector
- C12N2780/00043—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Biomedical Technology (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Animal Behavior & Ethology (AREA)
- Epidemiology (AREA)
- Organic Chemistry (AREA)
- Cell Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Immunology (AREA)
- General Engineering & Computer Science (AREA)
- Developmental Biology & Embryology (AREA)
- Virology (AREA)
- Medicinal Chemistry (AREA)
- Biochemistry (AREA)
- Pharmacology & Pharmacy (AREA)
- Microbiology (AREA)
- Hematology (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Reproductive Health (AREA)
- Dermatology (AREA)
- Physiology (AREA)
- Nutrition Science (AREA)
- Gynecology & Obstetrics (AREA)
- Toxicology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Chemical & Material Sciences (AREA)
Abstract
本公开内容涉及用于将细胞重新程序化为常规树突细胞(cDC)、特别是cDC 2型(在下文中被称作“cDC2”或“CD11b阳性树突细胞”)的组合物、其方法和用途。本公开内容涉及通过引入和表达分离的/合成的转录因子从分化的、多潜能或多能干细胞制备具有抗原呈递能力的常规树突细胞的方法的开发。更具体地,本公开内容提供了通过直接细胞重新程序化和令人惊讶地使用特定转录因子的组合来获得常规树突细胞(cDC)、特别是cDC 2型或CD11b阳性树突细胞的方法。
Description
技术领域
本公开内容涉及用于将细胞重新程序化为常规树突细胞(cDC)、特别是cDC 2型(在下文中被称作“cDC2”或“CD11b阳性树突细胞”)的组合物、方法及其用途。
本公开内容涉及通过引入和表达分离的/合成的转录因子,从分化的、多潜能或多能干细胞制备具有抗原呈递能力的常规树突细胞的方法的开发。更具体地,本公开内容提供了通过直接细胞重新程序化和令人惊讶地使用特定转录因子的组合来获得常规树突细胞(cDC)、特别是cDC 2型或CD11b阳性树突细胞的方法。
背景
细胞重新程序化依赖于将一种细胞状态的表观遗传和转录网络重新连接到不同细胞类型的表观遗传和转录网络。转录因子(TF)-过表达实验已强调了成体体细胞或分化的细胞的可塑性,从而提供了生成任何期望细胞类型的新技术。通过TF的强制表达,可能将体细胞或分化的细胞重新程序化为与胚胎干细胞非常相似的诱导的多能干细胞(iPSC)(Takahashi等人,2007;Takahashi和Yamanaka,2006)。可替换地,还可以将体细胞直接转化为另一种特化细胞类型(Pereira,Lemischka,和Moore,2012)。使用规定靶细胞身份的TF,已证明直接谱系转化成功地将小鼠和人成纤维细胞重新程序化为几种细胞类型,诸如神经元、心肌细胞和肝细胞(Xu,Du,和Deng,2015)。还在造血系统中证实了直接细胞转化,其中TF的强制表达诱导了B细胞和成纤维细胞中的巨噬细胞命运(Xie,Ye,Feng,和Graf,2004),并且用Gata2、Gfi1b、cFos和Etv6实现了将小鼠成纤维细胞直接重新程序化为克隆原性的造血祖细胞(Pereira等人,2013)。这四种TF诱导动态的多阶段造血过程,其通过内皮样中间体进展,在体外重演发育造血作用(Pereira等人,2016)。
重新程序化的细胞对于再生医学而言是非常有前途的治疗工具,并且已在临床研究中测试了通过iPSC的分化获得的细胞。
细胞重新程序化策略已经强调了细胞命运的灵活性,可能使用细胞类型特异性的TF将体细胞转化为多能性。一种分化的细胞类型向另一种的直接谱系转化也已被证明和探索,以阐明细胞生物学机制和用于再生医学目的。近年来,已经证实,抗原呈递树突细胞可以通过TF的小组合从不相关的细胞类型重新程序化。经典地,有人认为,骨髓DC定向祖细胞产生在功能上不同的DC子集:常规DC(cDC)(其为专业抗原呈递细胞(APC))和浆细胞样DC(pDC)。cDC驱动抗原特异性免疫应答,而pDC是在病毒感染期间I型干扰素的专业生产者。但是,在DC发育期间调节不同子集的分歧的时机和确切机制仍有待确定。
DC是一类源自淋巴-骨髓造血作用的骨髓衍生细胞,其扫描生物体中的病原体,在先天性免疫系统和适应性免疫的活化之间形成必要的界面。DC充当专业的APC,其能够通过在表面展示与主要组织相容性复合物(MHC)复合的肽抗原以及所有必要的可溶性的和膜相关的共刺激分子来活化T细胞应答。DC诱导初级免疫应答,增强先前引发的T淋巴细胞的效应功能,并协调先天性免疫和适应性免疫之间的交流。DC存在于大多数组织中,它们在那里不断地对抗原环境进行采样并使用几种类型的受体来监测入侵的病原体。在稳定状态下,并且在检测到病原体后以增加的速率,非淋巴样组织中的前哨DC迁移到淋巴样器官,它们在那里将它们已经收集和处理的抗原呈递给T细胞。T细胞获得的表型取决于抗原呈递的背景。如果抗原来源于病原体,或者自身受损,DC将收到危险信号,变得活化并随后刺激T细胞成为效应子,这是提供保护性免疫所必需的。
免疫应答控制的一个重要方面是存在几种不同类型的DC,每一种专门对特定病原体作出应答并与T细胞的特定子集相互作用。在该背景下,出现了三种主要的DC子集:浆细胞样DC(pDC)、骨髓/常规DC1(cDC1)和骨髓/常规DC2(cDC2)。这扩大了免疫系统对宽范围的不同病原体和危险信号作出适当反应的灵活性。
cDC2以CD11b表面表达为特征,并专门用于MHC-II呈递,其将原初CD4+ T细胞极化导向辅助Th2和Th17(Plantinga等人,2013)。尽管Th2-相关的细胞因子(IL-4、IL-5、IL-9和IL-13)介导与针对细胞外寄生虫的保护相关的应答(Mosmann和Coffman,1989)并诱导变态反应和超敏反应(Kopf等人,1993;Zhu和Paul,2008),但是Th17与针对细胞外细菌和真菌的免疫应答有关,还诱发许多自身免疫性疾病(Weaver,Harrington,Mangan,Gavrieli,和Murphy,2006)。在肿瘤中,已知cDC2通过参与MHC-II上的抗原呈递至肿瘤-引流淋巴结中的CD4+ T细胞来补充cDC1(Merad,Sathe,Helft,Miller,和Mortha,2013)。在一项研究中,证明肿瘤衍生的cDC2的抗原呈递驱动肿瘤相关巨噬细胞向Th17依赖性物质上的抗肿瘤表型的转变(Laoui等人,2016)。cDC2还通过引发调节性T细胞(Treg)促进效应T细胞的下调,所述调节性T细胞(Treg)通过破坏自体反应性CD4+ T细胞(Merad等人,2013)和负调节免疫应答(Sakaguchi,2004)对维持自身耐受性至关重要。
另外的cDC2特征性标志物包括CD11b、Sirpα、CD4和ESAM。由于cDC2固有的异质性,一些特定的表面标志物表征了特定的子集。最近的发现已经鉴定了由不同的转录调节剂和独特的免疫功能定义的两个不同的cDC2子集(Brown等人,2019)。虽然cDC2A是被定义为Tbet依赖性的并且以Esam和Clec4a4的表面表达为特征的抗炎子集,但cDC2B由相当促炎的RORγt定义的表达Clec10a和Clec12a标志物的子集组成。
DC诱导适应性免疫的能力已经促进了对细菌、病毒和寄生虫病原体的DC疫苗接种策略和免疫疗法(即在癌症中)的研究。事实上,正在进行临床试验,其利用DC介导的免疫疗法以治疗多种肿瘤类型,包括实体瘤和血液肿瘤(Datta等人,2014)。但是,临床结果并不一致,可能与体外产生的DC的可变效率有关:自体单核细胞可以在体外分化成效率较低的DC,并且以非常低的数目分离出造血祖细胞。此外,这些前体细胞通常在患有癌症的患者中受到损害,导致产生功能失调的DC(Datta等人,2014;Subklewe等人,2014)。癌症逃避机制也可能是基于DC的免疫疗法中缺乏一致的治疗优势的基础。在肿瘤进展期间,癌细胞利用几种免疫过程来逃避免疫监视。这些适应与癌症抗原异质性一起,阻止了免疫系统对肿瘤抗原的识别,并从而导致肿瘤细胞和当前免疫疗法的免疫原性降低。
通过直接重新程序化来产生APC为更好地理解DC规格和细胞身份提供了新的机会,有助于使用自体工程改造的细胞更有效地控制免疫应答。
文件EP 3 385 373涉及用于细胞诱导或将细胞重新程序化为DC状态或APC状态的组合物、核酸构建体、方法和其试剂盒,其部分基于允许诱导或重新程序化分化的或未分化的细胞为DC或APC的TF的新用途和组合的惊人效果。
在文件EP3385373中描述的生成的重新程序化细胞主要重演了DC的cDC1子集特有的表面标志物表达、抗原呈递、细胞因子释放和T-细胞活化特征。未描述其它DC子集的表型特征。
抗原呈递细胞(APC)是一组异质的免疫细胞,其通过加工和呈递抗原以供某些淋巴细胞(诸如T细胞)识别来介导细胞免疫应答。经典的APC包括树突细胞、巨噬细胞、朗格汉斯细胞和B细胞。
DC通过其捕获、处理和呈递抗原给T细胞的能力,将它们靶向至不同类型的免疫应答或诱导耐受性应答,从而在外部环境和适应性免疫系统之间提供至关重要的联系。
表型标准允许将小鼠DC分类为不同的亚群,其特征在于不同表面标志物的表达。在淋巴样组织中的常规DC(cDC)传统上细分为cDC1和cDC2亚群。不同的DC子集参与某些病原体的特异性识别和/或调节不同的免疫应答。虽然cDC1与Th1应答的启动有关,对促进肿瘤清除而言是重要的,但cDC2子集已经与Th1、Th2、Th17(免疫)和Treg(耐受性)应答有关。
文件EP 3 385 373涉及用于细胞诱导或将细胞重新程序化为DC状态或APC状态的组合物、核酸构建体、方法及其试剂盒,其部分基于允许诱导或重新程序化分化的或未分化的细胞为DC或APC(更具体地cDC1)的TF的新用途和组合的惊人效果。
目前,基于DC的免疫疗法依赖于自体DC前体:与效率较低的DC的产生相关的单核细胞,或以非常低的数目分离的造血祖细胞。此外,这些前体细胞通常在患有癌症的患者中受到损害,导致功能失调的DC的产生。另一方面,非造血细胞类型(诸如成纤维细胞)通常不受影响。鉴于DC作为桥接先天性免疫系统和适应性免疫系统的APC的基本作用,临床仍需要寻找替代策略来产生功能性DC以引发抗原特异性免疫应答。
发明概述
通过本公开内容的直接重新程序化产生的诱导的DC或APC出人意料地重演了关于DC的cDC2子集的MHC-II分子中的表面标志物表达、细胞因子分泌和抗原呈递的表型。
为了举例说明本公开内容所解决的技术问题而公开这些事实。
本文主题鉴定了几种分离的或合成的TF,其出人意料地在体外、离体或在体内将分化的细胞、多潜能或多能干细胞重新程序化或诱导为抗原呈递树突细胞,更具体地是cDC2。
在一个方面,本公开内容包含一种组合物,其包含由分离的或合成的序列编码的至少两种转录因子的组合,所述序列与以下序列具有至少90%同一性且选自:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35);或至少两种分离的或合成的转录因子的组合,所述转录因子与选自以下的序列具有至少90%同一性:PU.1(SEQ.ID.3,SEQ.ID.6)、IRF4(SEQ.ID.9,SEQ.ID.12)、PRDM1(SEQ.ID.15,SEQ.ID.18)、IRF2(SEQ.ID.21,SEQ.ID.24)、POU2F2(SEQ.ID.27,SEQ.ID.30)、TGIF1(SEQ.ID.33,SEQ.ID.36)、及其混合物;
所述组合物用于将干细胞或分化的细胞或其混合物重新程序化为常规树突细胞2型(cDC2)或CD11b阳性树突细胞。
在一个方面,本公开内容包含一种组合物,其包含至少三种分离的或合成的转录因子的组合,第一种和第二种是分离的和合成的PU.1和IRF4转录因子,其与PU.1(SEQ.ID.3,SEQ.ID.6)和IRF4(SEQ.ID.9,SEQ.ID.12)的序列具有至少90%同一性,且第三种是分离的或合成的转录因子,其与选自以下的序列具有至少90%同一性:PRDM1(SEQ.ID.15,SEQ.ID.18)、IRF2(SEQ.ID.21,SEQ.ID.24)、POU2F2(SEQ.ID.27,SEQ.ID.30)、TGIF1(SEQ.ID.33,SEQ.ID.36)、RBPJ(SEQ.ID.45,SEQ.ID.48)和RELB(SEQ.ID.39,SEQ.ID.42);
所述组合物用于将干细胞或分化的细胞或其混合物重新程序化为常规树突细胞2型(cDC2)或CD11b阳性树突细胞。
本文中使用的变体是指与本公开内容的DNA编码序列具有60%、61%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或至少99%总体序列同一性的序列。
在另一个实施方案中,本公开内容包含一种组合物,其中至少两种转录因子由分离的或合成的序列编码,所述序列与以下序列具有至少90%同一性且选自:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35),或至少两种分离的或合成的转录因子与选自以下的序列具有至少90%同一性:PU.1(SEQ.ID.3,SEQ.ID.6)、IRF4(SEQ.ID.9,SEQ.ID.12)、PRDM1(SEQ.ID.15,SEQ.ID.18)、IRF2(SEQ.ID.21,SEQ.ID.24)、POU2F2(SEQ.ID.27,SEQ.ID.30)、TGIF1(SEQ.ID.33,SEQ.ID.36)及其混合物;
所述组合物用于将干细胞或分化的细胞或其混合物重新程序化为常规树突细胞2型(cDC2),
前提条件是,不包括由PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)组成的至少两种分离的或合成的转录因子的组合。
在一个实施方案中,本公开内容包含一种组合物,其中至少两种转录因子由分离的或合成的序列编码,所述序列与以下序列具有至少90%同一性且选自:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35)、RBPJ(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47)和RELB(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41),或至少两种分离的或合成的转录因子与选自以下的序列具有至少90%同一性:PU.1(SEQ.ID.3,SEQ.ID.6)、IRF4(SEQ.ID.9,SEQ.ID.12)、PRDM1(SEQ.ID.15,SEQ.ID.18)、IRF2(SEQ.ID.21,SEQ.ID.24)、POU2F2(SEQ.ID.27,SEQ.ID.30)、TGIF1(SEQ.ID.33,SEQ.ID.36))、RBPJ(SEQ.ID.45,SEQ.ID.48)和RELB(SEQ.ID.39,SEQ.ID.42),及其混合物;
所述组合物用于将干细胞或分化的细胞重新程序化为常规树突细胞2型(cDC2)或CD11b阳性树突细胞。
在一个实施方案中,本公开内容包含用于如本文前述的用途的组合物,其中所述转录因子分别由与下述序列具有至少90%同一性的多核苷酸编码:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35)、RBPJ(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47)、RELB(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41)。
在一个实施方案中,本公开内容包含由分离的或合成的序列编码的至少两种转录因子的组合,所述序列与以下序列具有至少95%同一性且选自:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35),或与选自以下的序列具有至少95%同一性的至少两种分离的或合成的转录因子:PU.1(SEQ.ID.3,SEQ.ID.6)、IRF4(SEQ.ID.9,SEQ.ID.12)、PRDM1(SEQ.ID.15,SEQ.ID.18)、IRF2(SEQ.ID.21,SEQ.ID.24)、POU2F2(SEQ.ID.27,SEQ.ID.30)、TGIF1(SEQ.ID.33,SEQ.ID.36)及其混合物。
在一个实施方案中,本公开内容包含用于如本文前述的用途的组合物,其中所述转录因子分别由与下述序列具有至少95%同一性的多核苷酸编码:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35)、RBPJ(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47)、RELB(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41),或所述分离的或合成的转录因子分别与下述序列具有至少95%同一性:PU.1(SEQ.ID.3,SEQ.ID.6)、IRF4(SEQ.ID.9,SEQ.ID.12)、PRDM1(SEQ.ID.15,SEQ.ID.18)、IRF2(SEQ.ID.21,SEQ.ID.24)、POU2F2(SEQ.ID.27,SEQ.ID.30)、TGIF1(SEQ.ID.33,SEQ.ID.36)、RBPJ(SEQ.ID.45,SEQ.ID.48)、RELB(SEQ.ID.39,SEQ.ID.42)。
在另一个实施方案中,本公开内容包含至少两种转录因子的组合,所述转录因子选自以下分离的或合成的编码的组合或选自以下蛋白:
PU.1(SEQ.ID.1-SEQ.ID.6)和IRF4(SEQ.ID.7-SEQ.ID.12);
PU.1(SEQ.ID.1-SEQ.ID.6)和PRDM1(SEQ.ID.13-SEQ.ID.18);
IRF4(SEQ.ID.7-SEQ.ID.12)和PRDM1(SEQ.ID.13-SEQ.ID.18);
PU.1(SEQ.ID.1-SEQ.ID.6)和IRF2(SEQ.ID.19-SEQ.ID.24);
PU.1(SEQ.ID.1-SEQ.ID.6)和POU2F2(SEQ.ID.25-SEQ.ID.30);
PU.1(SEQ.ID.1-SEQ.ID.6)和TGIF1(SEQ.ID.31-SEQ.ID.36);
IRF4(SEQ.ID.7-SEQ.ID.12)和IRF2(SEQ.ID.19-SEQ.ID.24);
IRF4(SEQ.ID.7-SEQ.ID.12)和POU2F2(SEQ.ID.25-SEQ.ID.30);
IRF4(SEQ.ID.7-SEQ.ID.12)和TGIF1(SEQ.ID.31-SEQ.ID.36);
PRDM1(SEQ.ID.13-SEQ.ID.18)和IRF2(SEQ.ID.19-SEQ.ID.24);
PRDM1(SEQ.ID.13-SEQ.ID.18)和POU2F2(SEQ.ID.25-SEQ.ID.30);
PRDM1(SEQ.ID.13-SEQ.ID.18)和TGIF1(SEQ.ID.31-SEQ.ID.36);
PU.1(SEQ.ID.1-SEQ.ID.6)和IRF4(SEQ.ID.7-SEQ.ID.12);
IRF2(SEQ.ID.19-SEQ.ID.24)和POU2F2(SEQ.ID.25-SEQ.ID.30);
IRF2(SEQ.ID.19-SEQ.ID.24)和TGIF1(SEQ.ID.31-SEQ.ID.36);
PU.1(SEQ.ID.1-SEQ.ID.6)和POU2F2(SEQ.ID.25-SEQ.ID.30);
POU2F2(SEQ.ID.25-SEQ.ID.30)和TGIF1(SEQ.ID.31-SEQ.ID.36);
PU.1(SEQ.ID.1-SEQ.ID.6)和TGIF1(SEQ.ID.31-SEQ.ID.36);
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)、PRDM1(SEQ.ID.13-SEQ.ID.18)、IRF2(SEQ.ID.19-SEQ.ID.24)、POU2F2(SEQ.ID.25-SEQ.ID.30)和TGIF1(SEQ.ID.31-SEQ.ID.36);
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和RBPJ(SEQ.ID.43-SEQ.ID.48);
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和RELB(SEQ.ID.37-SEQ.ID.42);
或其混合物。
在一个实施方案中,本公开内容包含用于前述用途的组合物,其中所述转录因子的组合选自以下组合:
PU.1、IRF4和PRDM1;
PU.1、IRF4和IRF2;
PU.1、IRF4和POU2F2;
PU.1、IRF4和TGIF1;
PU.1、IRF4和RBPJ;和
PU.1、IRF4和RELB。
在一个实施方案中,本公开内容的组合物可以包含由分离的或合成的序列编码的至少三种转录因子,所述序列与选自由以下成员组成的集合或选自以下蛋白的序列具有至少90%同一性:PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)、PRDM1(SEQ.ID.13-SEQ.ID.18)、IRF2(SEQ.ID.19-SEQ.ID.24)、POU2F2(SEQ.ID.25-SEQ.ID.30)、TGIF1(SEQ.ID.31-SEQ.ID.36)及其混合物。
在一个实施方案中,本公开内容涉及如本文所述的组合物,其中所述转录因子的组合是:PU.1、IRF4、PRDM1,或PU.1、IRF4和IRF2。
在另一个实施方案中,本公开内容的组合物可以包含转录因子的组合,所述转录因子选自以下分离的或合成的蛋白或选自以下分离的或合成的编码组合:
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和PRDM1(SEQ.ID.13-SEQ.ID.18);
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和IRF2(SEQ.ID.19-SEQ.ID.24);
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和POU2F2(SEQ.ID.25-SEQ.ID.30);
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和TGIF1(SEQ.ID.31-、SEQ.ID.36);
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)、PRDM1(SEQ.ID.13-SEQ.ID.18);
IRF2(SEQ.ID.19-SEQ.ID.24)、POU2F2(SEQ.ID.25-SEQ.ID.30)和TGIF1(SEQ.ID.31-SEQ.ID.36);
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)PRDM1(SEQ.ID.13-SEQ.ID.18)、IRF2(SEQ.ID.19-SEQ.ID.24)、POU2F2(SEQ.ID.25-SEQ.ID.30)和TGIF1(SEQ.ID.31-SEQ.ID.36);
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和RBPJ(SEQ.ID.43-SEQ.ID.48);
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和RELB(SEQ.ID.37-SEQ.ID.42);
或其混合物。
在一个实施方案中,所述分离的或合成的转录因子的组合是:PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和PRDM1(SEQ.ID.13-SEQ.ID.18)。
在一个实施方案中,所述分离的或合成的转录因子的组合是:PU.1、IRF4和PRDM1。
在另一个实施方案中,所述分离的或合成的转录因子的组合是:PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和PRDM1(SEQ.ID.13-SEQ.ID.18),或PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和IRF2(SEQ.ID.19-SEQ.ID.24)。
在另一个实施方案中,所述分离的或合成的转录因子的组合是:PU.1、IRF4和PRDM1,或PU.1、IRF4和IRF2。
在一个实施方案中,本公开内容的组合物可以包含选自以下的干细胞或分化的细胞:多能干细胞、多潜能干细胞、分化的细胞、成纤维细胞、肿瘤细胞、癌细胞、及其混合物。
在一个实施方案中,细胞可以选自:多能干细胞、多潜能干细胞、分化的细胞、成纤维细胞、肿瘤细胞、癌细胞、及其混合物。
在另一个实施方案中,所述细胞可以选自:肿瘤细胞、癌细胞、及其混合物。
在一个实施方案中,所述抗原可以是癌症抗原、自体抗原、变应原、来自病原性和/或感染性生物体的抗原。
在一个实施方案中,本公开内容的组合物可以用在兽医或人药物中,尤其是在免疫疗法中,或在自身免疫性疾病、免疫缺陷中,或在神经变性或老化疾病中,或在癌症中,或在感染性疾病中,或作为药物筛选。
在另一个实施方案中,所述多能干细胞、多潜能干细胞或分化的细胞是哺乳动物多能干细胞、多潜能干细胞或分化的细胞,尤其是小鼠或人细胞。
本公开内容的一个方面涉及一种构建体或载体,其至少编码本公开内容的两种分离的或合成的转录因子的组合,优选地编码分离的或合成的三种转录因子的组合。
本公开内容的另一个方面涉及一种构建体或载体,其编码如本文所述的转录因子的组合。
在另一个实施方案中,本公开内容包含构建体或载体,其中三种分离的或合成的转录因子的组合从5’至3’遵循以下序列顺序:
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17);PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29);
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35);
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)和TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35);
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和RBPJ(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47);
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和RELB(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41)。
在另一个实施方案中,本公开内容包含构建体或载体,其中编码的转录因子的组合从5’至3’遵循以下序列顺序:
PU.1、IRF4和PRDM1;
PU.1、IRF4和IRF2;
PU.1、IRF4和POU2F2;
PU.1、IRF4和TGIF1;
PU.1、IRF4和RBPJ;或
PU.1、IRF4和RELB。
在另一个实施方案中,本公开内容包含载体,其中所述载体是病毒载体;尤其是逆转录病毒、腺病毒、慢病毒、疱疹病毒、痘病毒或腺相关病毒载体。
在一个实施方案中,所述载体或构建体是合成的mRNA、裸甲病毒RNA复制子或裸黄病毒RNA复制子。
在本公开内容的一个方面,本公开内容涉及一种或多种载体,其包含编码至少三种转录因子的至少三种多核苷酸序列,其中第一种和第二种转录因子是PU.1和IRF4,且第三种转录因子选自:PRDM1、IRF2、POU2F2、RBPJ、RELB和TGIF1,所述载体用于将干细胞或分化的细胞重新程序化为常规树突细胞2型(cDC2)或CD11b阳性树突细胞。
在一个实施方案中,本公开内容涉及一种或多种载体,其中所述转录因子分别由多核苷酸编码,所述多核苷酸与选自以下的序列具有至少90%同一性:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35)、RELB(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41)和RBPJ(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47)。
在另一个实施方案中,本公开内容包含一种或多种载体,其中编码的转录因子的组合选自以下组合:
PU.1、IRF4和PRDM1;
PU.1、IRF4和IRF2;
PU.1、IRF4和POU2F2;
PU.1、IRF4和TGIF1;
PU.1、IRF4和RBPJ;或
PU.1、IRF4和RELB。
在一个方面,本公开内容涉及一种或多种载体,其包含编码至少三种转录因子的至少三种多核苷酸序列,其中所述转录因子是PU.1、IRF4和PRDM1。
在一个实施方案中,本公开内容包含一种或多种载体,其中所述一种或多种载体是病毒载体;尤其是逆转录病毒、腺病毒、慢病毒、疱疹病毒、痘病毒、副粘病毒、棒状病毒、甲病毒、黄病毒或腺相关病毒载体。
在一个实施方案中,本公开内容包含一种或多种载体,其中所述一种或多种载体是合成的mRNA、裸甲病毒RNA复制子或裸黄病毒RNA复制子。
在一个实施方案中,本公开内容包含一种或多种载体,其中所述细胞选自:多能干细胞、多潜能干细胞、分化的细胞、肿瘤细胞、癌细胞及其混合物。
在一个实施方案中,本公开内容包含一种或多种载体,其用于用在兽医或人药物中,特别是在免疫疗法中,或在神经变性疾病的治疗或疗法中,或在自身免疫性疾病、免疫缺陷中,或在癌症的治疗或疗法中,或在感染性疾病的治疗或疗法中;真皮内和透皮疗法;在免疫疗法中,或在神经变性或老化疾病中,或在癌症中,或在感染性疾病中,作为药物筛选;或用于以下疾病的治疗、疗法或诊断中:中枢和周围神经系统病症,瘤形成,尤其是癌症,即实体肿瘤或血液学肿瘤,免疫学疾病,尤其是自身免疫性疾病、超敏反应或免疫缺陷;真菌、病毒、衣原体、细菌、纳米细菌或寄生虫感染性疾病;HIV,被SARS冠状病毒、亚洲流感病毒、单纯疱疹、带状疱疹、肝炎或病毒性肝炎感染。
本公开内容的另一个方面涉及一种将干细胞或分化的细胞重新程序化或诱导为常规树突细胞2型的方法,所述方法包括以下步骤:
用一种或多种载体转导细胞,所述细胞选自:干细胞或分化的细胞、及其混合物,
所述载体包含至少两种核酸序列,所述核酸序列编码与选自以下的序列具有至少90%同一性、优选至少95%同一性的序列:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35)、RBPJ(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47)和RELB(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41)、及其混合物;
在支持树突细胞或抗原呈递细胞的生长的细胞培养基中培养所述转导的细胞。
本公开内容的另一个方面涉及一种将干细胞或分化的细胞重新程序化或诱导为常规树突细胞2型的方法,所述方法包括以下步骤:
用一种或多种载体转导细胞,所述细胞选自:干细胞或分化的细胞、及其混合物,
所述载体编码至少三种转录因子,第一种和第二种是PU.1和IRF4且第三种选自PRDM1、IRF2、POU2F2、TGIF1、RELB和RBPJ;及其混合物;
在支持树突细胞或抗原呈递细胞的生长的细胞培养基中培养所述转导的细胞。
在一个实施方案中,本公开内容涉及如本文所述的方法,其中将所述转导的细胞培养至少2天,优选至少5天,更优选至少8天,甚至更优选至少9天,更优选至少10天。
在另一个实施方案中,所述序列的组合可以是:
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17);
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23);
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29);
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35);
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和RBPJ(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47);
PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和RELB(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41)。
在一个实施方案中,本公开内容包含本公开内容的构建体或载体,其中选自所述组的序列是PU.1和IRF4的组合。
在一个方面,本公开内容包含一种用于将干细胞或分化的细胞重新程序化或诱导为常规树突细胞2型的方法,所述方法包括以下步骤:用一种或多种载体转导细胞,所述细胞选自:干细胞或分化的细胞、及其混合物,所述载体包含至少两种核酸序列,所述核酸序列编码与选自以下的序列具有至少90%同一性、优选至少95%同一性的序列:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35)、RBPJ(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47)和RELB(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41);及其混合物;在支持树突细胞或抗原呈递细胞的生长的细胞培养基中培养所述转导的细胞。
在一个实施方案中,三种分离的和合成的转录因子的组合从5’至3’遵循以下序列顺序:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5))、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35)。所述方法包括、但不限于将用多种分离的和合成的转录因子转导的细胞培养至少2天,优选至少5天,更优选至少8天,甚至更优选至少9天,甚至更优选至少10天。
在另一个实施方案中,本公开内容包括一种方法,其中所述转导步骤进一步包含至少一种载体,其选自:编码IL-12的核酸序列;编码IL-4的核酸序列;编码IFN-α的核酸序列;编码IFN-β的核酸序列;编码IFN-γ的核酸序列;编码TNF的核酸序列;编码GM-CSF的核酸序列;编码靶向IL-10RNA的siRNA的核酸序列,及其混合物。
在一个实施方案中,本公开内容包括一种方法,其中转导步骤进一步包含至少一种载体,其包含编码免疫刺激性细胞因子的核酸。
在另一个实施方案中,本公开内容包括一种方法,其中所述细胞选自:多能干细胞,或多潜能干细胞,或分化的细胞,及其混合物。
在另一个实施方案中,本公开内容包括一种方法,其中所述细胞是哺乳动物细胞。
在另一个实施方案中,本公开内容包括一种方法,其中所述多能干细胞、多潜能干细胞或分化的细胞选自:内胚层衍生的细胞,中胚层衍生的细胞,或外胚层衍生的细胞,多潜能干细胞,包括间质干细胞、造血干细胞、肠干细胞、多能干细胞和细胞系。
在一个实施方案中,本公开内容包括一种方法,其中所述细胞是非人细胞。
在另一个实施方案中,本公开内容包括一种方法,其中所述细胞是小鼠细胞。
在一个实施方案中,本公开内容包括一种方法,其中所述细胞是人细胞。
在另一个实施方案中,本公开内容包括一种方法,其中所述细胞是人或小鼠成纤维细胞或哺乳动物脐带血干细胞。
本公开内容的另一个方面涉及通过本公开内容的方法获得的诱导的树突细胞。
本公开内容的另一个方面涉及用如本文所述的构建体或载体或如本文所述的一种或多种载体转导的诱导的树突细胞。
在另一个实施方案中,本公开内容涉及治疗有效量的可通过本公开内容的方法获得的诱导的树突细胞和药学上可接受的赋形剂。
在另一个实施方案中,本公开内容包含一种用于用在兽医或人药物中的方法。
在另一个实施方案中,本公开内容包含一种方法,其用于用在免疫疗法中,或在神经变性疾病的治疗或疗法中,或在自身免疫性疾病、免疫缺陷中,或在癌症的治疗或疗法中,或在感染性疾病的治疗或疗法中。
在一个实施方案中,本公开内容包括一种方法,其进一步包含抗病毒剂、镇痛剂、抗炎剂、化疗剂、放疗剂、抗生素、利尿剂或其混合物。
在另一个实施方案中,本公开内容包含一种组合物,其进一步包含填充剂、粘合剂、崩解剂或润滑剂或其混合物。
在另一个实施方案中,本公开内容包含一种用于真皮内和透皮疗法中的组合物。
在另一个实施方案中,本公开内容包含一种可注射制剂,尤其是原位注射剂。
在一个实施方案中,本公开内容包含一种组合物,其用于用在兽医或人药物中,尤其是在免疫疗法中,或在神经变性或老化疾病中,或在癌症中,或在感染性疾病中,作为药物筛选。
在另一个实施方案中,本公开内容包含一种组合物,其用于中枢和周围神经系统病症的治疗、疗法或诊断。
在另一个实施方案中,本公开内容包含一种组合物,其用于瘤形成、尤其是癌症(即实体肿瘤或血液学肿瘤)的治疗、疗法或诊断中。
在另一个实施方案中,本公开内容包含一种组合物,其用于癌症或免疫学疾病(即自身免疫性疾病、超敏反应或免疫缺陷)的治疗、诊断或疗法中。
在一个实施方案中,本公开内容包含一种组合物,其用于真菌、病毒、衣原体、细菌、纳米细菌或寄生虫感染性疾病的治疗、疗法或诊断中。
在另一个实施方案中,本公开内容包含一种组合物,其用于HIV、被SARS冠状病毒、亚洲流感病毒、单纯疱疹、带状疱疹、肝炎或病毒性肝炎感染的治疗、疗法或诊断中。
在另一个实施方案中,本公开内容包含癌症疫苗,其包含如在前述权利要求中的任一项中所述的组合物或本公开内容的诱导的树突细胞或其混合物。
在一个方面,本公开内容涉及癌症的疫苗或可注射制剂,尤其是原位注射剂,其包含如本文所述的组合物或如本文所述的诱导的树突细胞或其混合物。
在另一个实施方案中,本公开内容包含试剂盒,其包含下述组分中的至少一种:
本公开内容的诱导的树突细胞;
如本公开内容中所述的组合物;
本公开内容的载体或构建体;
或其混合物。
令人惊讶地,通过本公开内容中所述的重新程序化产生的诱导的DC表现出常规树突细胞2型(CD11b)的固有表面标志物表型、以及细胞因子分泌和在MHC-II分子中的抗原呈递。
本公开内容涉及组合物,其包含至少两种分离的转录因子的组合作为细胞的重新程序化或诱导因子,所述转录因子由与选自以下的序列具有90%同一性的序列编码:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5))、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35)、RBPJ(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47)、RELB(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41),所述细胞选自:干细胞或分化的细胞或其混合物。
与由提及的序列(SEQ.ID.1-36)编码的参考多肽具有相同或类似的活性的多肽变体或家族成员可以用在本文所述的组合物、方法和试剂盒中。通常,用于本文所述的组合物、方法和试剂盒中的编码DC诱导因子的特定多肽的变体将与该特定参考多核苷酸或多肽具有至少约90%、至少约91%、至少约92%、至少约93%、至少约94%、至少约95%、至少约96%、至少约97%、至少约98%、至少约99%或更多序列同一性,如通过本文所述的和本领域技术人员已知的序列比对算法和参数所确定。
用于比较的序列比对方法包括GAP、BESTFIT、BLAST、FASTA和TFASTA。GAP使用Needleman和Wunsch的算法((1970)J Mol Biol 48:443-453)以寻找两个序列的总体(在整个序列中)比对,其使匹配数最大化并使缺口数最小化。BLAST算法(Altschul等人(1990)JMol Biol 215:403-10)计算序列同一性百分比并执行两个序列之间的相似性的统计学分析。用于执行BLAST分析的软件可以通过国立生物技术信息中心(National Centre forBiotechnology Information,NCBI)公开获得。使用MatGAT软件包中可得到的方法之一,也可以确定相似性和同一性的总体百分比(Campanella等人,BMC Bioinformatics.2003年7月10日;4:29.MatGAT:an application that generates similarity/identity matricesusing protein or DNA sequences)。如本领域技术人员显而易见的,可以进行少量手动编辑以优化保守基序之间的比对。使用BLAST和默认参数在整个氨基酸序列上确定在本发明主题中作为百分比指示的序列同一性值。
在一个实施方案中,本公开内容的任何DNA编码序列可以被改变、置换或修饰以包含一个或多个、优选0、1、2、3、4、5、6个不同的脱氧核糖核苷酸碱基。
在一个实施方案中,本公开内容验证了在Clec9a报告小鼠中,大部分的cDC2被tdTomato荧光蛋白标记,使得该模型适用于筛选cDC2诱导因子。PU.1已被描述为在DC发育中发挥关键作用,且IRF4已被描述为确保cDC2规范。此外,PU.1和IRF4两者都在cDC2子集上高表达。因此,本公开内容将PU.1和IRF4与另外33种cDC2诱导候选物组合,并在Clec9a报告小鼠胚胎成纤维细胞(MEF)中进行加性筛选。
在一个实施方案中,与IRF4和PRDM1组合的PU.1足以诱导Clec9a报告物活化和cDC2表面标志物CD11b的表面表达。此外,与IRF4和PRDM1组合的PU.1诱导对于DC功能性而言重要的主要组织相容性复合物(MHC)II类分子的表达。
编码PU.1和随后的IRF4和PRDM1的多顺反子构建体增加了Clec9a报告物活化的效率。由此产生的tdTomato+细胞表现出在TLR刺激后分泌促炎性TNF-α并将负载在MHC-II上的抗原呈递给CD4+ T细胞从而诱导它们的增殖和活化的能力。
在一个实施方案中,与IRF4和IRF2组合的PU.1诱导显著的报告物活化。此外,与IRF4和IRF2组合的PU.1导致增加的tdT+CD11b+双阳性细胞群体。
在一个实施方案中,与IRF4和POU2F2组合的PU.1,或与IRF4和TGIF1组合的PU.1,导致增加的CD11b(一种cDC2-特异性的表面标志物)的表达。
在一个实施方案中,与IRF4和RBPJ组合的PU.1,或与IRF4和RELB组合的PU.1,导致增加的CD11b(一种cDC2-特异性的表面标志物)的表达。
总之,我们显示,当与PRDM1、IRF2、RBPJ、RELB、POU2F2或TGIF1组合时,PU.1和IRF4在成纤维细胞中诱导cDC2表型。这些发现提供了对cDC2异质性规范的洞察。通过直接重新程序化产生的下一代cDC2为使用自体工程改造的细胞诱导免疫促进和耐受性应答开辟了可能性。
在一个实施方案中,分离的转录因子的组合可以是:
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和PRDM1(SEQ.ID.13-SEQ.ID.18);
PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和IRF2(SEQ.ID.19-SEQ.ID.24);
或PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和POU2F2(SEQ.ID.25-SEQ.ID.30);
或PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和TGIF1(SEQ.ID.31-SEQ.ID.36);
或PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和RBPJ(SEQ.ID.43-SEQ.ID.48);
或PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)和RELB(SEQ.ID.37-SEQ.ID.42)。
本公开内容的另一个方面是至少两个序列的组合的应用,所述序列来自:PU.1(SEQ.ID.1-SEQ.ID.6)、IRF4(SEQ.ID.7-SEQ.ID.12)、PRDM1(SEQ.ID.13-SEQ.ID.18)、IRF2(SEQ.ID.19-SEQ.ID.24)、POU2F2(SEQ.ID.25-SEQ.ID.30)和TGIF1(SEQ.ID.31-SEQ.ID.36)。所述分离的转录因子可以包括以下组合:
PU.1(SEQ.ID.1-SEQ.ID.6)和IRF4(SEQ.ID.7-SEQ.ID.12);或
PU.1(SEQ.ID.1-SEQ.ID.6)和PRDM1(SEQ.ID.13-SEQ.ID.18);或
IRF4(SEQ.ID.7-SEQ.ID.12)和PRDM1(SEQ.ID.13-SEQ.ID.18);或
PU.1(SEQ.ID.1-SEQ.ID.6)和IRF2(SEQ.ID.19-SEQ.ID.24);或
PU.1(SEQ.ID.1-SEQ.ID.6)和POU2F2(SEQ.ID.25-SEQ.ID.30);
或PU.1(SEQ.ID.1-SEQ.ID.6)和TGIF1(SEQ.ID.31-SEQ.ID.36);
或IRF4(SEQ.ID.7-SEQ.ID.12)和IRF2(SEQ.ID.19-SEQ.ID.24);
或IRF4(SEQ.ID.7-SEQ.ID.12)和POU2F2(SEQ.ID.25-SEQ.ID.30);
或IRF4(SEQ.ID.7-SEQ.ID.12)和TGIF1(SEQ.ID.31-SEQ.ID.36);
或PRDM1(SEQ.ID.13-SEQ.ID.18)和IRF2(SEQ.ID.19-SEQ.ID.24);
或PRDM1(SEQ.ID.13-SEQ.ID.18)和POU2F2(SEQ.ID.25-SEQ.ID.30);
或PRDM1(SEQ.ID.13-SEQ.ID.18)和TGIF1(SEQ.ID.31-SEQ.ID.36);
或PU.1(SEQ.ID.1-SEQ.ID.6)和IRF2(SEQ.ID.19-SEQ.ID.24);
或IRF2(SEQ.ID.19-SEQ.ID.24)和POU2F2(SEQ.ID.25-SEQ.ID.30);
或IRF2(SEQ.ID.19-SEQ.ID.24)和TGIF1(SEQ.ID.31-SEQ.ID.36);
或PU.1(SEQ.ID.1-SEQ.ID.6)和POU2F2(SEQ.ID.25-SEQ.ID.30);
或POU2F2(SEQ.ID.25-SEQ.ID.30)和TGIF1(SEQ.ID.31-SEQ.ID.36);
或PU.1(SEQ.ID.1-SEQ.ID.6)和TGIF1(SEQ.ID.31-SEQ.ID.36)。
在一个实施方案中,为了更好的结果,所述细胞可以选自:多能干细胞,多潜能干细胞,分化的细胞,肿瘤细胞,癌细胞,及其混合物。尤其是哺乳动物细胞,更尤其是小鼠或人细胞。
在一个实施方案中,本公开内容的分离的转录因子可以用于兽医或人药物应用,尤其是在感染性疾病或病毒性疾病或病毒诱导的疾病或神经变性疾病中,或在癌症中,或在糖尿病中,或在免疫疗法中,或在自身免疫性疾病中,或在超敏反应疾病中。
在一个实施方案中,为了更好的结果,本公开内容的分离的转录因子可以用作特定细胞向树突细胞或干扰素产生细胞的重新程序化或诱导因子,所述特定细胞选自:多能干细胞,或多潜能干细胞,或分化的细胞,及其混合物。
附图描述
以下附图提供了用于说明本公开内容的优选实施方案并且不应被视为限制本发明的范围。
图1-3个主要DC子集的个体发育。DC来自骨髓中的共同DC前体(CDP),所述前体可以发育成不同的DC子集:cDC1,其主要进行抗原交叉呈递以促进Th1应答和细胞毒性的T-细胞应答;pDC,其在病毒感染后充当干扰素I型产生细胞;cDC2,主要执行MHC-II抗原呈递并促进Th2、Th17和Treg应答的DC子集。
图2-直接重新程序化的cDC2的应用的示意图。从患者获得的成纤维细胞将被重新程序化为可以应用于个性化免疫疗法的cDC2细胞。诱导的cDC2可以用于诱导针对寄生虫、针对细胞外病原体的免疫,促进抗肿瘤应答,或在自身免疫或超敏反应的情况下诱导对自体抗原的免疫耐受。
图3-脾cDC2表达高水平的由Clec9a-tdTomato报告物驱动的tdTomato蛋白。(A)从Clec9a-tdTomato小鼠分离的脾cDC1(MHC-II+CD11c+CD8a+)和cDC2(MHC-II+CD11c+CD11b+)群体中的tdTomato表达的流式细胞计量术分析。(B)在cDC1(CD8α+)和cDC2(CD8α-CD11b+)中的tdTomato+细胞的定量。
图4-Clec9a-报告物活化和基因表达模式令人惊讶地适合鉴定因子的cDC2指令。(A)筛选cDC2诱导转录因子(TF)的实验策略。PU.1、IRF8和BATF3(PIB)组合诱导Clec9a-tdTomato(Clec9a-tdT)小鼠胚胎成纤维细胞(MEF)向cDC1-样诱导的细胞的重新程序化。该组合将被修改以保留cDC的基本TF并鉴定用于cDC2重新程序化的组合。(B)在cDC1和cDC2中的Spi1、Irf8和Batf3的表达的比较(GSE15907)。在括号中描述了基因表达的倍数变化。(C和D)在用PU.1+BATF3与IRF家族的不同成员(IRF1-IRF9)的组合或PU.1+IRF4转导后第6天tdTomato+细胞的定量。(E)在cDC1和cDC2中的Irf基因家族的表达(GSE15907)。在括号中描述了倍数变化。*P<0.05,**P<0.01,***P<0.001,****P<0.00001,不成对t检验和单因素方差分析。
图5-鉴定cDC2诱导转录因子的策略。(A)cDC2重新程序化组合的筛选策略的示意图。将PU.1和IRF4与另外的各种候选TF一起过表达。在重新程序化的第6天评估Clec9a报告物活化和cDC2表面标志物CD11b的表达。(B)当与cDC1和pDC群体相比时,候选TF在cDC2中高度富集。PU.1、IRF4和33种候选物在cDC1、cDC2和pDC群体中的表达的热图(GSE15907)。
图6-基于Clec9a报告物的关于cDC2诱导TF的筛选鉴定了cDC2重新程序化的新调节剂。在用与单独另外候选物组合的PU.1+IRF4转导Clec9a报告物MEF后第6天的流式细胞计量术分析代表图(A)和tdTomato阳性(tdTomato+)细胞的定量(B)(平均值±SD;每种条件2-7个重复的筛选数据和统计数据)。包括用M2rtTA、PU.1+IRF8+BATF3和PU.1+IRF4+BATF3转导的MEF作为对照。*P<0.05,**P<0.01,单因素方差分析。
图7-基于CD11b表达的关于cDC2诱导TF的筛选鉴定了cDC2重新程序化的新调节剂。在用与单独另外候选物组合的PU.1+IRF4转导Clec9a报告物MEF后第6天的流式细胞计量术分析代表图(A)和CD11b+细胞的定量(B)(平均值±SD;每种条件2-7个重复的筛选数据和统计数据)。包括用M2rtTA、PU.1+IRF8+BATF3和PU.1+IRF4+BATF3转导的MEF作为对照。*P<0.05,**P<0.01,单因素方差分析。
图8–用三种转录因子的组合从小鼠成纤维细胞诱导cDC2-样细胞。在用与单独另外候选物组合的PU.1+IRF4转导后第6天的流式细胞计量术分析代表图(A)和双阳性tdTomato+CD11b+细胞的定量(B)(平均值±SD,由每种条件2个重复组成的筛选数据)。包括M2rtTA转导的MEF作为对照。
图9-PU.1、IRF4和PRDM1足以cDC2重新程序化且是cDC2重新程序化所需要的。在第6天在PU.1+IRF4+PRDM1转导和从3种TF合并物单独除去TF或单独TF表达以后,tdT+细胞(A)和在tdT+群体内门控的CD11b+细胞(B)的定量(n=2,平均值±SD)。包括M2rtTA转导的MEF作为对照。
图10-PU.1、IRF4和PRDM1在cDC2细胞中富集。(A)在DC群体(pDC、cDC1和cDC2)中Spi1、Irf4和Prdm1的基因表达。Spi1、Irf4和Prdm1更多地在cDC2中表达,且Prdm1特别更多地在cDC2中表达。(B)Spi1、Irf4和Prdm1的组合在96种小鼠组织和细胞类型中主要富集在CD8a-DC中。基因表达数据(GeneAtlas MOE430)对每个基因进行对数转换并针对0-1范围归一化,然后搜索Spi1+Irf4+Prdm1的最高平均表达。
图11-PU.1、IRF4和PRDM1诱导CD45和MHC-II表面表达。在第6天在M2rtTA、PU.1+IRF8+BATF3和PU.1+IRF4+PRDM1转导的MEF中的(A)代表性流式细胞计量术图和(B)MHC-II+细胞的定量(n=2,平均值±SD)。(C)在第6天在M2rtTA、PU.1+IRF8+BATF3和PU.1+IRF4+PRDM1转导的MEF中在tdT+内的MHC-II+细胞的定量(n=2,平均值±SD)。(D)在第9天在PU.1+IRF4+PRDM1转导的MEF中在tdT-和tdT+群体内的CD45和MHC-II表达。
图12-PU.1和IRF4的组合足以Clec9a报告物活化。在用与单独另外候选物组合的PU.1转导Clec9a报告物MEF后第6天tdTomato阳性(tdTomato+)细胞的定量(平均值±SD;每种条件2个重复的筛选数据)。包括M2rtTA转导的MEF作为对照。
图13-DC2诱导TF的组合诱导逐渐的Clec9a报告物活化。PU.1+IRF4+PRDM1(P+I4+P)和PU.1+IRF4+IRF2(P+I4+I2)组合的Clec9a-tdTomato报告物活化的动力学。包括M2rtTA转导的MEF作为对照。
图14-PU.1、IRF4和IRF2是独立于PRDM1而诱导DC表型的最小且足够的组合。(A)在第6天在用PU.1+IRF4+IRF2转导和从3种TF合并物单独除去TF或单独TF表达后tdTomato+细胞的定量(n=2,平均值±SD)。(B)Spi1、Irf4和Irf2的组合在96种小鼠组织和细胞类型中主要富集在CD8a-DC中。基因表达数据(GeneAtlas MOE430)对每个基因进行对数转换并针对0-1范围归一化,然后搜索Spi1+Irf4+Irf2的最高平均表达。(C)在重新程序化的第6天,用与PRDM1、IRF2或PRDM1+IRF2组合的PU.1+IRF4转导后TdTomato+细胞的定量。(D)在pFUW-TetO质粒中编码Spi1和随后的Irf4和Irf2(被自切割肽P2A和T2A隔开)的多顺反子构建体(PI4I2Poly)的示意图。(E)在重新程序化的第6天,用M2rtTA、单独的PU.1、IRF4和IRF2因子(P+I4+I2)和PI4I2Poly构建体转导后TdTomato+细胞的定量。
图15-多顺反子PI4P载体(PI4PPoly)增加重新程序化效率。(A)插入在pFUW-TetO质粒中的编码Spi1和随后的Irf4和Prdm1(被自切割肽P2A和T2A隔开)的多顺反子构建体的示意图(PI4PPoly)。在重新程序化的第6天在用M2rtTA、单独载体编码的PU.1、IRF4和PRDM1(P+I4+P)、多顺反子PU.1、IRF8和BATF3组合(PI8BPoly)和多顺反子PU.1、IRF4和PRDM1(PI4PPoly)构建体转导后的(B)代表性流式细胞计量术图和(C)TdT+细胞的定量(平均值±SD,n=2)。(D)在重新程序化的第9天M2rtTA、PI8BPoly和PI4PPoly转导的MEF的荧光显微术比较,代表了tdT+细胞形态学(白色箭头)。
图16-PIP诱导的细胞分泌TNF-α促炎细胞因子。在重新程序化的第9天在FACS分选的tdTomato+细胞的上清液中的TNF-α和IL-10浓度的定量,其在用LPS、聚I:C(PiC)、R848和CpG ODN 1585进行过夜TLR刺激之前(-)或之后由PU.1、IRF4和PRDM1多顺反子载体(PI4PPoly)诱导。
图17-PIP表达诱导了以抗原特异性方式将MHC-II分子中的抗原呈递给CD4+ T-细胞的能力。在MEF、分选的PIP-TdT+细胞(第9天)和预先加载或没有加载OVA肽(323-339)的骨髓DC(BM-DC)的共培养后和在以下不同刺激条件下OVA-特异性的OT-II Rag2KO CD4+ T细胞(CD4+ TCRb+)的CTV稀释的定量:无刺激(-),LPS,PiC,R848和CpG ODN 1585。
详细描述
本公开内容涉及组合物、核酸构建体、方法及其试剂盒,其用于将细胞重新程序化为常规树突细胞,特别是常规树突细胞2型(cDC2),它们方法和用途,特别是通过引入和表达分离的/合成的转录因子从分化的、多潜能或多能干细胞制备cDC、特别是cDC2的方法的开发。更具体地,本公开内容提供了用特定转录因子的组合的惊人有益应用进行直接细胞重新程序化获得cDC、特别是cDC2的方法。这样的组合物、核酸构建体、方法和试剂盒可以用于在体外、离体或在体内诱导树突细胞,并且这些诱导的DC或APC可以用于免疫疗法应用。
天然DC是接种在所有组织中的骨髓衍生的细胞。DC准备好对环境进行采样并将收集到的信息传输给适应性免疫系统的细胞(T细胞和B细胞)。在抗原吞噬后,DC通过将处理的抗原(以肽-主要组织相容性复合物(MHC)分子复合物的形式)呈递给淋巴样组织中的原初(也就是说,未经历过抗原的)T细胞来启动免疫应答。活化后,DC通常过表达共刺激分子和MHC分子,此外还分泌负责启动和/或增强许多T和B淋巴细胞应答的各种细胞因子,即I型干扰素、肿瘤坏死因子(TNF)-α、IFN-γ、IL-12和IL-6。因此,通常通过其主要组织相容性复合物II类分子(MHC-II)、共刺激分子(诸如CD80/86和CD40)和整联蛋白CD11c的高表达以及它们的分泌炎性细胞因子并从非淋巴样器官迁移到淋巴样器官并刺激原初T细胞的优异能力来鉴定DC。在小鼠和人中,不同的DC子集可以通过表型、个体发育和功能来不同地定义(图1)。它们包括在淋巴样器官中发现的常规DC子集1(cDC1,也被称作CD8α+DC子集),其展现对MHC I类交叉呈递的能力,并触发针对传染原或肿瘤的CTL应答。pDC通过响应于病毒感染产生大量I型干扰素而发挥作用。另一方面,cDC2在导致Th2和Th17 T细胞应答的MHC-II呈递方面表现出色。除了引发T细胞外,cDC2还已经通过引发Treg或通过促进胸腺中自体反应性T细胞的负选择来参与对抗原的自身耐受性的建立。DNGR-1,也被称作CLEC9A,是坏死细胞的受体,其有利于小鼠中针对死细胞相关抗原的CTL的交叉引发。DNGR-1由小鼠cDC1DC、cDC2 DC和pDC选择性地高水平表达。近年来,显示Clec9a的表达允许鉴定在淋巴样组织中定向为一个和多个常规或浆细胞样DC谱系及其后代的DC前体(CDP)。
如本文所述,能够将分化的细胞重新程序化为诱导的DC、特别是cDC2的DC诱导因子的成功鉴定,可以以多种方式促进我们对cDC2生物学和异质性的基本理解。这项工作将提供对cDC2转录网络的透彻洞察。此外,DC诱导因子的鉴定为理解DC状态如何建立以及关键调节机制如何实施提供了前所未有的机会。
转录因子在发育期间在所有细胞类型的规范中发挥着关键作用。使用转录因子介导的重新程序化的直接重新程序化策略的成功指示,使用这样的因子将多能ES/iPS细胞或多潜能干细胞的分化引导至特定命运是同样似乎合理的。因此,使用本文鉴定的DC诱导因子,可以实现通过富含DC的转录因子的表达将ES/iPS细胞定向分化为确定DC命运。此外,使用本文鉴定的DC诱导因子,可以实现通过富含DC的转录因子的表达将多潜能造血干细胞和祖细胞定向分化为确定DC命运。
本公开内容的一个方面是使用TF或使用TF的组合来产生可以呈递自体抗原以产生耐受性应答的细胞。该方法代表了在自身免疫和超敏反应病症的背景下致耐受性免疫疗法的可行策略。
成纤维细胞可以从人来源获得,并且然后重新程序化为cDC2用于免疫调节目的(图2)。根据已知的cDC2的免疫作用,这些生成的cDC2可以应用于促进抗寄生虫免疫、针对细胞外病原体的免疫、在自身免疫或超敏反应的背景下对自体抗原的免疫耐受,或者甚至当与cDC1组合时促进抗肿瘤免疫。
使用病毒载体或不使用病毒载体,通过一次或重复转染,将编码DC诱导因子的核酸(例如,DNA或RNA或其构建体)引入细胞,并且基因产物的表达和/或RNA分子的翻译产生在形态学上、生物化学上和功能上与cDC2相似的细胞,如本文中所述。这些诱导的cDC2表达cDC2表面标志物CD11b。
在一个实施方案中,为了通过细胞重新程序化筛选cDC2诱导TF和cDC2诱导TF组合的效果,使用了携带DC特异性报告物(Clec9a-Cre X R26-stop-tdTomato)的小鼠胚胎成纤维细胞(MEF),其中报告物的活化用于显示cDC2诱导TF。在Clec9a-tomato报告小鼠中,tdTomato荧光蛋白仅由CDP、前-DC、cDC和pDC表达。培养中的巨噬细胞、其它免疫谱系或单核细胞衍生的DC不表达Clec9a,且因此也不表达tdTomato蛋白。分析了从Clec9a报告物小鼠中分离的脾细胞,证实78.9%的cDC2细胞(在CD11c+MHC-II+CD8a-CD11b+中门控)表达tdTomato荧光蛋白(图3)。
将双转基因Clec9a-tdTomato报告物MEF从E13.5胚胎中分离,并通过使用荧光活化细胞分选(FACS)从已经定向至造血谱系的任何污染tdTomato+或CD45+细胞中排除。
最近通过PU.1、IRF8和BATF3(PIB)的组合过表达证明了成纤维细胞向cDC1-样细胞的重新程序化。通过采用在cDC1、cDC2和pDC中表达的Clec9a-Cre X R26-stop-tdT(Clec9a-tdT)小鼠进行筛选,鉴定了这种组合(Rosa等人,2018)。在这里,将这种相同的DC报告物用来鉴定cDC2转录因子,所述转录因子通过修改报告的TF组合来指示cDC2谱系(图4A)。
通过比较cDC群体之间Spi1、Irf8和Batf3的表达,显示当与cDC1相比时,Spi1在cDC2中高表达,而Irf8和Batf3表达较少(图4B)。此外,根据功能缺失研究,PU.1丧失损害整个DC谱系的规范,表明其是DC发育的持续要求。总之,这些数据支持用于cDC2重新程序化的PU.1维持,并提示,IRF8和BATF3可以被其它TF替代。
由于已知TF的IRF家族对于DC发育、成熟和功能作用是至关重要的(Gabriele和Ozato,2007),因此关于Clec9a报告物的活化测试了与PU.1和BATF3组合的其它IRF蛋白对IRF8的替换。IRF4导致显著的tdT表达(图4C),从而提示,IRF4能够在DC重新程序化中替代IRF8。IRF4被认为是cDC2发育所必需的。另一方面,IRF8和BATF3仅被认为对cDC1发育至关重要,提示它们对于cDC2重新程序化而言可以忽略不计。但是,PU.1和IRF4的组合过表达没有导致显著的Clec9a报告物活化(0.21%,图4C和D),提示这两种TF可能是诱导cDC2重新程序化所需要的,但不足以诱导cDC2重新程序化。此外,在比较cDC1和cDC2群体之间所有Irf基因家族成员的表达时,Irf4在cDC2中明显更多地表达(图4E),进一步证明了它对cDC2重新程序化的重要性。事实上,在整个IRF家族中,只有Irf4(2.6倍)和较低程度的Irf2(1.3倍)在cDC2细胞中过多表现。
在一个实施方案中,为了筛选cDC2重新程序化TF组合,将候选TF单独地与PU.1和IRF4组合并评估Clec9a报告物活化和cDC2表面标志物CD11b的表达(图5A)。
在一个实施方案中,选择了33种cDC2诱导候选TF,因为当与cDC1和pDC相比时,它们在cDC2中特异性富集的基因表达(图5B)。将这33种候选TF与PU.1和IRF4一起单独克隆到重新程序化证明的多西环素(Dox)诱导型慢病毒载体中。
在一个实施方案中,为了筛选cDC2重新程序化TF组合,首先将候选TF与PU.1单独组合并评估Clec9a报告物活化(图12)。从这个筛选中,只有PU.1+IRF4组合导致TdT+细胞的显著百分比,这巩固了该组合作为进一步cDC2诱导组合的基线。
在一个实施方案中,候选TF的筛选鉴定出,与PU.1和IRF4组合的PRDM1或IRF2导致显著的Clec9a报告物活化(图6)和增加的tdT+CD11b+双重群体(图8)。与PU.1和IRF4组合的PRDM1也导致cDC2表面标志物CD11b的增加表达。
在一个实施方案中,与PU.1和IRF4组合的PRDM1、RBPJ、RELB、POU2F2或TGIF1导致cDC2表面标志物CD11b的增加表达(图7)。总的来说,这些数据将PRDM1、RBPJ、RELB、POU2F2和TGIF1鉴定为额外的cDC2指导TF,可能指示不同cDC2细胞状态的诱导,反映在cDC2子集中的固有多样性。
在一个实施方案中,为了评价PU.1+IRF4+PRDM1是否代表用于重新程序化的最小网络,从三个TF合并物中单独除去每种因子。这些单独TF中的每一种的除去都减少Clec9a报告物活化和CD11b表达,并且每种TF的单独表达产生低tdT和CD11b表达(图9A和B)。总的来说,该数据指示PU.1+IRF4+PRDM1是足以实现Clec9a-tdT报告物活化和CD11b表面表达的TF组合。
比较Spi1、Irf4和Prdm1单独表达,揭示了所有这三种TF在cDC2中的富集,其中当与其它DC群体相比时,PRDM1在cDC2中表现出最高表达(图10A)。Spi1、Irf4和Prdm1的组合表达也与CD8a-DC高度相关(图10B)。
进一步分析细胞表面标志物表达,13.62%的PU.1+IRF4+PRDM1转导的细胞表达MHC-II,与之相比,在PU.1+IRF8+BATF3产生的DC中的表达为14.43%(图11A和B)。在tdT+隔室内,35.58%的PU.1+IRF4+PRDM1诱导的细胞表达表面MHC-II(图11C)。另外,20.10%的tdT+细胞共表达表面MHC-II和CD45,而只有1.36%的细胞在tdT-隔室内是MHC-II+CD45+(图11D)。这些数据进一步支持PU.1+IRF4+PRDM1诱导造血和APC表型以及抗原呈递机制的获得。
还进一步评估了PU.1+IRF4+IRF2组合在DC重新程序化中的作用。每种单独TF的除去消除了Clec9a报告物活化,且每种TF的单独表达导致低TdT表达(图14A)。Spi1、Irf4和Irf2的组合表达也与CD8a-DC高度相关(图14B)。鉴于PRDM1和IRF2向PU.1和IRF4的单独添加导致产出性的Clec9a报告物活化,我们研究了这4种TF的共表达以实现潜在的协同效应。但是,PU.1、IRF4、IRF2和PRDM1的过表达消除了Clec9a报告物活化,表明PRDM1和IRF2在DC重新程序化中的交叉抑制作用(图14C)。总的来说,该数据表明PU.1+IRF4+IRF2是独立于PRDM1诱导DC表型的最小且足够的组合,表明不同的cDC2子集的诱导。
在用PU.1、IRF4和PRDM1或者PU.1、IRF4和IRF2转导后,对于两种组合,在第2天开始检测到Clec9a报告物的活化。在第9天(PU.1+IRF4+IRF2)和第10天(PU.1、IRF4和PRDM1)之间达到TdT表达的峰值(图13)。
编码转录因子组合的多顺反子构建体已被用于增加重新程序化的效率。当与单独表达(P+I4+I2)(图14E)相比时,用编码Spi1和随后的Irf4和Irf2的多顺反子构建体(PI4I2Poly)对MEF的转导导致重新程序化效率的增加(图14D)。另外,产生了编码Spi1和随后的Irf4和Prdm1(被自切割肽P2A和T2A隔开)的多顺反子构建体(PI4PPoly)(图15A)。与单独表达(P+I4+P)相比,PI4PPoly导致重新程序化效率的增加,达到12.20%,该百分比与先前描述的用于cDC1-样重新程序化的多顺反子构建体PU.1+IRF8+BATF3(PI8BPoly)相当(图15B和15C)。荧光显微术突出了由PI8BPoly和PI4PPoly组合产生的tdT+细胞的树突细胞形态学(图15C)。
DC的一个典型免疫调节特征依赖于它们分泌细胞因子的能力。促炎性cDC2已被描述为响应于TLR刺激而分泌TNF-α。另一方面,抗炎性cDC2的特征是IL-10的分泌,这进一步介导它们的免疫调节功能。在toll-样受体TLR3(PiC-聚肌苷酸:聚胞苷酸)、TLR4(LPS-脂多糖)、TLR7/TLR8(R848-瑞喹莫德)和TLR9(CpG ODN 1585)的刺激后,进行分选的PI4PPoly产生的TdT+细胞的细胞因子分泌。PU.1、IRF8和PRDM1的过表达诱导分泌促炎性肿瘤坏死因子-α(TNF-α)(其在LPS攻击后增加了2.2倍)的能力(图16)。相比之下,没有检测到抗炎细胞因子IL-10。这些结果提示,PI4P诱导细胞促炎性DC2。
为了表征生成的细胞促进CD4+ T细胞的抗原特异性增殖的功能能力,将第9天分选的PI4PPoly生成的TdT+细胞、MEF和骨髓衍生的DC(BM-DC)与OT-II CD4+ T细胞共培养,所述OT-II CD4+ T细胞表达对卵白蛋白(OVA)肽323-329特异性的T细胞受体,其在MHC-II分子的背景下呈递(图16)。当先前装载OVA肽323-339时,PIP诱导的细胞获得了诱导10.67±1.16%OT-II CD4+ T细胞的增殖(CTVlow)的能力。在LPS和R848存在的情况下,PIP诱导的细胞诱导OT-II CD4+ T细胞的略高增殖,分别为12.14±1.33%和12.23±0.54%。这些数据支持PI4P诱导的细胞在驱动CD4+ T细胞应答的MHC-II分子上加载和呈递抗原的能力(图17)。
最近关于DC异质性的更新已经鉴定了cDC2的两个不同子集,所述子集由不同的转录调节剂和不同的抗炎和促炎功能定义(Brown等人,2019年)。巧合的是,在该报告中,对这些新鉴定的子集中的每一个的首要定义性TF基因的分析强调了PRDM1是与cDC2B相关的顶级TF,cDC2B是以其促炎表型为特征的子集。因此,这些数据进一步支持在本公开内容中报道的PU.1+IRF4+PRDM1所产生的DC的促炎表型,因此类似于cDC2B表型。
在一些实施方案中,与由序列表中提供的序列所编码的参考多肽具有相同或类似活性的多肽变体或家族成员可以用在本文所述的组合物、方法和试剂盒中。通常,用于本文所述的组合物、方法和试剂盒中的编码cDC2诱导因子的特定多肽的变体将与该特定参考多核苷酸或多肽具有至少约95%、至少约96%、至少约97%、至少约98%、至少约99%或更多序列同一性,如通过本文所述的和本领域技术人员已知的序列比对程序和参数所确定的。
在一个实施方案中,智人PU.1转录因子(PU.1)、mRNA(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)和优化的密码子,或编码相同氨基酸的不同密码子,自然也考虑被对如本文所述的核酸的提及涵盖。
在一个实施方案中,智人干扰素调节因子4(IRF4)、mRNA(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)和优化的密码子,或编码相同氨基酸的不同密码子,自然也考虑被对如本文所述的核酸的提及涵盖。
在一个实施方案中,智人PR结构域锌指蛋白1(PRDM1)、mRNA(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)和优化的密码子,或编码相同氨基酸的不同密码子,自然也考虑被对如本文所述的核酸的提及涵盖。
在一个实施方案中,智人干扰素调节因子2(IRF2)、mRNA(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)和优化的密码子,或编码相同氨基酸的不同密码子,自然也考虑被对如本文所述的核酸的提及涵盖。
在一个实施方案中,智人POU 2类同源框2(POU2F2)、mRNA(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)和优化的密码子,或编码相同氨基酸的不同密码子,自然也考虑被对如本文所述的核酸的提及涵盖。
在一个实施方案中,智人同源框蛋白TGIF1(TGIF1)、mRNA(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35)和优化的密码子,或编码相同氨基酸的不同密码子,自然也考虑被对如本文所述的核酸的提及涵盖。
在一个实施方案中,智人重组无毛结合蛋白抑制因子(RBPJ)、mRNA(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47)和优化的密码子,或编码相同氨基酸的不同密码子,自然也考虑被对如本文所述的核酸的提及涵盖。
在一个实施方案中,智人转录因子RelB(RELB)、mRNA(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41)和优化的密码子,或编码相同氨基酸的不同密码子,自然也考虑被对如本文所述的核酸的提及涵盖。
在本文提供的组合物、构建体、载体、方法和试剂盒的一些实施方案中,被使用或被选择从起始体细胞(诸如成纤维细胞或造血谱系细胞、多潜能干细胞、诱导的多能干细胞、癌症或肿瘤细胞)产生诱导的cDC2的cDC2诱导因子的数目是至少两种。在一些实施方案中,被使用或被选择的cDC2诱导因子的数目是至少3种、至少4种、至少5种、至少6种、至少7种、至少8种、至少9种、至少10种、至少11种、至少12种、至少13种、至少14种、至少15种、至少16种、至少17种、至少18种、至少19种、至少20种、至少30种、至少33种、至少35种、至少40种或更多种。
在本文所述的组合物、构建体、载体、方法和试剂盒的一些实施方案中,将编码cDC2诱导因子(诸如PU.1、IRF4、PRDM1、IRF2、RBPJ、RELB、POU2F2和TGIF1)的核酸序列或构建体插入或可操作地连接到合适的表达载体中,用于使用标准的分子生物学技术转染细胞。本文中使用的“载体”表示核酸分子,诸如为插入的核苷酸序列提供有用的生物学或生化特性的dsDNA分子,诸如本文所述的核酸构建体或替换盒。例子包括质粒、噬菌体、自主复制序列(ARS)、着丝粒和能够在体外或在宿主细胞中复制或被复制、或者将期望的核酸区段运送到宿主细胞内期望位置的其它序列。载体可以具有一个或多个限制性内切核酸酶识别位点(无论是I、II或IIs型),在所述位点处可以以可确定的方式切割序列,而不丧失载体的基本生物学功能,并且可以向其中剪接或插入核酸片段以实现其复制和克隆。载体还可以包含一个或多个重组位点,其允许两个核酸分子之间的核酸序列的交换。载体可以进一步提供引物位点(例如用于PCR)、转录和/或翻译起始和/或调节位点、重组信号、复制子、另外的选择标志物等。载体可以进一步包含一种或多种选择标志物,其适用于鉴定用该载体转化的细胞。
在本文所述的组合物、方法、构建体、载体和试剂盒的一些实施方案中,所述表达载体是病毒载体。一些病毒介导的表达方法采用逆转录病毒、腺病毒、慢病毒、疱疹病毒、痘病毒和腺相关病毒(AAV)载体,且这样的表达方法已用于基因递送并且是本领域众所周知的。
在本文所述的组合物、构建体、载体、方法和试剂盒的一些实施方案中,所述病毒载体是逆转录病毒。逆转录病毒为基因递送提供了方便平台。使用本领域已知的技术,可以将选择的基因插入载体中并包装在逆转录病毒颗粒中。然后可以将重组病毒分离并在体内或离体递送给受试者的靶细胞。已经描述了许多逆转录病毒系统。参见,例如,美国专利号5,219,740;Miller和Rosman(1989)BioTechniques 7:980-90;Miller,A.D.(1990)HumanGene Therapy 1:5-14;Scarpa等人(1991)Virology 180:849-52;Burns等人(1993)Proc.Natl.Acad.Sci.USA 90:8033-37;Boris-Lawrie和Temin(1993)Curr.Opin.Genet.Develop.3:102-09。在本文所述的组合物、方法和试剂盒的一些实施方案中,所述逆转录病毒是复制缺陷的。逆转录病毒载体系统利用以下事实:含有5′和3′LTR和包装信号的最小载体足以允许载体包装、感染和整合到靶细胞中,前提条件是,病毒结构蛋白在包装细胞系中以反式供给。用于基因转移的逆转录病毒载体的基本优势包括在大多数细胞类型中的有效感染和基因表达、精确的单个拷贝载体向靶细胞染色体DNA中的整合以及逆转录病毒基因组的易于操作。
在本文所述的组合物、构建体、载体、方法和试剂盒的一些实施方案中,所述病毒载体是基于腺病毒的表达载体。与整合到宿主基因组中的逆转录病毒不同,腺病毒在染色体外持续存在,因此将与插入诱变相关的风险降至最低(Haj-Ahmad和Graham(1986)J.Virol.57:267-74;Bett等人(1993)J.Virol.67:5911-21;Mittereder等人(1994)HumanGene Therapy 5:717-29;Seth等人(1994)J.Virol.68:933-40;Barr等人(1994)GeneTherapy 1:51-58;Berkner,K.L.(1988)BioTechniques 6:616-29;和Rich等人(1993)Human Gene Therapy 4:461-76)。腺病毒载体感染多种细胞,具有广宿主范围,表现出高感染效率,指导异源基因的高水平表达,并实现那些基因在体内的长期表达。该病毒作为无细胞病毒粒子具有完全感染性,因此不必需注射生产细胞系。就安全性而言,腺病毒与严重的人病理学无关,并且源自该病毒的重组载体可以通过病毒基因组的早期区域1(“E1”)中的缺失而使得复制缺陷。也可以相对容易地大量生产腺病毒。用于本文所述的组合物、方法和试剂盒中的腺病毒载体可以衍生自各种腺病毒血清型中的任一种,包括、但不限于腺病毒的超过40种血清型毒株(诸如血清型2、5、12、40和41)中的任一种。本文使用的腺病毒载体优选地是复制缺陷的并且含有可操作地连接至合适启动子的感兴趣的cDC2诱导因子。
在本文所述的组合物、构建体、载体、方法和试剂盒的一些实施方案中,使用一种或多种诱导型慢病毒载体引入或递送编码cDC2诱导因子诸如PU.1、IRF4、PRDM1、IRF2、RBPJ、RELB、POU2F2和TGIF1的核酸序列。在一些实施方案中,通过使细胞(所述细胞在处于诱导型启动子的控制下或可操作地连接至诱导型启动子的表达载体中具有至少一种DC诱导因子)与调节剂(例如,多西环素)或其它诱导剂接触,可以实现使用一种或多种诱导型慢病毒载体递送的cDC2诱导因子的表达的控制。当使用一些类型的诱导型慢病毒载体时,使这样的细胞与诱导剂接触诱导cDC2诱导因子的表达,而调节剂的撤去抑制表达。当使用其它类型的诱导型慢病毒载体时,调节剂的存在抑制表达,而调节剂的除去允许表达。本文中使用的术语“表达的诱导”表示,在例如诱导剂存在的情况下,或在一种或多种造成基因在细胞中的内源性表达的试剂或因子存在的情况下,基因的表达,诸如由诱导型病毒载体编码的cDC2诱导因子的表达。
在本文描述的方面的一些实施方案中,使用多西环素(Dox)诱导型慢病毒系统。与逆转录病毒不同,慢病毒能够转导静止细胞,使其适合转导更广泛种类的造血细胞类型。例如,pFUW-tetO慢病毒系统已被显示以高效地转导原代造血祖细胞。
在本文所述的方法的一些实施方案中,使用非整合载体(例如,腺病毒)引入或递送编码cDC2诱导因子、诸如PU.1(SEQ.ID.1、SEQ.ID.2、SEQ.ID.4、SEQ.ID.5)、IRF4(SEQ.ID.7、SEQ.ID.8、SEQ.ID.10、SEQ.ID.11)、PRDM1(SEQ.ID.13、SEQ.ID.14、SEQ.ID.16、SEQ.ID.17)、IRF2(SEQ.ID.19、SEQ.ID.20、SEQ.ID.22、SEQ.ID.23)、POU2F2(SEQ.ID.25、SEQ.ID.26、SEQ.ID.28、SEQ.ID.29)、TGIF1(SEQ.ID.31、SEQ.ID.32、SEQ.ID.34、SEQ.ID.35)、RBPJ(SEQ.ID.43、SEQ.ID.44、SEQ.ID.46、SEQ.ID.47)和RELB(SEQ.ID.37、SEQ.ID.38、SEQ.ID.40、SEQ.ID.41)的核酸序列。虽然整合载体(诸如逆转录病毒载体)整合到宿主细胞基因组中并可能破坏正常基因功能,但是非整合载体通过染色体外转录控制基因产物的表达。由于非整合载体不会成为宿主基因组的一部分,因此非整合载体倾向于在细胞群中瞬时表达核酸。这部分地由于非整合载体经常使得复制缺陷的事实。因此,非整合载体具有胜过逆转录病毒载体的几个优点,包括、但不限于:(1)不会破坏宿主基因组,和(2)瞬时表达,以及(3)没有剩余的病毒整合产物。用于与本文所述方法一起使用的非整合载体的一些非限制性例子包括腺病毒、杆状病毒、甲病毒、小RNA病毒和痘苗病毒。在本文所述的方法的一些实施方案中,所述非整合病毒载体是腺病毒。非整合病毒载体的其它优点包括以高滴度生产它们的能力、它们在体内的稳定性以及它们对宿主细胞的有效感染。
在一些实施方案中,用于在本文所述的组合物、方法和试剂盒中产生诱导的cDC2的核酸构建体和载体可以进一步包含一种或多种编码用于细胞的阳性和阴性选择的选择标志物的序列。这样的选择标志物序列通常可以提供对抗生素的抗性或敏感性特性,所述特性在没有引入核酸构建体的情况下通常不会在细胞中发现。选择标志物可以与选择剂诸如抗生素结合使用,以在培养物中选择表达插入的核酸构建体的细胞。编码阳性选择标志物的序列通常提供抗生素抗性,即,当阳性选择标志物序列存在于细胞的基因组中时,所述细胞对抗生素或试剂是敏感的。编码阴性选择标志物的序列通常提供对抗生素或试剂的敏感性,即,当阴性选择标志物存在于细胞的基因组中时,细胞对抗生素或试剂是敏感的。
在一些实施方案中,用于制备在本文所述的组合物、方法、及其试剂盒中的诱导的cDC2的核酸构建体和载体可以进一步包含用于构建体或其它载体遗传元件的调节、表达、稳定化的其它核酸元件,例如,本领域普通技术人员已知的启动子、增强子、TATA盒、核糖体结合位点、IRES。
在本文所述的组合物、构建体、载体、方法和试剂盒的一些实施方案中,将DC诱导因子、诸如PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35)、RBPJ(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47)和RELB(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41)作为合成的、修饰的RNA提供,或者作为合成的、修饰的RNA引入或递送到细胞中,如美国专利公开2012-0046346-A1中所述,其内容通过引用整体并入本文。在其中根据本文所述的方法使用合成的、修饰的RNA将细胞重新程序化为诱导的cDC2的那些实施方案中,所述方法可以涉及细胞的重复接触或涉及编码DC诱导因子的合成的、修饰的RNA的重复转染,诸如例如,至少2次、至少3次、至少4次、至少5次、至少6次、至少7次、至少8次、至少9次、至少10次、至少11次、至少12次、至少13次、至少14次、至少15次、至少16次、至少17次、至少18次、至少19次、至少20次、至少25次、至少30次或更多次转染。
除了一种或多种修饰的核苷之外,用于本文所述的组合物、构建体、载体、方法和试剂盒中的修饰的mRNA还可以包含本领域技术人员已知的和如在美国专利公开US 2012/0046346 A1和US 2012/0251618 A1以及PCT公开WO 2012/019168中描述的任何另外修饰。这样的其它组分包括,例如,5'帽(例如,抗反向帽类似物(ARCA)帽,其含有5′-5′-三磷酸鸟嘌呤-鸟嘌呤键,其中一个鸟嘌呤含有N7甲基基团以及3′-O-甲基基团;使用重组痘苗病毒加帽酶和重组2′-O-甲基转移酶产生的帽,其可以在mRNA的最5′侧核苷酸和鸟嘌呤核苷酸之间产生规范的5′-5′-三磷酸键,其中鸟嘌呤含有N7甲基化,且最终的5′-核苷酸含有2′-O-甲基,从而产生Cap1结构);聚腺苷酸尾巴(例如,大于30个核苷酸长度、大于35个核苷酸长度、至少40个核苷酸、至少45个核苷酸、至少55个核苷酸、至少60个核苷酸、至少70个核苷酸、至少80个核苷酸、至少90个核苷酸、至少100个核苷酸、至少200个核苷酸、至少300个核苷酸、至少400个核苷酸、至少500个核苷酸、至少600个核苷酸、至少700个核苷酸、至少800个核苷酸、至少900个核苷酸、至少1000个核苷酸或更长的聚-A尾巴);Kozak序列;3′非翻译区(3′UTR);5′非翻译区(5′UTR);一种或多种能够从核酸中切除的内含子核苷酸序列,或它们的任意组合。
在一个实施方案中,用于本文所述的组合物、构建体、载体、方法和试剂盒中的修饰的mRNA可以进一步包含内部核糖体进入位点(IRES)。IRES可以充当唯一的核糖体结合位点,或可以充当mRNA的多个核糖体结合位点之一。含有多于一个功能性核糖体结合位点的mRNA可以编码几种肽或多肽,诸如本文所述的cDC2诱导因子,它们由核糖体(“多顺反子mRNA”)独立翻译。当给核酸提供IRES时,进一步任选地提供第二可翻译区。可以根据本公开内容使用的IRES序列的例子包括、但不限于来自小RNA病毒(例如FMDV)、害虫病毒(CFFV)、脊髓灰质炎病毒(PV)、脑心肌炎病毒(ECMV)、口蹄疫病毒(FMDV)、丙型肝炎病毒(HCV)、经典猪瘟病毒(CSFV)、鼠白血病病毒(MLV)、猿猴免疫缺陷病毒(SW)或蟋蟀麻痹病毒(CrPV)的那些。
在本文所述的组合物、构建体、载体、方法和试剂盒的一些实施方案中,所述合成的、修饰的RNA分子包含至少一种修饰的核苷。在本文所述的组合物、方法和试剂盒的一些实施方案中,所述合成的、修饰的RNA分子包含至少两种修饰的核苷。
在本文所述的组合物、构建体、载体、方法和试剂盒的一些实施方案中,所述修饰的核苷选自:5-甲基胞嘧啶(5mC)、N6-甲基腺苷(m6A)、3,2′-O-二甲基尿苷(m4U)、2-硫代尿苷(s2U)、2′氟代尿苷、假尿苷、2′-O-甲基尿苷(Um)、2′脱氧尿苷(2′dU)、4-硫代尿苷(s4U)、5-甲基尿苷(m5U)、2′-O-甲基腺苷(m6A)、N6,2′-O-二甲基腺苷(m6Am)、N6,N6,2′-O-三甲基腺苷(m62Am)、2′-O-甲基胞苷(Cm)、7-甲基鸟苷(m7G)、2′-O-甲基鸟苷(Gm)、N2,7-二甲基鸟苷(m2,7G)、N2,N2,7-三甲基鸟苷(m2,2,7G)和肌苷(I)。在一些实施方案中,所述修饰的核苷是5-甲基胞嘧啶(5mC)、假尿嘧啶或它们的组合。
修饰的mRNA不需要沿分子的整个长度进行均匀修饰。不同的核苷酸修饰和/或主链结构可以存在于核酸中的不同位置。本领域普通技术人员将理解,核苷酸类似物或其它修饰可以位于核酸的任何位置,使得核酸的功能不会显著降低。修饰也可以是5'或3'末端修饰。核酸可以含有最少一个且最多100%的修饰的核苷酸,或任何中间百分比,诸如至少50%修饰的核苷酸、至少80%修饰的核苷酸或至少90%修饰的核苷酸。
在一些实施方案中,优选但并非绝对必要的是,给定的核苷在分子中的每次出现都被修饰(例如,每个胞嘧啶是修饰的胞嘧啶,例如5-甲基胞嘧啶,每个尿嘧啶是修饰的尿嘧啶,例如,假尿嘧啶,等)。例如,修饰的mRNA可以包含修饰的嘧啶诸如尿嘧啶或胞嘧啶。在一些实施方案中,核酸中至少25%、至少50%、至少80%、至少90%或100%的尿嘧啶被修饰的尿嘧啶替代。还考虑到相同核苷的不同出现可以在给定的合成的、修饰的RNA分子中以不同的方式修饰。修饰的尿嘧啶可以被具有单一独特结构的化合物替代,或者可以被具有不同结构(例如,2、3、4种或更多种独特结构)的多种化合物替代。在一些实施方案中,核酸中至少25%、至少50%、至少80%、至少90%或100%的胞嘧啶可以被修饰的胞嘧啶替代。修饰的胞嘧啶可以被具有单一独特结构的化合物替代,或者可以被具有不同结构(例如,2、3、4种或更多种独特结构)的多种化合物替代(例如,一些胞嘧啶修饰为5mC,其它修饰为2′-O-甲基胞嘧啶或其它胞嘧啶类似物)。这样的多修饰的合成RNA分子可以通过使用包含所有期望的修饰核苷的核糖核苷共混物或混合物来产生,使得当合成RNA分子时,仅期望的修饰核苷被掺入所得编码cDC2诱导因子的RNA分子中。
在某些实施方案中,期望的是,在细胞内降解引入细胞中的修饰核酸,例如如果期望精确的蛋白产生时机。因此,在本文所述的组合物、方法和试剂盒的一些实施方案中,本文提供了修饰的核酸,其包含能够在细胞内以定向方式被起作用的降解结构域。
虽然应理解诱导的cDC2可以通过以核酸(DNA或RNA)或氨基酸序列的形式递送cDC2诱导因子来产生,但是在本文所述的组合物、构建体、载体、方法和试剂盒的一些实施方案中,诱导的cDC2可以使用其它方法来诱导,例如,通过用试剂(诸如小分子或小分子混合物,其诱导一种或多种cDC2诱导因子的表达)处理细胞。
使用本文所述的组合物、构建体、载体、方法和试剂盒检测引入细胞中或在细胞群中诱导的cDC2诱导因子的表达,可以通过本领域技术人员已知的几种技术中的任一种来实现,所述技术包括、例如Western印迹分析、免疫细胞化学和荧光介导的检测。
为了区分DC诱导因子的给定组合是否已产生诱导的cDC2,可以测量一种或多种DC活性或参数,诸如,在一些实施方案中,表面抗原的差异表达。使用本文所述的组合物、方法和试剂盒产生诱导的DC,例如优选造成内源性cDC2特有的细胞表面表型(诸如CD45、MHC-II、CD11b、Sirpa、CD4、ESAM、Clec4a4、Clec10a、Clec12a和Mgl2)的出现。
DC通过其功能行为最可靠地与其它免疫细胞区分开。cDC2表型或cDC2活性的功能方面,诸如诱导的cDC2分泌细胞因子的能力,可以由本领域技术人员使用本领域已知的常规方法容易地确定。在本文描述的方面的一些实施方案中,可以使用鉴定重新程序化因子的功能测定。例如,在一些实施方案中,细胞因子分泌可以用于证实使用本文所述的组合物、构建体、载体、方法和试剂盒产生的诱导的cDC2的免疫调节特性。
本文中使用的“细胞参数”、“DC参数”或“细胞因子分泌”表示内源或天然DC的可测量组分或质量,特别是可以准确测量的组分。细胞参数可以是与细胞的表型、功能或行为相关的任何可测量参数。这样的细胞参数包括DC或DC群体的特征和标志物的变化,包括、但不限于活力、细胞生长、一种或多种标志物或标志物组合的表达的变化,所述标志物诸如细胞表面决定簇诸如受体、蛋白(包括其构象或翻译后修饰)、脂质、碳水化合物、有机或无机分子、核酸(例如mRNA、DNA)、总基因表达模式等。使用本领域技术人员已知的多种测定中的任一种,可以测量这样的细胞参数。例如,通过测定诸如锥虫蓝排除、CFSE稀释和3H-胸苷掺入,可以测量活力和细胞生长。例如,使用流式细胞计数测定、Western印迹技术或显微术方法,可以测量蛋白或多肽标志物的表达。例如,使用RNA-测序方法和定量或半定量实时PCR测定,可以测定基因表达概况。细胞参数还可以表示功能参数或功能活性。虽然大多数细胞参数将提供定量读出值,但在一些情况下,半定量或定性结果可以是可接受的。读出值可以包括单个确定值,或可以包括平均值、中位值或方差等。特有地,可以从多个相同测定中为每种参数获得一系列参数读出值。预见到变异性,并且将使用标准统计方法和用于提供单个值的通用统计方法来获得每组测试参数的一系列值。
在本文所述的组合物、方法和试剂盒的一些实施方案中,可以使用另外的因子和试剂来增强诱导的cDC2重新程序化。例如,修饰表观遗传途径的因子和试剂可以用于促进重新程序化为诱导的cDC2。
根据本文描述的组合物、方法和试剂盒,基本上任何原代体细胞类型都可以用于产生诱导的cDC2或将体细胞重新程序化为诱导的cDC2。这样的原代体细胞类型还包括其它干细胞类型,包括多能干细胞,诸如诱导的多能干细胞(iPS细胞);其它多潜能干细胞;寡能干细胞;和单能干细胞。可用于本文所述方法的各个方面和实施方案中的原代体细胞的一些非限制性例子包括、但不限于成纤维细胞、上皮细胞、内皮细胞、神经元细胞、脂肪细胞、心脏细胞、骨骼肌细胞、造血细胞或免疫细胞、肝细胞、脾细胞、肺细胞、循环血细胞、胃肠细胞、肾细胞、骨髓细胞和胰腺细胞,以及衍生出那些细胞的干细胞。所述细胞可以是从任何体组织分离的原代细胞,所述体组织包括、但不限于脾、骨髓、血液、脑、肝、肺、肠道、胃、肠、脂肪、肌肉、子宫、皮肤、脾、内分泌器官、骨骼等。在一些实施方案中,术语“体细胞”进一步涵盖在培养物中生长的原代细胞,前提条件是,体细胞不是永生化的。当将细胞维持在体外条件下时,可以使用常规的组织培养条件和方法,并且它们是本领域技术人员已知的。各种原代体细胞的分离和培养方法完全是在本领域技术人员的能力范围内。
在本文描述的这些方面和所有这样的方面的一些实施方案中,所述体细胞是成纤维细胞。
在本文描述的这些方面和所有这样的方面的一些实施方案中,所述体细胞可以是造血谱系细胞。
在本文描述的这些方面和所有这样的方面的一些实施方案中,所述体细胞可以是癌细胞或肿瘤细胞。
在本文所述的组合物、方法和试剂盒的一些实施方案中,待重新程序化或制成诱导的cDC2细胞的体细胞是造血来源的细胞。本文中使用的术语“造血衍生的细胞”、“造血的衍生的分化细胞”、“造血谱系细胞”和“造血来源的细胞”表示衍生自或分化自多潜能造血干细胞(HSC)的细胞。因此,用于与本文所述的组合物、方法和试剂盒一起使用的造血谱系细胞包括多潜能、寡能和谱系限制的造血祖细胞、粒细胞(例如,早幼粒细胞、嗜中性粒细胞、嗜酸性粒细胞、嗜碱性粒细胞)、红细胞(例如,网织红细胞、红细胞)、凝血细胞(例如,幼巨核细胞、产生血小板的巨核细胞、血小板)、单核细胞(例如,单核细胞、巨噬细胞)、树突细胞和淋巴细胞(例如,T-淋巴细胞(其携带T-细胞受体(TCR))、B-淋巴细胞或B细胞(其表达免疫球蛋白并产生抗体)、NK细胞、NKT细胞和先天性淋巴细胞)。本文中使用的术语“造血祖细胞”表示能够分化为造血系统的两种或更多种细胞类型的多潜能、寡能和谱系限制的造血细胞,包括、但不限于,粒细胞、单核细胞、红细胞、巨核细胞和淋巴细胞B-细胞和T-细胞。造血祖细胞涵盖多能祖细胞(MPP)、共同髓样祖细胞(CMP)、共同淋巴样祖细胞(CLP)、粒细胞-单核细胞祖细胞(GMP)和前巨核细胞-红细胞祖细胞。谱系限制的造血祖细胞包括巨核细胞-红细胞祖细胞(MEP)、原B细胞、前B细胞、前原B细胞、原T细胞、双阴性T细胞、原-NK细胞、前粒细胞/巨噬细胞、粒细胞/巨噬细胞祖(GMP)细胞和原-肥大细胞(ProMC)。
用于本文所述的组合物、方法和试剂盒中的造血来源的细胞可以获得自已知包含这些细胞的任何来源,诸如胎儿组织、脐带血、骨髓、外周血、动员的外周血、脾、肝、胸腺、淋巴液等。从这些来源获得的细胞可以使用本领域技术人员可接受的任何方法离体扩增,然后与本文描述的用于制备诱导的cDC2的组合物、方法和试剂盒一起使用。例如,使用本领域技术人员可接受的任何程序,可以对细胞进行分选、分级分离、处理以除去特定细胞类型,或以其它方式操作以获得用于本文所述的方法中的细胞群体。可以收集单核淋巴细胞,例如,通过如美国专利号4,690,915中所述的使用连续流动室分离器的重复淋巴细胞除去法,或使用CLP方法的亲和纯化步骤分离,诸如使用细胞计数器的流式细胞计量术、磁性分离、使用抗体或蛋白包被的珠子、亲和色谱法或固体支持亲和分离(其中根据细胞对特定蛋白或蛋白类型的表达或表达缺乏将细胞保留在基质上)、或批式纯化(使用一种或多种针对目标细胞类型特异性表达的一种或多种表面抗原的抗体)。造血来源的细胞也可以从外周血获得。在从外周血收获细胞之前,可以用细胞因子诸如例如粒细胞-集落刺激因子治疗受试者,以促进细胞从骨髓向血液隔室的迁移和/或促进目标群体的活化和/或增殖。例如,可以采用适用于鉴定表面蛋白的任何方法从异质群体分离造血来源的细胞。在一些实施方案中,获得造血来源的细胞(诸如淋巴细胞)的克隆群体。在一些实施方案中,造血来源的细胞不是克隆群体。
此外,关于本文所述的组合物、方法和试剂盒的各个方面和实施方案,可以从任何哺乳动物物种获得体细胞,非限制性例子包括鼠、牛、猿猴、猪、马、羊或人细胞。在一些实施方案中,所述体细胞是人细胞。在一些实施方案中,所述细胞来自非人生物体,诸如非人哺乳动物。
一般而言,本文所述的制备诱导的cDC2的方法涉及在本领域普通技术人员可获得和众所周知的任何培养基中培养或扩增体细胞,诸如造血来源的细胞。这样的培养基包括、但不限于Dulbecco氏改良的伊格尔培养基(DMEM)、DMEM F12培养基/>Eagle氏最低基础培养基/>F-12K培养基/>Iscove氏改良的Dulbecco氏培养基/>RPMI-1640培养基/>以及用于培养和扩增DC的无血清培养基。许多培养基也可作为含钠或不含钠的低葡萄糖制剂获得。在一些实施方案中,可以给与本文所述的方法一起使用的培养基补充一种或多种免疫刺激性细胞因子。常用生长因子包括、但不限于G-CSF、GM-CSF、TNF-α、IL-4、IL-3、Flt-3配体和kit配体。另外,在优选的实施方案中,所述免疫刺激性细胞因子选自白介素(例如,IL-1α、IL-1β、IL-2、IL-3、IL-4、IL-6、IL-8、IL-9、IL-10、IL-12、IL-18、IL-19、IL-20)、干扰素(例如,IFN-α、IFN-β、IFN-γ)、肿瘤坏死因子(TNF)、转化生长因子-β(TGF-β)、粒细胞集落刺激因子(G-CSF)、巨噬细胞集落刺激因子(M-CSF)、粒细胞-巨噬细胞集落刺激因子(GM-CSF)、Flt-3配体和kit配体。
例如,可以将培养中的细胞维持在悬浮液中或附着至固体支持物(诸如细胞外基质组分),或铺板在饲养细胞上。在本文所述的方法中使用的细胞可以需要促进它们附着到固体支持物的额外因子,在一些实施方案中,诸如I型和II型胶原、硫酸软骨素、纤连蛋白、“超纤连蛋白”和纤连蛋白样聚合物、明胶、聚-D和聚-L-赖氨酸、凝血酶敏感蛋白和玻璃体结合蛋白。在一些实施方案中,细胞适合在悬浮培养物中生长。适合悬浮的宿主细胞通常是单分散的或以松散聚集体形式生长而没有大量聚集。适合悬浮的宿主细胞包括适用于悬浮培养而无需适应或操作的细胞(例如,造血来源的细胞,诸如淋巴样细胞)和通过附着依赖性细胞(例如,上皮细胞、成纤维细胞)的改变或适应而成为适合悬浮的细胞。
在本文描述的这些方面和所有这样的方面的一些实施方案中,所述分离的诱导的cDC2进一步包含药学上可接受的载体以施用给有需要的受试者。
在一些方面,本文还提供了治疗需要治疗的受试者以诱导抗原特异性免疫应答以消除癌细胞或传染原或产生对自体抗原的免疫耐受的方法,其使用本文所述的cDC2诱导组合物和制备诱导的cDC2的方法,或使用分离的诱导的cDC2及其细胞克隆,其使用本文所述的DC诱导因子、DC诱导组合物或制备诱导的cDC2的方法的任意组合产生。在这样的治疗方法中,可以首先从受试者中分离体细胞,诸如成纤维细胞或造血谱系细胞,并且如本文所述用分别包含表达载体或合成mRNA的DC诱导组合物转导或转染分离的细胞。然后可以将使用本文所述的cDC2诱导因子、cDC2诱导组合物或制备诱导的cDC2的方法的任何组合产生的分离的诱导的cDC2施用给受试者,诸如通过将诱导的cDC2全身注射给受试者。
在一些方面,本文还提供了使用本文描述的cDC2诱导组合物以及cDC2诱导因子的任何组合治疗需要治疗的受试者以诱导抗原特异性免疫应答以消除癌细胞或传染原的方法。在这样的治疗方法中,如本文所述,用包含表达载体的cDC2诱导组合物转导癌细胞。可以首先将癌细胞从受试者中分离,用包含表达载体的cDC2诱导组合物转导,并且然后施用给受试者,诸如通过全身注射。可替换地,可以用包含病毒表达载体的cDC2诱导组合物原位或在体内转导癌细胞。
在本文所述的治疗方法的一些实施方案中,可以将使用本文所述的组合物、方法和试剂盒产生的重新程序化的诱导的cDC2直接使用或施用给需要免疫疗法的受试者。因此,本文所述方法的各个实施方案涉及将有效量的使用本文所述的组合物、载体、方法和试剂盒中的任一种产生的诱导的cDC2或诱导的cDC2群体施用给需要细胞疗法的个体或受试者。被施用的细胞或细胞群体可以是自体群体,或衍生自一种或多种异源来源。此外,可以以允许它们迁移到淋巴结并活化效应T细胞的方式施用这样的诱导的cDC2。
在本文所述的治疗方法的一些实施方案中,使用本文所述的组合物、方法和试剂盒产生的重新程序化诱导的cDC2可以直接使用或施用给患有自身免疫病症或超敏反应病症的受试者。因此,本文所述方法的各个实施方案涉及将有效量的使用本文所述的组合物、方法和试剂盒中的任一种产生的诱导的cDC2或诱导的cDC2群体施用给需要细胞疗法的个体或受试者。被施用的细胞或细胞群体可以是自体群体或衍生自一种或多种异源来源。此外,这样的诱导的cDC2可以装载自体抗原并以允许它们迁移至胸腺并促进自体反应性T细胞的阴性选择、迁移至淋巴结并限制效应T细胞或促进Treg分化的方式施用。
本领域技术人员已知多种将细胞施用给受试者的方法。这样的方法可以包括全身注射,例如,静脉内注射,或者将细胞植入进受试者的靶部位。可以将细胞插入递送装置中,该递送装置有助于通过注射或植入进受试者而引入。这样的递送装置可以包括用于将细胞和流体注射到接受受试者的身体中的管子,例如导管。在一个优选的实施方案中,所述管子还另外具有针头,例如,通过该针头可以将细胞在期望的位置处引入受试者中。细胞可以以多种不同的形式制备用于递送。例如,当被包含在这样的递送装置中时,可以将细胞悬浮在溶液或凝胶中或嵌入支持基质中。可以将细胞与细胞在其中保持活力的药学上可接受的载体或稀释剂混合。
因此,通过本文所述的方法产生的细胞可以用于制备细胞以治疗或减轻几种癌症和肿瘤,包括、但不限于乳腺癌、前列腺癌、淋巴瘤、皮肤癌、胰腺癌、结肠癌、黑素瘤、恶性黑素瘤、卵巢癌、脑癌、原发性脑癌、头颈癌、神经胶质瘤、胶质母细胞瘤、肝癌、膀胱癌、非小细胞肺癌、头或颈癌、乳腺癌、卵巢癌、肺癌、小细胞肺癌、威尔曼瘤、宫颈癌、睾丸癌、膀胱癌、胰腺癌、胃癌、结肠癌、前列腺癌、生殖泌尿癌、甲状腺癌、食管癌、骨髓瘤、多发性骨髓瘤、肾上腺癌、肾细胞癌、子宫内膜癌、肾上腺皮质癌、恶性胰腺胰岛素瘤、恶性类癌、绒毛膜癌、蕈样肉芽肿病、恶性高钙血症、宫颈增生、白血病、急性淋巴细胞白血病、慢性淋巴细胞白血病、急性髓性白血病、慢性髓性白血病、慢性粒细胞白血病、急性粒细胞白血病、毛细胞白血病、神经母细胞瘤、横纹肌肉瘤、卡波西氏肉瘤、真性红细胞增多症、原发性血小板增多症、霍奇金病、非霍奇金淋巴瘤、软组织肉瘤、成骨性肉瘤、原发性巨球蛋白血症和视网膜母细胞瘤等。
除了上述之外,本公开内容的方法可以用于预防或消除已知会诱发某些癌症的病原体的感染。用于本文提供的癌症疫苗中的特别感兴趣的病原体包括乙型肝炎病毒(肝细胞癌)、丙型肝炎病毒(肝细胞瘤)、爱泼斯坦-巴尔病毒(EBV)(伯基特淋巴瘤、鼻咽癌、免疫抑制个体中的PTLD)、HTLVL(成体T细胞白血病)、致癌人乳头瘤病毒16、18、33、45型(成人宫颈癌)和细菌幽门螺杆菌(Helicobacter pylori)(B细胞胃淋巴瘤)。在文献中广泛描述了可以在哺乳动物和更特别是人类中充当抗原的其它医学相关微生物,例如,C.G.A Thomas,Medical Microbiology,Bailliere Tindall,(1983)。
除了上述之外,本公开内容的方法可以用于病毒感染。示例性的病毒病原体包括、但不限于感染哺乳动物和更特别是人类的感染性病毒。感染性病毒的例子包括、但不限于:逆转录病毒科(例如,人免疫缺陷病毒,诸如HIV-I(也被称作HTLV-III,LAV或HTLV-III/LAV,或HIV-III;和其它分离株,诸如HIV-LP);细小RNA病毒科(例如脊髓灰质炎病毒、甲型肝炎病毒;肠道病毒、人柯萨奇病毒、鼻病毒、埃可病毒);杯状病毒科(例如引起胃肠炎的毒株);披膜病毒科(例如马脑炎病毒、风疹病毒);黄病毒科(例如登革热病毒、脑炎病毒、黄热病病毒);冠状病毒科(例如冠状病毒诸如SARS冠状病毒);弹状病毒科(例如水疱性口炎病毒、狂犬病病毒);纤丝病毒科(例如埃博拉病毒);副粘病毒(例如副流感病毒、腮腺炎病毒、麻疹病毒、呼吸道合胞体病毒);正粘病毒科(例如流感病毒);布尼亚病毒科(例如汉坦病毒、bunga病毒、白蛉病毒和Nairo病毒);沙粒病毒科(出血热病毒);呼肠孤病毒科(例如里奥病毒、环状病毒和轮状病毒);双RNA病毒科;嗜肝病毒科(乙型肝炎病毒);细小病毒科(细小病毒);乳多空病毒科(乳头状瘤病毒、多瘤病毒);腺病毒科(大多数腺病毒);疱疹病毒科单纯疱疹病毒(HSV)1和2、水痘带状疱疹病毒、巨细胞病毒(CMV)、疱疹病毒;痘病毒科(天花病毒、痘苗病毒、痘病毒);和虹彩病毒科(例如非洲猪瘟病毒);和未分类的病毒(例如海绵状脑病的病原体、丁型肝炎的病原体(被认为是乙型肝炎病毒的缺陷卫星)、非甲非乙型肝炎的病原体(1类=血液(internally)传播;2类=胃肠外传播(即丙型肝炎);诺瓦克病毒(Norwalk)和有关的病毒和星状病毒)。
除了上述之外,本公开内容的方法可以用于靶向脊椎动物中的革兰氏阴性和革兰氏阳性细菌。这样的革兰氏阳性细菌包括、但不限于巴斯德氏菌属种(Pasteurella sp.)、葡萄球菌属种(Staphylococci sp.)和链球菌属种(Streptococcus sp.)。革兰氏阴性细菌包括、但不限于大肠杆菌(Escherichia coli)、假单胞菌属种(Pseudomonas sp.)和沙门氏菌属种(Salmonella sp.)。感染性细菌的具体例子包括、但不限于:幽门螺杆菌(Helicobacter pyloris)、伯氏疏螺旋体(Borella burgdorferi)、嗜肺军团菌(Legionella pneumophilia)、分枝杆菌属种(例如结核分枝杆菌(M.tuberculosis)、鸟分枝杆菌(M.avium)、胞内分枝杆菌(M.intracellulare)、堪萨斯分枝杆菌(M.kansaii)、戈登分枝杆菌(M.gordonae))、金黄色葡萄球菌(Staphylococcus aureus)、淋病奈瑟球菌(Neisseria gonorrhoeae)、脑膜炎奈瑟球菌(Neisseria meningitidis)、单核细胞增生李斯特菌(Listeria monocytogenes)、酿脓链球菌(Streptococcus pyogenes)(A群链球菌)、无乳链球菌(Streptococcus agalactiae)(B群链球菌)、链球菌属(viridans group)、粪链球菌(Streptococcus faecalis)、牛链球菌(Streptococcus bovis)、链球菌属(厌氧消化种)、肺炎链球菌(Streptococcus pneumoniae)、病原性弯曲杆菌属种(Campylobactersp.)、肠球菌属种(Enterococcus sp.)、流感嗜血杆菌(Haemophilus infuenzae)、炭疽芽孢杆菌(Bacillus antracis)、白喉棒杆菌(Corynebacterium diphtheriae)、棒杆菌属种(Corynebacterium sp.)、猪红斑丹毒丝菌(Erysipelothrix rhusiopathiae)、产气英膜梭菌(Clostridium perfringers)、破伤风梭菌(Clostridium tetani)、产气肠杆菌(Enterobacter aerogenes)、肺炎克雷伯氏菌(Klebsiella pneumoniae)、多杀巴斯德氏菌(Pasturella multocida)、拟杆菌属种(Bacteroides sp.)、具核梭杆菌(Fusobacteriumnucleatum)、念珠状链杆菌(Streptobacillus moniliformis)、苍白密螺旋体(Treponemapallidium)、极细密螺旋体(Treponema pertenue)、钩端螺旋体属(Leptospira)、立克次氏体属(Rickettsia)和以色列放线菌(Actinomyces israelii)。
除了上述之外,本公开内容的方法可以用于靶向病原体,所述病原体包括、但不限于感染哺乳动物和更特别是人类的感染性真菌和寄生虫。感染性真菌的例子包括、但不限于:新型隐球菌(Cryptococcus neoformans)、荚膜组织胞浆菌(Histoplasmacapsulatum)、粗球孢子菌(Coccidioides immitis)、皮炎芽生菌(Blastomycesdermatitidis)、砂眼衣原体(Chlamydia trachomatis)和白色假丝酵母(Candidaalbicans)
除了上述之外,本公开内容的方法可以用于靶向寄生虫,诸如细胞内寄生虫和专性细胞内寄生虫。寄生虫的例子包括、但不限于恶性疟原虫(Plasmodium falciparum)、卵形疟原虫(Plasmodium ovale)、三日疟原虫(Plasmodium malariae)、间日疟原虫(Plasmodium vivax)、诺氏疟原虫(Plasmodium knowlesi)、果氏巴贝虫(Babesiamicroti)、分歧巴贝虫(Babesia divergens)、克氏锥虫(Trypanosoma cruzi)、刚地弓形虫(Toxoplasma gondii)、旋毛形线虫(Trichinella spiralis)、硕大利什曼原虫(Leishmania major)、杜氏利什曼原虫(Leishmania donovani)、巴西利什曼原虫(Leishmania braziliensis)、热带利什曼原虫(Leishmania tropica)、冈比亚锥虫(Trypanosoma gambiense)、罗德西亚锥虫(Trypanosoma rhodesiense)、班氏吴策线虫(Wuchereria bancrofti)、马来布鲁线虫(Brugia malayi)、帝汶布鲁线虫(Brugiatimori)、人蛔虫(Ascaris lumbricoides)、旋盘尾丝虫(Onchocerca volvulus)和曼森血吸虫(Schistosoma mansoni)。
修饰的诱导的cDC2可以用于诱导对一种或多种靶抗原的致耐受性应答,包括未来或现有的免疫应答的抑制。因此,诱导的cDC2可用于治疗或预防不期望的免疫应答,包括例如移植排斥、移植物抗宿主病、变态反应、寄生虫疾病、炎性疾病和自身免疫性疾病。可以根据本公开内容治疗或预防的移植排斥的例子包括与骨髓和器官(诸如心脏、肝、胰腺、肾、肺、眼、皮肤等)的移植相关的排斥。变态反应的例子包括季节性呼吸变态反应;对气源性变应原的变态反应诸如花粉热;通过减少血清IgE可治疗的变态反应和嗜酸性粒细胞增多症;哮喘;湿疹;动物变态反应、食物变态反应;胶乳变态反应;皮炎;或通过变应性脱敏可治疗的变态反应。可以通过本公开内容治疗或预防的自身免疫性疾病包括例如银屑病、系统性红斑狼疮、重症肌无力、僵人综合征、甲状腺炎、西德纳姆舞蹈病、类风湿性关节炎、糖尿病和多发性硬化。炎性疾病的例子包括克罗恩氏病、慢性炎性眼疾病、慢性炎性肺疾病和慢性炎性肝疾病、自身免疫性溶血性贫血、特发性白细胞减少症、溃疡性结肠炎、皮肌炎、硬皮病、混合性结缔组织病、肠易激综合征、系统性红斑狼疮(SLE)、多发性硬化、重症肌无力、Guillan-Barre综合征(抗磷脂综合征)、原发性粘液性水肿、甲状腺毒症、恶性贫血、自身免疫性萎缩性胃炎、阿狄森氏病、胰岛素依赖型糖尿病(IDDM)、古德帕斯彻氏综合征、贝赫切特综合征、舍格伦综合征、类风湿性关节炎、交感性眼炎、桥本氏病/甲状腺功能减退症、乳糜泻/疱疹样皮炎和脱髓鞘病、原发性胆汁性肝硬化、混合性结缔组织病、慢性活动性肝炎、格雷夫斯病/甲状腺机能亢进、硬皮病、慢性特发性血小板减少性紫癜、糖尿病性神经病和脓毒性休克。
药学上可接受的载体和稀释剂包括盐水、缓冲水溶液、溶剂和/或分散介质。这样的载体和稀释剂的应用是本领域众所周知的。溶液优选地是无菌的和流体。优选地,在引入细胞之前,溶液在制造和储存条件下是稳定的,并且通过使用例如对羟基苯甲酸酯、三氯叔丁醇、苯酚、抗坏血酸、硫柳汞等针对微生物诸如细菌和真菌的污染作用进行防腐。
优选的是,细胞施用模式是相对非侵袭性的,例如通过静脉内注射、通过吸入的肺部递送、局部或鼻内施用。但是,细胞施用途径将取决于待治疗的组织,并且可能包括植入。用于细胞递送的方法是本领域技术人员已知的,并且可以由医学领域的技术人员外推以与本文所述的方法和组合物一起使用。
在一些方面,本文还提供了用于制备诱导的cDC2的试剂盒,所述试剂盒包含本文描述的含有一种或多种表达载体组分的任何DC诱导组合物。
在一些方面,本文还提供了包含本文所述的一种或多种cDC2诱导因子作为组分的试剂盒,其用于本文描述的制备诱导的cDC2的方法。
因此,在一些方面,本文提供了用于制备诱导的树突细胞的试剂盒,其包含以下组分:(a)一种或多种表达载体,其编码至少1、2、3、4、5、6种或更多种选自以下的cDC2诱导因子:PU.1(SEQ.ID.1,SEQ.ID.2,SEQ.ID.4,SEQ.ID.5)、IRF4(SEQ.ID.7,SEQ.ID.8,SEQ.ID.10,SEQ.ID.11)、PRDM1(SEQ.ID.13,SEQ.ID.14,SEQ.ID.16,SEQ.ID.17)、IRF2(SEQ.ID.19,SEQ.ID.20,SEQ.ID.22,SEQ.ID.23)、POU2F2(SEQ.ID.25,SEQ.ID.26,SEQ.ID.28,SEQ.ID.29)、TGIF1(SEQ.ID.31,SEQ.ID.32,SEQ.ID.34,SEQ.ID.35)、RBPJ(SEQ.ID.43,SEQ.ID.44,SEQ.ID.46,SEQ.ID.47)、RELB(SEQ.ID.37,SEQ.ID.38,SEQ.ID.40,SEQ.ID.41),和(b)其包装和说明书。
在一些实施方案中,本文所述的试剂盒可以进一步提供合成的mRNA或一种或多种编码DC诱导因子的表达载体,其呈混合物或作为单独等分试样。
在一些实施方案中,所述试剂盒可以进一步包含增强重新程序化效率的试剂。在一些实施方案中,所述试剂盒可以进一步包含一种或多种抗体或引物试剂以检测细胞类型特异性标志物以鉴定被诱导到cDC2状态的细胞。
在一些实施方案中,所述试剂盒可以进一步包含缓冲液。在一些这样的实施方案中,所述缓冲液是pH 7.0的无RNA酶的TE缓冲液。在一些实施方案中,所述试剂盒进一步包含含有细胞培养基的容器。
本文描述的所有试剂盒可以进一步包含缓冲液、细胞培养基、转导或转染培养基和/或培养基补充剂。在优选的实施方案中,所述缓冲液、细胞培养基、转染培养基和/或培养基补充剂不含DNA酶和RNA酶。在一些实施方案中,在所述试剂盒中提供的合成的、修饰的RNA可以呈特定量或质量(例如,20μg)的非溶液形式,诸如低压冻干的粉末形式,使得最终用户添加合适量的缓冲液或培养基以使组分达到期望的浓度,例如,100ng/μl。
本文描述的所有试剂盒可以进一步包含装置以促进使用本文描述的试剂盒组分生成的细胞的单次施用或重复或频繁输注,诸如不可植入的递送装置,例如,针头、注射器、笔式装置,或可植入递送装置,例如,泵、半永久性支架(例如,静脉内、腹膜内、脑池内或囊内)或蓄池。在一些这样的实施方案中,所述递送装置可以包括机构以分配单位剂量的包含诱导的cDC2的药物组合物。在一些实施方案中,所述装置连续地释放组合物,例如,通过扩散。在一些实施方案中,所述装置可以包括监测受试者体内的参数的传感器。例如,所述装置可以包括例如泵和任选的相关电子元件。
在一个实施方案中,通过例如修饰体细胞、多能细胞、祖细胞或干细胞的至少一种本文所公开因子的基因表达,或通过使这些细胞类型中的任一种暴露于至少一种蛋白或产生至少一种如本文所公开蛋白的RNA,人工制造诱导的cDC2。可以进一步通过使其暴露于开启至少一种本文所公开因子的小分子来制造所述细胞。在一些方面,使用至少2、3、4、5、6种因子来制造诱导的cDC2。
在一个实施方案中,以如下方式分离并纯化小鼠胚胎成纤维细胞(MEF):使Clec9aCre/Cre动物(Schraml等人,2013)与Rosa26-stopflox-tdTomato报告物小鼠(TheJackson Laboratory)杂交以生成双重纯合的Clec9aCre/Cre RosatdTomato/tdTomato(Clec9a-tdTomato)小鼠。将所有动物圈养在受控温度(23±2℃)下,经历固定的12-h光照/黑暗周期,自由获得食物和水。
在一个实施方案中,从Clec9a-tdTomato或C57BL/6小鼠的E13.5胚胎分离MEF的原代培养物。移除头、胎儿肝脏和所有内部器官,并将剩余组织机械解离。将解剖的组织使用0.12%胰蛋白酶/0.1mM乙二胺四乙酸(EDTA)溶液(3mL/胚胎)进行酶消化,并在37℃温育15min。每个胚胎另外添加3mL相同溶液,随后是另外15min温育时段。获得单细胞悬浮液并铺板于0.1%明胶包被的10-cm组织培养皿中的生长培养基中。使细胞生长2-3天直到汇合为止,用Tryple Express解离并在胎牛血清(FBS)+10%二甲亚砜(DMSO)中冷冻。在铺板用于慢病毒转导之前,分选MEF以移除残留的CD45+和tdTomato+细胞,其可以代表具有造血潜力的细胞。用于筛选和以下实验中的MEF是tdTomato-CD45-,其中纯度高于99%,且繁殖最多4代。
在一个实施方案中,将HEK293T细胞和MEF维持在生长培养基[补充有10%(v/v)FBS、2mM L-谷氨酰胺和抗生素(10μg/ml青霉素和链霉素)的Dulbecco氏改良的伊格尔培养基(DMEM)]中。将所有细胞维持在37℃和5%(v/v)CO2下。除非另有说明,否则所有组织培养试剂均来自Thermo Fisher Scientific。
在一个实施方案中,以如下方式进行病毒转导和重新程序化实验:将Clec9a-tdTomato MEF以40,000个细胞/孔的密度接种于0.1%明胶包被的6-孔平板上。将细胞以1:1FUW-TetO-TF和FUW-M2rtTA慢病毒颗粒的比率在补充有8μg/mL聚凝胺的生长培养基中温育过夜。在测试TF的组合时,应用相等MOI的每种单独病毒颗粒。将细胞在连续几天中转导两次,并且在过夜温育后,用新鲜的生长培养基更换培养基。在第二次转导后,向生长培养基中补充多西环素(1μg/mL)-第0天。每2-3天更换培养基,持续培养的持续时间。在转导后5-9天分析新出现的tdTomato+细胞。
在一个实施方案中,以如下方式进行流式细胞计量术分析:将转导的Clec9a-tdTomato MEF用TrypLE Express解离,重新悬浮于200μL PBS 5%FBS中并保持在4℃,然后在BD Accuri C6(BD Biosciences)中进行分析。为了分析MHC-II、CD45和CD11b细胞表面标志物表达,将解离的细胞与分别在PBS 5%FBS中稀释的APC-缀合物大鼠抗-小鼠I-A/I-E、抗-小鼠CD45和抗-小鼠CD11b抗体(Biolegend)一起在阻断非特异性结合的大鼠血清(1/100,GeneTex)存在的情况下在4℃温育30分钟。将细胞用PBS 5%FBS洗涤,重新悬浮于PBS5%FBS中并在BD Accuri C6中进行分析。使用FlowJo软件(FLOWJO,LLC,版本7.6)分析流式细胞计量术数据。
在一个实施方案中,以如下方式进行荧光激活细胞分选术(FACS):为了纯化Clec9a-tdTomato MEF,将细胞与在PBS 5%FBS中稀释的APC-Cy7-缀合的抗-CD45抗体(Biolegend)一起在4℃温育30分钟。随后,将MEF用PBS 5%FBS洗涤,重新悬浮在PBS 5%FBS中,并将tdTomato-CD45-MEF在BD FACSAria III(BD Biosciences)中纯化。
在一个实施方案中,以如下方式进行细胞因子分泌分析:将由PI4P过表达产生的tdT+细胞在重新程序化的第9天进行FACS分选。第二天,通过向培养基中添加LPS(100ng/mL)、PiC(1μg/mL)、R848(1μg/mL)或CpG ODN 1585(0.5μM)(Invivogen)进行过夜刺激。然后收集培养物上清液,根据生产商的说明书通过LEGENDplexTMMouse Th Cytokine Panel(13-plex)进一步分析。在BD Accuri C6中进行采集,并且然后使用LEGENDplexTM v8.0软件(BioLegend)分析数据。
在一个实施方案中,从C57BL6小鼠分离骨髓并用于产生骨髓衍生的树突细胞。简而言之,通过用研棒和研钵粉碎,从长骨(胫骨和股骨)收获总骨髓(BM)细胞。将细胞收获在补充有2%FBS的磷酸盐缓冲盐水(PBS)中,并通过70-μm细胞过滤网(BD Biosciences)过滤。将红血细胞用BD Pharm Lyse(BD Biosciences)在室温裂解8分钟。通过添加≥5体积的含2%FBS的PBS,停止裂解。在补充有Flt3l(200ng/ml)和GM-CSF(5ng/ml)的RPMI完全培养基中,将总BM细胞铺板在培养皿中(15x106个细胞/10-cm平板)。培养5天后,加入5ml完全RPMI培养基,并在第9天,将3x106个细胞重新铺板在10ml含有Flt3l和GM-CSF的新鲜培养基中。在培养15天后使用BM-DC。
在一个实施方案中,以如下方式进行抗原呈递测定:通过收获OT-II小鼠的脾,随后用Miltenyi Naive CD4+ T细胞分离试剂盒进行MACS纯化,获得CD4+ T细胞。将纯化的CD4+ T细胞用5mM CTV(Thermo Fisher)在室温标记20分钟,洗涤并计数。将FACS分选的tdT+PIP产生的细胞、MEF或BM-DC先前与OVA 323-339肽(10μg/ml)一起培养过夜。充分洗涤后,在TLR刺激LPS(100ng/mL)、PiC(1μg/mL)、R848(1μg/mL)或CpG ODN 1585(0.5μM)(Invivogen)存在或不存在的情况下,将20,000个tdT+PIP产生的细胞、MEF或BM-DC与100,000个CTV-标记的CD4+ T细胞一起在96-孔U-底培养板中共培养。培养5天后,将T细胞收集、染色和在BDLSRFortessaTM中分析。通过在活的单个TCRβ+CD4+ T-细胞中门控来确定T细胞增殖。
本领域技术人员将认识到或仅仅使用例行实验就能够确定许多与本文所述的发明的具体实施方案等效的方案。本发明的范围无意限于上面的描述,而是如所附权利要求所述。
如果没有特别排除,当在权利要求的说明书中使用单数形式的要素或特征时,也包括复数形式,反之亦然。例如,术语“一种转录因子”或“所述转录因子”还包括复数形式的“多种转录因子”或“所述转录因子”,反之亦然。在权利要求中,冠词诸如“一个/种(a)”、“一个/种(an)”和“所述”可以是指一个/种或超过一个/种,除非指示相反情形或以其它方式从上下文显而易见。如果组成员中的一个、超过一个或所有成员存在于给定的产品或过程中、在给定的产品或过程中使用、或以其它方式与给定的产品或过程相关,则认为满足了在一组的一个或多个成员之间包括“或”的权利要求或描述,除非指示相反情形或以其它方式从上下文显而易见。本发明包括这样的实施方案,其中所述组的刚好一个成员存在于给定产品或方法中、用于给定产品或方法中或者以其它方式与给定产品或方法相关。本发明还包括这样的实施方案,其中所述组成员中的超过一个或全部都存在于给定产品或方法中、用于给定产品或方法中或者以其它方式与给定产品或方法相关。
此外,应当理解,本发明涵盖所有变体、组合和排列,其中将来自一项或多项权利要求或来自说明书的相关部分的一个或多个限制、要素、条款、描述性术语等引入另一项权利要求中。例如,从属于另一项权利要求的任何权利要求可以修改为包括在从属于同一基本权利要求的任何其它权利要求中发现的一个或多个限制。
此外,当权利要求叙述一种组合物时,应当理解,包括为了本文中公开的任何目的而使用所述组合物的方法,并且包括根据本文中公开的任意制备方法来制备所述组合物的方法或本领域已知的其它方法,除非另外指出或除非本领域的普通技术人员明白会出现矛盾或不一致。
在给出范围的情况下,包括端点。此外,应当理解,除非另外指出或以其它方式从上下文和/或本领域普通技术人员的理解显而易见,否则被表达为范围的值可以采用在本发明的不同实施方案中的所述范围内的任何具体值,其精确至该范围的下限单位的十分之一,除非上下文另外清楚地指明。还应当理解,除非另外指出或以其它方式从上下文和/或本领域普通技术人员的理解显而易见,否则被表达为范围的值可以采用在给定范围内的任何子范围,其中将所述子范围的端点表达至与所述范围的下限单位的1/10相同的精确度。
本公开内容不应以任何方式被视为限于所描述的实施方案,并且本领域普通技术人员将预见到对其修改的许多可能性。
上述实施方案是可组合的。
以下权利要求进一步阐述了本公开内容的特定实施方案。
实施例
为了进一步表征在该组合物中描述的诱导的细胞及其与真正的DC子集的相似性,可能在群体水平上进行mRNA-测序。通常采用以下方式进行群体RNA-seq:用TRIzol试剂提取总RNA,通过特定RNA试剂盒(例如Takara SMARTSeq Ultra低输入RNA试剂盒)生成cDNA并进一步扩增。然后使用适当的试剂(例如Agilent高灵敏度DNA试剂盒)分析得到的cDNA。制备得到的文库之后进行cDNA标记,通过PCR添加正向和反向引物(index),并在适当的设备(例如Illumina NextSeq 500)上测序。
然后可以分析所得数据以进行差异表达分析、DC2特异性基因富集并与现有的公共可用数据集整合。
可替换地,单细胞RNA测序(scRNA-seq)可能通过提供单个细胞的表达概况来补充这种分析,这是更好地定义此处描述的生成细胞的表型和细胞状态的理想选择。
参考文献
Brown,C.C.,Gudjonson,H.,Pritykin,Y.,Deep,D.,Lavallée,V.P.,Mendoza,A.,Fromme,R.,Mazutis,L.,Ariyan,C.,Leslie,C.,Pe'er,D.,和Rudensky,A.Y.(2019).Transcriptional Basis of Mouse and Human Dendritic Cell Heterogeneity.Cell,179(4),846-863.e24.https://doi.org/10.1016/j.cell.2019.09.035
Datta,J.,Terhune,J.H.,Lowenfeld,L.,Cintolo,J.A.,Xu,S.,Roses,R.E.,和Czerniecki,B.J.(2014).Optimizing dendritic cell-based approaches for cancerimmunotherapy.The Yale Journal of Biology and Medicine,87(4),491-518.从http://www.ncbi.nlm.nih.gov/pubmed/25506283找到
Gabriele,L.,和Ozato,K.(2007).The role of the interferon regulatoryfactor(IRF)family in dendritic cell development and function.Cytokine&GrowthFactor Reviews,18(5-6),503-510.https://doi.org/10.1016/j.cytogfr.2007.06.008
Kopf,M.,Gros,G.Le,Bachmann,M.,Lamers,M.C.,Bluethmann,H.,和G.(1993).Disruption of the murine IL-4gene blocks Th2 cytokineresponses.Nature,362(6417),245-248.https://doi.org/10.1038/362245a0
Laoui,D.,Keirsse,J.,Morias,Y.,Van Overmeire,E.,Geeraerts,X.,Elkrim,Y.,…Van Ginderachter,J.A.(2016).The tumour microenvironment harboursontogenically distinct dendritic cell populations with opposing effects ontumour immunity.Nature Communications,7,13720.https://doi.org/10.1038/ncomms13720
Merad,M.,Sathe,P.,Helft,J.,Miller,J.,和Mortha,A.(2013).The dendriticcell lineage:ontogeny and function of dendritic cells and their subsets inthe steady state and the inflamed setting.Annual Review of Immunology,31,563-604.https://doi.org/10.1146/annurev-immunol-020711-074950
Mosmann,T.R.,和Coffman,R.L.(1989).TH1 and TH2 Cells:DifferentPatterns of Lymphokine Secretion Lead to Different FunctionalProperties.Annual Review of Immunology,7(1),145-173.https://doi.org/10.1146/annurev.iy.07.040189.001045
Pereira,C.-F.,Chang,B.,Gomes,A.,Bernitz,J.,Papatsenko,D.,Niu,X.,…Moore,K.A.(2016).Hematopoietic Reprogramming In Vitro Informs In VivoIdentification of Hemogenic Precursors to Definitive Hematopoietic StemCells.Developmental Cell,36(5),525-539.https://doi.org/10.1016/j.devcel.2016.02.011
Pereira,C.-F.,Chang,B.,Qiu,J.,Niu,X.,Papatsenko,D.,Hendry,C.E.,…Moore,K.(2013).Induction of a Hemogenic Program in Mouse Fibroblasts.CellStem Cell,13(2),205-218.https://doi.org/10.1016/j.stem.2013.05.024
Pereira,C.-F.,Lemischka,I.R.,和Moore,K.(2012).Reprogramming cellfates:insights from combinatorial approaches.Annals of the New York Academyof Sciences,1266(1),7-17.https://doi.org/10.1111/j.1749-6632.2012.06508.x
Plantinga,M.,Guilliams,M.,Vanheerswynghels,M.,Deswarte,K.,Branco-Madeira,F.,Toussaint,W.,…Lambrecht,B.N.(2013).Conventional and Monocyte-Derived CD11b+Dendritic Cells Initiate and Maintain T Helper 2Cell-MediatedImmunity to House Dust Mite Allergen.Immunity,38(2),322-335.https://doi.org/10.1016/j.immuni.2012.10.016
Rosa,F.F.,Pires,C.F.,Kurochkin,I.,Ferreira,A.G.,Gomes,A.M.,Palma,L.G.,…Pereira,C.-F.(2018).Direct reprogramming of fibroblasts into antigen-presenting dendritic cells.Science Immunology,3(30),eaau4292.https://doi.org/10.1126/sciimmunol.aau4292
Sakaguchi,S.(2004).N aturally A rising CD4+R egulatory T C ells forImmunologic S elf-T olerance and N egative C ontrol of I mmune Responses.Annual Review of Immunology,22(1),531-562.https://doi.org/10.1146/annurev.immunol.21.120601.141122
Schraml,B.U.,van Blijswijk,J.,Zelenay,S.,Whitney,P.G.,Filby,A.,Acton,S.E.,…Reis e Sousa,C.(2013).Genetic tracing via DNGR-1expression historydefines dendritic cells as a hematopoietic lineage.Cell,154(4),843-858.https://doi.org/10.1016/j.cell.2013.07.014
Subklewe,M.,Geiger,C.,Lichtenegger,F.S.,Javorovic,M.,Kvalheim,G.,Schendel,D.J.,和Bigalke,I.(2014).New generation dendritic cell vaccine forimmunotherapy of acute myeloid leukemia.Cancer Immunology,Immunotherapy:CII,63(10),1093-1103.https://doi.org/10.1007/s00262-014-1600-5
Takahashi,K.,Tanabe,K.,Ohnuki,M.,Narita,M.,Ichisaka,T.,Tomoda,K.,和Yamanaka,S.(2007).Induction of Pluripotent Stem Cells from Adult HumanFibroblasts by Defined Factors.Cell,131(5),861-872.https://doi.org/10.1016/j.cell.2007.11.019
Takahashi,K.,和Yamanaka,S.(2006).Induction of Pluripotent Stem Cellsfrom Mouse Embryonic and Adult Fibroblast Cultures by Defined Factors.Cell,126(4),663-676.https://doi.org/10.1016/j.cell.2006.07.024
Weaver,C.T.,Harrington,L.E.,Mangan,P.R.,Gavrieli,M.,和Murphy,K.M.(2006).Th17:An Effector CD4 T Cell Lineage with Regulatory T CellTies.Immunity,24(6),677-688.https://doi.org/10.1016/j.immuni.2006.06.002
Xie,H.,Ye,M.,Feng,R.,和Graf,T.(2004).Stepwise reprogramming of Bcellsinto macrophages.Cell,117(5),663-676.https://doi.org/10.1016/s0092-8674(04)00419-2
Xu,J.,Du,Y.,和Deng,H.(2015).Direct lineage reprogramming:strategies,mechanisms,and applications.Cell Stem Cell,16(2),119-134.https://doi.org/10.1016/j.stem.2015.01.013
Zhu,J.,和Paul,W.E.(2008).CD4 T cells:fates,functions,andfaults.Blood,112(5),1557-1569.https://doi.org/10.1182/blood-2008-05-078154
序列表
<110> BRT血液重编技术有限公司
<120> 用于将细胞重新程序化为能够呈递抗原的树突细胞2型的组合物、方法及其用途
<130> P5663PC00
<160> 48
<170> PatentIn 3.5版
<210> 1
<211> 1377
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(1377)
<223> PU.1人- 完整序列
<400> 1
aaaatcagga acttgtgctg gccctgcaat gtcaagggag ggggctcacc cagggctcct 60
gtagctcagg gggcaggcct gagccctgca cccgccccac gaccgtccag cccctgacgg 120
ggcaccccat cctgaggggc tctgcattgg cccccaccga ggcaggggat ctgaccgact 180
cggagcccgg ctggatgtta caggcgtgca aaatggaagg gtttcccctc gtcccccctc 240
agccatcaga agacctggtg ccctatgaca cggatctata ccaacgccaa acgcacgagt 300
attaccccta tctcagcagt gatggggaga gccatagcga ccattactgg gacttccacc 360
cccaccacgt gcacagcgag ttcgagagct tcgccgagaa caacttcacg gagctccaga 420
gcgtgcagcc cccgcagctg cagcagctct accgccacat ggagctggag cagatgcacg 480
tcctcgatac ccccatggtg ccaccccatc ccagtcttgg ccaccaggtc tcctacctgc 540
cccggatgtg cctccagtac ccatccctgt ccccagccca gcccagctca gatgaggagg 600
agggcgagcg gcagagcccc ccactggagg tgtctgacgg cgaggcggat ggcctggagc 660
ccgggcctgg gctcctgcct ggggagacag gcagcaagaa gaagatccgc ctgtaccagt 720
tcctgttgga cctgctccgc agcggcgaca tgaaggacag catctggtgg gtggacaagg 780
acaagggcac cttccagttc tcgtccaagc acaaggaggc gctggcgcac cgctggggca 840
tccagaaggg caaccgcaag aagatgacct accagaagat ggcgcgcgcg ctgcgcaact 900
acggcaagac gggcgaggtc aagaaggtga agaagaagct cacctaccag ttcagcggcg 960
aagtgctggg ccgcgggggc ctggccgagc ggcgccaccc gccccactga gcccgcagcc 1020
cccgccgggc cccgccaggc ctccccgctg gccatagcat taagccctcg cccggcccgg 1080
acacagggag gacgctcccg gggcccagag gcaggactgt ggcgggccgg gcctcgcctc 1140
acccgccccc tccccccact ccaggccccc tccacatccc gcttcgcctc cctccaggac 1200
tccaccccgg ctcccggacg ccagctgggc gtcagacccc accggggcaa ccttgcagag 1260
gacgacccgg ggtactgcct tgggagtctc aagtccgtat gtaaatcaga tctcccctct 1320
cacccctccc acccattaac ctcctcccaa aaaacaagta aagttattct caatcca 1377
<210> 2
<211> 816
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(816)
<223> PU.1人- CDS - >NM_001080547.2:195-1010智人Spi-1原癌基因(SPI1),
转录物变体1, mRNA
<400> 2
atgttacagg cgtgcaaaat ggaagggttt cccctcgtcc cccctcagcc atcagaagac 60
ctggtgccct atgacacgga tctataccaa cgccaaacgc acgagtatta cccctatctc 120
agcagtgatg gggagagcca tagcgaccat tactgggact tccaccccca ccacgtgcac 180
agcgagttcg agagcttcgc cgagaacaac ttcacggagc tccagagcgt gcagcccccg 240
cagctgcagc agctctaccg ccacatggag ctggagcaga tgcacgtcct cgataccccc 300
atggtgccac cccatcccag tcttggccac caggtctcct acctgccccg gatgtgcctc 360
cagtacccat ccctgtcccc agcccagccc agctcagatg aggaggaggg cgagcggcag 420
agccccccac tggaggtgtc tgacggcgag gcggatggcc tggagcccgg gcctgggctc 480
ctgcctgggg agacaggcag caagaagaag atccgcctgt accagttcct gttggacctg 540
ctccgcagcg gcgacatgaa ggacagcatc tggtgggtgg acaaggacaa gggcaccttc 600
cagttctcgt ccaagcacaa ggaggcgctg gcgcaccgct ggggcatcca gaagggcaac 660
cgcaagaaga tgacctacca gaagatggcg cgcgcgctgc gcaactacgg caagacgggc 720
gaggtcaaga aggtgaagaa gaagctcacc taccagttca gcggcgaagt gctgggccgc 780
gggggcctgg ccgagcggcg ccacccgccc cactga 816
<210> 3
<211> 271
<212> PRT
<213> 智人
<220>
<221> MISC_FEATURE
<222> (1)..(271)
<223> PU.1人- 蛋白>NP_001074016.1转录因子PU.1同种型1 [智人]
<400> 3
Met Leu Gln Ala Cys Lys Met Glu Gly Phe Pro Leu Val Pro Pro Gln
1 5 10 15
Pro Ser Glu Asp Leu Val Pro Tyr Asp Thr Asp Leu Tyr Gln Arg Gln
20 25 30
Thr His Glu Tyr Tyr Pro Tyr Leu Ser Ser Asp Gly Glu Ser His Ser
35 40 45
Asp His Tyr Trp Asp Phe His Pro His His Val His Ser Glu Phe Glu
50 55 60
Ser Phe Ala Glu Asn Asn Phe Thr Glu Leu Gln Ser Val Gln Pro Pro
65 70 75 80
Gln Leu Gln Gln Leu Tyr Arg His Met Glu Leu Glu Gln Met His Val
85 90 95
Leu Asp Thr Pro Met Val Pro Pro His Pro Ser Leu Gly His Gln Val
100 105 110
Ser Tyr Leu Pro Arg Met Cys Leu Gln Tyr Pro Ser Leu Ser Pro Ala
115 120 125
Gln Pro Ser Ser Asp Glu Glu Glu Gly Glu Arg Gln Ser Pro Pro Leu
130 135 140
Glu Val Ser Asp Gly Glu Ala Asp Gly Leu Glu Pro Gly Pro Gly Leu
145 150 155 160
Leu Pro Gly Glu Thr Gly Ser Lys Lys Lys Ile Arg Leu Tyr Gln Phe
165 170 175
Leu Leu Asp Leu Leu Arg Ser Gly Asp Met Lys Asp Ser Ile Trp Trp
180 185 190
Val Asp Lys Asp Lys Gly Thr Phe Gln Phe Ser Ser Lys His Lys Glu
195 200 205
Ala Leu Ala His Arg Trp Gly Ile Gln Lys Gly Asn Arg Lys Lys Met
210 215 220
Thr Tyr Gln Lys Met Ala Arg Ala Leu Arg Asn Tyr Gly Lys Thr Gly
225 230 235 240
Glu Val Lys Lys Val Lys Lys Lys Leu Thr Tyr Gln Phe Ser Gly Glu
245 250 255
Val Leu Gly Arg Gly Gly Leu Ala Glu Arg Arg His Pro Pro His
260 265 270
<210> 4
<211> 1473
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(1473)
<223> PU.1小鼠- 完整序列>NM_011355.2小家鼠脾病灶形成病毒(SFFV)
原病毒整合致癌基因(Spi1),转录物变体2, mRNA
<400> 4
aagagattta tgcaaacggg ctggggcggt gatgtcaccc caaggggact atctcccagt 60
ggcaggccct tcgataaaat caggaacttg tgctggccct gcaatgtcaa gggagggggc 120
tcacccaggg ctcctgtagc tcagggggca ggcctgagcc ctgcgtctga cccacgaccg 180
tccagtcccc cgacggggca cctggtcctg agggggatcc gccttgatcc ccaccgaagc 240
aggggatctg accaacctgg agctcagctg gatgttacag gcgtgcaaaa tggaagggtt 300
ttccctcacc gcccctccat cggatgactt ggttacttac gattcagagc tataccaacg 360
tccaatgcat gactactact ccttcgtggg cagcgatgga gaaagccata gcgatcacta 420
ctgggatttc tccgcacacc atgtccacaa caacgagttt gagaacttcc ctgagaacca 480
cttcacagag ctgcagagtg tgcagccccc gcagctacag cagctctatc gccacatgga 540
gctggaacag atgcacgtcc tcgatactcc catggtgcca ccccacaccg gcctcagtca 600
ccaggtttcc tacatgcccc ggatgtgctt cccttatcaa accttgtccc cagcccacca 660
gcagagctca gatgaggagg agggtgagag gcagagccct cccctggagg tgtctgatgg 720
agaagctgat ggcttggagc ctgggccagg tcttctgcac ggggagacag gcagcaagaa 780
aaagattcgc ctgtaccagt tcctgctgga cctgctgcgc agcggcgaca tgaaggacag 840
catctggtgg gtggacaagg acaaaggtac cttccagttc tcgtccaagc acaaggaggc 900
gctggcgcac cgctggggca tccagaaggg caaccgcaag aagatgacct accagaagat 960
ggcgcgcgcg ctgcgcaact acggcaagac aggcgaggtg aagaaagtca agaagaagct 1020
cacctaccag ttcagcggcg aggtgctggg ccgtgggggc ctggccgagc ggcgcctccc 1080
gccccactga tcgcccgcag agaccgccag gctcctggac cccgccggcc atagcattaa 1140
cccgtcgccc ggcccggaca cagggaggac attcccaggg ccgaggcagg actgggggcc 1200
cggcctcgcc ctcccatgcc cggcctggcc cgccccaccc gctttgcctc ccaccaggac 1260
tctagcccgc tccaagggcc gcctgggcct cggacctcaa ccgagggtca gcctggctta 1320
gtggccacgg tgcttccttg ggagtctggc gctggcacct ttttgtatat tgaatgcttt 1380
ttaaaaagct cttcctcccc accccctcat tagtcactaa agacaagtaa aattattgac 1440
agctattctc ccagaaaaaa aaaaaaaaaa aaa 1473
<210> 5
<211> 819
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(819)
<223> PU.1小鼠CDS >NM_011355.2:272-1090小家鼠脾病灶形成病毒(SFFV)
原病毒整合致癌基因(Spi1),转录物变体2, mRNA
<400> 5
atgttacagg cgtgcaaaat ggaagggttt tccctcaccg cccctccatc ggatgacttg 60
gttacttacg attcagagct ataccaacgt ccaatgcatg actactactc cttcgtgggc 120
agcgatggag aaagccatag cgatcactac tgggatttct ccgcacacca tgtccacaac 180
aacgagtttg agaacttccc tgagaaccac ttcacagagc tgcagagtgt gcagcccccg 240
cagctacagc agctctatcg ccacatggag ctggaacaga tgcacgtcct cgatactccc 300
atggtgccac cccacaccgg cctcagtcac caggtttcct acatgccccg gatgtgcttc 360
ccttatcaaa ccttgtcccc agcccaccag cagagctcag atgaggagga gggtgagagg 420
cagagccctc ccctggaggt gtctgatgga gaagctgatg gcttggagcc tgggccaggt 480
cttctgcacg gggagacagg cagcaagaaa aagattcgcc tgtaccagtt cctgctggac 540
ctgctgcgca gcggcgacat gaaggacagc atctggtggg tggacaagga caaaggtacc 600
ttccagttct cgtccaagca caaggaggcg ctggcgcacc gctggggcat ccagaagggc 660
aaccgcaaga agatgaccta ccagaagatg gcgcgcgcgc tgcgcaacta cggcaagaca 720
ggcgaggtga agaaagtcaa gaagaagctc acctaccagt tcagcggcga ggtgctgggc 780
cgtgggggcc tggccgagcg gcgcctcccg ccccactga 819
<210> 6
<211> 272
<212> PRT
<213> 小家鼠
<220>
<221> MISC_FEATURE
<222> (1)..(272)
<223> PU.1小鼠- 蛋白>NP_035485.1转录因子PU.1同种型2 [小家鼠]
<400> 6
Met Leu Gln Ala Cys Lys Met Glu Gly Phe Ser Leu Thr Ala Pro Pro
1 5 10 15
Ser Asp Asp Leu Val Thr Tyr Asp Ser Glu Leu Tyr Gln Arg Pro Met
20 25 30
His Asp Tyr Tyr Ser Phe Val Gly Ser Asp Gly Glu Ser His Ser Asp
35 40 45
His Tyr Trp Asp Phe Ser Ala His His Val His Asn Asn Glu Phe Glu
50 55 60
Asn Phe Pro Glu Asn His Phe Thr Glu Leu Gln Ser Val Gln Pro Pro
65 70 75 80
Gln Leu Gln Gln Leu Tyr Arg His Met Glu Leu Glu Gln Met His Val
85 90 95
Leu Asp Thr Pro Met Val Pro Pro His Thr Gly Leu Ser His Gln Val
100 105 110
Ser Tyr Met Pro Arg Met Cys Phe Pro Tyr Gln Thr Leu Ser Pro Ala
115 120 125
His Gln Gln Ser Ser Asp Glu Glu Glu Gly Glu Arg Gln Ser Pro Pro
130 135 140
Leu Glu Val Ser Asp Gly Glu Ala Asp Gly Leu Glu Pro Gly Pro Gly
145 150 155 160
Leu Leu His Gly Glu Thr Gly Ser Lys Lys Lys Ile Arg Leu Tyr Gln
165 170 175
Phe Leu Leu Asp Leu Leu Arg Ser Gly Asp Met Lys Asp Ser Ile Trp
180 185 190
Trp Val Asp Lys Asp Lys Gly Thr Phe Gln Phe Ser Ser Lys His Lys
195 200 205
Glu Ala Leu Ala His Arg Trp Gly Ile Gln Lys Gly Asn Arg Lys Lys
210 215 220
Met Thr Tyr Gln Lys Met Ala Arg Ala Leu Arg Asn Tyr Gly Lys Thr
225 230 235 240
Gly Glu Val Lys Lys Val Lys Lys Lys Leu Thr Tyr Gln Phe Ser Gly
245 250 255
Glu Val Leu Gly Arg Gly Gly Leu Ala Glu Arg Arg Leu Pro Pro His
260 265 270
<210> 7
<211> 5314
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(5314)
<223> IRF4人- 完整序列>NM_002460.4智人干扰素调节因子4 (IRF4),
转录物变体1, mRNA
<400> 7
agtttcaccg ctcgatcttg ggacccaccg ctgccctcag ctccgagtcc agggcgagtg 60
cagagcagag cgggcggagg accccgggcg cgggcgcgga cggcacgcgg ggcatgaacc 120
tggagggcgg cggccgaggc ggagagttcg gcatgagcgc ggtgagctgc ggcaacggga 180
agctccgcca gtggctgatc gaccagatcg acagcggcaa gtaccccggg ctggtgtggg 240
agaacgagga gaagagcatc ttccgcatcc cctggaagca cgcgggcaag caggactaca 300
accgcgagga ggacgccgcg ctcttcaagg cttgggcact gtttaaagga aagttccgag 360
aaggcatcga caagccggac cctcccacct ggaagacgcg cctgcggtgc gctttgaaca 420
agagcaatga ctttgaggaa ctggttgagc ggagccagct ggacatctca gacccgtaca 480
aagtgtacag gattgttcct gagggagcca aaaaaggagc caagcagctc accctggagg 540
acccgcagat gtccatgagc cacccctaca ccatgacaac gccttaccct tcgctcccag 600
cccagcaggt tcacaactac atgatgccac ccctcgaccg aagctggagg gactacgtcc 660
cggatcagcc acacccggaa atcccgtacc aatgtcccat gacgtttgga ccccgcggcc 720
accactggca aggcccagct tgtgaaaatg gttgccaggt gacaggaacc ttttatgctt 780
gtgccccacc tgagtcccag gctcccggag tccccacaga gccaagcata aggtctgccg 840
aagccttggc gttctcagac tgccggctgc acatctgcct gtactaccgg gaaatcctcg 900
tgaaggagct gaccacgtcc agccccgagg gctgccggat ctcccatgga catacgtatg 960
acgccagcaa cctggaccag gtcctgttcc cctacccaga ggacaatggc cagaggaaaa 1020
acattgagaa gctgctgagc cacctggaga ggggcgtggt cctctggatg gcccccgacg 1080
ggctctatgc gaaaagactg tgccagagca ggatctactg ggacgggccc ctggcgctgt 1140
gcaacgaccg gcccaacaaa ctggagagag accagacctg caagctcttt gacacacagc 1200
agttcttgtc agagctgcaa gcgtttgctc accacggccg ctccctgcca agattccagg 1260
tgactctatg ctttggagag gagtttccag accctcagag gcaaagaaag ctcatcacag 1320
ctcacgtaga acctctgcta gccagacaac tatattattt tgctcaacaa aacagtggac 1380
atttcctgag gggctacgat ttaccagaac acatcagcaa tccagaagat taccacagat 1440
ctatccgcca ttcctctatt caagaatgaa aaatgtcaag atgagtggtt ttctttttcc 1500
tttttttttt tttttttttg atacggggat acggggtctt gctctgtctc ccaggctgga 1560
gtgcagtgac acaatctcag ctcactgtga cctccgcctc ctgggttcaa gagactctcc 1620
tgcctcagcc tccctggtag ctgggattac aggtgtgagc cactgcaccc acccaagaca 1680
agtgattttc attgtaaata tttgacttta gtgaaagcgt ccaattgact gccctcttac 1740
tgttttgagg aattcagaag tggagatttc agttcagcgg ttgaggagaa ttgcggcgag 1800
acaagcatgg aaaatcagtg acatctgatt ggcagatgag cttatttcaa aaggaagggt 1860
ggctttgcat ttcttgtgtt ctgtagactg ccatcattga tgatcactgt gaaaattgac 1920
caagtgatgt gtttacattt actgaaatgc gctctttaat ttgttgtaga ttaggtcttg 1980
ctggaagaca gagaaaactt gcctttcagt attgacactg actagagtga tgactgcttg 2040
taggtatgtc tgtgccattt ctcagggaag taagatgtaa attgaagaag cctcacacgt 2100
aaaagaaatg tattaatgta tgtaggagct gcagttcttg tggaagacac ttgctgagtg 2160
aaggaaatga atctttgact gaagccgtgc ctgtagcctt ggggaggccc atcccccacc 2220
tgccagcggt ttcctggtgt gggtccctct gccccgccct ccttcccatt ggctttctct 2280
ccttggcctt tcctggaagc cagttagtaa acttcctatt ttcttgagtc aaaaaacatg 2340
agcgctactc ttggatggga catttttgtc tgtcctacaa tctagtaatg tctaagtaat 2400
ggttaagttt tcttgtttct gcatcttttt gaccctcatt ctttagagat gctaaaattc 2460
ttcgcataaa gaagaagaaa ttaaggaaca taaatcttaa tacttgaact gttgcccttc 2520
tgtccaagta cttaactatc tgttcccttc ctctgtgcca cgctcctctg tttgtttggc 2580
tgtccagcga tcagccatgg cgacactaaa ggaggaggag ccggggactc ccaggctgga 2640
gagcactgcc aggacccacc actggaagca ggatggagct gactacggaa ctgcacactc 2700
agtgggctgt ttctgcttat ttcatctgtt ctatgcttcc tcgtgccaat tatagtttga 2760
cagggcctta aaattacttg gctttttcca aatgcttcta tttatagaat cccaaagacc 2820
tccacttgct taagtatacc tatcacttac atttttgtgg ttttgagaaa gtacagcagt 2880
agactggggc gtcacctcca ggccgtttct catactacag gatatttact attactccca 2940
ggatcagcag aagattgcgt agctctcaaa tgtgtgttcc tgcttttcta atggatattt 3000
taaattcatt caacaagcac ctagtaagtg cctgctgtat ccctacatta cacagttcag 3060
cctttatcaa gcttagtgag cagtgagcac tgaaacatta ttttttaatg tttaaaaagt 3120
ttctaatatt aaagtcagaa tattaataca attaatatta atattaacta cagaaaagac 3180
aaacagtaga gaacagcaaa aaaataaaaa ggatctcctt ttttcccagc ccaaattctc 3240
ctctctaaaa gtgtccacaa gaaggggtgt ttattcttcc aacacatttc acttttctgt 3300
aaatatacat aaacttaaaa agaaaacctc atggagtcat cttgcacaca ctttcatgca 3360
gtgctctttg tagctaacag tgaagattta cctcgttctg ctcagaggcc ttgctgtgga 3420
gctccactgc catgtaccca gtagggtttg acatttcatt agccatgcaa catggatatg 3480
tattgggcag cagactgtgt ttcgtgaact gcagtgatgt atacatctta tagatgcaaa 3540
gtattttggg gtatattatc ctaagggaag ataaagatga tattaagaac tgctgtttca 3600
cggggccctt acctgtgacc ctctttgctg aagaatattt aaccccacac agcacttcaa 3660
agaagctgtc ttggaagtct gtctcaggag caccctgtct tcttaattct ccaagcggat 3720
gctccatttc aattgctttg tgacttcttc ttctttgttt ttttaaatat tatgctgctt 3780
taacagtgga gctgaatttt ctggaaaatg cttcttggct ggggccacta cctcctttcc 3840
tatctttaca tctatgtgta tgttgacttt ttaaaattct gagtgatcca gggtatgacc 3900
tagggaatga actagctatg aaatactcag ggttaggaat cctagcactt gtctcaggac 3960
tctgaaaagg aacggcttcc tcattccttg tcttgataaa gtggaattgg caaactagaa 4020
tttagtttgt actcagtgga cagtgctgtt gaagatttga ggacttgtta aagagcactg 4080
ggtcatatgg aaaaaatgta tgtgtctccc aggtgcattt cttggtttat gtcttgttct 4140
tgagattttg tatatttagg aaaacctcaa gcagtaatta atatctcctg gaacactata 4200
gagaaccaag tgaccgactc atttacaact gaaacctagg aagcccctga gtcctgagcg 4260
aaaacaggag agttagtcgc cctacagaaa acccagctag actattgggt atgaactaaa 4320
aagagactgt gccatggtga gaaaaatgta aaatcctaca gtgaaatgag cagcccttac 4380
agtattgtta ccaccaaggg caggtaggta ttagtgtttg aaaaagctgg tctttgagcg 4440
agggcataaa tacagctagc cccaggggtg gaacaactct gggagtcttg ggtactcgca 4500
cctcttggct ttgttgatgc tccgccagga aggccacttg tgtgtgcgtg tcagttactt 4560
ttttagtaac aattcagatc cagtgtaaac ttccgttcat tgctctccag tcacatgccc 4620
ccacttcccc acaggtgaaa gtttttctga aagtgttggg attggttaag gtctttattt 4680
gtattacgta tctccccaag tcctctgtgg ccagctgcat ctgtctgaat ggtgcgtgaa 4740
ggctctcaga ccttacacac cattttgtaa gttatgtttt acatgccccg tttttgagac 4800
tgatctcgat gcaggtggat ctccttgaga tcctgatagc ctgttacagg aatgaagtaa 4860
aggtcagttt ttttttgtat tgattttcac agctttgagg aacatgcata agaaatgtag 4920
ctgaagtaga ggggacgtga gagaagggcc aggccggcag gccaaccctc ctccaatgga 4980
aattcccgtg ttgcttcaaa ctgagacaga tgggacttaa caggcaatgg ggtccacttc 5040
cccctcttca gcatcccccg taccccactt tctgctgaaa gaactgccag caggtaggac 5100
cccagaggcc cccaaatgaa agcttgaatt tcccctactg gctctgcgtt ttgctgagat 5160
ctgtaggaaa ggatgcttca caaactgagg tagataatgc tatgctgtcg ttggtataca 5220
tcatgaattt ttatgtaaat tgctctgcaa agcaaattga tatgtttgat aaatttatgt 5280
ttttaggtaa ataaaaactt ttaaaaattt gtta 5314
<210> 8
<211> 1356
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(1356)
<223> IRF4人CDS >NM_002460.4:114-1469智人干扰素调节因子4 (IRF4),
转录物变体1, mRNA
<400> 8
atgaacctgg agggcggcgg ccgaggcgga gagttcggca tgagcgcggt gagctgcggc 60
aacgggaagc tccgccagtg gctgatcgac cagatcgaca gcggcaagta ccccgggctg 120
gtgtgggaga acgaggagaa gagcatcttc cgcatcccct ggaagcacgc gggcaagcag 180
gactacaacc gcgaggagga cgccgcgctc ttcaaggctt gggcactgtt taaaggaaag 240
ttccgagaag gcatcgacaa gccggaccct cccacctgga agacgcgcct gcggtgcgct 300
ttgaacaaga gcaatgactt tgaggaactg gttgagcgga gccagctgga catctcagac 360
ccgtacaaag tgtacaggat tgttcctgag ggagccaaaa aaggagccaa gcagctcacc 420
ctggaggacc cgcagatgtc catgagccac ccctacacca tgacaacgcc ttacccttcg 480
ctcccagccc agcaggttca caactacatg atgccacccc tcgaccgaag ctggagggac 540
tacgtcccgg atcagccaca cccggaaatc ccgtaccaat gtcccatgac gtttggaccc 600
cgcggccacc actggcaagg cccagcttgt gaaaatggtt gccaggtgac aggaaccttt 660
tatgcttgtg ccccacctga gtcccaggct cccggagtcc ccacagagcc aagcataagg 720
tctgccgaag ccttggcgtt ctcagactgc cggctgcaca tctgcctgta ctaccgggaa 780
atcctcgtga aggagctgac cacgtccagc cccgagggct gccggatctc ccatggacat 840
acgtatgacg ccagcaacct ggaccaggtc ctgttcccct acccagagga caatggccag 900
aggaaaaaca ttgagaagct gctgagccac ctggagaggg gcgtggtcct ctggatggcc 960
cccgacgggc tctatgcgaa aagactgtgc cagagcagga tctactggga cgggcccctg 1020
gcgctgtgca acgaccggcc caacaaactg gagagagacc agacctgcaa gctctttgac 1080
acacagcagt tcttgtcaga gctgcaagcg tttgctcacc acggccgctc cctgccaaga 1140
ttccaggtga ctctatgctt tggagaggag tttccagacc ctcagaggca aagaaagctc 1200
atcacagctc acgtagaacc tctgctagcc agacaactat attattttgc tcaacaaaac 1260
agtggacatt tcctgagggg ctacgattta ccagaacaca tcagcaatcc agaagattac 1320
cacagatcta tccgccattc ctctattcaa gaatga 1356
<210> 9
<211> 451
<212> PRT
<213> 智人
<220>
<221> MISC_FEATURE
<222> (1)..(451)
<223> IRF4人蛋白>NP_002451.2干扰素调节因子4同种型1 [智人]
<400> 9
Met Asn Leu Glu Gly Gly Gly Arg Gly Gly Glu Phe Gly Met Ser Ala
1 5 10 15
Val Ser Cys Gly Asn Gly Lys Leu Arg Gln Trp Leu Ile Asp Gln Ile
20 25 30
Asp Ser Gly Lys Tyr Pro Gly Leu Val Trp Glu Asn Glu Glu Lys Ser
35 40 45
Ile Phe Arg Ile Pro Trp Lys His Ala Gly Lys Gln Asp Tyr Asn Arg
50 55 60
Glu Glu Asp Ala Ala Leu Phe Lys Ala Trp Ala Leu Phe Lys Gly Lys
65 70 75 80
Phe Arg Glu Gly Ile Asp Lys Pro Asp Pro Pro Thr Trp Lys Thr Arg
85 90 95
Leu Arg Cys Ala Leu Asn Lys Ser Asn Asp Phe Glu Glu Leu Val Glu
100 105 110
Arg Ser Gln Leu Asp Ile Ser Asp Pro Tyr Lys Val Tyr Arg Ile Val
115 120 125
Pro Glu Gly Ala Lys Lys Gly Ala Lys Gln Leu Thr Leu Glu Asp Pro
130 135 140
Gln Met Ser Met Ser His Pro Tyr Thr Met Thr Thr Pro Tyr Pro Ser
145 150 155 160
Leu Pro Ala Gln Gln Val His Asn Tyr Met Met Pro Pro Leu Asp Arg
165 170 175
Ser Trp Arg Asp Tyr Val Pro Asp Gln Pro His Pro Glu Ile Pro Tyr
180 185 190
Gln Cys Pro Met Thr Phe Gly Pro Arg Gly His His Trp Gln Gly Pro
195 200 205
Ala Cys Glu Asn Gly Cys Gln Val Thr Gly Thr Phe Tyr Ala Cys Ala
210 215 220
Pro Pro Glu Ser Gln Ala Pro Gly Val Pro Thr Glu Pro Ser Ile Arg
225 230 235 240
Ser Ala Glu Ala Leu Ala Phe Ser Asp Cys Arg Leu His Ile Cys Leu
245 250 255
Tyr Tyr Arg Glu Ile Leu Val Lys Glu Leu Thr Thr Ser Ser Pro Glu
260 265 270
Gly Cys Arg Ile Ser His Gly His Thr Tyr Asp Ala Ser Asn Leu Asp
275 280 285
Gln Val Leu Phe Pro Tyr Pro Glu Asp Asn Gly Gln Arg Lys Asn Ile
290 295 300
Glu Lys Leu Leu Ser His Leu Glu Arg Gly Val Val Leu Trp Met Ala
305 310 315 320
Pro Asp Gly Leu Tyr Ala Lys Arg Leu Cys Gln Ser Arg Ile Tyr Trp
325 330 335
Asp Gly Pro Leu Ala Leu Cys Asn Asp Arg Pro Asn Lys Leu Glu Arg
340 345 350
Asp Gln Thr Cys Lys Leu Phe Asp Thr Gln Gln Phe Leu Ser Glu Leu
355 360 365
Gln Ala Phe Ala His His Gly Arg Ser Leu Pro Arg Phe Gln Val Thr
370 375 380
Leu Cys Phe Gly Glu Glu Phe Pro Asp Pro Gln Arg Gln Arg Lys Leu
385 390 395 400
Ile Thr Ala His Val Glu Pro Leu Leu Ala Arg Gln Leu Tyr Tyr Phe
405 410 415
Ala Gln Gln Asn Ser Gly His Phe Leu Arg Gly Tyr Asp Leu Pro Glu
420 425 430
His Ile Ser Asn Pro Glu Asp Tyr His Arg Ser Ile Arg His Ser Ser
435 440 445
Ile Gln Glu
450
<210> 10
<211> 4760
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(4760)
<223> IRF4小鼠- 完整序列>NM_013674.2小家鼠干扰素调节因子4 (Irf4),
转录物变体1, mRNA
<400> 10
tttcctagtt tcaccacttg aacttgggac cctttgctgc cctcagctaa gagtgcgggt 60
gagcgcacaa gcccaggagg aggtccgcac gcgtcatgaa cttggagacg ggcagccggg 120
gctcagagtt cggcatgagc gcagtgagct gcggcaatgg gaaactccga cagtggttga 180
tcgaccagat cgacagcggc aagtaccccg ggctggtgtg ggagaacgag gagaagagcg 240
tcttccgcat cccgtggaaa cacgcgggca agcaggacta caatcgtgag gaggacgctg 300
ccctcttcaa ggcttgggca ttgtttaaag gcaagttccg agaagggatc gacaagccag 360
atcctcctac ttggaagaca agattacgat gtgctctgaa caagagcaat gactttgagg 420
aattggtcga gaggagccag ctggatatct ctgacccata caaggtgtac aggattgttc 480
cagagggagc caaaaaagga gcaaagcagc tcactttgga tgacacacag atggccatgg 540
gccaccccta ccccatgaca gcaccttatg gctctctgcc agcccagcag gttcataact 600
acatgatgcc accccatgac aggagctgga gggattatgc ccctgaccag tcacacccag 660
aaatcccata tcaatgtcct gtgacgtttg gcccacgagg ccaccactgg caaggcccat 720
cttgtgaaaa tggttgccag gtgacaggaa ccttttatgc ttgtgcccca cctgagtccc 780
aggctcctgg aatccccatt gagccaagca taaggtctgc tgaagccttg gcgctctcag 840
actgccggct gcatatctgc ctgtattacc gggacatcct cgtgaaagag ctgaccacga 900
cgagccctga aggctgccgg atctcccacg gacacaccta tgatgttagc aacctggacc 960
aggtcctgtt tccctacccg gacgacaatg gacagaggaa gaacattgag aagttgctga 1020
gccacctgga gaggggactg gtcctctgga tggctccaga tgggctttat gccaaaagac 1080
tctgccagag taggatctac tgggatgggc ccctggcact gtgcagcgat cggcccaaca 1140
agctagaaag agaccagact tgcaagctct ttgacacaca gcagtttcta tcagagctgc 1200
aagtgtttgc tcaccatggc cggccagcac cgagattcca ggtgactctg tgctttggtg 1260
aggagtttcc agaccctcag agacagagga agctcatcac agctcatgtg gaacctctgc 1320
tagccagaca actgtattac tttgctcaac aaaacactgg acatttcctg aggggctacg 1380
agttacctga acacgttacc actccagatt accaccgctc cctccgtcat tcttccatcc 1440
aagagtgaga agaaatactc tgacagggca gccggctgct gccctttctc tttggaagag 1500
ctaagaagtg agtgggtttc cacttgaaga caacaacagg gctttgtgag gaaaaacagc 1560
tgtatctgct caacagagga gcttccccca gaagagtgcc tgtcatccag gtcttgacaa 1620
gtgccaggac ttgggtgact gtgccctggc ttataactgt gaaacttgat cagtgttgtg 1680
tttacatgta cttgaatgct ggctttagcc tggtatagat ggacttttgc ttgaagactg 1740
aaaacctgtg ccagcatgaa tccctgacaa gagaagacat acgtattatt ggtccatttc 1800
tcagggaagt aaagtctaga ctagagacta cgctgtccac tcacagagaa acgcattcct 1860
ggatagagga gctgtaggtt gcccagatca tgtccactga gtgaagggac ttggttctta 1920
ggtccagtct aggctatcat gtcctgatcc tgcactctct ctgagtagac cctacatcct 1980
gtttgctccc tcttcttgcc cttccctaaa agccagctgg tggactcttg ttgccacata 2040
cttgagccaa aaaaccatca gggacatata agatgagaca tgtttgcatc tcctctgatc 2100
taacaacggc tacaaagttt tttgttgttg ttgtctcccc tttctgtttt gcctctcatt 2160
ctttagaatt gttgcttttt tgtaaagaaa ggaagagaga gagaccgaat atactagaga 2220
tagattttta aaattctgtg ccatttcccc ttctatttgt tttgttggcc atcagtggtc 2280
cccatggcaa cactggaagg gcggagcctt tgagcaggtt gggaactgtc ccggacacca 2340
gtggttcagt gtacagctgt agactcagtg agctgtttct gcttattttg tatattagat 2400
gcttccttgt gccaataata gtttgacaga gtctcaaagc taccgggctt cttttccaac 2460
aacacattct ttcacagaat cccaaagagt ctcacttgcc ttcacaaccg tctgatcctt 2520
tattgttact gttttgttta ggtggctttg ccatgctctg ggatttttct cctaccacaa 2580
catcattcct gactctttcg ggctcagaaa gagttaagcg ctcttgcaca cttggcactt 2640
acttcttttt aatttttttt ttaacatgcc actcttaata tttcttctta agtctttcag 2700
cagcatacat tcaataagca atggaccatt gggtgtctca tttcattgtt ttatttttaa 2760
aaaggaacct ttgacaaata atacacatta cagaaaatgg aagcattttt catagagaag 2820
gtaagaactg aaatgcattt tatgttcgaa tcctccttgc tagtattagc cccaagagtt 2880
cagtctgtcc tttaagcatg ttttagtttt caatacatat atatgaagtt gccctttctt 2940
agagagagag agagagatat gtgatgacac tcttatgtac ccaactatac agatttacct 3000
catcttttga gtggcatgtg gacttccact gctatgtgcc taggtgcagt ttagacatct 3060
tcttagctac ccatcccagc agcccactga gtctcgtgaa ctaaagtcac ttatacatct 3120
tatagatgca aaacattctg gggtatatta tcccaaggga agatgaaggc aaaatcggga 3180
agcactcttc ctgaggcccc aagccacagc tgtcctcctg gagtatctta accatatctg 3240
gcacttcacc aaggtccctt gcctggtcct gtatgaccat gactcttcag tccataagca 3300
gatgctttgt ctcaattgcc atgtgaagtc ttttggtttt tggttttttt ccttaatgtt 3360
aagctgtttt aacagtgaag ctgaattttc tggaaaatgc tgtatggttc gagcccccag 3420
ttcatgtttc actttacagc catgcatgct gatgccttcc atcttgacaa atggaatgat 3480
gggtaaacac tcctgacaca aagaggaaca ccttcctcgt cccttgctga aacaaagtgc 3540
aatgcacaca gtagaattta gtttgcttgc actggaagat gccgttgaag aggtaggctg 3600
aggaacactg tatcctaggg agaaaatatg caagtctcac gggtgctttc tgttggcttg 3660
tttctggtcc ttgagcaggc ttaagtatct agtttcaaac tagaagcccc aaagccctca 3720
gtcgttgtcc tggacagagt tggtagaccc cacctgtaga aatccagatg cactctggga 3780
ttggagcaca gagaggcttg ctctatcgtg agaatgccca gcctgcagag gcccaggtgc 3840
ggtgcctgag tggctgtatg ccagacacag gaacgccagc aggctgagtg tggtgactga 3900
tatgtgaggt gcccttgccc tgcagacagg ctgaatgtga gccccagcag cagaggattc 3960
tgagagtcag ggtggtgtca tgtcctacac aggaagagcc ttcaatatta ctttttaaac 4020
gaactttccc attgctgata ttttgataaa aacccacaca aggtttaatt acatgccccc 4080
tcatccctct ccacacagtc ttgcccatcc cactggtgaa ttgttgttgt tcttgttgtt 4140
gttttttgtt attggtgaaa attgtgaaat tgttaaggtc tctgtttttg ttatttatct 4200
ccctaaatca catggctgct gcatacattg caatgttgcc ctaccacatg tatgtttttg 4260
caagttgctt cttcactcat tgaagaagcc tgcaccgtaa gtgaaactca cttagtcctt 4320
gctaattgct gctcactgtc acgtgaacgc agagaaggct gattttcctg gacccatttt 4380
cagtgctttg aggaacatga gtccagagta tattgaagta gaagacagtg tttggcaagc 4440
cagttcttat tagacactgc tacaggcttc acaatcttca aggtggaccc acctcatgtg 4500
gaagggcatc attgcccctc tgctcccatg cgctgtgcac ccccttcaca gtgaaagacc 4560
tgctggggac caggactcag agatctctgg cttcgagctc atatcttggc ttcctgtcat 4620
gtctaagccc gacaggaaag tgtggcacca tcctgcctgc ggcgttggat ggataggcca 4680
tggattttgc tgtaaacttg cttcacaagg aaagctgtgt gttttaaaat atttgataaa 4740
taaaggattt ctagaaagat 4760
<210> 11
<211> 1353
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(1353)
<223> IRF4小鼠CDS >NM_013674.2:96-1448小家鼠干扰素调节因子4 (Irf4),
转录物变体1, mRNA
<400> 11
atgaacttgg agacgggcag ccggggctca gagttcggca tgagcgcagt gagctgcggc 60
aatgggaaac tccgacagtg gttgatcgac cagatcgaca gcggcaagta ccccgggctg 120
gtgtgggaga acgaggagaa gagcgtcttc cgcatcccgt ggaaacacgc gggcaagcag 180
gactacaatc gtgaggagga cgctgccctc ttcaaggctt gggcattgtt taaaggcaag 240
ttccgagaag ggatcgacaa gccagatcct cctacttgga agacaagatt acgatgtgct 300
ctgaacaaga gcaatgactt tgaggaattg gtcgagagga gccagctgga tatctctgac 360
ccatacaagg tgtacaggat tgttccagag ggagccaaaa aaggagcaaa gcagctcact 420
ttggatgaca cacagatggc catgggccac ccctacccca tgacagcacc ttatggctct 480
ctgccagccc agcaggttca taactacatg atgccacccc atgacaggag ctggagggat 540
tatgcccctg accagtcaca cccagaaatc ccatatcaat gtcctgtgac gtttggccca 600
cgaggccacc actggcaagg cccatcttgt gaaaatggtt gccaggtgac aggaaccttt 660
tatgcttgtg ccccacctga gtcccaggct cctggaatcc ccattgagcc aagcataagg 720
tctgctgaag ccttggcgct ctcagactgc cggctgcata tctgcctgta ttaccgggac 780
atcctcgtga aagagctgac cacgacgagc cctgaaggct gccggatctc ccacggacac 840
acctatgatg ttagcaacct ggaccaggtc ctgtttccct acccggacga caatggacag 900
aggaagaaca ttgagaagtt gctgagccac ctggagaggg gactggtcct ctggatggct 960
ccagatgggc tttatgccaa aagactctgc cagagtagga tctactggga tgggcccctg 1020
gcactgtgca gcgatcggcc caacaagcta gaaagagacc agacttgcaa gctctttgac 1080
acacagcagt ttctatcaga gctgcaagtg tttgctcacc atggccggcc agcaccgaga 1140
ttccaggtga ctctgtgctt tggtgaggag tttccagacc ctcagagaca gaggaagctc 1200
atcacagctc atgtggaacc tctgctagcc agacaactgt attactttgc tcaacaaaac 1260
actggacatt tcctgagggg ctacgagtta cctgaacacg ttaccactcc agattaccac 1320
cgctccctcc gtcattcttc catccaagag tga 1353
<210> 12
<211> 450
<212> PRT
<213> 小家鼠
<220>
<221> MISC_FEATURE
<222> (1)..(450)
<223> IRF4小鼠蛋白>NP_038702.1干扰素调节因子4同种型a [小家鼠]
<400> 12
Met Asn Leu Glu Thr Gly Ser Arg Gly Ser Glu Phe Gly Met Ser Ala
1 5 10 15
Val Ser Cys Gly Asn Gly Lys Leu Arg Gln Trp Leu Ile Asp Gln Ile
20 25 30
Asp Ser Gly Lys Tyr Pro Gly Leu Val Trp Glu Asn Glu Glu Lys Ser
35 40 45
Val Phe Arg Ile Pro Trp Lys His Ala Gly Lys Gln Asp Tyr Asn Arg
50 55 60
Glu Glu Asp Ala Ala Leu Phe Lys Ala Trp Ala Leu Phe Lys Gly Lys
65 70 75 80
Phe Arg Glu Gly Ile Asp Lys Pro Asp Pro Pro Thr Trp Lys Thr Arg
85 90 95
Leu Arg Cys Ala Leu Asn Lys Ser Asn Asp Phe Glu Glu Leu Val Glu
100 105 110
Arg Ser Gln Leu Asp Ile Ser Asp Pro Tyr Lys Val Tyr Arg Ile Val
115 120 125
Pro Glu Gly Ala Lys Lys Gly Ala Lys Gln Leu Thr Leu Asp Asp Thr
130 135 140
Gln Met Ala Met Gly His Pro Tyr Pro Met Thr Ala Pro Tyr Gly Ser
145 150 155 160
Leu Pro Ala Gln Gln Val His Asn Tyr Met Met Pro Pro His Asp Arg
165 170 175
Ser Trp Arg Asp Tyr Ala Pro Asp Gln Ser His Pro Glu Ile Pro Tyr
180 185 190
Gln Cys Pro Val Thr Phe Gly Pro Arg Gly His His Trp Gln Gly Pro
195 200 205
Ser Cys Glu Asn Gly Cys Gln Val Thr Gly Thr Phe Tyr Ala Cys Ala
210 215 220
Pro Pro Glu Ser Gln Ala Pro Gly Ile Pro Ile Glu Pro Ser Ile Arg
225 230 235 240
Ser Ala Glu Ala Leu Ala Leu Ser Asp Cys Arg Leu His Ile Cys Leu
245 250 255
Tyr Tyr Arg Asp Ile Leu Val Lys Glu Leu Thr Thr Thr Ser Pro Glu
260 265 270
Gly Cys Arg Ile Ser His Gly His Thr Tyr Asp Val Ser Asn Leu Asp
275 280 285
Gln Val Leu Phe Pro Tyr Pro Asp Asp Asn Gly Gln Arg Lys Asn Ile
290 295 300
Glu Lys Leu Leu Ser His Leu Glu Arg Gly Leu Val Leu Trp Met Ala
305 310 315 320
Pro Asp Gly Leu Tyr Ala Lys Arg Leu Cys Gln Ser Arg Ile Tyr Trp
325 330 335
Asp Gly Pro Leu Ala Leu Cys Ser Asp Arg Pro Asn Lys Leu Glu Arg
340 345 350
Asp Gln Thr Cys Lys Leu Phe Asp Thr Gln Gln Phe Leu Ser Glu Leu
355 360 365
Gln Val Phe Ala His His Gly Arg Pro Ala Pro Arg Phe Gln Val Thr
370 375 380
Leu Cys Phe Gly Glu Glu Phe Pro Asp Pro Gln Arg Gln Arg Lys Leu
385 390 395 400
Ile Thr Ala His Val Glu Pro Leu Leu Ala Arg Gln Leu Tyr Tyr Phe
405 410 415
Ala Gln Gln Asn Thr Gly His Phe Leu Arg Gly Tyr Glu Leu Pro Glu
420 425 430
His Val Thr Thr Pro Asp Tyr His Arg Ser Leu Arg His Ser Ser Ile
435 440 445
Gln Glu
450
<210> 13
<211> 5148
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(5148)
<223> PRDM1人- 完整序列>NM_001198.4智人PR/SET结构域1 (PRDM1),
转录物变体1, mRNA
<400> 13
aacacagaca aagtgctgcc gtgacactcg gccctccagt gttgcggaga ggcaagagca 60
gcgaccgcgg cacctgtccg cccggagctg ggacgcgggc gcccgggcgg ccggacgaag 120
cgaggaggga ccgccgaggt gcgcgtctgt gcggctcagc ctggcggggg acgcggggag 180
aatgtggact gggtagagat gaacgagact tttctcagat gttggatatt tgcttggaaa 240
aacgtgtggg tacgaccttg gctgccccca agtgtaactc cagcactgtg aggtttcagg 300
gattggcaga ggggaccaag gggaccatga aaatggacat ggaggatgcg gatatgactc 360
tgtggacaga ggctgagttt gaagagaagt gtacatacat tgtgaacgac cacccctggg 420
attctggtgc tgatggcggt acttcggttc aggcggaggc atccttacca aggaatctgc 480
ttttcaagta tgccaccaac agtgaagagg ttattggagt gatgagtaaa gaatacatac 540
caaagggcac acgttttgga cccctaatag gtgaaatcta caccaatgac acagttccta 600
agaacgccaa caggaaatat ttttggagga tctattccag aggggagctt caccacttca 660
ttgacggctt taatgaagag aaaagcaact ggatgcgcta tgtgaatcca gcacactctc 720
cccgggagca aaacctggct gcgtgtcaga acgggatgaa catctacttc tacaccatta 780
agcccatccc tgccaaccag gaacttcttg tgtggtattg tcgggacttt gcagaaaggc 840
ttcactaccc ttatcccgga gagctgacaa tgatgaatct cacacaaaca cagagcagtc 900
taaagcaacc gagcactgag aaaaatgaac tctgcccaaa gaatgtccca aagagagagt 960
acagcgtgaa agaaatccta aaattggact ccaacccctc caaaggaaag gacctctacc 1020
gttctaacat ttcacccctc acatcagaaa aggacctcga tgactttaga agacgtggga 1080
gccccgaaat gcccttctac cctcgggtcg tttaccccat ccgggcccct ctgccagaag 1140
actttttgaa agcttccctg gcctacggga tcgagagacc cacgtacatc actcgctccc 1200
ccattccatc ctccaccact ccaagcccct ctgcaagaag cagccccgac caaagcctca 1260
agagctccag ccctcacagc agccctggga atacggtgtc ccctgtgggc cccggctctc 1320
aagagcaccg ggactcctac gcttacttga acgcgtccta cggcacggaa ggtttgggct 1380
cctaccctgg ctacgcaccc ctgccccacc tcccgccagc tttcatcccc tcgtacaacg 1440
ctcactaccc caagttcctc ttgcccccct acggcatgaa ttgtaatggc ctgagcgctg 1500
tgagcagcat gaatggcatc aacaactttg gcctcttccc gaggctgtgc cctgtctaca 1560
gcaatctcct cggtgggggc agcctgcccc accccatgct caaccccact tctctcccga 1620
gctcgctgcc ctcagatgga gcccggaggt tgctccagcc ggagcatccc agggaggtgc 1680
ttgtcccggc gccccacagt gccttctcct ttaccggggc cgccgccagc atgaaggaca 1740
aggcctgtag ccccacaagc gggtctccca cggcgggaac agccgccacg gcagaacatg 1800
tggtgcagcc caaagctacc tcagcagcga tggcagcccc cagcagcgac gaagccatga 1860
atctcattaa aaacaaaaga aacatgaccg gctacaagac ccttccctac ccgctgaaga 1920
agcagaacgg caagatcaag tacgaatgca acgtttgcgc caagactttc ggccagctct 1980
ccaatctgaa ggtccacctg agagtgcaca gtggagaacg gcctttcaaa tgtcagactt 2040
gcaacaaggg ctttactcag ctcgcccacc tgcagaaaca ctacctggta cacacgggag 2100
aaaagccaca tgaatgccag gtctgccaca agagatttag cagcaccagc aatctcaaga 2160
cccacctgcg actccattct ggagagaaac cataccaatg caaggtgtgc cctgccaagt 2220
tcacccagtt tgtgcacctg aaactgcaca agcgtctgca cacccgggag cggccccaca 2280
agtgctccca gtgccacaag aactacatcc atctctgtag cctcaaggtt cacctgaaag 2340
ggaactgcgc tgcggccccg gcgcctgggc tgcccttgga agatctgacc cgaatcaatg 2400
aagaaatcga gaagtttgac atcagtgaca atgctgaccg gctcgaggac gtggaggatg 2460
acatcagtgt gatctctgta gtggagaagg aaattctggc cgtggtcaga aaagagaaag 2520
aagaaactgg cctgaaagtg tctttgcaaa gaaacatggg gaatggactc ctctcctcag 2580
ggtgcagcct ttatgagtca tcagatctac ccctcatgaa gttgcctccc agcaacccac 2640
tacctctggt acctgtaaag gtcaaacaag aaacagttga accaatggat ccttaagatt 2700
ttcagaaaac acttattttg tttcttaagt tatgacttgg tgagtcaggg tgcctgtagg 2760
aagtggcttg tacataatcc cagctctgca aagctctctc gacagcaaat ggtttcccct 2820
cacctctgga attaaagaag gaactccaaa gttactgaaa tctcagggca tgaacaaggc 2880
aaaggccata tatatatata tatatatatc tgtatacata ttatatatac ttatttacac 2940
ctgtgtctat atatttgccc ctgtgtattt tgaatatttg tgtggacatg tttgcatagc 3000
cttcccatta ctaagactat tacctagtca taattatttt ttcaatgata atccttcata 3060
atttattata caatttatca ttcagaaagc aataattaaa aaagtttaca atgactggaa 3120
agattccttg taatttgagt ataaatgtat ttttgtcttg tggccattct ttgtagataa 3180
tttctgcaca tctgtataag tacctaagat ttagttaaac aaatatatga cttcagtcaa 3240
cctctctctc taataatggt ttgaaaatga ggtttgggta attgccaatg ttggacagtt 3300
gatgtgttca ttcctgggat cctatcattt gaacagcatt gtacataact tgggggtatg 3360
tgtgcaggat tacccaagaa taacttaagt agaagaaaca agaaagggaa tcttgtatat 3420
ttttgttgat agttcatgtt tttcccccag ccacaatttt accggaaggg tgacaggaag 3480
gctttaccaa cctgtctctc cctccaaaag agcagaatcc tcccaccgcc ctgccctccc 3540
caccgagtcc tgtggccatt cagagcggcc acatgacttt tgcatccatt gtattatcag 3600
aaaatgtgaa gaagaaaaaa atgccatgtt ttaaaaccac tgcgaaaatt tccccaaagc 3660
ataggtggct ttgtgtgtgt gcgatttggg ggcttgagtc tgggtggtgt tttgttgttg 3720
gtttttgttg cttttttttt tttttttttt ttaatgtcaa aattgcacaa acatggtgct 3780
ctaccaggaa ggattcgagg tagataggct caggccacac tttaaaaaca aacacacaaa 3840
caacaaaaaa cgggtattct agtcatcttg gggtaaaagc gggtaatgaa cattcctatc 3900
cccaacacat caattgtatt ttttctgtaa aactcagatt ttcctcagta tttgtgtttt 3960
tacattttat ggttaattta atggaagatg aaagggcatt gcaaagttgt tcaacaacag 4020
ttacctcatt gagtgtgtcc agtagtgcag gaaatgatgt cttatctaat gatttgcttc 4080
tctagaggag aaaccgagta aatgtgctcc agcaagatag actttgtgtt attctatctt 4140
ttattctgct aagcccaaag attacatgtt ggtgttcaaa gtgtagcaaa aaatgatgta 4200
tatttataaa tctatttata ccactatatc atatgtatat atatttataa ccacttaaat 4260
tgtgagccaa gccatgtaaa agatctactt tttctaaggg caaaaaaaaa aaaaaaaaaa 4320
aaagaacact cctttctgag actttgctta atacttggtg acctcacaat cacgtcggta 4380
tgattgggca cccttgccta ctgtaagaga ccctaaaacc ttggtgcagt ggtggggacc 4440
acaaaacaac cagggaggaa gagatacatc attttttagt attaaggacc atctaagaca 4500
gctctatttt ttttttgcca ctttatgatt atgtggtcac acccaagtca cagaaataaa 4560
aaactgactt taccgctgca atttttctgt tttcctcctt actaaatact gatacattac 4620
tccaatctat tttataatta tatttgacat tttgttcaca tcaactaatg ttcacctgta 4680
gaagagaaca aatttcgaat aatccaggga aacccaagag ccttactggt cttctgtaac 4740
ttccaagact gacagctttt tatgtatcag tgtttgataa acacagtcct taactgaagg 4800
taaaccaaag catcacgttg acattagacc aaatactttt gattcccaac tactcgtttg 4860
ttctttttct ccttttgtgc tttcccatag tgagaatttt tataaagact tcttgcttct 4920
ctcaccatcc atccttctct tttctgcctc ttacatgtga atgttgagcc cacaatcaac 4980
agtggtttta ttttttcctc tactcaaagt taaaactgac caaagttact ggctttttac 5040
tttgctagaa caacaaacta tcttatgttt acatactggt ttacaatgtt atttatgtgc 5100
aaattgtcaa aatgtaaatt aaatataaat gttcatgctt taccaaaa 5148
<210> 14
<211> 2478
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(2478)
<223> PRDM1人CDS >NM_001198.4:219-2696智人PR/SET结构域1 (PRDM1),
转录物变体1, mRNA
<400> 14
atgttggata tttgcttgga aaaacgtgtg ggtacgacct tggctgcccc caagtgtaac 60
tccagcactg tgaggtttca gggattggca gaggggacca aggggaccat gaaaatggac 120
atggaggatg cggatatgac tctgtggaca gaggctgagt ttgaagagaa gtgtacatac 180
attgtgaacg accacccctg ggattctggt gctgatggcg gtacttcggt tcaggcggag 240
gcatccttac caaggaatct gcttttcaag tatgccacca acagtgaaga ggttattgga 300
gtgatgagta aagaatacat accaaagggc acacgttttg gacccctaat aggtgaaatc 360
tacaccaatg acacagttcc taagaacgcc aacaggaaat atttttggag gatctattcc 420
agaggggagc ttcaccactt cattgacggc tttaatgaag agaaaagcaa ctggatgcgc 480
tatgtgaatc cagcacactc tccccgggag caaaacctgg ctgcgtgtca gaacgggatg 540
aacatctact tctacaccat taagcccatc cctgccaacc aggaacttct tgtgtggtat 600
tgtcgggact ttgcagaaag gcttcactac ccttatcccg gagagctgac aatgatgaat 660
ctcacacaaa cacagagcag tctaaagcaa ccgagcactg agaaaaatga actctgccca 720
aagaatgtcc caaagagaga gtacagcgtg aaagaaatcc taaaattgga ctccaacccc 780
tccaaaggaa aggacctcta ccgttctaac atttcacccc tcacatcaga aaaggacctc 840
gatgacttta gaagacgtgg gagccccgaa atgcccttct accctcgggt cgtttacccc 900
atccgggccc ctctgccaga agactttttg aaagcttccc tggcctacgg gatcgagaga 960
cccacgtaca tcactcgctc ccccattcca tcctccacca ctccaagccc ctctgcaaga 1020
agcagccccg accaaagcct caagagctcc agccctcaca gcagccctgg gaatacggtg 1080
tcccctgtgg gccccggctc tcaagagcac cgggactcct acgcttactt gaacgcgtcc 1140
tacggcacgg aaggtttggg ctcctaccct ggctacgcac ccctgcccca cctcccgcca 1200
gctttcatcc cctcgtacaa cgctcactac cccaagttcc tcttgccccc ctacggcatg 1260
aattgtaatg gcctgagcgc tgtgagcagc atgaatggca tcaacaactt tggcctcttc 1320
ccgaggctgt gccctgtcta cagcaatctc ctcggtgggg gcagcctgcc ccaccccatg 1380
ctcaacccca cttctctccc gagctcgctg ccctcagatg gagcccggag gttgctccag 1440
ccggagcatc ccagggaggt gcttgtcccg gcgccccaca gtgccttctc ctttaccggg 1500
gccgccgcca gcatgaagga caaggcctgt agccccacaa gcgggtctcc cacggcggga 1560
acagccgcca cggcagaaca tgtggtgcag cccaaagcta cctcagcagc gatggcagcc 1620
cccagcagcg acgaagccat gaatctcatt aaaaacaaaa gaaacatgac cggctacaag 1680
acccttccct acccgctgaa gaagcagaac ggcaagatca agtacgaatg caacgtttgc 1740
gccaagactt tcggccagct ctccaatctg aaggtccacc tgagagtgca cagtggagaa 1800
cggcctttca aatgtcagac ttgcaacaag ggctttactc agctcgccca cctgcagaaa 1860
cactacctgg tacacacggg agaaaagcca catgaatgcc aggtctgcca caagagattt 1920
agcagcacca gcaatctcaa gacccacctg cgactccatt ctggagagaa accataccaa 1980
tgcaaggtgt gccctgccaa gttcacccag tttgtgcacc tgaaactgca caagcgtctg 2040
cacacccggg agcggcccca caagtgctcc cagtgccaca agaactacat ccatctctgt 2100
agcctcaagg ttcacctgaa agggaactgc gctgcggccc cggcgcctgg gctgcccttg 2160
gaagatctga cccgaatcaa tgaagaaatc gagaagtttg acatcagtga caatgctgac 2220
cggctcgagg acgtggagga tgacatcagt gtgatctctg tagtggagaa ggaaattctg 2280
gccgtggtca gaaaagagaa agaagaaact ggcctgaaag tgtctttgca aagaaacatg 2340
gggaatggac tcctctcctc agggtgcagc ctttatgagt catcagatct acccctcatg 2400
aagttgcctc ccagcaaccc actacctctg gtacctgtaa aggtcaaaca agaaacagtt 2460
gaaccaatgg atccttaa 2478
<210> 15
<211> 825
<212> PRT
<213> 智人
<220>
<221> MISC_FEATURE
<222> (1)..(825)
<223> PRDM1人蛋白>NP_001189.2 PR结构域锌指蛋白1同种型1 [智人]
<400> 15
Met Leu Asp Ile Cys Leu Glu Lys Arg Val Gly Thr Thr Leu Ala Ala
1 5 10 15
Pro Lys Cys Asn Ser Ser Thr Val Arg Phe Gln Gly Leu Ala Glu Gly
20 25 30
Thr Lys Gly Thr Met Lys Met Asp Met Glu Asp Ala Asp Met Thr Leu
35 40 45
Trp Thr Glu Ala Glu Phe Glu Glu Lys Cys Thr Tyr Ile Val Asn Asp
50 55 60
His Pro Trp Asp Ser Gly Ala Asp Gly Gly Thr Ser Val Gln Ala Glu
65 70 75 80
Ala Ser Leu Pro Arg Asn Leu Leu Phe Lys Tyr Ala Thr Asn Ser Glu
85 90 95
Glu Val Ile Gly Val Met Ser Lys Glu Tyr Ile Pro Lys Gly Thr Arg
100 105 110
Phe Gly Pro Leu Ile Gly Glu Ile Tyr Thr Asn Asp Thr Val Pro Lys
115 120 125
Asn Ala Asn Arg Lys Tyr Phe Trp Arg Ile Tyr Ser Arg Gly Glu Leu
130 135 140
His His Phe Ile Asp Gly Phe Asn Glu Glu Lys Ser Asn Trp Met Arg
145 150 155 160
Tyr Val Asn Pro Ala His Ser Pro Arg Glu Gln Asn Leu Ala Ala Cys
165 170 175
Gln Asn Gly Met Asn Ile Tyr Phe Tyr Thr Ile Lys Pro Ile Pro Ala
180 185 190
Asn Gln Glu Leu Leu Val Trp Tyr Cys Arg Asp Phe Ala Glu Arg Leu
195 200 205
His Tyr Pro Tyr Pro Gly Glu Leu Thr Met Met Asn Leu Thr Gln Thr
210 215 220
Gln Ser Ser Leu Lys Gln Pro Ser Thr Glu Lys Asn Glu Leu Cys Pro
225 230 235 240
Lys Asn Val Pro Lys Arg Glu Tyr Ser Val Lys Glu Ile Leu Lys Leu
245 250 255
Asp Ser Asn Pro Ser Lys Gly Lys Asp Leu Tyr Arg Ser Asn Ile Ser
260 265 270
Pro Leu Thr Ser Glu Lys Asp Leu Asp Asp Phe Arg Arg Arg Gly Ser
275 280 285
Pro Glu Met Pro Phe Tyr Pro Arg Val Val Tyr Pro Ile Arg Ala Pro
290 295 300
Leu Pro Glu Asp Phe Leu Lys Ala Ser Leu Ala Tyr Gly Ile Glu Arg
305 310 315 320
Pro Thr Tyr Ile Thr Arg Ser Pro Ile Pro Ser Ser Thr Thr Pro Ser
325 330 335
Pro Ser Ala Arg Ser Ser Pro Asp Gln Ser Leu Lys Ser Ser Ser Pro
340 345 350
His Ser Ser Pro Gly Asn Thr Val Ser Pro Val Gly Pro Gly Ser Gln
355 360 365
Glu His Arg Asp Ser Tyr Ala Tyr Leu Asn Ala Ser Tyr Gly Thr Glu
370 375 380
Gly Leu Gly Ser Tyr Pro Gly Tyr Ala Pro Leu Pro His Leu Pro Pro
385 390 395 400
Ala Phe Ile Pro Ser Tyr Asn Ala His Tyr Pro Lys Phe Leu Leu Pro
405 410 415
Pro Tyr Gly Met Asn Cys Asn Gly Leu Ser Ala Val Ser Ser Met Asn
420 425 430
Gly Ile Asn Asn Phe Gly Leu Phe Pro Arg Leu Cys Pro Val Tyr Ser
435 440 445
Asn Leu Leu Gly Gly Gly Ser Leu Pro His Pro Met Leu Asn Pro Thr
450 455 460
Ser Leu Pro Ser Ser Leu Pro Ser Asp Gly Ala Arg Arg Leu Leu Gln
465 470 475 480
Pro Glu His Pro Arg Glu Val Leu Val Pro Ala Pro His Ser Ala Phe
485 490 495
Ser Phe Thr Gly Ala Ala Ala Ser Met Lys Asp Lys Ala Cys Ser Pro
500 505 510
Thr Ser Gly Ser Pro Thr Ala Gly Thr Ala Ala Thr Ala Glu His Val
515 520 525
Val Gln Pro Lys Ala Thr Ser Ala Ala Met Ala Ala Pro Ser Ser Asp
530 535 540
Glu Ala Met Asn Leu Ile Lys Asn Lys Arg Asn Met Thr Gly Tyr Lys
545 550 555 560
Thr Leu Pro Tyr Pro Leu Lys Lys Gln Asn Gly Lys Ile Lys Tyr Glu
565 570 575
Cys Asn Val Cys Ala Lys Thr Phe Gly Gln Leu Ser Asn Leu Lys Val
580 585 590
His Leu Arg Val His Ser Gly Glu Arg Pro Phe Lys Cys Gln Thr Cys
595 600 605
Asn Lys Gly Phe Thr Gln Leu Ala His Leu Gln Lys His Tyr Leu Val
610 615 620
His Thr Gly Glu Lys Pro His Glu Cys Gln Val Cys His Lys Arg Phe
625 630 635 640
Ser Ser Thr Ser Asn Leu Lys Thr His Leu Arg Leu His Ser Gly Glu
645 650 655
Lys Pro Tyr Gln Cys Lys Val Cys Pro Ala Lys Phe Thr Gln Phe Val
660 665 670
His Leu Lys Leu His Lys Arg Leu His Thr Arg Glu Arg Pro His Lys
675 680 685
Cys Ser Gln Cys His Lys Asn Tyr Ile His Leu Cys Ser Leu Lys Val
690 695 700
His Leu Lys Gly Asn Cys Ala Ala Ala Pro Ala Pro Gly Leu Pro Leu
705 710 715 720
Glu Asp Leu Thr Arg Ile Asn Glu Glu Ile Glu Lys Phe Asp Ile Ser
725 730 735
Asp Asn Ala Asp Arg Leu Glu Asp Val Glu Asp Asp Ile Ser Val Ile
740 745 750
Ser Val Val Glu Lys Glu Ile Leu Ala Val Val Arg Lys Glu Lys Glu
755 760 765
Glu Thr Gly Leu Lys Val Ser Leu Gln Arg Asn Met Gly Asn Gly Leu
770 775 780
Leu Ser Ser Gly Cys Ser Leu Tyr Glu Ser Ser Asp Leu Pro Leu Met
785 790 795 800
Lys Leu Pro Pro Ser Asn Pro Leu Pro Leu Val Pro Val Lys Val Lys
805 810 815
Gln Glu Thr Val Glu Pro Met Asp Pro
820 825
<210> 16
<211> 5221
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(5221)
<223> PRDM1小鼠- 完整序列>NM_007548.4小家鼠含PR结构域1,
具有ZNF结构域(Prdm1), mRNA
<400> 16
gggggaagag tagtcagtcg ctcgctcact cgctcgctcg cacagacact gctgcagtga 60
cactcggccc tccagtgtcg cggagacgca agagcagcgc gcagcacctg tccgcccgga 120
gcgagcccgg cccgcggccg tagaaaagga gggaccgccg aggtgcgcgt cagtactgct 180
cagcccggca gggacgcggg aggatgtgga ctgggtggac atgagagagg cttatctcag 240
atgttggatc ttctcttgga aaaacgtgtg ggtacgacct tggctgcccc caagtctagc 300
tccggctccg tgaagtttca aggactggca gagactggga tcatgaaaat ggacatggag 360
gacgctgata tgactttgtg gacagaggcc gagtttgaag agaagtgtac atacatagtg 420
aacgaccacc cctgggattc cggcgctgac gggggtactt ctgttcaagc cgaggcatcc 480
ttaccaagga acctgctttt caagtatgct gccaacaaca gcaaagaggt tattggcgtg 540
gtaagtaagg agtacatacc gaagggaaca cgctttggac ccctcatcgg tgaagtctac 600
actaatgaca cagttcccaa gaatgccaac aggaagtatt tttggcggat ctattccaga 660
gaggagttcc accacttcat tgatggcttt aatgaggaga aaagcaactg gatgcgctac 720
gtgaatccag ctcactctgc ccgggagcaa aacctggctg cctgtcagaa cgggatgaac 780
atctacttct acactattaa gcctatccct gccaaccagg aacttcttgt gtggtattgt 840
cgggactttg cggagaggct ccactaccct tatcctggag agctcacagt gataaatctc 900
acacaaacgg aaagcaaccc aaagcaatac agtagtgaga aaaatgaact ctacccaaag 960
agtgtcccca agagagagta cagcgtgaaa gaaattctaa aactggactc caatccctcc 1020
aaaaggaagg acatctaccg ttccaacatt tcacccttca ctttagaaaa ggacatggat 1080
ggctttcgga aaaatgggag ccccgacatg cccttctacc ctcgggtggt ttatcctatc 1140
cgggcacctc tgccagaaga ctttttgaaa gcgtccctgg cctatgggat ggagagaccc 1200
acctacataa ctcacagtcc ccttccgtct tccacaactc caagtccccc tgcgagcagc 1260
agcccggagc agagccttaa gagctccagc ccccacagca gcccgggaaa cacggtgtca 1320
cccctggcgc caggcctccc agaacaccgg gactcctact cctacttgaa tgtttcctat 1380
ggttccgagg gcctgggctc ctaccctggc tatgcacctg ccccccacct cccaccagct 1440
ttcattcctt cttacaatgc tcactacccc aagttcctgt tgccaccgta cggcattagt 1500
tccaatggct tgagcaccat gaacaacatc aatggtatca acaacttcag cctcttccct 1560
aggttgtatc ccgtctacag taacctcctt agtggcagca gcctgcctca tcccatgctc 1620
aatccagctt ccctaccgag ttccctgcct accgatggag cccggaggct gcttccaccg 1680
gagcacccca aagaggtgct tatcccagca ccccacagtg ccttctccct taccggggct 1740
gcagccagca tgaaggacga gagtagtccc cccagcggat ctccaacggc gggaactgca 1800
gccacgtcag aacacgtggt acaacccaaa gctacctcat cagtgatggc ggcccccagc 1860
actgacggag ccatgaatct cattaaaaac aaacgaaaca tgactggtta caagactctt 1920
ccttaccctc tgaagaaaca gaatggcaag atcaagtatg agtgcaatgt ctgtgccaag 1980
acgttcggtc agctctccaa cctgaaggtc cacctgagag tgcacagtgg agaacggcct 2040
ttcaagtgcc agacctgcaa caagggtttt actcagctcg cccacctgca gaaacactac 2100
ttggtacaca caggagagaa gccacatgag tgccaggtct gccacaagag atttagcagc 2160
acaagcaatc tcaagaccca ccttcgattg cattctggag aaaaacctta ccaatgtaag 2220
gtgtgccctg ccaagtttac gcaatttgtg cacctgaagc tgcacaagcg actgcatacc 2280
cgggagcggc ctcacaagtg tgcccagtgt cacaagagct acatccatct ctgcagcctc 2340
aaggtccacc tgaagggcaa ctgccctgcg ggcccagctg ctgggctgcc tttggaggat 2400
ctgacccgaa tcaatgaaga aattgagagg ttcgacatca gcgacaatgc agaccgtctt 2460
gaggacatgg aggacagtgt cgatgtgacc tccatggtgg agaaggagat tctagctgtg 2520
gtcagaaaag agaaagaaga aaccagtctg aaagtgtctt tgcaaagaaa catggggaac 2580
ggcctcctct cctcagggtg cagcctctat gagtcatcgg acctgtccct catgaagttg 2640
cctcacagca acccactacc tctggtgcct gtaaaggtca aacaagaaac agttgaaccg 2700
atggatcctt aagattttca gaaaataagt gtttcgtgtt gcttcttagg gtatggcttg 2760
gtgaatcagg gtgcctttag caaattgctt gtacatgact ccagatctgc aaagctccgc 2820
tggcaccggg tgcttccctg cacctctctg gaattaaaga aggactccaa tgttaccaaa 2880
atctcagggc ataaatgagg caaagactca ctatatatac atatatacat atatacatat 2940
tataaatata tatatactta tttacagcca tgtctatata tttgaacctg tgtattttga 3000
atatttgtgt ggatatgttt gcatagcgcc ttcctattac taaaactatt gcctagccat 3060
aattattttt tcaatgataa ttcttcataa tttattatac agtttatctt tcaaaaagca 3120
ataattaaag aagtttacaa tgactggaaa gattctttgt aatttgagta taaatgttgt 3180
atctttgtcc tgtggccatt ctttgtagat aatttctgca catctgttta aatgcctgag 3240
acttagaaga tagctctgtg atttcaggca acctttctct atgataatgc tttaaaatga 3300
ggttttgata ttgccaaagt catgtggttg gtgtgttaac tcagaagatc acacaatctg 3360
agtgacattc tctaagttgg ggatacatgt gcagaattgc tcagcaataa tttgagggga 3420
aggaagaaga aaaatatttt atgtttcaga atgatggttt ggttttcctc ctcctagtca 3480
caattttacc aaacagtgac aggaaggctt tgccaacctg tctcccaatg tcacatgacc 3540
attctgagtg gccatatgac tttggcatcc ctgggtgtta tctgaaaatg tgaagaagat 3600
aaaaaagccg tgttcagaag atctgtcgta aagcacagat gttgtgtgtg tgtgtgtgtg 3660
ggttgggggg tttgagtctg gctgtcattt tgctgttggc ttgtttttgt ttttttaata 3720
tcaaaattgc acaaagctgg tgccctacca agaaggattt gatatagaaa ggctcaggcc 3780
acacttaaaa tacaagcaag caaagagaac agaaaaaaat aaaagtaaaa acgggtattc 3840
ttatcatctt aggttaagcg ggtaatgaac attcctgtcc ccaacgcatc aactgtattg 3900
tatctgtaaa actcagcttt tctcagtatt tgtgtttttg cattgtataa ttaacttaat 3960
taaagatgaa agggcattgc aaaagtgttc aacaattacc tcattgagtg tatccagtag 4020
gagtgcagga attaatgtcg tatctcatga gttgctaccc agctgagcgt gtgtgcttcc 4080
aaatggtagg ctgggtggtt cggtcctgta ttctcctaag cccaaaggtt acctgttggt 4140
gttcaaggtg taataaagaa tgctgtatat ttatgaacct atttatacca gtataccatg 4200
tgtatatatg atatatttat aaccacttaa attgtgagcc aagccatgta aaagaaccta 4260
tttttcctaa gagcaaaaag aatctctctg aagttttgct taaaactcca tgacctcgct 4320
atgactttgg tgcttgggca ccaccctgcc tactaccaga gagcagagca cctcagtgca 4380
gaggtgaggg tgtgtagcat cttgggatgg atagaaacac cacaccatcc agtcgcattt 4440
gatggccttg ctacatgtgt gtcagttggg tcacagaata aaaatcattt ttctatttct 4500
gctctcctct tcctcttcct cttcttcttc ctctttctct ccctcctcta gaaccctgac 4560
tcatgctcac tgctcagtct gatgcttacc ttagagtttt gtatatatag atcaacttac 4620
aaagagggaa aacttcagat cctctggggg aaacccaaga gccttactga cctgttgctg 4680
tgactagcta gatgggtttc tctttacctt ccaaggatca aaaccagaga ttccacacat 4740
gctagcaagc aagcaagctg tcactgggct gcagccccaa caagactgac atttctggat 4800
gcatctgtat ttgagaaaaa tactcactta attgtaggtt aaccaaagca tgacctgaca 4860
ttgacaccaa atacaaatac gatttctttg cagtgaactt gggttgtttt cctcctgtgc 4920
ttttcttgtg ttgggaggat ttttacaagg acaattgctt ttcttgccat ctgtcttcct 4980
cttaggcctc ttacatgaga gtgttgagcc cacaatgaac agtggttggt tggttggttg 5040
gttggtttgt ttgtttgttt tttcctcaga gttaaaactg accaaagtta ttggcttttt 5100
actttgctag aacaacaaac tatcttatgt ttacgtactg gtttacattg ttatttatgt 5160
gcaaattgtc aaaatgtaaa ttaaaatata aatgttcatg ctttaaaaaa aaaaaaaaaa 5220
a 5221
<210> 17
<211> 2472
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(2472)
<223> PRDM1小鼠CDS >NM_007548.4:241-2712小家鼠含PR结构域1,
具有ZNF结构域(Prdm1), mRNA
<400> 17
atgttggatc ttctcttgga aaaacgtgtg ggtacgacct tggctgcccc caagtctagc 60
tccggctccg tgaagtttca aggactggca gagactggga tcatgaaaat ggacatggag 120
gacgctgata tgactttgtg gacagaggcc gagtttgaag agaagtgtac atacatagtg 180
aacgaccacc cctgggattc cggcgctgac gggggtactt ctgttcaagc cgaggcatcc 240
ttaccaagga acctgctttt caagtatgct gccaacaaca gcaaagaggt tattggcgtg 300
gtaagtaagg agtacatacc gaagggaaca cgctttggac ccctcatcgg tgaagtctac 360
actaatgaca cagttcccaa gaatgccaac aggaagtatt tttggcggat ctattccaga 420
gaggagttcc accacttcat tgatggcttt aatgaggaga aaagcaactg gatgcgctac 480
gtgaatccag ctcactctgc ccgggagcaa aacctggctg cctgtcagaa cgggatgaac 540
atctacttct acactattaa gcctatccct gccaaccagg aacttcttgt gtggtattgt 600
cgggactttg cggagaggct ccactaccct tatcctggag agctcacagt gataaatctc 660
acacaaacgg aaagcaaccc aaagcaatac agtagtgaga aaaatgaact ctacccaaag 720
agtgtcccca agagagagta cagcgtgaaa gaaattctaa aactggactc caatccctcc 780
aaaaggaagg acatctaccg ttccaacatt tcacccttca ctttagaaaa ggacatggat 840
ggctttcgga aaaatgggag ccccgacatg cccttctacc ctcgggtggt ttatcctatc 900
cgggcacctc tgccagaaga ctttttgaaa gcgtccctgg cctatgggat ggagagaccc 960
acctacataa ctcacagtcc ccttccgtct tccacaactc caagtccccc tgcgagcagc 1020
agcccggagc agagccttaa gagctccagc ccccacagca gcccgggaaa cacggtgtca 1080
cccctggcgc caggcctccc agaacaccgg gactcctact cctacttgaa tgtttcctat 1140
ggttccgagg gcctgggctc ctaccctggc tatgcacctg ccccccacct cccaccagct 1200
ttcattcctt cttacaatgc tcactacccc aagttcctgt tgccaccgta cggcattagt 1260
tccaatggct tgagcaccat gaacaacatc aatggtatca acaacttcag cctcttccct 1320
aggttgtatc ccgtctacag taacctcctt agtggcagca gcctgcctca tcccatgctc 1380
aatccagctt ccctaccgag ttccctgcct accgatggag cccggaggct gcttccaccg 1440
gagcacccca aagaggtgct tatcccagca ccccacagtg ccttctccct taccggggct 1500
gcagccagca tgaaggacga gagtagtccc cccagcggat ctccaacggc gggaactgca 1560
gccacgtcag aacacgtggt acaacccaaa gctacctcat cagtgatggc ggcccccagc 1620
actgacggag ccatgaatct cattaaaaac aaacgaaaca tgactggtta caagactctt 1680
ccttaccctc tgaagaaaca gaatggcaag atcaagtatg agtgcaatgt ctgtgccaag 1740
acgttcggtc agctctccaa cctgaaggtc cacctgagag tgcacagtgg agaacggcct 1800
ttcaagtgcc agacctgcaa caagggtttt actcagctcg cccacctgca gaaacactac 1860
ttggtacaca caggagagaa gccacatgag tgccaggtct gccacaagag atttagcagc 1920
acaagcaatc tcaagaccca ccttcgattg cattctggag aaaaacctta ccaatgtaag 1980
gtgtgccctg ccaagtttac gcaatttgtg cacctgaagc tgcacaagcg actgcatacc 2040
cgggagcggc ctcacaagtg tgcccagtgt cacaagagct acatccatct ctgcagcctc 2100
aaggtccacc tgaagggcaa ctgccctgcg ggcccagctg ctgggctgcc tttggaggat 2160
ctgacccgaa tcaatgaaga aattgagagg ttcgacatca gcgacaatgc agaccgtctt 2220
gaggacatgg aggacagtgt cgatgtgacc tccatggtgg agaaggagat tctagctgtg 2280
gtcagaaaag agaaagaaga aaccagtctg aaagtgtctt tgcaaagaaa catggggaac 2340
ggcctcctct cctcagggtg cagcctctat gagtcatcgg acctgtccct catgaagttg 2400
cctcacagca acccactacc tctggtgcct gtaaaggtca aacaagaaac agttgaaccg 2460
atggatcctt aa 2472
<210> 18
<211> 823
<212> PRT
<213> 小家鼠
<220>
<221> MISC_FEATURE
<222> (1)..(823)
<223> PRDM1小鼠蛋白NP_031574.2 PR结构域锌指蛋白1 [小家鼠]
<400> 18
Met Leu Asp Leu Leu Leu Glu Lys Arg Val Gly Thr Thr Leu Ala Ala
1 5 10 15
Pro Lys Ser Ser Ser Gly Ser Val Lys Phe Gln Gly Leu Ala Glu Thr
20 25 30
Gly Ile Met Lys Met Asp Met Glu Asp Ala Asp Met Thr Leu Trp Thr
35 40 45
Glu Ala Glu Phe Glu Glu Lys Cys Thr Tyr Ile Val Asn Asp His Pro
50 55 60
Trp Asp Ser Gly Ala Asp Gly Gly Thr Ser Val Gln Ala Glu Ala Ser
65 70 75 80
Leu Pro Arg Asn Leu Leu Phe Lys Tyr Ala Ala Asn Asn Ser Lys Glu
85 90 95
Val Ile Gly Val Val Ser Lys Glu Tyr Ile Pro Lys Gly Thr Arg Phe
100 105 110
Gly Pro Leu Ile Gly Glu Val Tyr Thr Asn Asp Thr Val Pro Lys Asn
115 120 125
Ala Asn Arg Lys Tyr Phe Trp Arg Ile Tyr Ser Arg Glu Glu Phe His
130 135 140
His Phe Ile Asp Gly Phe Asn Glu Glu Lys Ser Asn Trp Met Arg Tyr
145 150 155 160
Val Asn Pro Ala His Ser Ala Arg Glu Gln Asn Leu Ala Ala Cys Gln
165 170 175
Asn Gly Met Asn Ile Tyr Phe Tyr Thr Ile Lys Pro Ile Pro Ala Asn
180 185 190
Gln Glu Leu Leu Val Trp Tyr Cys Arg Asp Phe Ala Glu Arg Leu His
195 200 205
Tyr Pro Tyr Pro Gly Glu Leu Thr Val Ile Asn Leu Thr Gln Thr Glu
210 215 220
Ser Asn Pro Lys Gln Tyr Ser Ser Glu Lys Asn Glu Leu Tyr Pro Lys
225 230 235 240
Ser Val Pro Lys Arg Glu Tyr Ser Val Lys Glu Ile Leu Lys Leu Asp
245 250 255
Ser Asn Pro Ser Lys Arg Lys Asp Ile Tyr Arg Ser Asn Ile Ser Pro
260 265 270
Phe Thr Leu Glu Lys Asp Met Asp Gly Phe Arg Lys Asn Gly Ser Pro
275 280 285
Asp Met Pro Phe Tyr Pro Arg Val Val Tyr Pro Ile Arg Ala Pro Leu
290 295 300
Pro Glu Asp Phe Leu Lys Ala Ser Leu Ala Tyr Gly Met Glu Arg Pro
305 310 315 320
Thr Tyr Ile Thr His Ser Pro Leu Pro Ser Ser Thr Thr Pro Ser Pro
325 330 335
Pro Ala Ser Ser Ser Pro Glu Gln Ser Leu Lys Ser Ser Ser Pro His
340 345 350
Ser Ser Pro Gly Asn Thr Val Ser Pro Leu Ala Pro Gly Leu Pro Glu
355 360 365
His Arg Asp Ser Tyr Ser Tyr Leu Asn Val Ser Tyr Gly Ser Glu Gly
370 375 380
Leu Gly Ser Tyr Pro Gly Tyr Ala Pro Ala Pro His Leu Pro Pro Ala
385 390 395 400
Phe Ile Pro Ser Tyr Asn Ala His Tyr Pro Lys Phe Leu Leu Pro Pro
405 410 415
Tyr Gly Ile Ser Ser Asn Gly Leu Ser Thr Met Asn Asn Ile Asn Gly
420 425 430
Ile Asn Asn Phe Ser Leu Phe Pro Arg Leu Tyr Pro Val Tyr Ser Asn
435 440 445
Leu Leu Ser Gly Ser Ser Leu Pro His Pro Met Leu Asn Pro Ala Ser
450 455 460
Leu Pro Ser Ser Leu Pro Thr Asp Gly Ala Arg Arg Leu Leu Pro Pro
465 470 475 480
Glu His Pro Lys Glu Val Leu Ile Pro Ala Pro His Ser Ala Phe Ser
485 490 495
Leu Thr Gly Ala Ala Ala Ser Met Lys Asp Glu Ser Ser Pro Pro Ser
500 505 510
Gly Ser Pro Thr Ala Gly Thr Ala Ala Thr Ser Glu His Val Val Gln
515 520 525
Pro Lys Ala Thr Ser Ser Val Met Ala Ala Pro Ser Thr Asp Gly Ala
530 535 540
Met Asn Leu Ile Lys Asn Lys Arg Asn Met Thr Gly Tyr Lys Thr Leu
545 550 555 560
Pro Tyr Pro Leu Lys Lys Gln Asn Gly Lys Ile Lys Tyr Glu Cys Asn
565 570 575
Val Cys Ala Lys Thr Phe Gly Gln Leu Ser Asn Leu Lys Val His Leu
580 585 590
Arg Val His Ser Gly Glu Arg Pro Phe Lys Cys Gln Thr Cys Asn Lys
595 600 605
Gly Phe Thr Gln Leu Ala His Leu Gln Lys His Tyr Leu Val His Thr
610 615 620
Gly Glu Lys Pro His Glu Cys Gln Val Cys His Lys Arg Phe Ser Ser
625 630 635 640
Thr Ser Asn Leu Lys Thr His Leu Arg Leu His Ser Gly Glu Lys Pro
645 650 655
Tyr Gln Cys Lys Val Cys Pro Ala Lys Phe Thr Gln Phe Val His Leu
660 665 670
Lys Leu His Lys Arg Leu His Thr Arg Glu Arg Pro His Lys Cys Ala
675 680 685
Gln Cys His Lys Ser Tyr Ile His Leu Cys Ser Leu Lys Val His Leu
690 695 700
Lys Gly Asn Cys Pro Ala Gly Pro Ala Ala Gly Leu Pro Leu Glu Asp
705 710 715 720
Leu Thr Arg Ile Asn Glu Glu Ile Glu Arg Phe Asp Ile Ser Asp Asn
725 730 735
Ala Asp Arg Leu Glu Asp Met Glu Asp Ser Val Asp Val Thr Ser Met
740 745 750
Val Glu Lys Glu Ile Leu Ala Val Val Arg Lys Glu Lys Glu Glu Thr
755 760 765
Ser Leu Lys Val Ser Leu Gln Arg Asn Met Gly Asn Gly Leu Leu Ser
770 775 780
Ser Gly Cys Ser Leu Tyr Glu Ser Ser Asp Leu Ser Leu Met Lys Leu
785 790 795 800
Pro His Ser Asn Pro Leu Pro Leu Val Pro Val Lys Val Lys Gln Glu
805 810 815
Thr Val Glu Pro Met Asp Pro
820
<210> 19
<211> 2257
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(2257)
<223> IRF2人- 完整序列>NM_002199.4智人干扰素调节因子2 (IRF2), mRNA
<400> 19
ctctccttgt tttgctttcg atctggactg ttctcaggca agccggggag taacttttag 60
ttttgctcct gcgattattc aactgacggg ctttcatttc catttcacat accctagcaa 120
cacttatacc ttgcggaatt gtattggtag cgtgaaaaaa gcacactgag agggcaccat 180
gccggtggaa aggatgcgca tgcgcccgtg gctggaggag cagataaact ccaacacgat 240
cccggggctc aagtggctta acaaggaaaa gaagattttt cagatcccct ggatgcatgc 300
ggctagacat gggtgggatg tggaaaaaga tgcaccactc tttagaaact gggcaatcca 360
tacaggaaag catcaaccag gagtagataa acctgatccc aaaacatgga aggcgaattt 420
cagatgcgcc atgaattcct tgcctgatat tgaagaagtc aaggataaaa gcataaagaa 480
aggaaataat gccttcaggg tctaccgaat gctgccccta tcagaacggc cttctaagaa 540
aggaaagaaa ccaaagacag aaaaagaaga caaagttaag cacatcaagc aagaaccagt 600
tgagtcatct ctggggctta gtaatggagt aagtgatctt tctcctgagt atgcggtcct 660
gacttcaact ataaaaaatg aagtggatag tacggtgaac atcatagttg taggacagtc 720
ccatctggac agcaacattg agaatcaaga gattgtcacc aatccgccag acatttgcca 780
agttgtagag gtgaccactg agagcgacga gcagccggtc agcatgagcg agctctaccc 840
tctgcagatc tcccccgtgt cttcctatgc agaaagcgaa acgactgata gtgtgcccag 900
cgatgaagag agtgccgagg ggcggccaca ctggcggaag aggaatattg aaggcaaaca 960
gtacctcagc aacatgggga ctcgaggctc ctacctgctg cccggcatgg cgtccttcgt 1020
cacttccaac aaaccggacc tccaggtcac catcaaagag gagagcaatc cggtgcctta 1080
caacagctcc tggccccctt ttcaagacct ccccctttct tcctccatga ccccagcatc 1140
cagcagcagt cggccagacc gggagacccg ggccagcgtc atcaagaaaa catcggatat 1200
cacccaggcc cgcgtcaaga gctgttaagc ctctgactct ccgcggtggt tgttggggct 1260
tcttggcttt gttttgttgt ttgtttgtat tttatttttt tctctctgac acctatttta 1320
gacaaatcta agggaaaaag ccttgacaat agaacattga ttgctgtgtc caactccagt 1380
actggagctt ctctttaact caggactcca gcccattggt agacgtgtgt ttctagagcc 1440
tgctggatct cccagggcta ctcactcaag ttcaaggacc aacaagggca gtggaggtgc 1500
tgcattgcct gcggtcaagg ccagcaaggt ggagtggatg cctcagaacg gacgagataa 1560
tgtgaactag ctggaatttt ttattcttgt gaatatgtac ataggcagca ctagcgacat 1620
tgcagtctgc ttctgcacct tatcttaaag cacttacaga taggccttct tgtgatcttg 1680
ctctatctca cagcacactc agcaccccct tctctgccca ttccccagcc tctcttccta 1740
tcccatccca tcccatccca tcccatccca tcccatcccg ctcttttcct acttttcctt 1800
ccctcaaagc ttccattcca catccggagg agaagaagga aatgaatttc tctacagatg 1860
tcccattttc agactgcttt aaaaaaaatc cttctaatct gctatgcttg aatgccacgc 1920
ggtacaaagg aaaaagtatc atggaaatat tatgcaaatt cccagatttg aagacaaaaa 1980
tactctaatt ctaaccagag caagcttttt tattttttat acaggggaat attttattca 2040
aggtaaaatt ctaaataaaa tataattgtt ttttatcttt tctacagcaa atttataatt 2100
ttaagattcc ttttcttgtt tatcagcagt tgttattaca tccttgtggc acattttttt 2160
ttttaatttt gtaaaggtga aaaaagcttt tatgagctca tgtagcaatc agattttcct 2220
gtggattgat aataaatgaa tatgatatat agttaaa 2257
<210> 20
<211> 1050
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(1050)
<223> IRF2人CDS >NM_002199.4:179-1228智人干扰素调节因子2 (IRF2), mRNA
<400> 20
atgccggtgg aaaggatgcg catgcgcccg tggctggagg agcagataaa ctccaacacg 60
atcccggggc tcaagtggct taacaaggaa aagaagattt ttcagatccc ctggatgcat 120
gcggctagac atgggtggga tgtggaaaaa gatgcaccac tctttagaaa ctgggcaatc 180
catacaggaa agcatcaacc aggagtagat aaacctgatc ccaaaacatg gaaggcgaat 240
ttcagatgcg ccatgaattc cttgcctgat attgaagaag tcaaggataa aagcataaag 300
aaaggaaata atgccttcag ggtctaccga atgctgcccc tatcagaacg gccttctaag 360
aaaggaaaga aaccaaagac agaaaaagaa gacaaagtta agcacatcaa gcaagaacca 420
gttgagtcat ctctggggct tagtaatgga gtaagtgatc tttctcctga gtatgcggtc 480
ctgacttcaa ctataaaaaa tgaagtggat agtacggtga acatcatagt tgtaggacag 540
tcccatctgg acagcaacat tgagaatcaa gagattgtca ccaatccgcc agacatttgc 600
caagttgtag aggtgaccac tgagagcgac gagcagccgg tcagcatgag cgagctctac 660
cctctgcaga tctcccccgt gtcttcctat gcagaaagcg aaacgactga tagtgtgccc 720
agcgatgaag agagtgccga ggggcggcca cactggcgga agaggaatat tgaaggcaaa 780
cagtacctca gcaacatggg gactcgaggc tcctacctgc tgcccggcat ggcgtccttc 840
gtcacttcca acaaaccgga cctccaggtc accatcaaag aggagagcaa tccggtgcct 900
tacaacagct cctggccccc ttttcaagac ctcccccttt cttcctccat gaccccagca 960
tccagcagca gtcggccaga ccgggagacc cgggccagcg tcatcaagaa aacatcggat 1020
atcacccagg cccgcgtcaa gagctgttaa 1050
<210> 21
<211> 349
<212> PRT
<213> 智人
<220>
<221> MISC_FEATURE
<222> (1)..(349)
<223> IRF2人蛋白>NP_002190.2干扰素调节因子2 [智人]
<400> 21
Met Pro Val Glu Arg Met Arg Met Arg Pro Trp Leu Glu Glu Gln Ile
1 5 10 15
Asn Ser Asn Thr Ile Pro Gly Leu Lys Trp Leu Asn Lys Glu Lys Lys
20 25 30
Ile Phe Gln Ile Pro Trp Met His Ala Ala Arg His Gly Trp Asp Val
35 40 45
Glu Lys Asp Ala Pro Leu Phe Arg Asn Trp Ala Ile His Thr Gly Lys
50 55 60
His Gln Pro Gly Val Asp Lys Pro Asp Pro Lys Thr Trp Lys Ala Asn
65 70 75 80
Phe Arg Cys Ala Met Asn Ser Leu Pro Asp Ile Glu Glu Val Lys Asp
85 90 95
Lys Ser Ile Lys Lys Gly Asn Asn Ala Phe Arg Val Tyr Arg Met Leu
100 105 110
Pro Leu Ser Glu Arg Pro Ser Lys Lys Gly Lys Lys Pro Lys Thr Glu
115 120 125
Lys Glu Asp Lys Val Lys His Ile Lys Gln Glu Pro Val Glu Ser Ser
130 135 140
Leu Gly Leu Ser Asn Gly Val Ser Asp Leu Ser Pro Glu Tyr Ala Val
145 150 155 160
Leu Thr Ser Thr Ile Lys Asn Glu Val Asp Ser Thr Val Asn Ile Ile
165 170 175
Val Val Gly Gln Ser His Leu Asp Ser Asn Ile Glu Asn Gln Glu Ile
180 185 190
Val Thr Asn Pro Pro Asp Ile Cys Gln Val Val Glu Val Thr Thr Glu
195 200 205
Ser Asp Glu Gln Pro Val Ser Met Ser Glu Leu Tyr Pro Leu Gln Ile
210 215 220
Ser Pro Val Ser Ser Tyr Ala Glu Ser Glu Thr Thr Asp Ser Val Pro
225 230 235 240
Ser Asp Glu Glu Ser Ala Glu Gly Arg Pro His Trp Arg Lys Arg Asn
245 250 255
Ile Glu Gly Lys Gln Tyr Leu Ser Asn Met Gly Thr Arg Gly Ser Tyr
260 265 270
Leu Leu Pro Gly Met Ala Ser Phe Val Thr Ser Asn Lys Pro Asp Leu
275 280 285
Gln Val Thr Ile Lys Glu Glu Ser Asn Pro Val Pro Tyr Asn Ser Ser
290 295 300
Trp Pro Pro Phe Gln Asp Leu Pro Leu Ser Ser Ser Met Thr Pro Ala
305 310 315 320
Ser Ser Ser Ser Arg Pro Asp Arg Glu Thr Arg Ala Ser Val Ile Lys
325 330 335
Lys Thr Ser Asp Ile Thr Gln Ala Arg Val Lys Ser Cys
340 345
<210> 22
<211> 2484
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(2484)
<223> IRF2小鼠- 完整序列>NM_008391.4小家鼠干扰素调节因子2 (Irf2),
转录物变体1, mRNA
<400> 22
gctcctcgca gtttcctctc cttgttttgc tttcgatctg gactgttctc aggcaagccg 60
gggactaact tttagttttg ctcctgcgat tattcaactg acgggctttc atttccattt 120
tacacaccct aacaacactc acaccttgcg ggattgtatt ggtagcgtgg aaaaaaaaaa 180
gcacattgag agggtaccat gccggtggaa cggatgcgaa tgcgcccgtg gctggaggag 240
cagataaatt ccaatacgat accagggcta aagtggctga acaaggagaa gaagattttc 300
cagatcccct ggatgcatgc ggctcggcac ggatgggacg tggaaaagga tgctccgctc 360
ttcagaaact gggcgatcca tacaggaaag catcaaccag gaatagataa accagatcca 420
aaaacatgga aagcaaattt tcgatgtgcc atgaattccc tgcccgacat tgaggaagtg 480
aaggacagaa gcataaagaa aggaaacaac gccttcagag tctaccggat gctgccctta 540
tccgaacgac cttccaagaa aggaaagaaa ccaaagacag aaaaagaaga gagagttaag 600
cacatcaagc aagaaccagt tgagtcatct ttggggctta gtaatggagt aagtggcttt 660
tctcctgagt atgcggtcct gacttcagct ataaaaaatg aagtggatag tacggtgaac 720
atcatagttg taggacagtc ccatctggac agcaacattg aagatcaaga gatcgtcact 780
aacccgccag acatctgcca ggttgtagaa gtgaccactg agagtgatga ccagccagtc 840
agcatgagtg agctctaccc tctacagatt tctcctgtgt cttcctacgc agaaagcgaa 900
actaccgaca gtgtggccag tgatgaagag aacgcagagg ggagaccaca ctggaggaag 960
aggagcatcg aaggcaagca gtacctcagc aacatgggga cacggaacac ctatctgctg 1020
cccagcatgg cgacctttgt cacctccaac aagccagatc tgcaggtcac catcaaagag 1080
gatagctgtc cgatgcctta caacagctcc tggcccccat ttacagacct tccccttcct 1140
gccccagtga cccccacgcc cagcagcagt cggccagacc gggagacccg ggccagtgtc 1200
atcaagaaga catctgatat cacccaggcc cgtgtcaaga gctgttaagc ctttgactct 1260
ccctggtggt tgttgggatt tcttagcttt gtgttgttct ttgtttgtat tatattattt 1320
tttttctcta tgatacctat cttagacaca tctaagggag aaagccttga cgatagatta 1380
ttgattgctg tgtccaactc cagagctgga gcttcttctt aactcaggac tccagccccc 1440
cccccccctc ggtagatgcg tatctctaga acctgctgga tctgccaggg ctactccctc 1500
aagttcaagg accaacagcc acacgggcag tggaggtgct gcgttgccta cggtcaaggc 1560
cagcatggtg gagtggatgc ctcagaacgg aggagaaaat gtgaactagc tggaattttt 1620
ttattcttgt gaatatgtac atagggcagt acgagcaatg tcgcgggctg cttctgcacc 1680
ttatcttgaa gcacttacaa taggccttct tgtaatcttg ctctccttca cagcacactc 1740
ggcgacccct tctgtgtcca ctaccccact acccacccct ccctcctcaa cccctccatc 1800
ccggtcctct atgcgcccct tccccccaac caatcccatc acaacctctt acctatcctt 1860
tccctcccaa ccccttctat cccagcccac cacctacccc actcctcccc aactcctcca 1920
ttctagccca ttacccacgc ctctctcctc agcccagcct accccatccc accctgttcc 1980
tttcctccag tttcctctcc tcaaaggcaa ggctctacat cttggaggag gaggaggaga 2040
agaaaatgag tttcttcacc gctgtcccat tttaagactg cttgaataat aaaaaaaaaa 2100
tctttctaat ctgctatgct tgaatggcac gcggtacaaa ggaaaactgt catggaaata 2160
ttatgcaaat tcccagatct gaagacggaa aatactctaa ttctaaccag agcaagcttt 2220
tttatttttt tatacaaggg gaatatttta ttcaaggtaa aaaaattcta aataaaatat 2280
aattgttttt tatcttttct acagcaaatt tataatttta agattccttt tcctgttcat 2340
cagcagttgt tattacatcc cttgtggcac attttttttt taattttgta aaggtgaaaa 2400
aaaaactttt atgagctcat gtagcaatca aattatcctg tggattgata ataaatgaat 2460
atggtatata gttaaagatt ttaa 2484
<210> 23
<211> 1050
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(1050)
<400> 23
atgccggtgg aacggatgcg aatgcgcccg tggctggagg agcagataaa ttccaatacg 60
ataccagggc taaagtggct gaacaaggag aagaagattt tccagatccc ctggatgcat 120
gcggctcggc acggatggga cgtggaaaag gatgctccgc tcttcagaaa ctgggcgatc 180
catacaggaa agcatcaacc aggaatagat aaaccagatc caaaaacatg gaaagcaaat 240
tttcgatgtg ccatgaattc cctgcccgac attgaggaag tgaaggacag aagcataaag 300
aaaggaaaca acgccttcag agtctaccgg atgctgccct tatccgaacg accttccaag 360
aaaggaaaga aaccaaagac agaaaaagaa gagagagtta agcacatcaa gcaagaacca 420
gttgagtcat ctttggggct tagtaatgga gtaagtggct tttctcctga gtatgcggtc 480
ctgacttcag ctataaaaaa tgaagtggat agtacggtga acatcatagt tgtaggacag 540
tcccatctgg acagcaacat tgaagatcaa gagatcgtca ctaacccgcc agacatctgc 600
caggttgtag aagtgaccac tgagagtgat gaccagccag tcagcatgag tgagctctac 660
cctctacaga tttctcctgt gtcttcctac gcagaaagcg aaactaccga cagtgtggcc 720
agtgatgaag agaacgcaga ggggagacca cactggagga agaggagcat cgaaggcaag 780
cagtacctca gcaacatggg gacacggaac acctatctgc tgcccagcat ggcgaccttt 840
gtcacctcca acaagccaga tctgcaggtc accatcaaag aggatagctg tccgatgcct 900
tacaacagct cctggccccc atttacagac cttccccttc ctgccccagt gacccccacg 960
cccagcagca gtcggccaga ccgggagacc cgggccagtg tcatcaagaa gacatctgat 1020
atcacccagg cccgtgtcaa gagctgttaa 1050
<210> 24
<211> 349
<212> PRT
<213> 小家鼠
<220>
<221> MISC_FEATURE
<222> (1)..(349)
<223> IRF2小鼠蛋白>NP_032417.3干扰素调节因子2 [小家鼠]
<400> 24
Met Pro Val Glu Arg Met Arg Met Arg Pro Trp Leu Glu Glu Gln Ile
1 5 10 15
Asn Ser Asn Thr Ile Pro Gly Leu Lys Trp Leu Asn Lys Glu Lys Lys
20 25 30
Ile Phe Gln Ile Pro Trp Met His Ala Ala Arg His Gly Trp Asp Val
35 40 45
Glu Lys Asp Ala Pro Leu Phe Arg Asn Trp Ala Ile His Thr Gly Lys
50 55 60
His Gln Pro Gly Ile Asp Lys Pro Asp Pro Lys Thr Trp Lys Ala Asn
65 70 75 80
Phe Arg Cys Ala Met Asn Ser Leu Pro Asp Ile Glu Glu Val Lys Asp
85 90 95
Arg Ser Ile Lys Lys Gly Asn Asn Ala Phe Arg Val Tyr Arg Met Leu
100 105 110
Pro Leu Ser Glu Arg Pro Ser Lys Lys Gly Lys Lys Pro Lys Thr Glu
115 120 125
Lys Glu Glu Arg Val Lys His Ile Lys Gln Glu Pro Val Glu Ser Ser
130 135 140
Leu Gly Leu Ser Asn Gly Val Ser Gly Phe Ser Pro Glu Tyr Ala Val
145 150 155 160
Leu Thr Ser Ala Ile Lys Asn Glu Val Asp Ser Thr Val Asn Ile Ile
165 170 175
Val Val Gly Gln Ser His Leu Asp Ser Asn Ile Glu Asp Gln Glu Ile
180 185 190
Val Thr Asn Pro Pro Asp Ile Cys Gln Val Val Glu Val Thr Thr Glu
195 200 205
Ser Asp Asp Gln Pro Val Ser Met Ser Glu Leu Tyr Pro Leu Gln Ile
210 215 220
Ser Pro Val Ser Ser Tyr Ala Glu Ser Glu Thr Thr Asp Ser Val Ala
225 230 235 240
Ser Asp Glu Glu Asn Ala Glu Gly Arg Pro His Trp Arg Lys Arg Ser
245 250 255
Ile Glu Gly Lys Gln Tyr Leu Ser Asn Met Gly Thr Arg Asn Thr Tyr
260 265 270
Leu Leu Pro Ser Met Ala Thr Phe Val Thr Ser Asn Lys Pro Asp Leu
275 280 285
Gln Val Thr Ile Lys Glu Asp Ser Cys Pro Met Pro Tyr Asn Ser Ser
290 295 300
Trp Pro Pro Phe Thr Asp Leu Pro Leu Pro Ala Pro Val Thr Pro Thr
305 310 315 320
Pro Ser Ser Ser Arg Pro Asp Arg Glu Thr Arg Ala Ser Val Ile Lys
325 330 335
Lys Thr Ser Asp Ile Thr Gln Ala Arg Val Lys Ser Cys
340 345
<210> 25
<211> 6720
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(6720)
<223> POU2F2人- 完整序列>NM_001247994.2智人POU
2类同源盒2 (POU2F2), 转录物变体4, mRNA
<400> 25
acagttgttc ccccagccct ggcggggcgg gcagcatggt tcactccagc atgggggctc 60
cagaaataag aatgtctaag cccctggagg ccgagaagca aggtctggac tccccatcag 120
agcacacaga caccgaaaga aatggaccag acactaatca tcagaacccc caaaataaga 180
cctccccatt ctccgtgtcc ccaactggcc ccagtacaaa gatcaaggct gaagacccca 240
gtggcgattc agccccagca gcacccctgc cccctcagcc ggcccagcct catctgcccc 300
aggcccaact catgttgacg ggcagccagc tagctgggga catacagcag ctcctccagc 360
tccagcagct ggtgcttgtg ccaggccacc acctccagcc acctgctcag ttcctgctac 420
cgcaggccca gcagagccag ccaggcctgc taccgacacc aaatctattc cagctacctc 480
agcaaaccca gggagctctt ctgacctccc agccccgggc cgggcttccc acacagcccc 540
ccaaatgctt ggagccacca tcccaccccg aggagcccag tgatctggag gagctggagc 600
aattcgcccg caccttcaag caacgccgca tcaagctggg cttcacgcag ggtgatgtgg 660
gcctggccat gggcaagctc tacggcaacg acttcagcca gacgaccatt tcccgcttcg 720
aggccctcaa cctgagcttc aagaacatgt gcaaactcaa gcccctcctg gagaagtggc 780
tcaacgatgc agagactatg tctgtggact caagcctgcc cagccccaac cagctgagca 840
gccccagcct gggtttcgac ggcctgcccg gccggagacg caagaagagg accagcatcg 900
agacaaacgt ccgcttcgcc ttagagaaga gttttctagc gaaccagaag cctacctcag 960
aggagatcct gctgatcgcc gagcagctgc acatggagaa ggaagtgatc cgcgtctggt 1020
tctgcaaccg gcgccagaag gagaaacgca tcaacccctg cagtgcggcc cccatgctgc 1080
ccagcccagg gaagccggcc agctacagcc cccatatggt cacaccccaa gggggcgcgg 1140
ggaccttacc gttgtcccaa gcttccagca gtctgagcac aacagcacaa accccagccc 1200
tcaaggcagc cactcggcta tcggcttgtc aggcctgaac cccagcacgg gccctggcct 1260
ctggtggaac cctgcccctt accagccttg atggcagcgg gaatctggtg ctgggggcag 1320
ccggtgcagc cccggggagc cctggcctgg tgacctcgcc gctcttcttg aatcatgctg 1380
ggctgcccct gctcagcacc ccgcctggtg tgggcctggt ctcagcagcg gctgcggctg 1440
tggcagcctc catctccagc aagtctcctg gcctctcctc ctcatcctct tcatcctcat 1500
cctcctcctc ctccacttgc agcgagacgg cagcacagac ccctggaggt ccaggggggc 1560
ccgaggcagg gtccaaacct gagtgagggc cagccatgcc tcccctccca ttcctctggt 1620
ccctgccttg gtcccttgcc tgggaagagg gcgaggaggc cagtggtggg gacgcagagg 1680
gtcctcagag caggagtgac aagggaggaa agaccaaaaa aacaaccaac caaaaaaaaa 1740
aaaaaaaagg aaagaaacta accaacaaaa gagaaaacca aaaataatca caacagaaac 1800
cagctgcccc aaaggaacca gaggtgaaaa acaaacaaaa aaaaaccaaa aacaaaccaa 1860
aaaaaaaaaa aacccacaaa atcaaacaaa ccaaaaaacc agtcgcgagc cagacctcag 1920
cgtgctcacc ctcactgcta cgacgccaaa taaaaacccc agccaggggc ggagaagcct 1980
ccagcaggtc agacctcatg ccaccgagcc ctggctgtgg gaccaacccc caaccctgcc 2040
tcccccgtgg gggctacaga aggaaaaaga gaagatgcca gcttcctaat cccagccccc 2100
agccctgggc cagcgaagac agggcacagc ctgggcagat gggctggggc ttagcaccac 2160
ccaccaaatg ttcttttcca gaaggtgaaa gagaaagggc ctgaacaacc ttacaccaaa 2220
tattcagtag cttcatccaa aggatgtaca gaatttttag cattgtgctc aacagaatgt 2280
gtccctacca tgtgtcccct tctcccctgg cccccagctc tccccacctg ggcagggggt 2340
cttgctttaa cctcctccct ccccccagca ggggagagtt caagggaagg cctgcggaca 2400
acttttcatc ccctgttcct ccctcttttc ccctttgaag ggtgggctag gccattttgc 2460
caagttctag ctctcacaag tccctcctca accctgtcac cctcctctgc tctggaactg 2520
actccctccc cagcctatgg gaaggtggaa atttcaggca agaggggatg aagacattca 2580
gttgagggca ttcagttgtc tttttccatc ctgtctgttt ccctgaaaaa aaaaaaaatt 2640
catattccag tgcctatccg tgggatcctt cacgttcttt gacattaacc agaaaccaaa 2700
aagagaactc acctcgcttt tcccaccctg tgcccctctg gtactggcga atgcctctcc 2760
ctccccccac atgcacgcac gcacacaccc cagtggtggg gttcctcgag atggcatccc 2820
catcagggtc catgtggtgg ggacagtgcc acagcctgtg gtccccatct gagagggcgc 2880
ggtggtggcc accactccca caagacctcc acaactcttg gctggaccct gtgtttgacc 2940
agccccggac acgtctccac tgacgacgga cagaggagag acaccactgg gagccaccct 3000
gccctcctgc tattgtggag gccaacaaag ttttgaaccg aaaaccaaaa aaacagaaac 3060
aaacaaacaa taaatttaaa ctagaaaaaa aatttatata tatatataaa catatatata 3120
tatatatata taaaagggaa agaagatgag gactcttcaa gaataagatg gaaccgcgag 3180
gcggagccaa gcccctgcac cggtggtcca ggcccttgtt ccccaaggcg gatggaagga 3240
cggatgcttc tttctcttca caaatacctc atggacgttg tctttgggaa agccggaggg 3300
ggcggctggg gcagggcctg tccctcggcc agggcagatg gaagcgggtg ggcggggctg 3360
agagagggta tcagggacag gggtgaagag ccccaaactc ccctcccaaa tcaactgaaa 3420
aagctttaaa aaaagacagg aaaaaaagga gtaagaaagc aaaaagaaca gatgaaggaa 3480
aacttaacaa cttttgggtg gtttgattcc tcctttgatt tctttttcgg ttctcttttg 3540
tctgacctct ctcctcccct ccttcctctc tcctcccctc cttcctctct cctctcctcc 3600
ttcctgtctc ctctctctcc ctttgctctc ctctcttctc tctcaatctt gttctttctg 3660
tgtctctcct caaaccaaag tgttgcagtg aaggacacga gccattgaaa tcagggtggg 3720
gcctgacagc cgtagtgtgg cccctgcccc ttgttgggca ggagcagaga gagaggagag 3780
ccctgagacc gcagggcttt ggcctgggca gcttccactt tctgccggac actctgagga 3840
gaggcagatg gagctccagg cctcagcgtt ctcttttttc cttgtctccc ttctcccacc 3900
cgggaaatga aactgttggg ctgggcgacg gaggcagcag agactcggcc attgcagggg 3960
cctctgggtg ccttgtccgg gcagtggtga agtagccgcc cctcccaccc cggcacgatg 4020
gggcctggca cctcgtctac ccaacctcac cggaatgtaa gcatctccgc tgaacgactc 4080
cctgccctta ccctacctct gagtttgtcc atgtttattt cctgaagaga gaagggactg 4140
gcaagagggc ttgggcccca gcccaagggt ggagaggggg ccaagggctc ctgaccaaat 4200
aggaaggaca tttgagcaga gcaaaatgaa aggaattaca accaaaaacc ccctccgaga 4260
agacaggcag catggaaggc atggtggaga tgactcaaac aaaaactatg attctagacc 4320
aaaaaggaaa aaaaaaaaaa aaaaaggaaa gaaaatccag gactcaccac cacccacttc 4380
atcctgcttc ctccccatca agtcccacca cctgggacct ctccccaccc cagtatgtgg 4440
gagtggggta aagaggagag gaggagcagg gggcagggat ggggcagatg ccctgatgtt 4500
ggtggaggga cccctgtgca gccgggggtg ctggggaccc tccccatcca accctcacac 4560
ctaggaaaag aaaaacaaga aaaaaaaatt cctgggaggg ggaaaaataa ccaaatccca 4620
agctttaact acagctgtga aaccaaatat tgggaagggg gtaggaggga gggaggggca 4680
ggtgagcccc aggtctcccc ctgggccccc ctcccctgag gactcgagat aaggcaccaa 4740
atacctcata gaactttaat gttaaaaaaa ggaaaccaaa tatcgttggc tgggggccag 4800
ccagggcaag gggctggggg gctggaggga gccagggttg ggaaggtgat gggggaattc 4860
atgcccccca cccccatctg ccccttacac tctggtcccc cgtctcgact ccgggaaaca 4920
aaactatctc atcgttttta ttttgtggtc tgtacagagc ctgtgtccgt gtgtctgtgt 4980
atgagcgaga gagttggagg gctggggagc tttatgtggg ggatggggag ggcaacccca 5040
ggctaggcta tgggctaggt tagatgtccg aggttggggg ccaagggcct gggctagaaa 5100
gagaggagag atgagtgggt tggagcagtg gggagccctc tgaatttctc ttctccccaa 5160
ccctgacctc ctacttaagg gagggacaga gactgggtct accctaatgg tgggcctttt 5220
cattgtgcca aaaaaaaaaa aaaaaaaatt gatgccagta actaaaccac ctctctctgt 5280
tctcttctgg ggggtggggt gggtgtgtgt gtgggggtgg gtagggatgg ggaacccaag 5340
ccccagtagg agttggggct gagatcagga ggaggggaag gatatgagtt ttccatccat 5400
ctccctggag aaggccatga cgggctctcc aagccctggg gagatctggc gagggactag 5460
ccaagtcagg gtgacctctt gacctgacta gtttttctaa cccacatgaa caccagggtg 5520
cctctctctt cctctagccc agggtagggg gacctgagtg agagagagag tgagacagac 5580
agcaagattg agacacgcgg gcccccactg gtctctgagt gggaaaggca agtgcgagag 5640
ataaaggcct ccaagagaaa aaagaaacaa accaaagatt taaccattta aaaaaaaaaa 5700
aaaacccaca gacaactccg ctaccttttt atatgttgga aaaaaagtga aaaaaaatta 5760
aaaataaaaa taaaaaaaaa tacacaaaaa ctccaagccg caagtccttc atgcggttta 5820
aactttgacc tcagcagtct ccaaagcaag acgacgatga caaaggcgga aacaaagaaa 5880
aaataatgta tatttttaag tatttgtcct tgaattgtta gctccttgtt aaacaaacaa 5940
aaggagagaa aaatccagag agaagtgttg ctgggatgga gacaactatt tactatgtct 6000
gtgacccctc ccttctgcct ccctcccctt cctctctttc ctgtcctcta tccttgctgc 6060
ccctccccca atatgtcccc caaacccagg ggctcagtat ttatttattt atataattgc 6120
agggtgactg ggggcctctc tcggcaactc gtagagagcc tttggataat ggcagggtgg 6180
ggaatgggcg aggagtcttt gggaagagca ggggcctgaa ctttcaactc ctttctggcc 6240
tcttgtccca ctgcttccac catcaccttc caaatccagg gctgcactag ggcaaggtga 6300
ccaccctact tggtaaagac ctgattggcg cagcccattc tggtgtgggt gtctttcacc 6360
acccagaatg accaggttgg aatgggggtg ggtgaagttg gagggggtgg caagtacaag 6420
gcaagggtgc ccaggaaccg gcatccagaa atcagccctg gagtagacct gcctttcctc 6480
catccttacc aagttagcct ccctttcagt tccgtctgac ccctgtgttg gtagcccctt 6540
ccttcccttc ctgtaccccc tccgtctctt ctcgtggtag cgggttagtg tttcagtgtt 6600
cttaactcca atctgcttgt tcattgtaca atgtgcttct tttaaggccc catttttgta 6660
acttgagtgt gtcattcatc tgaacaacaa acatcaaaaa ataaaaaatt aaaaactgta 6720
<210> 26
<211> 1203
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(1203)
<223> POU2F2人CDS NM_001247994.2:36-1238智人POU
2类同源盒2 (POU2F2), 转录物变体4, mRNA
<400> 26
atggttcact ccagcatggg ggctccagaa ataagaatgt ctaagcccct ggaggccgag 60
aagcaaggtc tggactcccc atcagagcac acagacaccg aaagaaatgg accagacact 120
aatcatcaga acccccaaaa taagacctcc ccattctccg tgtccccaac tggccccagt 180
acaaagatca aggctgaaga ccccagtggc gattcagccc cagcagcacc cctgccccct 240
cagccggccc agcctcatct gccccaggcc caactcatgt tgacgggcag ccagctagct 300
ggggacatac agcagctcct ccagctccag cagctggtgc ttgtgccagg ccaccacctc 360
cagccacctg ctcagttcct gctaccgcag gcccagcaga gccagccagg cctgctaccg 420
acaccaaatc tattccagct acctcagcaa acccagggag ctcttctgac ctcccagccc 480
cgggccgggc ttcccacaca gccccccaaa tgcttggagc caccatccca ccccgaggag 540
cccagtgatc tggaggagct ggagcaattc gcccgcacct tcaagcaacg ccgcatcaag 600
ctgggcttca cgcagggtga tgtgggcctg gccatgggca agctctacgg caacgacttc 660
agccagacga ccatttcccg cttcgaggcc ctcaacctga gcttcaagaa catgtgcaaa 720
ctcaagcccc tcctggagaa gtggctcaac gatgcagaga ctatgtctgt ggactcaagc 780
ctgcccagcc ccaaccagct gagcagcccc agcctgggtt tcgacggcct gcccggccgg 840
agacgcaaga agaggaccag catcgagaca aacgtccgct tcgccttaga gaagagtttt 900
ctagcgaacc agaagcctac ctcagaggag atcctgctga tcgccgagca gctgcacatg 960
gagaaggaag tgatccgcgt ctggttctgc aaccggcgcc agaaggagaa acgcatcaac 1020
ccctgcagtg cggcccccat gctgcccagc ccagggaagc cggccagcta cagcccccat 1080
atggtcacac cccaaggggg cgcggggacc ttaccgttgt cccaagcttc cagcagtctg 1140
agcacaacag cacaaacccc agccctcaag gcagccactc ggctatcggc ttgtcaggcc 1200
tga 1203
<210> 27
<211> 400
<212> PRT
<213> 智人
<220>
<221> MISC_FEATURE
<222> (1)..(400)
<223> POU2F2人蛋白>NP_001234923.1 POU结构域,
2类,转录因子2同种型4 [智人]
<400> 27
Met Val His Ser Ser Met Gly Ala Pro Glu Ile Arg Met Ser Lys Pro
1 5 10 15
Leu Glu Ala Glu Lys Gln Gly Leu Asp Ser Pro Ser Glu His Thr Asp
20 25 30
Thr Glu Arg Asn Gly Pro Asp Thr Asn His Gln Asn Pro Gln Asn Lys
35 40 45
Thr Ser Pro Phe Ser Val Ser Pro Thr Gly Pro Ser Thr Lys Ile Lys
50 55 60
Ala Glu Asp Pro Ser Gly Asp Ser Ala Pro Ala Ala Pro Leu Pro Pro
65 70 75 80
Gln Pro Ala Gln Pro His Leu Pro Gln Ala Gln Leu Met Leu Thr Gly
85 90 95
Ser Gln Leu Ala Gly Asp Ile Gln Gln Leu Leu Gln Leu Gln Gln Leu
100 105 110
Val Leu Val Pro Gly His His Leu Gln Pro Pro Ala Gln Phe Leu Leu
115 120 125
Pro Gln Ala Gln Gln Ser Gln Pro Gly Leu Leu Pro Thr Pro Asn Leu
130 135 140
Phe Gln Leu Pro Gln Gln Thr Gln Gly Ala Leu Leu Thr Ser Gln Pro
145 150 155 160
Arg Ala Gly Leu Pro Thr Gln Pro Pro Lys Cys Leu Glu Pro Pro Ser
165 170 175
His Pro Glu Glu Pro Ser Asp Leu Glu Glu Leu Glu Gln Phe Ala Arg
180 185 190
Thr Phe Lys Gln Arg Arg Ile Lys Leu Gly Phe Thr Gln Gly Asp Val
195 200 205
Gly Leu Ala Met Gly Lys Leu Tyr Gly Asn Asp Phe Ser Gln Thr Thr
210 215 220
Ile Ser Arg Phe Glu Ala Leu Asn Leu Ser Phe Lys Asn Met Cys Lys
225 230 235 240
Leu Lys Pro Leu Leu Glu Lys Trp Leu Asn Asp Ala Glu Thr Met Ser
245 250 255
Val Asp Ser Ser Leu Pro Ser Pro Asn Gln Leu Ser Ser Pro Ser Leu
260 265 270
Gly Phe Asp Gly Leu Pro Gly Arg Arg Arg Lys Lys Arg Thr Ser Ile
275 280 285
Glu Thr Asn Val Arg Phe Ala Leu Glu Lys Ser Phe Leu Ala Asn Gln
290 295 300
Lys Pro Thr Ser Glu Glu Ile Leu Leu Ile Ala Glu Gln Leu His Met
305 310 315 320
Glu Lys Glu Val Ile Arg Val Trp Phe Cys Asn Arg Arg Gln Lys Glu
325 330 335
Lys Arg Ile Asn Pro Cys Ser Ala Ala Pro Met Leu Pro Ser Pro Gly
340 345 350
Lys Pro Ala Ser Tyr Ser Pro His Met Val Thr Pro Gln Gly Gly Ala
355 360 365
Gly Thr Leu Pro Leu Ser Gln Ala Ser Ser Ser Leu Ser Thr Thr Ala
370 375 380
Gln Thr Pro Ala Leu Lys Ala Ala Thr Arg Leu Ser Ala Cys Gln Ala
385 390 395 400
<210> 28
<211> 3323
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(3323)
<223> POU2F2小鼠- 完整序列>NM_001163554.1小家鼠POU结构域,
2类, 转录因子2 (Pou2f2), 转录物变体4, mRNA
<400> 28
cgggcagcat ggttcattcc agcatggggg ctccagaaat aagaatgtct aagcccctgg 60
aggccgagaa gcaaagtctg gactccccgt cagagcacac agacaccgaa agaaatggac 120
ccgacattaa ccatcagaac ccccagaata aagcgtcccc attctctgtg tccccaactg 180
gccccagcac caagatcaag gctgaagacc ccagtggcga ttcagcccca gcagcacccc 240
cgccccccca gccggctcag cctcatctgc cccaggccca actcatgctg acgggcagcc 300
agctagctgg ggacatacag caactcctcc agctccagca gctggtgctt gtccccggcc 360
accacctcca gccacctgct cagttcctgc tgccacaggc acagcagagt cagccaggcc 420
tgctaccaac gccaaatcta ttccagctac ctcaacaaac ccagggagct ctcctgacct 480
cccagccccg ggctgggctt cctacacagc ccccgaaatg cttggagccg ccctcccacc 540
cggaggagcc cagcgatctg gaggagctgg aacagtttgc tcgcaccttc aagcaacgcc 600
gcatcaagct gggcttcaca cagggtgatg tgggcctggc catgggcaag ctctatggca 660
acgacttcag ccaaacgacc atttcccgct tcgaggccct caacctgagc ttcaagaaca 720
tgtgtaaact caagcccctc ctggagaagt ggctcaacga cgcagagact atgtctgtgg 780
attcaagcct acccagccca aaccagctga gcagccccag cctgggtttc gacgggctgc 840
cggggcggag acgcaagaag aggaccagca tcgagacgaa tgtccgcttc gccttagaga 900
agagtttcct agcgaaccag aagcctacct cagaggagat cctgctgatc gcagagcagc 960
tgcacatgga gaaggaagtg atccgcgtct ggttctgcaa ccggcgccag aaggagaaac 1020
gcatcaaccc ttgcagtgcg gcccccatgc tgcccagccc gggaaagccg accagctaca 1080
gccctcacct ggtcacaccc caagggggcg cagggacctt accattgtcc caagcttcta 1140
gcagtctgag cacaacagtt actaccttat cctcagctgt ggggacgctc catcccagcc 1200
ggacagcagg agggggtggg ggtgggggcg gagctgcgcc ccccctcaat tccatcccct 1260
ctgtcactcc cccacccccg gccaccacca acagcacaaa cccgagccct caaggcagcc 1320
actcggctat tggcttgtcg ggcctgaacc ccagcgcggg aagcacaatg gtggggttga 1380
gctctgggct gagtccagcc ctcatgagca acaacccttt ggccactatc caagccctgg 1440
cctctggtgg aaccctgccc cttaccagcc ttgatggcag cgggaacctg gtgctggggg 1500
cagccggtgc ggccccaggg agtcccagct tagtaacctc gcctctcttc ttgaaccaca 1560
ccggtctgcc gctgctcagt gccccaccag gcgtgggcct ggtctcagcg gcggctgcag 1620
ccgtagcagc atccatctcc agcaagtctc ctggcctctc ctcgtcttct tcatcctcat 1680
catcctccac gtgcagtgat gtggcagcac agacccctgg aggccccgga ggacccgagg 1740
cggggtccaa ggctgagtga gagccggcca tgcctcccct cctactcctc tgaatctccc 1800
gaccttggtc ccctgcccaa gagagggtaa ggaggctggt gggggatacg gtgggtcctt 1860
ggagcaggag tgacaaagga ggaaaagacc aaaaatccaa ccaaaaagaa aagaaaagaa 1920
aaaacgacaa aacaaaaaaa ataaataaat aaacaaaaaa acaaagagac caaccaacaa 1980
aaaagaaaac caaaaatatt cacaacagaa accagctgcc ctaaaggaaa cagatggaaa 2040
acaaagaaac gaacaactac aacaaaacaa caacaacaac aacaaaaaac acctccaaac 2100
taaccaaaca cccccacccc caaaccaaac aaaccaaaaa ccagtcttga gccagatctt 2160
ggagtgctca ctgctatgac accaaataag aaccttcagc caggagcgga gcagcctcca 2220
gcaggtcgga cctcatgccg cggagccctg gctgtgggac caactcctaa ccttacccca 2280
ctgagggtgg gcagagaagg aaccagagaa tgtgccagct tgctggccca gccccagctc 2340
tgggccaatg gagacaggta ctgcctgggg caggaggccc agggcccaga gccacctacc 2400
aaatggtctt ttccacaagg taaaagagaa aggacccgaa caaccttaca ccaaatactc 2460
agtagcttca tccaaaggat gtacagagtt ttcagcgttg cgctcaacag aatgtgtgtc 2520
ccttctacgt gtttcctccc cccccagccc cccagctctc cccaacaggg cagggggtct 2580
cgctttaacc cctccccccg ccacccccag tctggggaga gttcagagga aggcctggga 2640
atgggctttt ttgtcctctg cccctttctt tctccctttg aagggtgggc taggccattt 2700
tttgccgagt tccagctctc aaaagcccct cctgagctcc cccatcacct ttctgctctg 2760
gagtctacct cctccccaga ctgtgggaag gtggtcggtt caagcaaagg gggatacagt 2820
gagggcaacc aagttgtctt tcttcccatc ctgtctgtca gtttccctga aaaaaaaaat 2880
catattccag tgcctatccg tgggatcttc acgttctttg acattaacca gaaaccaaaa 2940
agattaactc gcctcgctct gcccacccta caccccctat gatactggca gatacctctc 3000
ctgtcccctt ccctccccac cttccaatgc acacacgcac acaccccagt ggtgggctcc 3060
tccggatggc atctccatca gaaccacatg gtggtgacat cactatgacc tgtggttccc 3120
catgcaagtg gctaccactc ccacgagacc tccacgagta ttggctggac cctgtatttg 3180
cctagctctg gacaaatctc aacagtggac agaggagatg ctactgggag ccaccatgcc 3240
ctcctaccat ggaggccaac aaagttttga accaaaaaaa aaaaaccaaa ccaaacaaac 3300
aaacacacac aaaacccaac aaa 3323
<210> 29
<211> 1752
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(1752)
<223> POU2F2小鼠CDS >NM_001163554.1:9-1760小家鼠POU结构域,
2类, 转录因子2 (Pou2f2), 转录物变体4, mRNA
<400> 29
atggttcatt ccagcatggg ggctccagaa ataagaatgt ctaagcccct ggaggccgag 60
aagcaaagtc tggactcccc gtcagagcac acagacaccg aaagaaatgg acccgacatt 120
aaccatcaga acccccagaa taaagcgtcc ccattctctg tgtccccaac tggccccagc 180
accaagatca aggctgaaga ccccagtggc gattcagccc cagcagcacc cccgcccccc 240
cagccggctc agcctcatct gccccaggcc caactcatgc tgacgggcag ccagctagct 300
ggggacatac agcaactcct ccagctccag cagctggtgc ttgtccccgg ccaccacctc 360
cagccacctg ctcagttcct gctgccacag gcacagcaga gtcagccagg cctgctacca 420
acgccaaatc tattccagct acctcaacaa acccagggag ctctcctgac ctcccagccc 480
cgggctgggc ttcctacaca gcccccgaaa tgcttggagc cgccctccca cccggaggag 540
cccagcgatc tggaggagct ggaacagttt gctcgcacct tcaagcaacg ccgcatcaag 600
ctgggcttca cacagggtga tgtgggcctg gccatgggca agctctatgg caacgacttc 660
agccaaacga ccatttcccg cttcgaggcc ctcaacctga gcttcaagaa catgtgtaaa 720
ctcaagcccc tcctggagaa gtggctcaac gacgcagaga ctatgtctgt ggattcaagc 780
ctacccagcc caaaccagct gagcagcccc agcctgggtt tcgacgggct gccggggcgg 840
agacgcaaga agaggaccag catcgagacg aatgtccgct tcgccttaga gaagagtttc 900
ctagcgaacc agaagcctac ctcagaggag atcctgctga tcgcagagca gctgcacatg 960
gagaaggaag tgatccgcgt ctggttctgc aaccggcgcc agaaggagaa acgcatcaac 1020
ccttgcagtg cggcccccat gctgcccagc ccgggaaagc cgaccagcta cagccctcac 1080
ctggtcacac cccaaggggg cgcagggacc ttaccattgt cccaagcttc tagcagtctg 1140
agcacaacag ttactacctt atcctcagct gtggggacgc tccatcccag ccggacagca 1200
ggagggggtg ggggtggggg cggagctgcg ccccccctca attccatccc ctctgtcact 1260
cccccacccc cggccaccac caacagcaca aacccgagcc ctcaaggcag ccactcggct 1320
attggcttgt cgggcctgaa ccccagcgcg ggaagcacaa tggtggggtt gagctctggg 1380
ctgagtccag ccctcatgag caacaaccct ttggccacta tccaagccct ggcctctggt 1440
ggaaccctgc cccttaccag ccttgatggc agcgggaacc tggtgctggg ggcagccggt 1500
gcggccccag ggagtcccag cttagtaacc tcgcctctct tcttgaacca caccggtctg 1560
ccgctgctca gtgccccacc aggcgtgggc ctggtctcag cggcggctgc agccgtagca 1620
gcatccatct ccagcaagtc tcctggcctc tcctcgtctt cttcatcctc atcatcctcc 1680
acgtgcagtg atgtggcagc acagacccct ggaggccccg gaggacccga ggcggggtcc 1740
aaggctgagt ga 1752
<210> 30
<211> 583
<212> PRT
<213> 小家鼠
<220>
<221> MISC_FEATURE
<222> (1)..(583)
<223> POU2F2小鼠蛋白>NP_001157026.1 POU结构域, 2类,
转录因子2同种型4 [小家鼠]
<400> 30
Met Val His Ser Ser Met Gly Ala Pro Glu Ile Arg Met Ser Lys Pro
1 5 10 15
Leu Glu Ala Glu Lys Gln Ser Leu Asp Ser Pro Ser Glu His Thr Asp
20 25 30
Thr Glu Arg Asn Gly Pro Asp Ile Asn His Gln Asn Pro Gln Asn Lys
35 40 45
Ala Ser Pro Phe Ser Val Ser Pro Thr Gly Pro Ser Thr Lys Ile Lys
50 55 60
Ala Glu Asp Pro Ser Gly Asp Ser Ala Pro Ala Ala Pro Pro Pro Pro
65 70 75 80
Gln Pro Ala Gln Pro His Leu Pro Gln Ala Gln Leu Met Leu Thr Gly
85 90 95
Ser Gln Leu Ala Gly Asp Ile Gln Gln Leu Leu Gln Leu Gln Gln Leu
100 105 110
Val Leu Val Pro Gly His His Leu Gln Pro Pro Ala Gln Phe Leu Leu
115 120 125
Pro Gln Ala Gln Gln Ser Gln Pro Gly Leu Leu Pro Thr Pro Asn Leu
130 135 140
Phe Gln Leu Pro Gln Gln Thr Gln Gly Ala Leu Leu Thr Ser Gln Pro
145 150 155 160
Arg Ala Gly Leu Pro Thr Gln Pro Pro Lys Cys Leu Glu Pro Pro Ser
165 170 175
His Pro Glu Glu Pro Ser Asp Leu Glu Glu Leu Glu Gln Phe Ala Arg
180 185 190
Thr Phe Lys Gln Arg Arg Ile Lys Leu Gly Phe Thr Gln Gly Asp Val
195 200 205
Gly Leu Ala Met Gly Lys Leu Tyr Gly Asn Asp Phe Ser Gln Thr Thr
210 215 220
Ile Ser Arg Phe Glu Ala Leu Asn Leu Ser Phe Lys Asn Met Cys Lys
225 230 235 240
Leu Lys Pro Leu Leu Glu Lys Trp Leu Asn Asp Ala Glu Thr Met Ser
245 250 255
Val Asp Ser Ser Leu Pro Ser Pro Asn Gln Leu Ser Ser Pro Ser Leu
260 265 270
Gly Phe Asp Gly Leu Pro Gly Arg Arg Arg Lys Lys Arg Thr Ser Ile
275 280 285
Glu Thr Asn Val Arg Phe Ala Leu Glu Lys Ser Phe Leu Ala Asn Gln
290 295 300
Lys Pro Thr Ser Glu Glu Ile Leu Leu Ile Ala Glu Gln Leu His Met
305 310 315 320
Glu Lys Glu Val Ile Arg Val Trp Phe Cys Asn Arg Arg Gln Lys Glu
325 330 335
Lys Arg Ile Asn Pro Cys Ser Ala Ala Pro Met Leu Pro Ser Pro Gly
340 345 350
Lys Pro Thr Ser Tyr Ser Pro His Leu Val Thr Pro Gln Gly Gly Ala
355 360 365
Gly Thr Leu Pro Leu Ser Gln Ala Ser Ser Ser Leu Ser Thr Thr Val
370 375 380
Thr Thr Leu Ser Ser Ala Val Gly Thr Leu His Pro Ser Arg Thr Ala
385 390 395 400
Gly Gly Gly Gly Gly Gly Gly Gly Ala Ala Pro Pro Leu Asn Ser Ile
405 410 415
Pro Ser Val Thr Pro Pro Pro Pro Ala Thr Thr Asn Ser Thr Asn Pro
420 425 430
Ser Pro Gln Gly Ser His Ser Ala Ile Gly Leu Ser Gly Leu Asn Pro
435 440 445
Ser Ala Gly Ser Thr Met Val Gly Leu Ser Ser Gly Leu Ser Pro Ala
450 455 460
Leu Met Ser Asn Asn Pro Leu Ala Thr Ile Gln Ala Leu Ala Ser Gly
465 470 475 480
Gly Thr Leu Pro Leu Thr Ser Leu Asp Gly Ser Gly Asn Leu Val Leu
485 490 495
Gly Ala Ala Gly Ala Ala Pro Gly Ser Pro Ser Leu Val Thr Ser Pro
500 505 510
Leu Phe Leu Asn His Thr Gly Leu Pro Leu Leu Ser Ala Pro Pro Gly
515 520 525
Val Gly Leu Val Ser Ala Ala Ala Ala Ala Val Ala Ala Ser Ile Ser
530 535 540
Ser Lys Ser Pro Gly Leu Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser
545 550 555 560
Thr Cys Ser Asp Val Ala Ala Gln Thr Pro Gly Gly Pro Gly Gly Pro
565 570 575
Glu Ala Gly Ser Lys Ala Glu
580
<210> 31
<211> 3631
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(3631)
<223> TGIF1人- 完整序列>NM_170695.5
智人TGFB诱导的因子同源盒1 (TGIF1), 转录物变体1, mRNA
<400> 31
acattctttc agacccggcc cgctgcgggg cgttcctggg gggtagcctc aaggccagcg 60
gggttccttc ggctgcgttt ctgtgggagg ccctgaaacg cgcggagctt ccctctgcct 120
ccaggctttc ccagcgagag tgaaattaaa cttgaaactc ggatcaactg gcagtcgttg 180
ttggtagaac gccctaagga cccctccccg cgggacggag ggaggactcg ggacagggaa 240
ttggccctgg gagaaaacgc gcggggggcg tccgagacgc cccgtgaaag ccgtgccgac 300
ccttgggagg actgacaggt ctagagacac gcgctgtctg ttgtggtggg cctcccggga 360
ataagtgagg ggctctgtgt ttcgaggatg gttctagcgc agagccgggt gtctgccggg 420
gtgggctccc cgcattgttc gggctccggc gggggcggct ctgattcctt tccatggccc 480
gcctcccacc ccgggaatcc ccagtgctcc ttttccacgg cttttctggc gtccccccga 540
ctctcccgcg gcactttggc ctaccttccc ccagcgccgt ggtcctccct ggcgaccccc 600
tctgcgctcc tggggtcctc ctgcgccccc cctcctccac cggcgcgctg cccacagccg 660
cgtgccctct ccccggagct ggggaccaag gctgggcccc gccggccgca tcggtgggaa 720
cttccgcggt ccccatccca gggcgcacag ggtccagctc ctcggcgccg actcctggaa 780
acaatgaaag gtattgttgc agcatctggc agtgagactg aggatgagga cagcatggac 840
attcccttgg acctttcttc atccgctggc tcaggcaaga gaaggagaag gggcaaccta 900
cccaaggagt ctgtgcagat tcttcgggat tggctgtatg agcaccgtta caatgcctat 960
ccttcagagc aagaaaaagc gttgctgtcc cagcaaacac acctgtctac gctacaggtc 1020
tgtaactggt tcatcaacgc ccgccgcagg ctcctccctg acatgctgag aaaggatggc 1080
aaagatccaa atcagttcac aatttcccgc cgtggggcca agatttctga aacgagctct 1140
gtggagtccg tgatgggcat caaaaacttc atgccagctc tagaggagac cccatttcat 1200
tcctgtacag ctgggccaaa cccaacccta gggaggccac tgtctcctaa gccgtcatcc 1260
ccgggatcag ttttggctcg tccatcagtg atctgccata ccactgtgac tgcattgaaa 1320
gatgtccctt tctctctctg ccagtcggtc ggtgtgggac aaaacacaga tatacagcag 1380
atagcggcca aaaacttcac agacacctct ctcatgtacc cagaggacac ttgtaaatct 1440
ggaccaagta cgaatacaca gagtggtctt ttcaacactc ctccccctac tccaccggac 1500
ctcaaccagg acttcagtgg atttcagctt ctagtggatg ttgcactcaa acgggctgca 1560
gagatggagc ttcaggcaaa acttacagct taacccattt tcaagcaaaa cagttctcag 1620
aaatgtcatg attgccgggg tgaaggcaag agatgaattg cattatttta tatatttttt 1680
attaatattt gcacatggga ttgctaaaac agcttcctgt tactgagatg tcttcaatgg 1740
aatacagtca ttccaagaac tataaactta aagctactgt agaaacaaag ggttttcttt 1800
tttaaatgtt tcttggtaga ttattcataa tgtgagatgg ttcccaatat catgtgattt 1860
tttttttcct ccccttccct ttttttgtta ttttttcaga ctgtgcaata cttagagaac 1920
ctatagcatc ttctcattcc catgtggaac aggatgccca catactgtct aattaataaa 1980
ttttccattt tttttcaaac aagtatgaat ctagttggtt gatgcctttt ttttcatgac 2040
ataataaagt attttcttta aaaattgttg taattcagag tatttctgtt gagggaggtg 2100
cttcttaaaa ataagtagga atatagcacc ccagtgagca ggaagctggg ggggtagggt 2160
gcagtgttag ggggtgtcac cagctctttg aaaacctgtg gacaacaagc cagtttgcat 2220
aaacaggatg tgtgatattt actcttgata ggaggcatag caggccctta gagctttact 2280
taaactgcat ggcaaattga aatgaatcat tttagtgttt agctagccac taaatccctt 2340
gctgatctgt cctgcgtagt ttaaaacgtg gccatgttat aaagaaaagt cttgaagcat 2400
ccggttattg ctgagttttc agaagaaaac aacgcaagca atctcttgcc ctttttccac 2460
gttacccttt aaaataacct ttgttcagtg gtacatttag ttaaggcatt tgtattcaaa 2520
tgtagcatag tatttgcaga gaaattacct tttaaaacat ctcaaaagtg ttttctaaaa 2580
tgatgaaata aacggtgggg ttctgtctat ggtgggattg tcagtttatt aggtgagtat 2640
tttccaaggt ctcagtttgt gaaaaaatgg tgctatgtac aagtggagtt tgttgggata 2700
gacaatcact tacaaagccc agatggccag aagggaattg gtgttgctcc tttcacttgt 2760
atggttgagc agatttgtca catgccacaa aaccattctt tgtagtcatg cttagtatga 2820
gcttgctttg ttgttacggt catgttcttt ccagccctgc cccagctatg aattgtctaa 2880
gtgtgaggga gcatcagagc ttcagtattg tttaaagaat gtagaaaaat acaaatttgt 2940
ctaattttga aactatttta gaaagtccct actgtagaaa gagctaaact caaagacttt 3000
ccttgtaagt gaaagaaaaa aactgctgat gattaccaag aggcgtaaga aaattaggtg 3060
agggaatgtt gtgctctagc ctgtggggtg cagcttcgta gggcacatca gcttagaatg 3120
gtctggcctc acctcagaga gtcacagaaa tagacaaaaa cccaaaaagt actagtaaag 3180
taaatacaag gaacaaaata aacattaata aaaatgaatg acgcaaaggt gatccagaat 3240
cattacttgc tccaacagct agatcaaagg ctgaacattt tgctttagta tagatgaggg 3300
aaatgagcaa gtcctaaaag gtttgccctg ttacggtgtt tcatcactca tactggaagg 3360
agaaggaaag gagccatatg actcatttaa aagaaaaact gtaagtcacc ttaaattccc 3420
tatgttttac ttcatttttt ccctattgag gttgttacag gtcatggttg ataatagttg 3480
gtgttgaatg aatgtaatcc attatttaaa aacaggtaca tttaagtgaa aagataaatg 3540
taaaaagctg ttcatatcaa agccaaatag aaatgatttt tatgcttata atttacatgg 3600
tgcaatatat ctttttaatt tgtttacttt t 3631
<210> 32
<211> 759
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(759)
<223> TGIF1人CDS >NM_170695.5:835-1593
智人TGFB诱导的因子同源盒1 (TGIF1), 转录物变体1, mRNA
<400> 32
atggacattc ccttggacct ttcttcatcc gctggctcag gcaagagaag gagaaggggc 60
aacctaccca aggagtctgt gcagattctt cgggattggc tgtatgagca ccgttacaat 120
gcctatcctt cagagcaaga aaaagcgttg ctgtcccagc aaacacacct gtctacgcta 180
caggtctgta actggttcat caacgcccgc cgcaggctcc tccctgacat gctgagaaag 240
gatggcaaag atccaaatca gttcacaatt tcccgccgtg gggccaagat ttctgaaacg 300
agctctgtgg agtccgtgat gggcatcaaa aacttcatgc cagctctaga ggagacccca 360
tttcattcct gtacagctgg gccaaaccca accctaggga ggccactgtc tcctaagccg 420
tcatccccgg gatcagtttt ggctcgtcca tcagtgatct gccataccac tgtgactgca 480
ttgaaagatg tccctttctc tctctgccag tcggtcggtg tgggacaaaa cacagatata 540
cagcagatag cggccaaaaa cttcacagac acctctctca tgtacccaga ggacacttgt 600
aaatctggac caagtacgaa tacacagagt ggtcttttca acactcctcc ccctactcca 660
ccggacctca accaggactt cagtggattt cagcttctag tggatgttgc actcaaacgg 720
gctgcagaga tggagcttca ggcaaaactt acagcttaa 759
<210> 33
<211> 275
<212> PRT
<213> 智人
<220>
<221> MISC_FEATURE
<222> (1)..(275)
<223> TGIF1人蛋白>NP_001265611.1同源盒蛋白TGIF1同种型e[智人]
<400> 33
Met Ala Ala Ala His Ser Gly Arg Ser Ile Val Ala Ala Ser Gly Ser
1 5 10 15
Glu Thr Glu Asp Glu Asp Ser Met Asp Ile Pro Leu Asp Leu Ser Ser
20 25 30
Ser Ala Gly Ser Gly Lys Arg Arg Arg Arg Gly Asn Leu Pro Lys Glu
35 40 45
Ser Val Gln Ile Leu Arg Asp Trp Leu Tyr Glu His Arg Tyr Asn Ala
50 55 60
Tyr Pro Ser Glu Gln Glu Lys Ala Leu Leu Ser Gln Gln Thr His Leu
65 70 75 80
Ser Thr Leu Gln Val Cys Asn Trp Phe Ile Asn Ala Arg Arg Arg Leu
85 90 95
Leu Pro Asp Met Leu Arg Lys Asp Gly Lys Asp Pro Asn Gln Phe Thr
100 105 110
Ile Ser Arg Arg Gly Ala Lys Ile Ser Glu Thr Ser Ser Val Glu Ser
115 120 125
Val Met Gly Ile Lys Asn Phe Met Pro Ala Leu Glu Glu Thr Pro Phe
130 135 140
His Ser Cys Thr Ala Gly Pro Asn Pro Thr Leu Gly Arg Pro Leu Ser
145 150 155 160
Pro Lys Pro Ser Ser Pro Gly Ser Val Leu Ala Arg Pro Ser Val Ile
165 170 175
Cys His Thr Thr Val Thr Ala Leu Lys Asp Val Pro Phe Ser Leu Cys
180 185 190
Gln Ser Val Gly Val Gly Gln Asn Thr Asp Ile Gln Gln Ile Ala Ala
195 200 205
Lys Asn Phe Thr Asp Thr Ser Leu Met Tyr Pro Glu Asp Thr Cys Lys
210 215 220
Ser Gly Pro Ser Thr Asn Thr Gln Ser Gly Leu Phe Asn Thr Pro Pro
225 230 235 240
Pro Thr Pro Pro Asp Leu Asn Gln Asp Phe Ser Gly Phe Gln Leu Leu
245 250 255
Val Asp Val Ala Leu Lys Arg Ala Ala Glu Met Glu Leu Gln Ala Lys
260 265 270
Leu Thr Ala
275
<210> 34
<211> 1720
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(1720)
<223> TGIF1小鼠- 完整序列>NM_001164075.1
小家鼠TGFB-诱导的因子同源盒1 (Tgif1), 转录物变体1,mRNA
<400> 34
gcgctgtgcc gcgcccgcgt gtcaatggca gctgcgcctt ctggaagaag ttgattcttg 60
tctggaatac ttttccttcg gtgccccacc ccttgaatca gaagctgctg cctccggcgg 120
ctggagcaga gacagtgaag tttggggcag ctgtcaaaaa caaggatggg ggtgtctttt 180
cttctctggg gtggacagag gagatgtcct taaacctctc tccatccttt acaaactaaa 240
tgcactcttc gctagcccgg ggaggaaagg ccccaaaaga gaaaccagtg tggccgctga 300
gagctgaggg atggagatgg tgctctcgcc acgggcaggc aggcatgacc ctgcttctgg 360
gttccgccag cgcgtagagg gacgggtgat ctcggccccc aggcccaggg ttgtggagct 420
ggaaggtctt gttgcagcat ctggcagtga ctctgaggat gaagacagca tggacagtcc 480
cctggacctt tcctcatcag cagcctctgg caagagaagg aggagaggca atctgcccaa 540
ggagtcagtc cagattctgc gagactggct gtatgaacac agatacaacg cctatccctc 600
agagcaagag aaagcactgc tgtcccagca gacacacctg tccacactac aggtctgtaa 660
ctggttcatc aacgcccgcc gcaggctcct tcctgacatg ctgagaaagg atggcaaaga 720
tccaaatcag ttcacgattt cccgccgtgg ggccaagatt tcagaagcta gctctattga 780
agctgcaatg ggtatcaaaa acttcatgcc aactctagaa gagagcccat ttcattcctg 840
cgtagttgga cccaacccaa ccctagggag accagtgtct cccaaacctc cctccccagg 900
atccattttg gctcgcccgt cagtgatctg ccataccact gtgactgcat tgaaggatgg 960
gcctttctct ctctgtcagc cgattggtgt gggacagagt acagatgtac cgcaaatagc 1020
acccagcaac tttacagaca cctctctcgt gtacccagag gacacttgca aatctggacc 1080
cagtccaaac cctcagagtg gtcttttcaa cactcctccc cctactccac cagacctcaa 1140
ccaggatttt agtggattcc agcttctagt ggatgttgca ctcaaacgag cggcagagat 1200
ggagcttcag gccaaactca cagcttaacc gttttttcaa acaaaacagt tctccaaaat 1260
acggtcctga ttgccggggg tgatggcaag agatgcatta ttttatatat ttttctatta 1320
atatttgcac atgggattgc tcagacgaag cttcctgtta ctaagatgtc ttaagtggaa 1380
tagagtcatt ccaagaacta caaactaaag ctactgtaga aacaaagggt tttcttttcg 1440
aatgtttctt ggtagtttct cataatgtga gacggttccc agtatcatgt gatcttctcc 1500
tccagactcc tcttctttat gttccaagac tgtgcaatac tttagacgcc ctcgcacctc 1560
tctcttccca tgtggaatgg gacgcccacc tacagtctaa tgagtaaact ttcagttttt 1620
tgtttgtttg ttttttttta gattcaagca agtatgaatc tagttgttgg ataccttttt 1680
tcatgatgta ataaagtatt ttctttaaaa gttattgcaa 1720
<210> 35
<211> 918
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(918)
<223> TGIF1小鼠CDS >NM_001164075.1:311-1228
小家鼠TGFB-诱导的因子同源盒1 (Tgif1), 转录物变体1,mRNA
<400> 35
atggagatgg tgctctcgcc acgggcaggc aggcatgacc ctgcttctgg gttccgccag 60
cgcgtagagg gacgggtgat ctcggccccc aggcccaggg ttgtggagct ggaaggtctt 120
gttgcagcat ctggcagtga ctctgaggat gaagacagca tggacagtcc cctggacctt 180
tcctcatcag cagcctctgg caagagaagg aggagaggca atctgcccaa ggagtcagtc 240
cagattctgc gagactggct gtatgaacac agatacaacg cctatccctc agagcaagag 300
aaagcactgc tgtcccagca gacacacctg tccacactac aggtctgtaa ctggttcatc 360
aacgcccgcc gcaggctcct tcctgacatg ctgagaaagg atggcaaaga tccaaatcag 420
ttcacgattt cccgccgtgg ggccaagatt tcagaagcta gctctattga agctgcaatg 480
ggtatcaaaa acttcatgcc aactctagaa gagagcccat ttcattcctg cgtagttgga 540
cccaacccaa ccctagggag accagtgtct cccaaacctc cctccccagg atccattttg 600
gctcgcccgt cagtgatctg ccataccact gtgactgcat tgaaggatgg gcctttctct 660
ctctgtcagc cgattggtgt gggacagagt acagatgtac cgcaaatagc acccagcaac 720
tttacagaca cctctctcgt gtacccagag gacacttgca aatctggacc cagtccaaac 780
cctcagagtg gtcttttcaa cactcctccc cctactccac cagacctcaa ccaggatttt 840
agtggattcc agcttctagt ggatgttgca ctcaaacgag cggcagagat ggagcttcag 900
gccaaactca cagcttaa 918
<210> 36
<211> 305
<212> PRT
<213> 小家鼠
<220>
<221> MISC_FEATURE
<222> (1)..(305)
<223> TGIF1小鼠蛋白>NP_001157547.1同源盒蛋白TGIF1同种型a [小家鼠]
<400> 36
Met Glu Met Val Leu Ser Pro Arg Ala Gly Arg His Asp Pro Ala Ser
1 5 10 15
Gly Phe Arg Gln Arg Val Glu Gly Arg Val Ile Ser Ala Pro Arg Pro
20 25 30
Arg Val Val Glu Leu Glu Gly Leu Val Ala Ala Ser Gly Ser Asp Ser
35 40 45
Glu Asp Glu Asp Ser Met Asp Ser Pro Leu Asp Leu Ser Ser Ser Ala
50 55 60
Ala Ser Gly Lys Arg Arg Arg Arg Gly Asn Leu Pro Lys Glu Ser Val
65 70 75 80
Gln Ile Leu Arg Asp Trp Leu Tyr Glu His Arg Tyr Asn Ala Tyr Pro
85 90 95
Ser Glu Gln Glu Lys Ala Leu Leu Ser Gln Gln Thr His Leu Ser Thr
100 105 110
Leu Gln Val Cys Asn Trp Phe Ile Asn Ala Arg Arg Arg Leu Leu Pro
115 120 125
Asp Met Leu Arg Lys Asp Gly Lys Asp Pro Asn Gln Phe Thr Ile Ser
130 135 140
Arg Arg Gly Ala Lys Ile Ser Glu Ala Ser Ser Ile Glu Ala Ala Met
145 150 155 160
Gly Ile Lys Asn Phe Met Pro Thr Leu Glu Glu Ser Pro Phe His Ser
165 170 175
Cys Val Val Gly Pro Asn Pro Thr Leu Gly Arg Pro Val Ser Pro Lys
180 185 190
Pro Pro Ser Pro Gly Ser Ile Leu Ala Arg Pro Ser Val Ile Cys His
195 200 205
Thr Thr Val Thr Ala Leu Lys Asp Gly Pro Phe Ser Leu Cys Gln Pro
210 215 220
Ile Gly Val Gly Gln Ser Thr Asp Val Pro Gln Ile Ala Pro Ser Asn
225 230 235 240
Phe Thr Asp Thr Ser Leu Val Tyr Pro Glu Asp Thr Cys Lys Ser Gly
245 250 255
Pro Ser Pro Asn Pro Gln Ser Gly Leu Phe Asn Thr Pro Pro Pro Thr
260 265 270
Pro Pro Asp Leu Asn Gln Asp Phe Ser Gly Phe Gln Leu Leu Val Asp
275 280 285
Val Ala Leu Lys Arg Ala Ala Glu Met Glu Leu Gln Ala Lys Leu Thr
290 295 300
Ala
305
<210> 37
<211> 2258
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(2258)
<223> RELB人完整序列>NM_006509.4 RELB原癌基因, NF-kB亚基, mRNA [智人]
<400> 37
gcagccccgg gcgccgcgcg tcctgcccgg cctgcggccc cagcccttgc gccgctcgtc 60
cgacccgcga tcgtccacca gaccgtgcct cccggccgcc cggccggccc gcgtgcatgc 120
ttcggtctgg gccagcctct gggccgtccg tccccactgg ccgggccatg ccgagtcgcc 180
gcgtcgccag accgccggct gcgccggagc tgggggcctt agggtccccc gacctctcct 240
cactctcgct cgccgtttcc aggagcacag atgaattgga gatcatcgac gagtacatca 300
aggagaacgg cttcggcctg gacgggggac agccgggccc gggcgagggg ctgccacgcc 360
tggtgtctcg cggggctgcg tccctgagca cggtcaccct gggccctgtg gcgcccccag 420
ccacgccgcc gccttggggc tgccccctgg gccgactagt gtccccagcg ccgggcccgg 480
gcccgcagcc gcacctggtc atcacggagc agcccaagca gcgcggcatg cgcttccgct 540
acgagtgcga gggccgctcg gccggcagca tccttgggga gagcagcacc gaggccagca 600
agacgctgcc cgccatcgag ctccgggatt gtggagggct gcgggaggtg gaggtgactg 660
cctgcctggt gtggaaggac tggcctcacc gagtccaccc ccacagcctc gtggggaaag 720
actgcaccga cggcatctgc agggtgcggc tccggcctca cgtcagcccc cggcacagtt 780
ttaacaacct gggcatccag tgtgtgagga agaaggagat tgaggctgcc attgagcgga 840
agattcaact gggcattgac ccctacaacg ctgggtccct gaagaaccat caggaagtag 900
acatgaatgt ggtgaggatc tgcttccagg cctcatatcg ggaccagcag ggacagatgc 960
gccggatgga tcctgtgctt tccgagcccg tctatgacaa gaaatccaca aacacatcag 1020
agctgcggat ttgccgaatt aacaaggaaa gcgggccgtg caccggtggc gaggagctct 1080
acttgctctg cgacaaggtg cagaaagagg acatatcagt ggtgttcagc agggcctcct 1140
gggaaggtcg ggctgacttc tcccaggccg acgtgcaccg ccagattgcc attgtgttca 1200
agacgccgcc ctacgaggac ctggagattg tcgagcccgt gacagtcaac gtcttcctgc 1260
agcggctcac cgatggggtc tgcagcgagc cattgccttt cacgtacctg cctcgcgacc 1320
atgacagcta cggcgtggac aagaagcgga aacgggggat gcccgacgtc cttggggagc 1380
tgaacagctc tgacccccat ggcatcgaga gcaaacggcg gaagaaaaag ccggccatcc 1440
tggaccactt cctgcccaac cacggctcag gcccgttcct cccgccgtca gccctgctgc 1500
cagaccctga cttcttctct ggcaccgtgt ccctgcccgg cctggagccc cctggcgggc 1560
ctgacctcct ggacgatggc tttgcctacg accctacggc ccccacactc ttcaccatgc 1620
tggacctgct gcccccggca ccgccacacg ctagcgctgt tgtgtgcagc ggaggtgccg 1680
gggccgtggt tggggagacc cccggccctg aaccactgac actggactcg taccaggccc 1740
cgggccccgg ggatggaggc accgccagcc ttgtgggcag caacatgttc cccaatcatt 1800
accgcgaggc ggcctttggg ggcggcctcc tatccccggg gcctgaagcc acgtagcccc 1860
gcgatgccag aggaggggca ctgggtgggg agggaggtgg aggagccgtg caatcccaac 1920
caggatgtct agcaccccca tccccttggc ccttcctcat gcttctgaag tggacatatt 1980
cagccttggc gagaagctcc gttgcacggg tttccccttg agcccatttt acagatgagg 2040
aaactgagtc cggagaggaa aagggacatg gctcccgtgc actagcttgt tacagctgcc 2100
tctgtcccca catgtggggg caccttctcc agtaggattc ggaaaagatt gtacatatgg 2160
gaggaggggg cagattcctg gccctccctc cccagacttg aaggtggggg gtaggttggt 2220
tgttcagagt cttcccaata aagatgagtt tttgagcc 2258
<210> 38
<211> 1740
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(1740)
<223> RELB人CDS > NM_006509.4:117-1856 RELB原癌基因,
NF-kB亚基(RELB), mRNA, CDS [智人]
<400> 38
atgcttcggt ctgggccagc ctctgggccg tccgtcccca ctggccgggc catgccgagt 60
cgccgcgtcg ccagaccgcc ggctgcgccg gagctggggg ccttagggtc ccccgacctc 120
tcctcactct cgctcgccgt ttccaggagc acagatgaat tggagatcat cgacgagtac 180
atcaaggaga acggcttcgg cctggacggg ggacagccgg gcccgggcga ggggctgcca 240
cgcctggtgt ctcgcggggc tgcgtccctg agcacggtca ccctgggccc tgtggcgccc 300
ccagccacgc cgccgccttg gggctgcccc ctgggccgac tagtgtcccc agcgccgggc 360
ccgggcccgc agccgcacct ggtcatcacg gagcagccca agcagcgcgg catgcgcttc 420
cgctacgagt gcgagggccg ctcggccggc agcatccttg gggagagcag caccgaggcc 480
agcaagacgc tgcccgccat cgagctccgg gattgtggag ggctgcggga ggtggaggtg 540
actgcctgcc tggtgtggaa ggactggcct caccgagtcc acccccacag cctcgtgggg 600
aaagactgca ccgacggcat ctgcagggtg cggctccggc ctcacgtcag cccccggcac 660
agttttaaca acctgggcat ccagtgtgtg aggaagaagg agattgaggc tgccattgag 720
cggaagattc aactgggcat tgacccctac aacgctgggt ccctgaagaa ccatcaggaa 780
gtagacatga atgtggtgag gatctgcttc caggcctcat atcgggacca gcagggacag 840
atgcgccgga tggatcctgt gctttccgag cccgtctatg acaagaaatc cacaaacaca 900
tcagagctgc ggatttgccg aattaacaag gaaagcgggc cgtgcaccgg tggcgaggag 960
ctctacttgc tctgcgacaa ggtgcagaaa gaggacatat cagtggtgtt cagcagggcc 1020
tcctgggaag gtcgggctga cttctcccag gccgacgtgc accgccagat tgccattgtg 1080
ttcaagacgc cgccctacga ggacctggag attgtcgagc ccgtgacagt caacgtcttc 1140
ctgcagcggc tcaccgatgg ggtctgcagc gagccattgc ctttcacgta cctgcctcgc 1200
gaccatgaca gctacggcgt ggacaagaag cggaaacggg ggatgcccga cgtccttggg 1260
gagctgaaca gctctgaccc ccatggcatc gagagcaaac ggcggaagaa aaagccggcc 1320
atcctggacc acttcctgcc caaccacggc tcaggcccgt tcctcccgcc gtcagccctg 1380
ctgccagacc ctgacttctt ctctggcacc gtgtccctgc ccggcctgga gccccctggc 1440
gggcctgacc tcctggacga tggctttgcc tacgacccta cggcccccac actcttcacc 1500
atgctggacc tgctgccccc ggcaccgcca cacgctagcg ctgttgtgtg cagcggaggt 1560
gccggggccg tggttgggga gacccccggc cctgaaccac tgacactgga ctcgtaccag 1620
gccccgggcc ccggggatgg aggcaccgcc agccttgtgg gcagcaacat gttccccaat 1680
cattaccgcg aggcggcctt tgggggcggc ctcctatccc cggggcctga agccacgtag 1740
<210> 39
<211> 579
<212> PRT
<213> 智人
<220>
<221> MISC_FEATURE
<222> (1)..(579)
<223> RELB人蛋白>NP_006500.2转录因子RelB [智人]
<400> 39
Met Leu Arg Ser Gly Pro Ala Ser Gly Pro Ser Val Pro Thr Gly Arg
1 5 10 15
Ala Met Pro Ser Arg Arg Val Ala Arg Pro Pro Ala Ala Pro Glu Leu
20 25 30
Gly Ala Leu Gly Ser Pro Asp Leu Ser Ser Leu Ser Leu Ala Val Ser
35 40 45
Arg Ser Thr Asp Glu Leu Glu Ile Ile Asp Glu Tyr Ile Lys Glu Asn
50 55 60
Gly Phe Gly Leu Asp Gly Gly Gln Pro Gly Pro Gly Glu Gly Leu Pro
65 70 75 80
Arg Leu Val Ser Arg Gly Ala Ala Ser Leu Ser Thr Val Thr Leu Gly
85 90 95
Pro Val Ala Pro Pro Ala Thr Pro Pro Pro Trp Gly Cys Pro Leu Gly
100 105 110
Arg Leu Val Ser Pro Ala Pro Gly Pro Gly Pro Gln Pro His Leu Val
115 120 125
Ile Thr Glu Gln Pro Lys Gln Arg Gly Met Arg Phe Arg Tyr Glu Cys
130 135 140
Glu Gly Arg Ser Ala Gly Ser Ile Leu Gly Glu Ser Ser Thr Glu Ala
145 150 155 160
Ser Lys Thr Leu Pro Ala Ile Glu Leu Arg Asp Cys Gly Gly Leu Arg
165 170 175
Glu Val Glu Val Thr Ala Cys Leu Val Trp Lys Asp Trp Pro His Arg
180 185 190
Val His Pro His Ser Leu Val Gly Lys Asp Cys Thr Asp Gly Ile Cys
195 200 205
Arg Val Arg Leu Arg Pro His Val Ser Pro Arg His Ser Phe Asn Asn
210 215 220
Leu Gly Ile Gln Cys Val Arg Lys Lys Glu Ile Glu Ala Ala Ile Glu
225 230 235 240
Arg Lys Ile Gln Leu Gly Ile Asp Pro Tyr Asn Ala Gly Ser Leu Lys
245 250 255
Asn His Gln Glu Val Asp Met Asn Val Val Arg Ile Cys Phe Gln Ala
260 265 270
Ser Tyr Arg Asp Gln Gln Gly Gln Met Arg Arg Met Asp Pro Val Leu
275 280 285
Ser Glu Pro Val Tyr Asp Lys Lys Ser Thr Asn Thr Ser Glu Leu Arg
290 295 300
Ile Cys Arg Ile Asn Lys Glu Ser Gly Pro Cys Thr Gly Gly Glu Glu
305 310 315 320
Leu Tyr Leu Leu Cys Asp Lys Val Gln Lys Glu Asp Ile Ser Val Val
325 330 335
Phe Ser Arg Ala Ser Trp Glu Gly Arg Ala Asp Phe Ser Gln Ala Asp
340 345 350
Val His Arg Gln Ile Ala Ile Val Phe Lys Thr Pro Pro Tyr Glu Asp
355 360 365
Leu Glu Ile Val Glu Pro Val Thr Val Asn Val Phe Leu Gln Arg Leu
370 375 380
Thr Asp Gly Val Cys Ser Glu Pro Leu Pro Phe Thr Tyr Leu Pro Arg
385 390 395 400
Asp His Asp Ser Tyr Gly Val Asp Lys Lys Arg Lys Arg Gly Met Pro
405 410 415
Asp Val Leu Gly Glu Leu Asn Ser Ser Asp Pro His Gly Ile Glu Ser
420 425 430
Lys Arg Arg Lys Lys Lys Pro Ala Ile Leu Asp His Phe Leu Pro Asn
435 440 445
His Gly Ser Gly Pro Phe Leu Pro Pro Ser Ala Leu Leu Pro Asp Pro
450 455 460
Asp Phe Phe Ser Gly Thr Val Ser Leu Pro Gly Leu Glu Pro Pro Gly
465 470 475 480
Gly Pro Asp Leu Leu Asp Asp Gly Phe Ala Tyr Asp Pro Thr Ala Pro
485 490 495
Thr Leu Phe Thr Met Leu Asp Leu Leu Pro Pro Ala Pro Pro His Ala
500 505 510
Ser Ala Val Val Cys Ser Gly Gly Ala Gly Ala Val Val Gly Glu Thr
515 520 525
Pro Gly Pro Glu Pro Leu Thr Leu Asp Ser Tyr Gln Ala Pro Gly Pro
530 535 540
Gly Asp Gly Gly Thr Ala Ser Leu Val Gly Ser Asn Met Phe Pro Asn
545 550 555 560
His Tyr Arg Glu Ala Ala Phe Gly Gly Gly Leu Leu Ser Pro Gly Pro
565 570 575
Glu Ala Thr
<210> 40
<211> 2194
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(2194)
<223> RELB小鼠完整序列>NM_001290457.1
小家鼠鸟网状内皮组织增殖病毒(v-rel)致癌基因相关的B (Relb),转录物变体2, mRNA[小家鼠]
<400> 40
gcccgccgcc cgctcggccc ggcgcccccc gcagccccgg gcgccgtgcg tccagcccgg 60
ccttcggccc tagccttgca ccgcttgccc ggccggtgat cgtcccagcc gaccgggctg 120
tccgactgcc cggcttgccc tcgtgtgcgc tcgccctgag ctggccttcg ggctgtcggt 180
ccctacgggc cgggccatgc cgagtcgccg cgctgccaga gagtccgcgc ccgagctagg 240
ggccttgggt tccagtgacc tctcttccct gtcactaacg gtctccagga ccacagaaat 300
catcgacgaa tacattaagg agaacggctt tggcctggac gggacacagc tgagtgagat 360
gccgcgcctg gtgccccgcg ggcccgcctc actgagcagc gtcacgctgg gccctgctgc 420
accaccgcct ccggccacgc cgtcctggag ctgcacactg ggcaggctgg tgtcacccgg 480
cccgtgccca cggccgtacc tggtcatcac agagcagcca aagcagcgtg gcatgcgctt 540
ccgctacgag tgcgagggcc gctcggccgg cagcatcctc ggggagagca gcaccgaagc 600
cagcaagacc ctgcccgcca tcgagcttcg agactgtggc gggctgcggg aggtggaggt 660
gacggcgtgc ctggtgtgga aggactggcc acaccgggta cacccacata gcctcgtggg 720
gaaagactgc acggacggcg tctgcagggt gcggctgcgg cctcacgtca gcccccggca 780
cagctttaac aacctgggca tccagtgtgt taggaagaag gaaattgaag ctgccattga 840
gcggaagatc cagctgggaa ttgaccccta caatgctggc tccctgaaga accatcagga 900
ggtcgacatg aatgtcgtca ggatctgctt ccaggcctcc tatcgggacc agcagggaca 960
tctgcaccgc atggacccca tcctctctga gcctgtctac gacaagaagt ccaccaacac 1020
atcggagctg cggatttgcc gaatcaacaa ggagagcggg ccgtgcacag gtggtgagga 1080
gctgtacttg ctctgtgaca aggtgcaaaa agaggacata tccgtggtgt tcagcacagc 1140
ttcctgggaa ggccgtgccg acttctctca agctgatgtg caccggcaga tcgccattgt 1200
gttcaaaacg ccaccctacg aggacctgga gatctcagag cccgtgactg tcaatgtgtt 1260
cttgcagcgg ctcacggatg gggtgtgcag cgagccgctg cccttcacgt acctgcctcg 1320
ggatcatgac agctacggtg tggacaagaa gcgaaagcgg ggactgcctg atgtccttgg 1380
agagttgagc agctctgatc cacatggaat cgagagcaaa cgaaggaaaa agaaaccagt 1440
gttcttggac cacttcctgc ctggccacag ctcaggcctg ttcctcccac catcggctct 1500
gcagccggca gactctgatt tcttccctgc ttccatatcc cttcctgggc tggagcctcc 1560
tggtggaccc gatctcctgg acgatggctt tgcctatgat ccttctgccc ccacgctctt 1620
cactatgttg gacctgctgc ccccagcacc accacttgcc agtgctgtgg tgggtagcgg 1680
gggtgcaggg gccaccgttg tggagtcttc tggcccagag cccctatcac tggactcttt 1740
tgcagcgccg ggccccgggg atgttggtac tgctagcctt gtgggcagca acatgtttcc 1800
caaccagtac cgagaggcag ctttcggggg tggcctccta tctccagggc ctgaagccac 1860
gtagcctctg aggtaacaga ggaggcactg ggtgaggtat gtggtatagc actccattcc 1920
gaagccaacc ttgatcagtc ttccagcttc ctcatcctga atcggacatc tgcagcgctg 1980
gtgggaagat ggggagcact ccggttctct ttgagcccat tttacagaat gctgagtccg 2040
aagaggaaaa ggggctcctg cagatggacc ccttctcagg acagattctc agagattgta 2100
cataggggag gagggagcag gtccccagcc ttctccccta atcctgaaga aggcagtgga 2160
ttgttcagtt ttcccaataa aaattagttt tttg 2194
<210> 41
<211> 1668
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(1668)
<223> RELB小鼠CDS >NM_001290457.1:197-1864小家鼠鸟网状内皮组织增殖病毒(v-rel)
致癌基因相关的B (Relb),转录物变体2, mRNA [小家鼠]
<400> 41
atgccgagtc gccgcgctgc cagagagtcc gcgcccgagc taggggcctt gggttccagt 60
gacctctctt ccctgtcact aacggtctcc aggaccacag aaatcatcga cgaatacatt 120
aaggagaacg gctttggcct ggacgggaca cagctgagtg agatgccgcg cctggtgccc 180
cgcgggcccg cctcactgag cagcgtcacg ctgggccctg ctgcaccacc gcctccggcc 240
acgccgtcct ggagctgcac actgggcagg ctggtgtcac ccggcccgtg cccacggccg 300
tacctggtca tcacagagca gccaaagcag cgtggcatgc gcttccgcta cgagtgcgag 360
ggccgctcgg ccggcagcat cctcggggag agcagcaccg aagccagcaa gaccctgccc 420
gccatcgagc ttcgagactg tggcgggctg cgggaggtgg aggtgacggc gtgcctggtg 480
tggaaggact ggccacaccg ggtacaccca catagcctcg tggggaaaga ctgcacggac 540
ggcgtctgca gggtgcggct gcggcctcac gtcagccccc ggcacagctt taacaacctg 600
ggcatccagt gtgttaggaa gaaggaaatt gaagctgcca ttgagcggaa gatccagctg 660
ggaattgacc cctacaatgc tggctccctg aagaaccatc aggaggtcga catgaatgtc 720
gtcaggatct gcttccaggc ctcctatcgg gaccagcagg gacatctgca ccgcatggac 780
cccatcctct ctgagcctgt ctacgacaag aagtccacca acacatcgga gctgcggatt 840
tgccgaatca acaaggagag cgggccgtgc acaggtggtg aggagctgta cttgctctgt 900
gacaaggtgc aaaaagagga catatccgtg gtgttcagca cagcttcctg ggaaggccgt 960
gccgacttct ctcaagctga tgtgcaccgg cagatcgcca ttgtgttcaa aacgccaccc 1020
tacgaggacc tggagatctc agagcccgtg actgtcaatg tgttcttgca gcggctcacg 1080
gatggggtgt gcagcgagcc gctgcccttc acgtacctgc ctcgggatca tgacagctac 1140
ggtgtggaca agaagcgaaa gcggggactg cctgatgtcc ttggagagtt gagcagctct 1200
gatccacatg gaatcgagag caaacgaagg aaaaagaaac cagtgttctt ggaccacttc 1260
ctgcctggcc acagctcagg cctgttcctc ccaccatcgg ctctgcagcc ggcagactct 1320
gatttcttcc ctgcttccat atcccttcct gggctggagc ctcctggtgg acccgatctc 1380
ctggacgatg gctttgccta tgatccttct gcccccacgc tcttcactat gttggacctg 1440
ctgcccccag caccaccact tgccagtgct gtggtgggta gcgggggtgc aggggccacc 1500
gttgtggagt cttctggccc agagccccta tcactggact cttttgcagc gccgggcccc 1560
ggggatgttg gtactgctag ccttgtgggc agcaacatgt ttcccaacca gtaccgagag 1620
gcagctttcg ggggtggcct cctatctcca gggcctgaag ccacgtag 1668
<210> 42
<211> 555
<212> PRT
<213> 小家鼠
<220>
<221> MISC_FEATURE
<222> (1)..(555)
<223> RELB小鼠蛋白>NP_001277386.1转录因子RelB同种型2 [小家鼠]
<400> 42
Met Pro Ser Arg Arg Ala Ala Arg Glu Ser Ala Pro Glu Leu Gly Ala
1 5 10 15
Leu Gly Ser Ser Asp Leu Ser Ser Leu Ser Leu Thr Val Ser Arg Thr
20 25 30
Thr Glu Ile Ile Asp Glu Tyr Ile Lys Glu Asn Gly Phe Gly Leu Asp
35 40 45
Gly Thr Gln Leu Ser Glu Met Pro Arg Leu Val Pro Arg Gly Pro Ala
50 55 60
Ser Leu Ser Ser Val Thr Leu Gly Pro Ala Ala Pro Pro Pro Pro Ala
65 70 75 80
Thr Pro Ser Trp Ser Cys Thr Leu Gly Arg Leu Val Ser Pro Gly Pro
85 90 95
Cys Pro Arg Pro Tyr Leu Val Ile Thr Glu Gln Pro Lys Gln Arg Gly
100 105 110
Met Arg Phe Arg Tyr Glu Cys Glu Gly Arg Ser Ala Gly Ser Ile Leu
115 120 125
Gly Glu Ser Ser Thr Glu Ala Ser Lys Thr Leu Pro Ala Ile Glu Leu
130 135 140
Arg Asp Cys Gly Gly Leu Arg Glu Val Glu Val Thr Ala Cys Leu Val
145 150 155 160
Trp Lys Asp Trp Pro His Arg Val His Pro His Ser Leu Val Gly Lys
165 170 175
Asp Cys Thr Asp Gly Val Cys Arg Val Arg Leu Arg Pro His Val Ser
180 185 190
Pro Arg His Ser Phe Asn Asn Leu Gly Ile Gln Cys Val Arg Lys Lys
195 200 205
Glu Ile Glu Ala Ala Ile Glu Arg Lys Ile Gln Leu Gly Ile Asp Pro
210 215 220
Tyr Asn Ala Gly Ser Leu Lys Asn His Gln Glu Val Asp Met Asn Val
225 230 235 240
Val Arg Ile Cys Phe Gln Ala Ser Tyr Arg Asp Gln Gln Gly His Leu
245 250 255
His Arg Met Asp Pro Ile Leu Ser Glu Pro Val Tyr Asp Lys Lys Ser
260 265 270
Thr Asn Thr Ser Glu Leu Arg Ile Cys Arg Ile Asn Lys Glu Ser Gly
275 280 285
Pro Cys Thr Gly Gly Glu Glu Leu Tyr Leu Leu Cys Asp Lys Val Gln
290 295 300
Lys Glu Asp Ile Ser Val Val Phe Ser Thr Ala Ser Trp Glu Gly Arg
305 310 315 320
Ala Asp Phe Ser Gln Ala Asp Val His Arg Gln Ile Ala Ile Val Phe
325 330 335
Lys Thr Pro Pro Tyr Glu Asp Leu Glu Ile Ser Glu Pro Val Thr Val
340 345 350
Asn Val Phe Leu Gln Arg Leu Thr Asp Gly Val Cys Ser Glu Pro Leu
355 360 365
Pro Phe Thr Tyr Leu Pro Arg Asp His Asp Ser Tyr Gly Val Asp Lys
370 375 380
Lys Arg Lys Arg Gly Leu Pro Asp Val Leu Gly Glu Leu Ser Ser Ser
385 390 395 400
Asp Pro His Gly Ile Glu Ser Lys Arg Arg Lys Lys Lys Pro Val Phe
405 410 415
Leu Asp His Phe Leu Pro Gly His Ser Ser Gly Leu Phe Leu Pro Pro
420 425 430
Ser Ala Leu Gln Pro Ala Asp Ser Asp Phe Phe Pro Ala Ser Ile Ser
435 440 445
Leu Pro Gly Leu Glu Pro Pro Gly Gly Pro Asp Leu Leu Asp Asp Gly
450 455 460
Phe Ala Tyr Asp Pro Ser Ala Pro Thr Leu Phe Thr Met Leu Asp Leu
465 470 475 480
Leu Pro Pro Ala Pro Pro Leu Ala Ser Ala Val Val Gly Ser Gly Gly
485 490 495
Ala Gly Ala Thr Val Val Glu Ser Ser Gly Pro Glu Pro Leu Ser Leu
500 505 510
Asp Ser Phe Ala Ala Pro Gly Pro Gly Asp Val Gly Thr Ala Ser Leu
515 520 525
Val Gly Ser Asn Met Phe Pro Asn Gln Tyr Arg Glu Ala Ala Phe Gly
530 535 540
Gly Gly Leu Leu Ser Pro Gly Pro Glu Ala Thr
545 550 555
<210> 43
<211> 6212
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(6212)
<223> RBPJ人完整序列>NM_001363577.2重组免疫球蛋白κJ区域
信号结合蛋白(RBPJ),转录物变体5, mRNA [智人]
<400> 43
gccgtggctc aggctccagc gcgtgagggc gggcggcggc ggcggcggcg aggccggcgg 60
gtgcgggcgt gagtgcggca gggcgcgcgg ggccctcggg cgccgtgcct cccgcgcgcc 120
tccagcgggc tctgcctggg ggccccgtgc atctttgttc ctcgcggccg ttgtcgccgc 180
ttcgccccgc tagtgccccc ggccccgggc gctttgtctt tcaccgctac ccgctgcgac 240
tctctacccc cggccctttg tttcgccgcc ttgggcgctt tgtccgaccc gtggcgtccc 300
gcatcccctc tccatctcct cccctccgct gtccttcctc ccgtgcccct ttgttgtgcc 360
acactcttgt cgttagccca gtgacctgcc acagcccggg caccagattt ctgccttaat 420
tgttcttcca ttgtctttct cctgtgggtc ccctctcacc tttctgtatg gtcctggatc 480
accccccgag gctttgtctc ccccatccac gggcttattc tctcggcacc cccttcctct 540
cccgtcatcg gttgatttat ccacaaccca cggtgtacgt ctgctgaact gtctcagccc 600
cggggactcc atctgctgct tctaagtgca cacaagactc tacggaaatt tggtgagcgg 660
cctccaccta aacgacttac tagggaagct atgcgaaatt atttaaaaga gcgaggggat 720
caaacagtac ttattcttca tgcaaaagtt gcacagaagt catatggaaa tgaaaaaagg 780
tttttttgcc cacctccttg tgtatatctt atgggcagtg gatggaagaa aaaaaaagaa 840
caaatggaac gcgatggttg ttctgaacaa gagtctcaac cgtgtgcatt tattgggata 900
ggaaatagtg accaagaaat gcagcagcta aacttggaag gaaagaacta ttgcacagcc 960
aaaacattgt atatatctga ctcagacaag cgaaagcact tcatgttgtc tgtaaagatg 1020
ttctatggca acagtgatga cattggtgtg ttcctcagca agcggataaa agtcatctcc 1080
aaaccttcca aaaagaagca gtcattgaaa aatgctgact tatgcattgc ctcaggaaca 1140
aaggtggctc tgtttaatcg actacgatcc cagacagtta gtaccagata cttgcatgta 1200
gaaggaggta attttcatgc cagttcacag cagtggggag ccttttttat tcatctcttg 1260
gatgatgatg aatcagaagg agaagaattc acagtccgag atggctacat ccattatgga 1320
caaacagtca aacttgtgtg ctcagttact ggcatggcac tcccaagatt gataattagg 1380
aaagttgata agcagaccgc attattggat gcagatgatc ctgtgtcaca actccataaa 1440
tgtgcatttt accttaagga tacagaaaga atgtatttgt gcctttctca agaaagaata 1500
attcaatttc aggccactcc atgtccaaaa gaaccaaata aagagatgat aaatgatggc 1560
gcttcctgga caatcattag cacagataag gcagagtata cattttatga gggaatgggc 1620
cctgtccttg ccccagtcac tcctgtgcct gtggtagaga gccttcagtt gaatggcggt 1680
ggggacgtag caatgcttga acttacagga cagaatttca ctccaaattt acgagtgtgg 1740
tttggggatg tagaagctga aactatgtac aggtgtggag agagtatgct ctgtgtcgtc 1800
ccagacattt ctgcattccg agaaggttgg agatgggtcc ggcaaccagt ccaggttcca 1860
gtaactttgg tccgaaatga tggaatcatt tattccacca gccttacctt tacctacaca 1920
ccagaaccag ggccgcggcc acattgcagt gcagcaggag caatccttcg agccaattca 1980
agccaggtgc cccctaacga atcaaacaca aacagcgagg gaagttacac aaacgccagc 2040
acaaattcaa ccagtgtcac atcatctaca gccacagtgg tatcctaact accgtctttt 2100
tgctaggact taaactgact tgagtgtggc aaaaagttaa caaaaaagga gaaaaaatga 2160
acaatcgttt gtggtttctt gggaaaactt ttcataccag gtgatactat tcaaaaaccc 2220
cgttgtctcc ctgcaagtgc tgatttgaaa tgcagaagcc acagtaaaaa aaaaaaaaaa 2280
aaaaaaaaaa aagaaaaaaa aatcaaaatg tataaatatt ggaaatcaag tttttcagct 2340
gttttgttgg ttggttggtt ggtttttgtt tggttttgtt taaatgggca agaagtaaat 2400
aatgtggctg gaatacaagt tgaacaaact agaagacaca aatctaacat agtttttatg 2460
gaccaaggaa cttgtatatt gtataagctt tagtaaaagg tacattttca ccataccttt 2520
ttttatatca cggtattata gtacaccttg ttaccaaata ggttgttctc ttccccaccc 2580
acctttgagc ttttgctcta aaatacattc aggttccaag cctgaccatc cttgtttaat 2640
ctatcatact cttccaggtt tttttttttt ggtctaaggc tggaactttt ttcttttttt 2700
tcagctgaag tcttatgact tttcatgagt caaaattgtt tggatttcag caagtcaaat 2760
cttgcaaagg cctgcatatt ttttttaaga ttatatgaag tctgtgcaaa agctttaaaa 2820
aaatgcctct gccttgcctg caatacatgc aatgtatgtt aacttagtct ctcttctcag 2880
acactgttgg tagttatttc tgtgttttcc tttttttaaa aaaaaatatg gacttattgt 2940
ggttatctga gaggttctaa cattcacatg caatttggtg tggccattta gctattaatg 3000
agttaatggc gcagaacttg ttgatatttg aagtgttctc tccccttttc ccatgacgta 3060
aatacatagg tgtgttccag gatttgttca ggtttttccc ccctcctaat cttgtacata 3120
acttgtatta tgtgtaagtt aaacatttta ttttgaactt ggaatgttcc cagtgatttc 3180
attcagcagg gtattttctg ccttgttggc aagtgacaaa aaatatggga agtatttgct 3240
accagttggt agatggtgcc cttaatggta gaatgaggaa aatgtccgca aaagcatgtt 3300
ttattatctt tacttttttg gggggttgga gggggtagcc tagccagaac atcattgtaa 3360
tcttaaaaca taagatgctt ttattagatg atcaactaaa atagctggaa gacagtactt 3420
tagaaacaga tagttgtaag attataaaat gcaaatgtaa cttatgtttt catttttttc 3480
tctgcctttt ttgtttgttt gttttctctt ttccagtact gagcatctcc acaaatgtct 3540
cctaactcag aaaatgtttc ttttcttttc agttgagatt tggttgcatt cagggttgta 3600
ggttggcctt gcttgctaac cccgccggtt ttaccgtgct ttcattcctg aactttgttt 3660
atgcctttgt ttggtttctt cgaaattgca gcagactcat tgggctacat ttagtacagg 3720
aaccacgtgt gtaatgttat acaacacagt ctagtaatac aatcatccct cttagagtaa 3780
aaactacctc tagattgtgg taagctttta ctgtcccata aaacaggagc cacagtacct 3840
tatgaatgca aaactgtaac ttcctacagt gtttccctac agaacattgt ctttctggtg 3900
tcctgggctg ttttgaaaaa gtttccatta atagactttt tagaaattat tattagtagc 3960
attttttttc cagctttgct gtcttcatca ctcactctat gctcagacta tgccactgta 4020
aatattcttc ctaacatctt taaatcgcct tttcctcagt tttcaagggg aaggtcattt 4080
gtaaagcacg ttaggtggtt aaatcagtta ttgcggtttt ctcttactgc aagccttttt 4140
aatcaccccc aggctgcatt ttattctata tcgccttttt tcttcaaatc tgctccaatc 4200
actcacttct ctcttataag ctaatcctgc ctcacacctt aaatctgttt cagtgatcaa 4260
gggcagaact cattgtggcc ttatctttct ttgttgtaat tgttcactgt ctctttctta 4320
cagaccactt atttctgagt agtagttatt cctctctatg gagtcatggc aggaatcatt 4380
acacagtgct tttgttcaga gcatggacat gttcctagtg ctgctttgct ttaacggcca 4440
caagtttcct ccacttccta ggtttggtat ttagttaagg aatcatatta aattaaccaa 4500
taacaaaaga gatacttttg aagaacaaac tattccttac ccatttttgt agctcaaaaa 4560
taatttttca agttcatgac cttattaaaa tgaacttgtg tttttttaac aaacatgtat 4620
gttttatttt gatagtttct ttccgtaaga taattgaaat attatactgt aaaccctttt 4680
cttttctttt tttgaaaagt ccaagaatgt acttatacag gcatttttcc ccacctattt 4740
ttggccattc tcataccaca gactaaagag tgaaatgatt tgtccattgt agcttattgt 4800
ttatcagtag ttcttttgtc agctgcttac attttttctt tcatggtttt gtgaatcatt 4860
ttcagtatgt aatttatagg aaccttgtcc tctggttata gtagactgtg tgccctcctc 4920
cagtgatggc attattagac atgctggtca tttaccctca gaaagactct cttattagaa 4980
tggtgagtgc ttcagttata gtatgtttga atttttaaaa aattctgttt tagaaatgta 5040
tcttatgctc tcatgactat gcagtttcta aacatacaca tagaagctga gtctctgatc 5100
caatatgttt ttatttgttc catttaattt atcacataga ttgggaaggc aagctaaaag 5160
ccttaaaaat gccctttata ttttgagtga tttcagcgtt gaacactagt atactatcta 5220
aatttgctgc tcactttctt taaactgtgg caattaaagg catgtttata catgacttaa 5280
tcgtgaaatg tttgtcactc ttactgcaca gacttatctg caatcataac tggttagttt 5340
ttttgttttg ttttgtttta ttgtttttaa tgaaactggt accatctgtg ctttcacaaa 5400
aaacttccaa tgccattttt gagaactaac ctaactagtc atgctaacca gaaaatccac 5460
tggggaggag gttccttttg aaacaaaatg ctgttcagtt agtaaccaag ttactttgat 5520
tgcaaaagca gctgtgtttc tgataagtac tgaacaaatg tgtgtaattt tctgtgccag 5580
acttatgact ttgttttcaa gcactgtaat gtgggatgga tggttagaaa caataatata 5640
ttagggtttc tgtttaaccc tttcaggact gaactgtatc tccttttgtt aattttcccc 5700
tgtgttgtga taaatgtttg ccagcattca gtactgtgtt ggtccagatg taggtttata 5760
tgctcatttt tagcttattt cttgtacctt gcagcatgct ctacgcattc agtccttaag 5820
gggtttattt tacaaactgt gcgcctgtaa ggtttattag caataagata gaaaattgag 5880
caagtttata ccataatttt gtagaaaaaa agaatctgct cagttccata tttcatccgt 5940
gaaaaacttg caatacgagc agtttcaagg aataaataaa aaggaaatgt aaaccattgt 6000
aaaagtcttc tgtcgaatgt gcctgatgca tgtattatcg tcttttattt cagaatactt 6060
cataaagata aaattaaatt ctatattata gttggtgtat ttacaatctt accatgtaca 6120
tcacatcaaa gtgatagctc tactaattta atttccttgt caatgttttt aactatatag 6180
tgctttaaag agattttttt tccctgtgta aa 6212
<210> 44
<211> 1398
<212> DNA
<213> 智人
<220>
<221> misc_feature
<222> (1)..(1398)
<223> RBPJ人CDS >NM_001363577.2:691-2088重组免疫球蛋白
κJ区域信号结合蛋白(RBPJ),转录物变体5, mRNA [智人]
<400> 44
atgcgaaatt atttaaaaga gcgaggggat caaacagtac ttattcttca tgcaaaagtt 60
gcacagaagt catatggaaa tgaaaaaagg tttttttgcc cacctccttg tgtatatctt 120
atgggcagtg gatggaagaa aaaaaaagaa caaatggaac gcgatggttg ttctgaacaa 180
gagtctcaac cgtgtgcatt tattgggata ggaaatagtg accaagaaat gcagcagcta 240
aacttggaag gaaagaacta ttgcacagcc aaaacattgt atatatctga ctcagacaag 300
cgaaagcact tcatgttgtc tgtaaagatg ttctatggca acagtgatga cattggtgtg 360
ttcctcagca agcggataaa agtcatctcc aaaccttcca aaaagaagca gtcattgaaa 420
aatgctgact tatgcattgc ctcaggaaca aaggtggctc tgtttaatcg actacgatcc 480
cagacagtta gtaccagata cttgcatgta gaaggaggta attttcatgc cagttcacag 540
cagtggggag ccttttttat tcatctcttg gatgatgatg aatcagaagg agaagaattc 600
acagtccgag atggctacat ccattatgga caaacagtca aacttgtgtg ctcagttact 660
ggcatggcac tcccaagatt gataattagg aaagttgata agcagaccgc attattggat 720
gcagatgatc ctgtgtcaca actccataaa tgtgcatttt accttaagga tacagaaaga 780
atgtatttgt gcctttctca agaaagaata attcaatttc aggccactcc atgtccaaaa 840
gaaccaaata aagagatgat aaatgatggc gcttcctgga caatcattag cacagataag 900
gcagagtata cattttatga gggaatgggc cctgtccttg ccccagtcac tcctgtgcct 960
gtggtagaga gccttcagtt gaatggcggt ggggacgtag caatgcttga acttacagga 1020
cagaatttca ctccaaattt acgagtgtgg tttggggatg tagaagctga aactatgtac 1080
aggtgtggag agagtatgct ctgtgtcgtc ccagacattt ctgcattccg agaaggttgg 1140
agatgggtcc ggcaaccagt ccaggttcca gtaactttgg tccgaaatga tggaatcatt 1200
tattccacca gccttacctt tacctacaca ccagaaccag ggccgcggcc acattgcagt 1260
gcagcaggag caatccttcg agccaattca agccaggtgc cccctaacga atcaaacaca 1320
aacagcgagg gaagttacac aaacgccagc acaaattcaa ccagtgtcac atcatctaca 1380
gccacagtgg tatcctaa 1398
<210> 45
<211> 465
<212> PRT
<213> 智人
<220>
<221> MISC_FEATURE
<222> (1)..(465)
<223> RBPJ人蛋白NP_001350506.1重组无毛结合蛋白抑制因子同种型5 [智人]
<400> 45
Met Arg Asn Tyr Leu Lys Glu Arg Gly Asp Gln Thr Val Leu Ile Leu
1 5 10 15
His Ala Lys Val Ala Gln Lys Ser Tyr Gly Asn Glu Lys Arg Phe Phe
20 25 30
Cys Pro Pro Pro Cys Val Tyr Leu Met Gly Ser Gly Trp Lys Lys Lys
35 40 45
Lys Glu Gln Met Glu Arg Asp Gly Cys Ser Glu Gln Glu Ser Gln Pro
50 55 60
Cys Ala Phe Ile Gly Ile Gly Asn Ser Asp Gln Glu Met Gln Gln Leu
65 70 75 80
Asn Leu Glu Gly Lys Asn Tyr Cys Thr Ala Lys Thr Leu Tyr Ile Ser
85 90 95
Asp Ser Asp Lys Arg Lys His Phe Met Leu Ser Val Lys Met Phe Tyr
100 105 110
Gly Asn Ser Asp Asp Ile Gly Val Phe Leu Ser Lys Arg Ile Lys Val
115 120 125
Ile Ser Lys Pro Ser Lys Lys Lys Gln Ser Leu Lys Asn Ala Asp Leu
130 135 140
Cys Ile Ala Ser Gly Thr Lys Val Ala Leu Phe Asn Arg Leu Arg Ser
145 150 155 160
Gln Thr Val Ser Thr Arg Tyr Leu His Val Glu Gly Gly Asn Phe His
165 170 175
Ala Ser Ser Gln Gln Trp Gly Ala Phe Phe Ile His Leu Leu Asp Asp
180 185 190
Asp Glu Ser Glu Gly Glu Glu Phe Thr Val Arg Asp Gly Tyr Ile His
195 200 205
Tyr Gly Gln Thr Val Lys Leu Val Cys Ser Val Thr Gly Met Ala Leu
210 215 220
Pro Arg Leu Ile Ile Arg Lys Val Asp Lys Gln Thr Ala Leu Leu Asp
225 230 235 240
Ala Asp Asp Pro Val Ser Gln Leu His Lys Cys Ala Phe Tyr Leu Lys
245 250 255
Asp Thr Glu Arg Met Tyr Leu Cys Leu Ser Gln Glu Arg Ile Ile Gln
260 265 270
Phe Gln Ala Thr Pro Cys Pro Lys Glu Pro Asn Lys Glu Met Ile Asn
275 280 285
Asp Gly Ala Ser Trp Thr Ile Ile Ser Thr Asp Lys Ala Glu Tyr Thr
290 295 300
Phe Tyr Glu Gly Met Gly Pro Val Leu Ala Pro Val Thr Pro Val Pro
305 310 315 320
Val Val Glu Ser Leu Gln Leu Asn Gly Gly Gly Asp Val Ala Met Leu
325 330 335
Glu Leu Thr Gly Gln Asn Phe Thr Pro Asn Leu Arg Val Trp Phe Gly
340 345 350
Asp Val Glu Ala Glu Thr Met Tyr Arg Cys Gly Glu Ser Met Leu Cys
355 360 365
Val Val Pro Asp Ile Ser Ala Phe Arg Glu Gly Trp Arg Trp Val Arg
370 375 380
Gln Pro Val Gln Val Pro Val Thr Leu Val Arg Asn Asp Gly Ile Ile
385 390 395 400
Tyr Ser Thr Ser Leu Thr Phe Thr Tyr Thr Pro Glu Pro Gly Pro Arg
405 410 415
Pro His Cys Ser Ala Ala Gly Ala Ile Leu Arg Ala Asn Ser Ser Gln
420 425 430
Val Pro Pro Asn Glu Ser Asn Thr Asn Ser Glu Gly Ser Tyr Thr Asn
435 440 445
Ala Ser Thr Asn Ser Thr Ser Val Thr Ser Ser Thr Ala Thr Val Val
450 455 460
Ser
465
<210> 46
<211> 5590
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(5590)
<223> 小鼠RBPJ完整序列>NM_001277116.1重组免疫球蛋白
κJ区域信号结合蛋白(Rbpj),转录物变体4, mRNA [小家鼠]
<400> 46
agtaatgccc tccggttttc ctcagtctcc acgtacgtcc ccgagggcgc gtcccaaaac 60
ccggataacc ggagcgctcc ccatggacta ctcggagggc ttgtccgcgg aggagcggcc 120
tgcgcatgct ccatcggcgg gctcctggcc ttgtgccctt gaggcaagca ttctacagat 180
tacatacttc tacactccag aagagggcgt cagatctcgt tacggatggt tgtgagccac 240
catgtggttg ctgggatttg aactcctgac cttcagaaga gcagtcgggt gctcttaccc 300
attgagccat ctcaccagcc cgaagtttgg tgagcggcct ccacccaaac gactcactag 360
ggaagctatg cgaaattatt taaaagaacg aggggatcaa acagtgctca ttcttcatgc 420
aaaagttgca cagaagtctt acggaaatga aaaacgattt ttttgccctc ctccttgtgt 480
gtatcttatg ggcagtggtt ggaagaaaaa aaaagagcaa atggaacgag atggatgttc 540
tgaacaagag tctcaaccct gtgcgtttat tggaatagga aacagtgacc aagaaatgca 600
gcagctgaac ttggaaggga agaactactg tacagccaaa acattgtaca tatctgattc 660
agacaagaga aaacatttca tgttgtctgt aaagatgttc tatggcaaca gcgatgacat 720
tggtgtgttc ctcagcaagc ggataaaggt catctccaaa ccctccaaaa agaagcagtc 780
actgaagaat gctgacttgt gcattgcttc aggaacgaag gtggcactgt tcaatcgcct 840
tcggtcccag acagttagta ccaggtacct gcatgtagaa ggagggaatt tccacgccag 900
ttcacaacag tggggagcat tttacatcca tctcttggac gacgacgagt cggaaggaga 960
ggagttcaca gttagagatg gctacatcca ttacgggcag actgtcaagc ttgtgtgctc 1020
agtgactggc atggcactcc caagattgat aattaggaaa gttgataagc agacggcatt 1080
actggatgca gacgaccctg tatcacaact ccacaaatgt gcattttacc ttaaggatac 1140
agaaagaatg tacttgtgcc tttctcaaga aagaataatc caatttcagg ccactccatg 1200
tccaaaagaa caaaataagg aaatgataaa cgatggagct tcctggacaa tcattagcac 1260
agacaaggcc gagtacacgt tctatgaggg aatgggcccc gtccttgccc cagtcacccc 1320
tgtgcctgtc gtagaaagtc ttcagttgaa tggcggcggg gacgtagcaa tgcttgaact 1380
tacaggacaa aactttactc caaatttaag agtgtggttt ggggatgtag aagccgaaac 1440
aatgtacaga tgtggagaga gcatgctctg tgtggtccca gacatttctg cattccggga 1500
aggttggaga tgggtccgcc agccagtcca ggttccagtg actttggtcc gaaatgacgg 1560
ggtcatttac tccaccagcc ttaccttcac ctacacacca gagccagggc cgaggccaca 1620
ctgcagcgct gcaggagcga ttctcagagc caactccagc caagtgccct ccaatgagtc 1680
aaacacaaac agcgagggga attacacaaa tgccagcaca aattctacca gtgtcacatc 1740
gtccacagca accgtggtgt cctaactacc gtctttttgc tgggactgaa actgacttga 1800
atgcagcaaa aagttgacag agaaaggcaa ggataaagtg aacaatcttt tgtggtttct 1860
tggggaaacg tttcgtacca ggtgatctat tcaaaactgc cagtgcaggt gtgacatgca 1920
gaggccacag tacaatgaac aacatcagaa tgcacaggct gatggaaatc caggttttca 1980
gctgtccctc tcacacgcac ccctttttgg ttgttttggt ttggttttga tttaatggcc 2040
aggaaacaga tatgtggctg gagcacaggc taagctagaa gacacaaacc taacatagac 2100
ttatggacca agggactcgc ataagcttta gtgaaaggta cattttcacc atgccttttt 2160
tatatcacag tgtgtactac accttgttac caaacaggtt gcgctcccca ccctttgagg 2220
gttggctcta aagtacattc attcaggttc gagcctcacc atgtttgttc agtctgaaca 2280
ctgaacactt ccaggttctt gtggtctaag gctggaactt ttttaagctg aagtcttatg 2340
actttttcat aagttggaat tattttgatt tcagcaagtc aaattttgta aaggcctgca 2400
tatttttttt aagattatat gaagtctgca aaagctttaa aaatgcctct gccttgcctg 2460
caatacatgc aatgtgtgtt aactgagtct gttctcagaa tctgttggta gttatttctg 2520
tgttttccct ttttttaaag aaatgtattt atagtggtaa tctaagagat gctaacagct 2580
ccgtgtggcc atttagccct ttctgagttc atggcagtga gcactgccgt tcgaaggctg 2640
tgcgcctctg cctgttcatc agttcatctg tgttccagga tttgttcagg tttccccttc 2700
ccccaaatct tgtacataac ttgtattatg tgtaagttaa acattttatc tttacttgga 2760
atgttcccag tgattacatt tcacagggtg tttcccacct tgttggcaaa tgacaaaaaa 2820
tatcatgcga attatttgct accaagatgg tgcccgtagg gtaaaatgag gaacatctct 2880
gcaggatgat ttattgtatg ttttcctttt tttttagcct agccagaaca tcattataat 2940
tttaaagacc ataagatggc ttttattaga tgataactaa ataaactgaa ctaagtagcc 3000
agaagacagt accttagaga ctgatagttg taagtttata aaaatacaaa tgtaacttat 3060
atttccattt tcctctttgc ccctttgttt attttcccca gtagaaatgt tctgtccaga 3120
tttttttccc cctttttagt taagccttgc ttgcattcag gcttgtgtgt cagccctgtg 3180
tgccaatatg ccagattcat tttgtttcca tccctgcact ttggttattc cttgttcagt 3240
ttcttgggaa ttgaagcaga ctcattgggc tacatttagt acaagaacca catgtgtgat 3300
gttaaaggac acagtctagt gatgccatca tccttgagta aaaactacct ctagattgtg 3360
gtaagctttt actgtccctt aaaacaggag ccactagtac cttatgaatg cagactgtaa 3420
cttctatggc attttctggt gtctttctgg ctctctgggc ttctgaaagt tttcacgtag 3480
acccttagac tcctctgtca ggaccattgc tttccagcgt tgctgctttc actcgctgtc 3540
tcgaagactg taccactgta agtgccactg cgaacatctt taaacggctc ctccttggct 3600
tgtcaagggg aaggccacct gtaagatgtg gtggtgacag ccttcctgtc caccccaggc 3660
tgctccttac cagctgttct agccgattct cttcagatca gctacagccc acctcctccc 3720
cgccccgcct cacccctcca ggctgtctca gtggtccagg gcagagctca ctgtggcctt 3780
actgatctat ctctgcagta actgttctcc ctcagttgcg atggaagcag gcccctgcga 3840
tctgttcaga gcctgagcag tttgcagttc cgttgtgctt tctgaggtcc tagattcagg 3900
atgtagttaa gtcgataact gagaacagat gagatgcttt taaagaacaa actcttcctt 3960
acccagcttt gtagcttaaa gtaagttttc aagttcatgg ctttattaaa atgaactttg 4020
actttttaaa atgcttatat ctcatttcgg ttcttttgtt tattttagta agaattgaaa 4080
cagactataa gtagttttcc tgtttgtttg tttgtttgtt tgttttttac aaagttcaag 4140
aatgtaacaa taaaggcatt ttgctatggt tctatgtcag actcctacca cagaatagtg 4200
tgaacagata tgtctgtctg ctttaccatg ttattgatag ttcttttgtc agctgctttt 4260
attaaaagtc ctcttttgtg aatttgtaaa tcaatttcaa tatatgatgt acaggagtct 4320
tgtcccccgt tactgtggaa tgttcccccc tgttgatgct ggtcccttta tcctcttatc 4380
ttcttagaac cgggaattct tcagttgtgg tgtgtggctg tatttttcta ttctgtttac 4440
agtgcgtcta cacagcttct taaaacatgt cgaaatgcag aatccaagat ccaccatgaa 4500
tttgtcaaga gattggaagg caggccaaaa cctgaaaatg tggggtgccc tttgtacaat 4560
tgcagagtca agcactagtg ggagtttgaa ctggctactg accgctctta aactgtggca 4620
gtgtccttgg gtacagttaa ctagcagaac ctgctgtcac tcttcctgcc accatcttaa 4680
ccttgacagt tttgtttctt ttctgtgtgg tgctggggtc aaatccagag ccttaggctt 4740
gctaaggctt tttgtttgtt tgatttttcc ttgagacaag tgtctcaccc atggagcccc 4800
tcagcattct tgaaagtctc tgccttagcc tccaggtcct gtggtaccag gtgtggcagg 4860
atgcttttgt cattagagaa actggtctgt gtccaaccac agaaaaattc caatgcagtt 4920
tttaataatt aatccactag ttatttaatc aaaatctttt tgtagggaga gattcctgtt 4980
tgaaagaaaa tgctatttga ttattaacca aggtaacaga cactgctttc caaagcagct 5040
gtgtttctgg tgagtactga acaaatgtgt ggttttctgt gccacctttg attcattttt 5100
caagcactgt tactcgggat ggatggttag aaacatgata ttttagtgtt tctacttaac 5160
cctttcagga ccgagctgta ttgcctttct attaatttcc ccctgtgttg tgataaatgt 5220
ttgctcgtgt tcagtcctgt gtcagtccag acgtaggttt gtgtaaaagc tcagttgtag 5280
cttatttctt gtaccttgca gcatgttcta cgcatccagt ccttaagggg tttactctac 5340
aaactgtgca cctgcgtgta agggtcatta gcaataagat agaaaattga gcaagtttat 5400
accataattt tgtagaaaaa ggaatctgct caattccata tttcatccat gaaaactctg 5460
caatacgggc agtttctagg aataaataaa aaggagatgc aaaccattgt agatgtcttc 5520
tgtgggatgt gcccgatgcg tgcatcatcg tctttgattc caggtacttc ataaagataa 5580
aattaaattc 5590
<210> 47
<211> 1524
<212> DNA
<213> 小家鼠
<220>
<221> misc_feature
<222> (1)..(1524)
<223> 小鼠RBPJ CDS >NM_001277116.1:242-1765重组免疫球蛋白
κJ区域信号结合蛋白(Rbpj),转录物变体4, mRNA [小家鼠]
<400> 47
atgtggttgc tgggatttga actcctgacc ttcagaagag cagtcgggtg ctcttaccca 60
ttgagccatc tcaccagccc gaagtttggt gagcggcctc cacccaaacg actcactagg 120
gaagctatgc gaaattattt aaaagaacga ggggatcaaa cagtgctcat tcttcatgca 180
aaagttgcac agaagtctta cggaaatgaa aaacgatttt tttgccctcc tccttgtgtg 240
tatcttatgg gcagtggttg gaagaaaaaa aaagagcaaa tggaacgaga tggatgttct 300
gaacaagagt ctcaaccctg tgcgtttatt ggaataggaa acagtgacca agaaatgcag 360
cagctgaact tggaagggaa gaactactgt acagccaaaa cattgtacat atctgattca 420
gacaagagaa aacatttcat gttgtctgta aagatgttct atggcaacag cgatgacatt 480
ggtgtgttcc tcagcaagcg gataaaggtc atctccaaac cctccaaaaa gaagcagtca 540
ctgaagaatg ctgacttgtg cattgcttca ggaacgaagg tggcactgtt caatcgcctt 600
cggtcccaga cagttagtac caggtacctg catgtagaag gagggaattt ccacgccagt 660
tcacaacagt ggggagcatt ttacatccat ctcttggacg acgacgagtc ggaaggagag 720
gagttcacag ttagagatgg ctacatccat tacgggcaga ctgtcaagct tgtgtgctca 780
gtgactggca tggcactccc aagattgata attaggaaag ttgataagca gacggcatta 840
ctggatgcag acgaccctgt atcacaactc cacaaatgtg cattttacct taaggataca 900
gaaagaatgt acttgtgcct ttctcaagaa agaataatcc aatttcaggc cactccatgt 960
ccaaaagaac aaaataagga aatgataaac gatggagctt cctggacaat cattagcaca 1020
gacaaggccg agtacacgtt ctatgaggga atgggccccg tccttgcccc agtcacccct 1080
gtgcctgtcg tagaaagtct tcagttgaat ggcggcgggg acgtagcaat gcttgaactt 1140
acaggacaaa actttactcc aaatttaaga gtgtggtttg gggatgtaga agccgaaaca 1200
atgtacagat gtggagagag catgctctgt gtggtcccag acatttctgc attccgggaa 1260
ggttggagat gggtccgcca gccagtccag gttccagtga ctttggtccg aaatgacggg 1320
gtcatttact ccaccagcct taccttcacc tacacaccag agccagggcc gaggccacac 1380
tgcagcgctg caggagcgat tctcagagcc aactccagcc aagtgccctc caatgagtca 1440
aacacaaaca gcgaggggaa ttacacaaat gccagcacaa attctaccag tgtcacatcg 1500
tccacagcaa ccgtggtgtc ctaa 1524
<210> 48
<211> 507
<212> PRT
<213> 小家鼠
<220>
<221> MISC_FEATURE
<222> (1)..(507)
<223> 小鼠RBPJ蛋白>NP_001264045.1
重组无毛结合蛋白抑制因子同种型4 [小家鼠]
<400> 48
Met Trp Leu Leu Gly Phe Glu Leu Leu Thr Phe Arg Arg Ala Val Gly
1 5 10 15
Cys Ser Tyr Pro Leu Ser His Leu Thr Ser Pro Lys Phe Gly Glu Arg
20 25 30
Pro Pro Pro Lys Arg Leu Thr Arg Glu Ala Met Arg Asn Tyr Leu Lys
35 40 45
Glu Arg Gly Asp Gln Thr Val Leu Ile Leu His Ala Lys Val Ala Gln
50 55 60
Lys Ser Tyr Gly Asn Glu Lys Arg Phe Phe Cys Pro Pro Pro Cys Val
65 70 75 80
Tyr Leu Met Gly Ser Gly Trp Lys Lys Lys Lys Glu Gln Met Glu Arg
85 90 95
Asp Gly Cys Ser Glu Gln Glu Ser Gln Pro Cys Ala Phe Ile Gly Ile
100 105 110
Gly Asn Ser Asp Gln Glu Met Gln Gln Leu Asn Leu Glu Gly Lys Asn
115 120 125
Tyr Cys Thr Ala Lys Thr Leu Tyr Ile Ser Asp Ser Asp Lys Arg Lys
130 135 140
His Phe Met Leu Ser Val Lys Met Phe Tyr Gly Asn Ser Asp Asp Ile
145 150 155 160
Gly Val Phe Leu Ser Lys Arg Ile Lys Val Ile Ser Lys Pro Ser Lys
165 170 175
Lys Lys Gln Ser Leu Lys Asn Ala Asp Leu Cys Ile Ala Ser Gly Thr
180 185 190
Lys Val Ala Leu Phe Asn Arg Leu Arg Ser Gln Thr Val Ser Thr Arg
195 200 205
Tyr Leu His Val Glu Gly Gly Asn Phe His Ala Ser Ser Gln Gln Trp
210 215 220
Gly Ala Phe Tyr Ile His Leu Leu Asp Asp Asp Glu Ser Glu Gly Glu
225 230 235 240
Glu Phe Thr Val Arg Asp Gly Tyr Ile His Tyr Gly Gln Thr Val Lys
245 250 255
Leu Val Cys Ser Val Thr Gly Met Ala Leu Pro Arg Leu Ile Ile Arg
260 265 270
Lys Val Asp Lys Gln Thr Ala Leu Leu Asp Ala Asp Asp Pro Val Ser
275 280 285
Gln Leu His Lys Cys Ala Phe Tyr Leu Lys Asp Thr Glu Arg Met Tyr
290 295 300
Leu Cys Leu Ser Gln Glu Arg Ile Ile Gln Phe Gln Ala Thr Pro Cys
305 310 315 320
Pro Lys Glu Gln Asn Lys Glu Met Ile Asn Asp Gly Ala Ser Trp Thr
325 330 335
Ile Ile Ser Thr Asp Lys Ala Glu Tyr Thr Phe Tyr Glu Gly Met Gly
340 345 350
Pro Val Leu Ala Pro Val Thr Pro Val Pro Val Val Glu Ser Leu Gln
355 360 365
Leu Asn Gly Gly Gly Asp Val Ala Met Leu Glu Leu Thr Gly Gln Asn
370 375 380
Phe Thr Pro Asn Leu Arg Val Trp Phe Gly Asp Val Glu Ala Glu Thr
385 390 395 400
Met Tyr Arg Cys Gly Glu Ser Met Leu Cys Val Val Pro Asp Ile Ser
405 410 415
Ala Phe Arg Glu Gly Trp Arg Trp Val Arg Gln Pro Val Gln Val Pro
420 425 430
Val Thr Leu Val Arg Asn Asp Gly Val Ile Tyr Ser Thr Ser Leu Thr
435 440 445
Phe Thr Tyr Thr Pro Glu Pro Gly Pro Arg Pro His Cys Ser Ala Ala
450 455 460
Gly Ala Ile Leu Arg Ala Asn Ser Ser Gln Val Pro Ser Asn Glu Ser
465 470 475 480
Asn Thr Asn Ser Glu Gly Asn Tyr Thr Asn Ala Ser Thr Asn Ser Thr
485 490 495
Ser Val Thr Ser Ser Thr Ala Thr Val Val Ser
500 505
Claims (37)
1.包含编码三种转录因子的组合的三种多核苷酸序列的构建体或载体,其中编码的转录因子的组合选自:
PU.1、IRF4和PRDM1;
PU.1、IRF4和POU2F2;
PU.1、IRF4和TGIF1;和
PU.1、IRF4和RBPJ;
并且,其中所述转录因子PU.1是SEQ.ID.3或SEQ.ID.6,转录因子IRF4是SEQ.ID.9或SEQ.ID.12,转录因子PRDM1是SEQ.ID.15或SEQ.ID.18,转录因子POU2F2是SEQ.ID.27或SEQ.ID.30,转录因子TGIF1是SEQ.ID.33或SEQ.ID.36和转录因子RBPJ是SEQ.ID.45或SEQ.ID.48。
2.根据权利要求1所述的构建体或载体,其中所述编码的转录因子的组合从5’至3’遵循以下序列顺序:
PU.1、IRF4和PRDM1。
3.根据权利要求1-2中的任一项所述的构建体或载体,其中所述载体是病毒载体。
4.根据权利要求3所述的构建体或载体,其中所述病毒载体是逆转录病毒载体、腺病毒载体、慢病毒载体、疱疹病毒载体、痘病毒载体、副粘病毒载体、棒状病毒载体、甲病毒载体、黄病毒载体或腺相关病毒载体。
5.根据权利要求1-2中的任一项所述的构建体或载体,其中所述载体或构建体是合成的mRNA、裸甲病毒RNA复制子或裸黄病毒RNA复制子。
6.一种组合物,其包含一种或多种根据权利要求1-5中的任一项所述的构建体或载体。
7.根据权利要求6所述的组合物,其中所述转录因子分别由下述序列的多核苷酸编码:由SEQ.ID.1、SEQ.ID.2、SEQ.ID.4或SEQ.ID.5编码的PU.1,由SEQ.ID.7、SEQ.ID.8、SEQ.ID.10或SEQ.ID.11编码的IRF4,由SEQ.ID.13、SEQ.ID.14、SEQ.ID.16或SEQ.ID.17编码的PRDM1,由SEQ.ID.25、SEQ.ID.26、SEQ.ID.28或SEQ.ID.29编码的POU2F2,由SEQ.ID.31、SEQ.ID.32、SEQ.ID.34或SEQ.ID.35编码的TGIF1和由SEQ.ID.43、SEQ.ID.44、SEQ.ID.46或SEQ.ID.47编码的RBPJ。
8.根据权利要求6所述的组合物,其中所述编码的转录因子的组合选自以下组合:
PU.1、IRF4和PRDM1;
PU.1、IRF4和POU2F2;
PU.1、IRF4和TGIF1;和
PU.1、IRF4和RBPJ。
9.根据权利要求6所述的组合物,其中所述编码的转录因子的组合是PU.1、IRF4和PRDM1。
10.根据权利要求6所述的组合物,其中所述一种或多种载体是病毒载体。
11.根据权利要求10所述的组合物,其中所述病毒载体是逆转录病毒载体、腺病毒载体、慢病毒载体、疱疹病毒载体、痘病毒载体、副粘病毒、棒状病毒载体、甲病毒载体、黄病毒载体或腺相关病毒载体。
12.根据权利要求6所述的组合物,其中所述一种或多种载体或构建体是合成的mRNA、裸甲病毒RNA复制子或裸黄病毒RNA复制子。
13.根据权利要求6所述的组合物,其中所述组合物用于将干细胞或分化的细胞重新程序化或诱导为常规树突细胞2型,并且其中所述干细胞或分化的细胞选自:多能干细胞、多潜能干细胞、分化的细胞、肿瘤细胞、癌细胞及其混合物。
14.根据权利要求6-13中的任一项所述的组合物用于制造用于治疗癌症、感染性疾病或自身免疫性疾病的药物的用途。
15.一种将干细胞或分化的细胞重新程序化或诱导为常规树突细胞2型的体外或离体方法,所述方法包括以下步骤:
用一种或多种载体转导细胞,所述细胞选自:干细胞或分化的细胞、及其混合物,
所述载体包含编码三种转录因子的三种多核苷酸序列,第一种转录因子和第二种转录因子是PU.1和IRF4且第三种转录因子选自PRDM1、POU2F2、TGIF1和RBPJ,其中转录因子PU.1是SEQ.ID.3或SEQ.ID.6,转录因子IRF4是SEQ.ID.9或SEQ.ID.12,转录因子PRDM1是SEQ.ID.15或SEQ.ID.18,转录因子POU2F2是SEQ.ID.27或SEQ.ID.30,转录因子TGIF1是SEQ.ID.33或SEQ.ID.36和转录因子RBPJ是SEQ.ID.45或SEQ.ID.48;以及
在支持树突细胞或抗原呈递细胞的生长的细胞培养基中培养所述转导的细胞。
16.根据权利要求15所述的方法,培养所述转导的细胞至少2天。
17.根据权利要求15所述的方法,培养所述转导的细胞至少5天。
18.根据权利要求15所述的方法,培养所述转导的细胞至少8天。
19.根据权利要求15所述的方法,培养所述转导的细胞至少9天。
20.根据权利要求15所述的方法,培养所述转导的细胞至少10天。
21.根据权利要求15所述的方法,其中所述细胞选自:多能干细胞或多潜能干细胞、分化的细胞、及其混合物。
22.根据权利要求21所述的方法,其中所述细胞是哺乳动物细胞。
23.根据权利要求21所述的方法,其中所述多能干细胞、多潜能干细胞或分化的细胞选自:内胚层衍生的细胞、中胚层衍生的细胞或外胚层衍生的细胞。
24.根据权利要求21所述的方法,其中所述多潜能干细胞包括间质干细胞、造血干细胞、肠干细胞、多能干细胞和细胞系。
25.根据权利要求21所述的方法,其中所述细胞是非人细胞或人细胞。
26.根据权利要求21所述的方法,其中所述细胞是小鼠细胞。
27.根据权利要求21所述的方法,其中所述细胞是人或小鼠成纤维细胞。
28.根据权利要求21所述的方法,其中所述细胞是哺乳动物脐带血干细胞。
29.根据权利要求15所述的方法,其中所述转导步骤进一步包含至少一种选自以下的载体:编码IL-12的核酸序列;编码IL-4的核酸序列;编码IFN-α的核酸序列;编码IFN-β的核酸序列;编码IFN-γ的核酸序列;编码TNF的核酸序列;编码GM-CSF的核酸序列;编码靶向IL-10RNA的siRNA的核酸序列,及其混合物。
30.根据权利要求15所述的方法,其中所述转导步骤进一步包含至少一种包含编码免疫刺激性细胞因子的多核酸序列的载体。
31.诱导的树突细胞,其包含根据权利要求1-5中的任一项所述的构建体或载体或根据权利要求6-13中任一项所述的组合物。
32.一种组合物,其包含治疗有效量的根据权利要求31所述的诱导的树突细胞,且进一步包含药学上可接受的赋形剂。
33.根据权利要求32所述的组合物,进一步包含抗病毒剂、镇痛剂、抗炎剂、化疗剂、放疗剂、抗生素、利尿剂、填充剂、粘合剂、崩解剂或润滑剂。
34.根据权利要求32-33中任一项所述的组合物用于制造用于治疗癌症、感染性疾病或自身免疫性疾病的药物的用途。
35.癌症的疫苗或可注射制剂,其包含根据权利要求32-33中的任一项所述的组合物或根据权利要求31所述的诱导的树突细胞。
36.根据权利要求35所述的疫苗或可注射制剂,其中所述疫苗或可注射制剂是原位注射剂。
37.一种试剂盒,其包含下述组分中的至少一种:
根据权利要求31所述的诱导的树突细胞;
根据权利要求32-33中的任一项所述的组合物;
根据权利要求1-5中的任一项所述的载体或构建体;
根据权利要求6-13中的任一项所述的组合物;
或其混合物。
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PT11593319 | 2019-11-25 | ||
PT115933 | 2019-11-25 | ||
EP20190159.2 | 2020-08-07 | ||
EP20190159 | 2020-08-07 | ||
PCT/EP2020/083400 WO2021105234A1 (en) | 2019-11-25 | 2020-11-25 | Compositions for reprogramming cells into dendritic cells type 2 competent for antigen presentation, methods and uses thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114729320A CN114729320A (zh) | 2022-07-08 |
CN114729320B true CN114729320B (zh) | 2023-12-15 |
Family
ID=73497794
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202080078541.XA Active CN114729320B (zh) | 2019-11-25 | 2020-11-25 | 用于将细胞重新程序化为能够呈递抗原的树突细胞2型的组合物、方法及其用途 |
Country Status (8)
Country | Link |
---|---|
US (1) | US20240139239A1 (zh) |
EP (1) | EP4065697A1 (zh) |
JP (1) | JP2023506707A (zh) |
KR (1) | KR20220105161A (zh) |
CN (1) | CN114729320B (zh) |
CA (1) | CA3156954A1 (zh) |
IL (1) | IL292894B2 (zh) |
WO (1) | WO2021105234A1 (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP4477749A1 (en) | 2022-03-18 | 2024-12-18 | National Institute Of Advanced Industrial Science and Technology | Method for reprogramming fibroblast or fibroblast-like cell to conventional type-2 dendritic cell |
WO2024030930A2 (en) * | 2022-08-01 | 2024-02-08 | The Johns Hopkins University | Compositions and methods for modifying muller glial cells |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3385373A1 (en) * | 2017-04-05 | 2018-10-10 | Centro de Neurociências e Biologia Celular | Compositions for reprogramming cells into dendritic cells or antigen presenting cells, methods and uses thereof |
WO2018185709A1 (en) * | 2017-04-05 | 2018-10-11 | Centro De Neurociencias E Biologia Celular | Compositions for reprogramming cells into dendritic cells or antigen presenting cells, methods and uses thereof |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4690915A (en) | 1985-08-08 | 1987-09-01 | The United States Of America As Represented By The Department Of Health And Human Services | Adoptive immunotherapy as a treatment modality in humans |
US5219740A (en) | 1987-02-13 | 1993-06-15 | Fred Hutchinson Cancer Research Center | Retroviral gene transfer into diploid fibroblasts for gene therapy |
EP2558571A4 (en) | 2010-04-16 | 2014-09-24 | Immune Disease Inst Inc | DELAYED POLYPEPTIDE EXPRESSION FROM MODIFIED SYNTHETIC RNAS AND USES THEREOF |
WO2012019168A2 (en) | 2010-08-06 | 2012-02-09 | Moderna Therapeutics, Inc. | Engineered nucleic acids and methods of use thereof |
DE12722942T1 (de) | 2011-03-31 | 2021-09-30 | Modernatx, Inc. | Freisetzung und formulierung von manipulierten nukleinsäuren |
-
2020
- 2020-11-25 EP EP20810982.7A patent/EP4065697A1/en active Pending
- 2020-11-25 JP JP2022530281A patent/JP2023506707A/ja active Pending
- 2020-11-25 KR KR1020227019462A patent/KR20220105161A/ko active Pending
- 2020-11-25 CA CA3156954A patent/CA3156954A1/en active Pending
- 2020-11-25 CN CN202080078541.XA patent/CN114729320B/zh active Active
- 2020-11-25 WO PCT/EP2020/083400 patent/WO2021105234A1/en active Search and Examination
- 2020-11-25 US US17/778,901 patent/US20240139239A1/en active Pending
-
2022
- 2022-05-10 IL IL292894A patent/IL292894B2/en unknown
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3385373A1 (en) * | 2017-04-05 | 2018-10-10 | Centro de Neurociências e Biologia Celular | Compositions for reprogramming cells into dendritic cells or antigen presenting cells, methods and uses thereof |
WO2018185709A1 (en) * | 2017-04-05 | 2018-10-11 | Centro De Neurociencias E Biologia Celular | Compositions for reprogramming cells into dendritic cells or antigen presenting cells, methods and uses thereof |
Non-Patent Citations (1)
Title |
---|
Fábio F Rosa 等.Direct reprogramming of fibroblasts into antigen-presenting dendritic cells.Sci Immunol ..2018,第3卷(第30期),eaau4292. * |
Also Published As
Publication number | Publication date |
---|---|
IL292894B2 (en) | 2023-07-01 |
IL292894B1 (en) | 2023-03-01 |
EP4065697A1 (en) | 2022-10-05 |
US20240139239A1 (en) | 2024-05-02 |
CN114729320A (zh) | 2022-07-08 |
KR20220105161A (ko) | 2022-07-26 |
CA3156954A1 (en) | 2021-06-03 |
IL292894A (en) | 2022-07-01 |
JP2023506707A (ja) | 2023-02-20 |
WO2021105234A1 (en) | 2021-06-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110691844B (zh) | 从多能干细胞产生模拟的先天免疫细胞的方法和试剂盒 | |
Bontkes et al. | Dendritic cells transfected with interleukin-12 and tumor-associated antigen messenger RNA induce high avidity cytotoxic T cells | |
JP7303743B2 (ja) | 細胞を樹状細胞または抗原提示細胞にリプログラムするための組成物、その方法および使用 | |
KR20190102259A (ko) | 신규의 유전자 스위치 발현 시스템을 통한 폴리펩티드의 발현 조절 | |
CN114729320B (zh) | 用于将细胞重新程序化为能够呈递抗原的树突细胞2型的组合物、方法及其用途 | |
US20180256645A1 (en) | Cxcr6-transduced t cells for targeted tumor therapy | |
JP2024133642A (ja) | 活性dnaトランスポゾンシステム及びその使用方法 | |
CN104892770B (zh) | 一种对t细胞和造血干细胞具有高效感染和促增殖能力的慢病毒载体 | |
US20240052312A1 (en) | Composition for reprogramming cells into plasmacytoid dendritic cells or interferon type i-producing cells, methods and uses thereof | |
KR20240026905A (ko) | 림프구의 단일 용기 확장 | |
CN111971394A (zh) | 嵌合抗原受体的肿瘤环境特异性表达 | |
CN102002493B (zh) | 小rna-326制备药物的应用 | |
CN111676199B (zh) | 一种呈递并激活hsv-1型溶瘤病毒的car-t技术及其应用 | |
CN114134182A (zh) | 一种新型免疫细胞的制备方法及其应用 | |
CN111676245B (zh) | 一种含有HSV-1型溶瘤病毒的NFAT-Cre-CAR-T细胞及其应用 | |
JP2002524040A (ja) | 造血の制御における新規遺伝子 | |
CN116783284A (zh) | 用于细胞重编程的组合物、构建体和载体 | |
WO2021206104A1 (ja) | がんの処置に用いるpp65含有人工アジュバントベクター細胞 | |
US20220273714A1 (en) | T-cell receptors and methods of use thereof | |
KR20240008332A (ko) | 세포의 1형 통상적 수지상 세포 또는 항원-제시 세포로의 재프로그래밍 | |
CN117580947A (zh) | 将细胞重编程为1型经典树突状细胞或抗原呈递细胞 | |
CN113005090A (zh) | 共表达趋化因子和共刺激分子的dc细胞及其应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |